Aggregation¶
On this page
Aggregation operations process data records and return computed results. Aggregation operations group values from multiple documents together, and can perform a variety of operations on the grouped data to return a single result. MongoDB provides three ways to perform aggregation: the aggregation pipeline, the map-reduce function, and single purpose aggregation methods.
Aggregation Pipeline¶
MongoDB’s aggregation framework is modeled on the concept of data processing pipelines. Documents enter a multi-stage pipeline that transforms the documents into an aggregated result. For example:
In the example,
First Stage: The $match
stage filters the documents by
the status
field and passes to the next stage those documents that
have status
equal to "A"
.
Second Stage: The $group
stage groups the documents by
the cust_id
field to calculate the sum of the amount for each
unique cust_id
.
The most basic pipeline stages provide filters that operate like queries and document transformations that modify the form of the output document.
Other pipeline operations provide tools for grouping and sorting documents by specific field or fields as well as tools for aggregating the contents of arrays, including arrays of documents. In addition, pipeline stages can use operators for tasks such as calculating the average or concatenating a string.
The pipeline provides efficient data aggregation using native operations within MongoDB, and is the preferred method for data aggregation in MongoDB.
The aggregation pipeline can operate on a sharded collection.
The aggregation pipeline can use indexes to improve its performance during some of its stages. In addition, the aggregation pipeline has an internal optimization phase. See Pipeline Operators and Indexes and Aggregation Pipeline Optimization for details.
Map-Reduce¶
Tip
Aggregation pipeline provides better performance and a more coherent interface than map-reduce.
For examples of aggregation alternatives to map-reduce operations, see Map-Reduce Examples. See also Map-Reduce to Aggregation Pipeline.
MongoDB also provides map-reduce operations to perform aggregation. Map-reduce uses custom JavaScript functions to perform the map and reduce operations, as well as the optional finalize operation.
Single Purpose Aggregation Operations¶
MongoDB also provides db.collection.estimatedDocumentCount()
,
db.collection.count()
and db.collection.distinct()
.
All of these operations aggregate documents from a single collection. While these operations provide simple access to common aggregation processes, they lack the flexibility and capabilities of the aggregation pipeline and map-reduce.
Additional Features and Behaviors¶
For a feature comparison of the aggregation pipeline, map-reduce, and the special group functionality, see Aggregation Commands Comparison.