IMPORTANT: No additional bug fixes or documentation updates will be released for this version. For the latest information, see the current release documentation.

« Children aggregation Date histogram aggregation »

› › ›

Composite aggregation

edit

Composite aggregation

edit

The composite aggregation is expensive. Load test your application before deploying a composite aggregation in production.

A multi-bucket aggregation that creates composite buckets from different sources.

Unlike the other multi-bucket aggregations, you can use the composite aggregation to paginate all buckets from a multi-level aggregation efficiently. This aggregation provides a way to stream all buckets of a specific aggregation, similar to what scroll does for documents.

The composite buckets are built from the combinations of the values extracted/created for each document and each combination is considered as a composite bucket.

For example, consider the following document:

{
  "keyword": ["foo", "bar"],
  "number": [23, 65, 76]
}

Using keyword and number as source fields for the aggregation results in the following composite buckets:

{ "keyword": "foo", "number": 23 }
{ "keyword": "foo", "number": 65 }
{ "keyword": "foo", "number": 76 }
{ "keyword": "bar", "number": 23 }
{ "keyword": "bar", "number": 65 }
{ "keyword": "bar", "number": 76 }

Value sources

edit

The sources parameter defines the source fields to use when building composite buckets. The order that the sources are defined controls the order that the keys are returned.

You must use a unique name when defining sources.

The sources parameter can be any of the following types:

Terms

edit

The terms value source is similar to a simple terms aggregation. The values are extracted from a field exactly like the terms aggregation.

Example:

response = client.search(
  body: {
    size: 0,
    aggregations: {
      my_buckets: {
        composite: {
          sources: [
            {
              product: {
                terms: {
                  field: 'product'
                }
              }
            }
          ]
        }
      }
    }
  }
)
puts response

GET /_search
{
  "size": 0,
  "aggs": {
    "my_buckets": {
      "composite": {
        "sources": [
          { "product": { "terms": { "field": "product" } } }
        ]
      }
    }
  }
}

	This index is sorted by `username` first then by `timestamp`.
	… in ascending order for the `username` field and in descending order for the `timestamp` field. could be used to optimize these composite aggregations:

	`user_name` is a prefix of the index sort and the order matches (`asc`).
	`timestamp` matches also the prefix and the order matches (`desc`).

The Search AI Company

Generative AI

Search

Security

Observability

By solution

Industries

Composite aggregation

Composite aggregation

Value sources

Terms

Histogram

Date histogram

GeoTile grid

Mixing different value sources

Order

Missing bucket

Size

Pagination

Early termination

Sub-aggregations

Pipeline aggregations

Follow us

About us

Join us

Partners

Trust & Security

Investor relations

Excellence Awards