The Elastic APM integration became generally available in 7.16 — see the APM Guide for updated documentation. Standalone APM Server users can see the Legacy APM Overview and Legacy APM Server Reference.

« Update existing data Tune data ingestion »

›

Parse data using ingest node pipelines

edit

IMPORTANT: This documentation is no longer updated. Refer to Elastic's version policy and the latest documentation.

Parse data using ingest node pipelines

edit

You can configure APM Server to use an ingest node to pre-process documents before indexing them in Elasticsearch. A pipeline definition specifies the series of pipelines or processors that will transform each document in a specific way. For example, a pipeline might define one processor that removes a field, followed by another that renames a field.

Pipelines can be used to ensure data security by removing or obfuscating sensitive information. See data security for an example.

Default ingest pipeline

edit

By default, APM Server registers the apm pipeline definition to Elasticsearch on startup. The apm pipeline defines the following inner pipelines:

`apm_user_agent`	Adds `user_agent` information for APM events
`apm_user_geo`	Enriches Elastic’s JavaScript RUM Agent data by adding user Geo-IP data to the `client.geo` field.
`apm_ingest_timestamp`	Adds an ingest timestamp for APM events.
`apm_remove_span_metadata`	[7.7] Added in 7.7. Upgrading? See upgrading to 7.7 To save storage, removes metadata fields, like `host`, `kubernetes`, and `container`, that are already available on the parent transaction. In previous versions of APM Server, this functionality was hardcoded internally. Switching metadata cleanup from an internal process to a processor allows you to keep any span metadata that is important in your architecture.
`apm_error_grouping_name`	[7.13] Added in 7.13. Adds `error.grouping_name` to error documents for use in the Kibana APM UI.
`apm_opentelemetry_metrics`	[7.13] Added in 7.13. Copies well-known OpenTelemetry metrics to their Elastic APM counterparts, for vizualisation in the Kibana APM UI. For example, the OpenTelemetry metric field `runtime.jvm.gc.time` is copied to the Elastic APM metric field `jvm.gc.time`. Metrics are duplicated so you can refer to them by either the OpenTelemetry or Elastic APM metric name.

See the complete pipeline definition by navigating to the APM Server’s home directory and viewing ingest/pipeline/definition.json.

To disable this, or any other pipeline, set output.elasticsearch.pipeline: _none.

Custom pipelines

edit

Using custom pipelines involves two steps:

First, you need to register a pipeline in Elasticsearch.
Then, the pipeline needs to be applied during data ingestion.

Register pipelines in Elasticsearch

edit

To register a pipeline in Elasticsearch, you can either configure APM Server to register pipelines on startup, or you can manually upload a pipeline definition.

Register pipelines on APM Server startup

edit

Automatic pipeline registration requires output.elasticsearch to be enabled and configured.

Navigate to APM Server’s home directory and find the default pipeline configuration at ingest/pipeline/definition.json. To add, change, or remove pipelines in Elasticsearch, change the definitions in this file and restart your APM Server or run apm-server setup --pipelines.

By default, pipeline registration is enabled.

Manually upload pipeline definitions

edit

You can manually upload pipeline definitions by describing them in a file. Consider the following sample pipeline in a file named pipeline.json. This pipeline definition converts the value of beat.name to lowercase before indexing each document.

{
    "description": "Test pipeline",
    "processors": [
        {
            "lowercase": {
                "field": "beat.name"
            }
        }
    ]
}

To register this pipeline, run:

curl -H 'Content-Type: application/json' -XPUT 'http://localhost:9200/_ingest/pipeline/test-pipeline' -d @pipeline.json

Apply pipelines during data ingestion

edit

To specify which pipelines to apply during data ingestion, add the pipeline IDs to the pipelines option under output.elasticsearch in the apm-server.yml file:

output.elasticsearch:
  pipelines:
  - pipeline: "test-pipeline"

More information and examples for applying pipelines is available in the Elasticsearch output pipeline documentation.

« Update existing data Tune data ingestion »

On this page

Default ingest pipeline
Custom pipelines
Register pipelines in Elasticsearch
Register pipelines on APM Server startup
Manually upload pipeline definitions
Apply pipelines during data ingestion

Was this helpful?

Feedback

The Search AI Company

ELK Stack

Elastic Cloud

Generative AI

Search

Security

Observability

By solution

Industries

Customer spotlight

Research

Build

Learn

Connect

Parse data using ingest node pipelines

Parse data using ingest node pipelines

Default ingest pipeline

Custom pipelines

Register pipelines in Elasticsearch

Register pipelines on APM Server startup

Manually upload pipeline definitions

Apply pipelines during data ingestion

Follow us

About us

Join us

Partners

Trust & Security

Investor relations

Excellence Awards

About us

Join us

Partners

Trust & Security

Investor relations

Excellence Awards