- Observability: other versions:
- Get started
- What is Elastic Observability?
- What’s new in 8.17
- Quickstart: Monitor hosts with Elastic Agent
- Quickstart: Monitor your Kubernetes cluster with Elastic Agent
- Quickstart: Monitor hosts with OpenTelemetry
- Quickstart: Unified Kubernetes Observability with Elastic Distributions of OpenTelemetry (EDOT)
- Quickstart: Collect data with AWS Firehose
- Add data from Splunk
- Applications and services
- Application performance monitoring (APM)
- Get started
- Learn about data types
- Collect application data
- View and analyze data
- Act on data
- Use APM securely
- Manage storage
- Configure APM Server
- Monitor APM Server
- APM APIs
- Troubleshooting
- Upgrade
- Release notes
- Known issues
- Synthetic monitoring
- Get started
- Scripting browser monitors
- Configure lightweight monitors
- Manage monitors
- Work with params and secrets
- Analyze monitor data
- Monitor resources on private networks
- Use the CLI
- Configure projects
- Multi-factor Authentication
- Configure Synthetics settings
- Grant users access to secured resources
- Manage data retention
- Use Synthetics with traffic filters
- Migrate from the Elastic Synthetics integration
- Scale and architect a deployment
- Synthetics support matrix
- Synthetics Encryption and Security
- Troubleshooting
- Real user monitoring
- Uptime monitoring (deprecated)
- Tutorial: Monitor a Java application
- Application performance monitoring (APM)
- CI/CD
- Cloud
- Infrastructure and hosts
- Logs
- Troubleshooting
- Incident management
- Data set quality
- Observability AI Assistant
- Reference
Data set quality
editData set quality
edit[beta] This functionality is in beta and is subject to change. The design and code is less mature than official GA features and is being provided as-is with no warranties. Beta features are not subject to the support SLA of official GA features.
The Data Set Quality page provides an overview of your log, metric, trace, and synthetic data sets. Use this information to get an idea of your overall data set quality and find data sets that contain incorrectly parsed documents.
To open Data Set Quality, find Stack Management in the main menu or use the global search field. By default, the page only shows log data sets. To see other data set types, select them from the Type menu.
Requirements
Users with the viewer
role can view the Data Sets Quality summary. To view the Active Data Sets and Estimated Data summaries, users need the monitor
index privilege for the logs-*-*
index.
The quality of your data sets is based on the percentage of degraded documents in each data set. A degraded document in a data set contains the _ignored property because one or more of its fields were ignored during indexing. Fields are ignored for a variety of reasons. For example, when the ignore_malformed parameter is set to true, if a document field contains the wrong data type, the malformed field is ignored and the rest of the document is indexed.
From the data set table, you’ll find information for each data set such as its namespace, size, when the data set was last active, and the percentage of degraded docs. The percentage of degraded documents determines the data set’s quality according to the following scale:
- Good (): 0% of the documents in the data set are degraded.
- Degraded (): Greater than 0% and up to 3% of the documents in the data set are degraded.
- Poor (): Greater than 3% of the documents in the data set are degraded.
Opening the details of a specific data set shows the degraded documents history, a summary for the data set, and other details that can help you determine if you need to investigate any issues.
Investigate issues
editThe Data Set Quality page has a couple of different ways to help you find ignored fields and investigate issues. From the data set table, you can open the data set’s details page, and view commonly ignored fields and information about those fields. Open a logs data set in Logs Explorer or other data set types in Discover to find ignored fields in individual documents.
Find ignored fields in data sets
editTo open the details page for a data set with poor or degraded quality and view ignored fields:
- From the data set table, click next to a data set with poor or degraded quality.
- From the details page, scroll down to Quality issues.
The Quality issues section shows fields that were ignored during ingest, the number of documents that contain ignored fields, and the timestamp of last occurrence of the field being ignored.
Find ignored fields in individual documents
editTo use Logs Explorer or Discover to find ignored fields in individual documents:
- Find data sets with degraded documents using the Degraded Docs column of the data sets table.
- Click the percentage in the Degraded Docs column to open the data set in Logs Explorer or Discover.
The Documents table in Logs Explorer or Discover is automatically filtered to show documents that were not parsed correctly. Under the actions column, you’ll find the degraded document icon.
Now that you know which documents contain ignored fields, examine them more closely to find the origin of the issue:
- Under the actions column, click to open the document details.
- Select the JSON tab.
-
Scroll towards the end of the JSON to find the
ignored_field_values
.
Here, you’ll find all of the _ignored
fields in the document and their values, which should provide some clues as to why the fields were ignored.
On this page