IMPORTANT: No additional bug fixes or documentation updates will be released for this version. For the latest information, see the current release documentation.

« ICU Collation Token Filter Japanese (kuromoji) Analysis Plugin »

› › ›

ICU Transform Token Filter

edit

ICU Transform Token Filter

edit

Transforms are used to process Unicode text in many different ways, such as case mapping, normalization, transliteration and bidirectional text handling.

You can define which transformation you want to apply with the id parameter (defaults to Null), and specify text direction with the dir parameter which accepts forward (default) for LTR and reverse for RTL. Custom rulesets are not yet supported.

For example:

PUT icu_sample
{
  "settings": {
    "index": {
      "analysis": {
        "analyzer": {
          "latin": {
            "tokenizer": "keyword",
            "filter": [
              "myLatinTransform"
            ]
          }
        },
        "filter": {
          "myLatinTransform": {
            "type": "icu_transform",
            "id": "Any-Latin; NFD; [:Nonspacing Mark:] Remove; NFC" 
          }
        }
      }
    }
  }
}

GET icu_sample/_analyze?analyzer=latin
{
  "text": "你好" 
}

GET icu_sample/_analyze?analyzer=latin
{
  "text": "здравствуйте" 
}

GET icu_sample/_analyze?analyzer=latin
{
  "text": "こんにちは" 
}

Copy as curl View in Sense

	This transforms transliterates characters to Latin, and separates accents from their base characters, removes the accents, and then puts the remaining text into an unaccented form.
	Returns `ni hao`.
	Returns `zdravstvujte`.
	Returns `kon'nichiha`.

For more documentation, Please see the user guide of ICU Transform.

« ICU Collation Token Filter Japanese (kuromoji) Analysis Plugin »

Was this helpful?

Feedback

The Search AI Company

ELK Stack

Elastic Cloud

Generative AI

Search

Security

Observability

By solution

Industries

Customer spotlight

Research

Build

Learn

Connect

ICU Transform Token Filter

ICU Transform Token Filter

Follow us

About us

Join us

Partners

Trust & Security

Investor relations

Excellence Awards

About us

Join us

Partners

Trust & Security

Investor relations

Excellence Awards