Stemmer token filter
editStemmer token filter
editA filter that provides access to (almost) all of the available stemming token filters through a single unified interface. For example:
PUT /my_index { "settings": { "analysis" : { "analyzer" : { "my_analyzer" : { "tokenizer" : "standard", "filter" : ["lowercase", "my_stemmer"] } }, "filter" : { "my_stemmer" : { "type" : "stemmer", "name" : "light_german" } } } } }
The language
/name
parameter controls the stemmer with the following
available values (the preferred filters are marked in bold):
Arabic |
|
Armenian |
|
Basque |
|
Bengali |
|
Brazilian Portuguese |
|
Bulgarian |
|
Catalan |
|
Czech |
|
Danish |
|
Dutch |
|
English |
|
Finnish |
|
French |
|
Galician |
|
German |
|
Greek |
|
Hindi |
|
Hungarian |
|
Indonesian |
|
Irish |
|
Italian |
|
Kurdish (Sorani) |
|
Latvian |
|
Lithuanian |
|
Norwegian (Bokmål) |
|
Norwegian (Nynorsk) |
|
Portuguese |
|
Romanian |
|
Russian |
|
Spanish |
|
Swedish |
|
Turkish |