WARNING: Version 2.0 of Elasticsearch has passed its EOL date.
This documentation is no longer being maintained and may be removed. If you are running this version, we strongly advise you to upgrade. For the latest information, see the current release documentation.
Stemmer Token Filter
editStemmer Token Filter
editA filter that provides access to (almost) all of the available stemming token filters through a single unified interface. For example:
{ "index" : { "analysis" : { "analyzer" : { "my_analyzer" : { "tokenizer" : "standard", "filter" : ["standard", "lowercase", "my_stemmer"] } }, "filter" : { "my_stemmer" : { "type" : "stemmer", "name" : "light_german" } } } } }
The language
/name
parameter controls the stemmer with the following
available values (the preferred filters are marked in bold):
Arabic |
|
Armenian |
|
Basque |
|
Brazilian Portuguese |
|
Bulgarian |
|
Catalan |
|
Czech |
|
Danish |
|
Dutch |
|
English |
|
Finnish |
|
French |
|
Galician |
|
German |
|
Greek |
|
Hindi |
|
Hungarian |
|
Indonesian |
|
Irish |
|
Italian |
|
Kurdish (Sorani) |
|
Latvian |
|
Norwegian (Bokmål) |
|
Norwegian (Nynorsk) |
|
Portuguese |
|
Romanian |
|
Russian |
|
Spanish |
|
Swedish |
|
Turkish |