Stop Token Filter
editStop Token Filter
editA token filter of type stop
that removes stop words from token
streams.
The following are settings that can be set for a stop
token filter
type:
|
A list of stop words to use. Defaults to |
|
A path (either relative to |
|
Set to |
|
Set to |
The stopwords
parameter accepts either an array of stopwords:
PUT /my_index { "settings": { "analysis": { "filter": { "my_stop": { "type": "stop", "stopwords": ["and", "is", "the"] } } } } }
or a predefined language-specific list:
PUT /my_index { "settings": { "analysis": { "filter": { "my_stop": { "type": "stop", "stopwords": "_english_" } } } } }
Elasticsearch provides the following predefined list of languages:
_arabic_
, _armenian_
, _basque_
, _bengali_
, _brazilian_
, _bulgarian_
,
_catalan_
, _czech_
, _danish_
, _dutch_
, _english_
, _finnish_
,
_french_
, _galician_
, _german_
, _greek_
, _hindi_
, _hungarian_
,
_indonesian_
, _irish_
, _italian_
, _latvian_
, _norwegian_
, _persian_
,
_portuguese_
, _romanian_
, _russian_
, _sorani_
, _spanish_
,
_swedish_
, _thai_
, _turkish_
.
For the empty stopwords list (to disable stopwords) use: _none_
.