NOTE: You are looking at documentation for an older release. For the latest information, see the current release documentation.
Stop Token Filter
editStop Token Filter
editA token filter of type stop
that removes stop words from token
streams.
The following are settings that can be set for a stop
token filter
type:
|
A list of stop words to use. Defaults to |
|
A path (either relative to |
|
Set to |
|
Set to |
The stopwords
parameter accepts either an array of stopwords:
PUT /my_index { "settings": { "analysis": { "filter": { "my_stop": { "type": "stop", "stopwords": ["and", "is", "the"] } } } } }
or a predefined language-specific list:
PUT /my_index { "settings": { "analysis": { "filter": { "my_stop": { "type": "stop", "stopwords": "_english_" } } } } }
Elasticsearch provides the following predefined list of languages:
_arabic_
, _armenian_
, _basque_
, _bengali_
, _brazilian_
, _bulgarian_
,
_catalan_
, _czech_
, _danish_
, _dutch_
, _english_
, _finnish_
,
_french_
, _galician_
, _german_
, _greek_
, _hindi_
, _hungarian_
,
_indonesian_
, _irish_
, _italian_
, _latvian_
, _norwegian_
, _persian_
,
_portuguese_
, _romanian_
, _russian_
, _sorani_
, _spanish_
,
_swedish_
, _thai_
, _turkish_
.
For the empty stopwords list (to disable stopwords) use: _none_
.