WARNING: Version 2.0 of Elasticsearch has passed its EOL date.
This documentation is no longer being maintained and may be removed. If you are running this version, we strongly advise you to upgrade. For the latest information, see the current release documentation.
Keep Words Token Filter
editKeep Words Token Filter
editA token filter of type keep
that only keeps tokens with text contained in a
predefined set of words. The set of words can be defined in the settings or
loaded from a text file containing one word per line.
Options
edit
keep_words |
a list of words to keep |
keep_words_path |
a path to a words file |
keep_words_case |
a boolean indicating whether to lower case the words (defaults to |
Settings example
edit{ "index" : { "analysis" : { "analyzer" : { "my_analyzer" : { "tokenizer" : "standard", "filter" : ["standard", "lowercase", "words_till_three"] }, "my_analyzer1" : { "tokenizer" : "standard", "filter" : ["standard", "lowercase", "words_on_file"] } }, "filter" : { "words_till_three" : { "type" : "keep", "keep_words" : [ "one", "two", "three"] }, "words_on_file" : { "type" : "keep", "keep_words_path" : "/path/to/word/file" } } } } }