IMPORTANT: No additional bug fixes or documentation updates
will be released for this version. For the latest information, see the
current release documentation.
polish_stop token filter
editpolish_stop
token filter
editThe polish_stop
token filter filters out Polish stopwords (_polish_
), and
any other custom stopwords specified by the user. This filter only supports
the predefined _polish_
stopwords list. If you want to use a different
predefined list, then use the
stop
token filter instead.
PUT /polish_stop_example { "settings": { "index": { "analysis": { "analyzer": { "analyzer_with_stop": { "tokenizer": "standard", "filter": [ "lowercase", "polish_stop" ] } }, "filter": { "polish_stop": { "type": "polish_stop", "stopwords": [ "_polish_", "jeść" ] } } } } } } GET polish_stop_example/_analyze { "analyzer": "analyzer_with_stop", "text": "Gdzie kucharek sześć, tam nie ma co jeść." }
The above request returns:
{ "tokens" : [ { "token" : "kucharek", "start_offset" : 6, "end_offset" : 14, "type" : "<ALPHANUM>", "position" : 1 }, { "token" : "sześć", "start_offset" : 15, "end_offset" : 20, "type" : "<ALPHANUM>", "position" : 2 } ] }