IMPORTANT: No additional bug fixes or documentation updates
will be released for this version. For the latest information, see the
current release documentation.
nori analyzer
editnori
analyzer
editThe nori
analyzer consists of the following tokenizer and token filters:
-
nori_tokenizer
-
nori_part_of_speech
token filter -
nori_readingform
token filter -
lowercase
token filter
It supports the decompound_mode
and user_dictionary
settings from
nori_tokenizer
and the stoptags
setting from
nori_part_of_speech
.