- Plugins and Integrations: other versions:
- Introduction to plugins
- Plugin management
- API extension plugins
- Analysis plugins
- ICU analysis plugin
- Japanese (kuromoji) analysis plugin
kuromoji
analyzerkuromoji_iteration_mark
character filterkuromoji_tokenizer
kuromoji_baseform
token filterkuromoji_part_of_speech
token filterkuromoji_readingform
token filterkuromoji_stemmer
token filterja_stop
token filterkuromoji_number
token filterhiragana_uppercase
token filterkatakana_uppercase
token filterkuromoji_completion
token filter
- Korean (nori) analysis plugin
- Phonetic analysis plugin
- Smart Chinese analysis plugin
- Stempel Polish analysis plugin
- Ukrainian analysis plugin
- Discovery plugins
- Mapper plugins
- Snapshot/restore repository plugins
- Store plugins
- Integrations
- Creating an Elasticsearch plugin
Smart Chinese analysis plugin
editSmart Chinese analysis plugin
editThe Smart Chinese Analysis plugin integrates Lucene’s Smart Chinese analysis module into elasticsearch.
It provides an analyzer for Chinese or mixed Chinese-English text. This analyzer uses probabilistic knowledge to find the optimal word segmentation for Simplified Chinese text. The text is first broken into sentences, then each sentence is segmented into words.
Installation
editThis plugin can be installed using the plugin manager:
sudo bin/elasticsearch-plugin install analysis-smartcn
The plugin must be installed on every node in the cluster, and each node must be restarted after installation.
You can download this plugin for offline
install from https://artifacts.elastic.co/downloads/elasticsearch-plugins/analysis-smartcn/analysis-smartcn-8.17.0.zip. To verify
the .zip
file, use the
SHA hash or
ASC key.
Removal
editThe plugin can be removed with the following command:
sudo bin/elasticsearch-plugin remove analysis-smartcn
The node must be stopped before removing the plugin.
smartcn
tokenizer and token filter
editThe plugin provides the smartcn
analyzer, smartcn_tokenizer
tokenizer, and
smartcn_stop
token filter which are not configurable.
The smartcn_word
token filter and smartcn_sentence
have been deprecated.