WARNING: Version 6.1 of Elasticsearch has passed its EOL date.
This documentation is no longer being maintained and may be removed. If you are running this version, we strongly advise you to upgrade. For the latest information, see the current release documentation.
CJK Width Token Filter
editCJK Width Token Filter
editThe cjk_width
token filter normalizes CJK width differences:
- Folds fullwidth ASCII variants into the equivalent basic Latin
- Folds halfwidth Katakana variants into the equivalent Kana
This token filter can be viewed as a subset of NFKC/NFKD
Unicode normalization. See the analysis-icu
plugin
for full normalization support.