Normalization Token Filter

edit

There are several token filters available which try to normalize special characters of a certain language.

Arabic

arabic_normalization

German

german_normalization

Hindi

hindi_normalization

Indic

indic_normalization

Kurdish (Sorani)

sorani_normalization

Persian

persian_normalization

Scandinavian

scandinavian_normalization, scandinavian_folding

Serbian

not-released-yet[serbian_normalization],