There are several token filters available which try to normalize special characters of a certain language.
Arabic
arabic_normalization
German
german_normalization
Hindi
hindi_normalization
Indic
indic_normalization
Kurdish (Sorani)
sorani_normalization
Persian
persian_normalization
Scandinavian
scandinavian_normalization, scandinavian_folding
scandinavian_normalization
scandinavian_folding
Serbian
not-released-yet[serbian_normalization],
serbian_normalization
Most Popular
Video
Get Started with Elasticsearch
Intro to Kibana
ELK for Logs & Metrics