IMPORTANT: No additional bug fixes or documentation updates
will be released for this version. For the latest information, see the
current release documentation.
Simple Analyzer
editSimple Analyzer
editThe simple
analyzer breaks text into terms whenever it encounters a
character which is not a letter. All terms are lower cased.
Example output
editPOST _analyze { "analyzer": "simple", "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog's bone." }
The above sentence would produce the following terms:
[ the, quick, brown, foxes, jumped, over, the, lazy, dog, s, bone ]
Configuration
editThe simple
analyzer is not configurable.
Definition
editThe simple
analzyer consists of:
- Tokenizer
If you need to customize the simple
analyzer then you need to recreate
it as a custom
analyzer and modify it, usually by adding token filters.
This would recreate the built-in simple
analyzer and you can use it as
a starting point for further customization: