IMPORTANT: No additional bug fixes or documentation updates
will be released for this version. For the latest information, see the
current release documentation.
Whitespace Tokenizer
editWhitespace Tokenizer
editThe whitespace
tokenizer breaks text into terms whenever it encounters a
whitespace character.
Example output
editPOST _analyze { "tokenizer": "whitespace", "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog's bone." }
The above sentence would produce the following terms:
[ The, 2, QUICK, Brown-Foxes, jumped, over, the, lazy, dog's, bone. ]
Configuration
editThe whitespace
tokenizer is not configurable.