Class ExtendedWhitespaceTokenizer

java.lang.Object
org.carrot2.language.ExtendedWhitespaceTokenizer
All Implemented Interfaces:
Tokenizer

public final class ExtendedWhitespaceTokenizer
extends Object
implements Tokenizer
A tokenizer separating input characters on whitespace, but capable of extracting more complex tokens, such as URLs, e-mail addresses and sentence delimiters.