Class CompletePreprocessingPipeline

java.lang.Object
org.carrot2.attrs.AttrComposite
org.carrot2.text.preprocessing.CompletePreprocessingPipeline
All Implemented Interfaces:
AcceptingVisitor, ContextPreprocessor

public class CompletePreprocessingPipeline
extends AttrComposite
implements ContextPreprocessor
Performs a complete preprocessing on the provided documents. The preprocessing consists of the following steps:
  1. InputTokenizer
  2. CaseNormalizer
  3. LanguageModelStemmer
  4. StopListMarker
  5. PhraseExtractor
  6. LabelFilterProcessor
  7. DocumentAssigner