Package org.carrot2.text.preprocessing.filter

  • Interface Summary 
    Interface Description
    LabelFilter
    Defines the contract for label filtering components, which mark words and phrases that should not be considered as candidates for cluster labels
  • Class Summary 
    Class Description
    CompleteLabelFilter
    A filter that removes "incomplete" labels.
    GenitiveLabelFilter
    Accepts labels that do not end in words in the Saxon Genitive form (e.g.
    MinLengthLabelFilter
    Accepts labels whose length in characters is greater or equal to the provided value.
    NumericLabelFilter
    Accepts labels that start with a non-numeric token.
    QueryLabelFilter
    Accepts labels that do not consist only of query words.
    SingleLabelFilterBase
    A base for LabelFilter implementations that handle each label independently.
    StopLabelFilter
    Accepts labels that are not declared as stop labels in the stoplabels.<lang> files.
    StopWordLabelFilter
    Accepts words that are not stop words and phrases that do not start nor end in a stop word.