Package org.carrot2.text.preprocessing
Class LabelFilterProcessor
- java.lang.Object
-
- org.carrot2.attrs.AttrComposite
-
- org.carrot2.text.preprocessing.LabelFilterProcessor
-
- All Implemented Interfaces:
AcceptingVisitor
public class LabelFilterProcessor extends AttrComposite
Applies basic filtering to words and phrases to produce candidates for cluster labels. Filtering is applied toPreprocessingContext.AllWords
andPreprocessingContext.AllPhrases
, the results are saved toPreprocessingContext.AllLabels
. Currently, the following filters are applied: This class saves the following results to thePreprocessingContext
:This class requires that
InputTokenizer
,CaseNormalizer
,StopListMarker
andPhraseExtractor
be invoked first.
-
-
Field Summary
Fields Modifier and Type Field Description CompleteLabelFilter
completeLabelFilter
Truncated phrase filter for this processor.GenitiveLabelFilter
genitiveLabelFilter
Genitive length label filter.MinLengthLabelFilter
minLengthLabelFilter
Min length label filter.NumericLabelFilter
numericLabelFilter
Numeric label filter for this processor.QueryLabelFilter
queryLabelFilter
Query word label filter for this processor.StopLabelFilter
stopLabelFilter
Stop label filter.StopWordLabelFilter
stopWordLabelFilter
Stop word label filter for this processor.-
Fields inherited from class org.carrot2.attrs.AttrComposite
attributes
-
-
Constructor Summary
Constructors Constructor Description LabelFilterProcessor()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description void
process(PreprocessingContext context)
Processes all filters declared as fields of this class.-
Methods inherited from class org.carrot2.attrs.AttrComposite
accept
-
-
-
-
Field Detail
-
queryLabelFilter
public QueryLabelFilter queryLabelFilter
Query word label filter for this processor.
-
stopWordLabelFilter
public StopWordLabelFilter stopWordLabelFilter
Stop word label filter for this processor.
-
stopLabelFilter
public StopLabelFilter stopLabelFilter
Stop label filter.
-
numericLabelFilter
public NumericLabelFilter numericLabelFilter
Numeric label filter for this processor.
-
completeLabelFilter
public CompleteLabelFilter completeLabelFilter
Truncated phrase filter for this processor.
-
minLengthLabelFilter
public MinLengthLabelFilter minLengthLabelFilter
Min length label filter.
-
genitiveLabelFilter
public GenitiveLabelFilter genitiveLabelFilter
Genitive length label filter.
-
-
Method Detail
-
process
public void process(PreprocessingContext context)
Processes all filters declared as fields of this class.
-
-