Class LingoClusteringAlgorithm

  • All Implemented Interfaces:
    AcceptingVisitor, ClusteringAlgorithm

    public class LingoClusteringAlgorithm
    extends AttrComposite
    implements ClusteringAlgorithm
    Lingo clustering algorithm. Implementation as described in: Stanisław Osiński, Dawid Weiss: A Concept-Driven Algorithm for Clustering Search Results. IEEE Intelligent Systems, May/June, 3 (vol. 20), 2005, pp. 48—54.
    • Field Detail

      • scoreWeight

        public AttrDouble scoreWeight
        Balance between cluster score and size during cluster sorting. Value equal to 0.0 will cause Lingo to sort clusters based only on cluster size. Value equal to 1.0 will cause Lingo to sort clusters based only on cluster score.
      • desiredClusterCount

        public AttrInteger desiredClusterCount
        Determines number of clusters to create. The larger the value, the more clusters will be created. The number of clusters created by the algorithm will be proportional to the value of this parameter, but may be different.
      • matrixBuilder

        public TermDocumentMatrixBuilder matrixBuilder
        Configuration of the size and contents of the term-document matrix.
      • matrixReducer

        public TermDocumentMatrixReducer matrixReducer
        Configuration of the matrix decomposition method to use for clustering.
      • clusterBuilder

        public ClusterBuilder clusterBuilder
        Configuration of the structure and labels of clusters.
      • dictionaries

        public EphemeralDictionaries dictionaries
        Per-request overrides of language components (dictionaries).
        Since:
        4.1.0
      • queryHint

        public final AttrString queryHint
        Query terms used to retrieve documents being clustered. The query is used as a hint to avoid creating trivial clusters consisting only of query words.
    • Constructor Detail

      • LingoClusteringAlgorithm

        public LingoClusteringAlgorithm()