Class CompleteLabelFilter

java.lang.Object
org.carrot2.attrs.AttrComposite
org.carrot2.text.preprocessing.filter.ContextLabelFilter
org.carrot2.text.preprocessing.filter.CompleteLabelFilter
All Implemented Interfaces:
AcceptingVisitor

public class CompleteLabelFilter extends ContextLabelFilter
A filter that removes "incomplete" labels.

For example, in a collection of documents related to Data Mining, the phrase Conference on Data is incomplete in a sense that most likely it should be Conference on Data Mining or even Conference on Data Mining in Large Databases. When truncated phrase removal is enabled, the algorithm would try to remove the "incomplete" phrases like the former one and leave only the more informative variants.

See this document, page 31 for a definition of a complete phrase.