Carrot2 v3.10.4 API Documentation

Carrot2 is an Open Source Search Results Clustering Engine, which can automatically organize small collections of documents, for example search results, into thematic categories, see below for more.

See: Description

Carrot2 Core 
Package Description
org.carrot2.core
Definitions of Carrot2 core interfaces and their implementations.
org.carrot2.core.attribute
Attribute annotations for Carrot2 core interfaces.
Carrot2 Data Sources 
Package Description
org.carrot2.source
Base classes for implementing Carrot2 document sources.
org.carrot2.source.ambient
Serves documents from the Ambient test set.
org.carrot2.source.etools
Fetches documents from the eTools Metasearch Engine.
org.carrot2.source.google
Fetches documents from the Google JSON Search API.
org.carrot2.source.idol
Fetches documents from an Autonmomy IDOL Search engine with an OpenSearch-compliant feed.
org.carrot2.source.lucene
Fetches documents from a local Lucene index.
org.carrot2.source.microsoft
Fetches documents from the Bing search engine using its publicly available API.
org.carrot2.source.opensearch
Fetches documents from an OpenSearch-compliant search feed.
org.carrot2.source.pubmed
Fetches documents from the PubMed medical abstracts database.
org.carrot2.source.solr
Fetches documents from the Solr search engine.
org.carrot2.source.xml
Fetches documents from XML streams.
Carrot2 Clustering Algorithms 
Package Description
org.carrot2.clustering.kmeans
Implementation of the bisecting k-means clustering algorithm.
org.carrot2.clustering.lingo
Implementation of the Lingo clustering algorithm.
org.carrot2.clustering.stc
Implementation of the STC clustering algorithm.
org.carrot2.clustering.synthetic
Synthetic clustering algorithms.
Carrot2 Results post-processing 
Package Description
org.carrot2.output.metrics
Cluster quality metrics calculation utilities.
Carrot2 Text preprocessing utilities 
Package Description
org.carrot2.text.analysis
Lexical analysis utilities.
org.carrot2.text.clustering
Multilingual clustering utilities.
org.carrot2.text.linguistic
Shallow linguistic processing utilities.
org.carrot2.text.linguistic.lucene
Shallow linguistic processing utilities dependent on Lucene stemmers and analyzers.
org.carrot2.text.linguistic.morfologik
Shallow linguistic processing utilities dependent on the Morfologik stemming library.
org.carrot2.text.preprocessing
Contains the unified input preprocessing infrastructure (term indexing, stemming, label discovery).
org.carrot2.text.preprocessing.filter
Text feature filtering utilities.
org.carrot2.text.preprocessing.pipeline
Predefined preprocessing pipeline utilities.
org.carrot2.text.suffixtree
Implementation of the suffix tree data structure.
org.carrot2.text.util
Data structures for text preprocessing.
org.carrot2.text.vsm
Vector Space Model utilities.
Carrot2 Matrix utilities 
Package Description
org.carrot2.matrix
Matrix factorization routines.
org.carrot2.matrix.factorization
Matrix factorization implementations.
org.carrot2.matrix.factorization.seeding
Matrix seeding strategies.
Carrot2 Utility classes 
Package Description
org.carrot2.util
Common utility classes.
org.carrot2.util.annotations
Marker annotations.
org.carrot2.util.attribute
Attribute handling utilities.
org.carrot2.util.factory
A simple object factory.
org.carrot2.util.httpclient
Apache Commons HTTP client utilities.
org.carrot2.util.pool
A very simple unbounded pool implementation.
org.carrot2.util.resource
Resource location abstraction layer.
org.carrot2.util.simplexml
Utilities for working with the Simple XML framework.
org.carrot2.util.tests
Unit test utilities and annotations.
org.carrot2.util.xslt
XSLT handling utilities.
org.carrot2.util.xsltfilter
XSLT processor servlet filter.
Other Packages 
Package Description
org.carrot2.log4j
Log4J utilities.

Copyright (c) Dawid Weiss, Stanislaw Osinski