Palladian

Palladian is a Java-based toolkit which provides functionality to perform Internet Information Retrieval tasks such as crawling, classification, and extraction of various types of information. It provides a collection of algorithms for text processing focused on classification, extraction, and retrieval. The aim of Palladian is to reuse algorithms that are freely available and build upon them to drive research by providing unified interfaces. This way, new algorithms can be quickly compared to the state-of-the-art allowing other users to create more advanced programs in the future.

Links