Flick Club LogoFlick Club Logo

Taming Text

Drew Farris

Grant Ingersoll

Thomas S. Morton

Apache Solr Methods Information Storage And Retrieval

Taming Text provides hands-on examples for working with unstructured text data. It covers techniques such as full-text search, named entity recognition, clustering, tagging, information extraction, and summarization. These techniques are demonstrated using real-world applications built with open-source libraries like Solr and Mahout. The book guides readers through each topic and its foundations in a clear and concise style without requiring background in statistics or natural language processing. It is written for Java developers but the concepts can be applied in any language.