Taming Text: How to Find, Organize, and Manipulate It was published by Manning in 2013, with a foreword by Dr. Elizabeth Liddy. Grant Ingersoll is the lead author, alongside Thomas S. Morton and Andrew L. Farris. The book was written for engineers shipping real text-intelligent software — not researchers chasing papers.
Across its chapters, the book walks through full-text search with Apache Solr and Lucene, named entity recognition, clustering, classification, deduplication, question answering, and text summarization. Each topic is paired with working code and grounded in the realities of production systems. For a long stretch of the 2010s, it was one of the few hands-on references that connected academic NLP to the messy work of actually searching and organizing text at scale.
Grant went on to co-found Lucidworks, where he served as CTO, and later became CTO of the Wikimedia Foundation. The problems the book addresses — helping people find, organize, and make sense of text — are still the problems he and the Develomentor team work on today.
