The tutorial is prepared by Academic Technology Services in UCLA. It has a Starter Kit for beginners, and also topics for a range of data analysis models.
Includes learning modules, data analysis examples, and annotate example output.
Text Mining is the discovery by computer of new, previously unknown information, by automatically extracting information from different written resources... The difference between regular data mining and text mining is that in text mining the patterns are extracted from natural language text rather than from structured databases of facts.
(From What is Text Mining? by Marti Hearst)
The HathiTrust Research Center has developed a suite of tools and services for text data mining including web-based algorithms, freely-accessible datasets, and secure computing capsules.