Text analysis
Summary
Find patterns and statistics about textual data.Overview
Text analysis involves taking one or more text files (e.g., a poem, a book, a page from a book, and essay, etc.) and processing them to calculate statics or find patterns. Some common text analysis outcomes include:
![Thumb foundation l word cloud without headers and quotes](https://alice.endicott.edu/uploads/bootsy/image/12/large_Foundation-l_word_cloud_without_headers_and_quotes.png)
(Wikipedia: https://upload.wikimedia.org/wikipedia/commons/9/9e/Foundation-l_word_cloud_without_headers_and_quot...)
- word frequencies
- phrase frequencies
- word clouds (or tag clouds)
- text similarity (how similar are these two pieces of text?)
- language modeling (e.g., take all the poems of Shakespeare and create a new one that sounds like him)
![Thumb foundation l word cloud without headers and quotes](https://alice.endicott.edu/uploads/bootsy/image/12/large_Foundation-l_word_cloud_without_headers_and_quotes.png)
(Wikipedia: https://upload.wikimedia.org/wikipedia/commons/9/9e/Foundation-l_word_cloud_without_headers_and_quot...)