Text Analysis with R for Students of Literature


Text Analysis with R for Students of Literature (2014) .. by Matthew Jockers (@mljockers)


Contents

Part I Microanalysis
1 R Basics… 3
2 First Foray into Text Analysis with R … 11
3 Accessing and Comparing Word Frequency Data … 25
4 Token Distribution Analysis … 29
5 Correlation … 47

Part II Mesoanalysis
6 Measures of Lexical Variety … 59
7 Hapax Richness … 69
8 Do It KWIC … 73
9 Do It KWIC (Better) … 81
10 Text Quality, Text Variety, and Parsing XML … 89

Part III Macroanalysis
11 Clustering … 101
12 Classification … 119
13 Topic Modeling … 135

  • A Variable Scope Example … 161
  • B The LDA Buffet … 163
  • C Start up Code … 167
  • D R Resources for Further Reading … 171

Practice Exercise Solutions … 173

Index … 193