Ling467 - Data to study Word frequency Distributions

  • Word Frequencies from 10 million word sample of GigaWord Corpus
  • Document Frequencies from 1.6 Million document sample of GigaWord Corpus
  • Sentence-Initial Word Frequencies from same 10 million word sample of GigaWord Corpus
  • Word Distribution Data - Distinct word growth
  • Word Distribution Data - Zipfian Distribution
  • Word Distribution Data - Stopwords
  • Word Distribution Data - Day of Week Study