Display concept models based on highly frequent lemmas in EEBO-TCP, by selecting a lemma below.
Concept models, or quads, are sets of four lemmas that occur together in spans of text up to 100 words long. Quads can be searched and ranked, and examples of each quad can be read in their original context.
On this site you will find concept models built from 1,000 of the most frequently occurring nouns, adjectives, and verbs in EEBO-TCP (the first word in the search interface). The second, third, and fourth words of each quad are restricted to nouns, adjectives, and verbs that occur at least 5,000 times in EEBO-TCP (but systematically excluding some high frequency words).1 The second, third, and fourth words of each quad occur within 50 words (tokens) to the left or right of the first word. We exclude quads that do not pass a Pearson’s Chi-Square test threshold of 2.706 (p<0.05). [Read more]
The Concept Modelling Demonstrator was developed by the Linguistic DNA project and The Digital Humanities Institute at the University of Sheffield. The original project developed billiions of rows of data, which can be downloaded from here.