As the project nears completion, researchers will produce a new online resource, freely available to all, which will provide users with access to the ‘Linguistic DNA’ of Early Modern printed discourse. This will enable users to configure and explore huge visualisations of the lexical structure of 28 million printed pages, locate specific regions within the model through a search and dictionary-based interface, as well as view every single word within the context of its original text. Further, a Web API will enable third parties to develop their own research projects and digital products using the underlying dataset.
In the future, the resulting data will underpin research projects on the linguistic and epistemological structures of modern knowledge. Moreover, findings may be used to drive the next generation of semantic search engines; enable publishers to develop finely tuned customer services which automatically classify and relate publications; inform public and corporate research into contemporary online discourse, including consumer trends and public opinions; and contribute to understanding trends, behaviour and demographic change.