Author Archives: cwong

Visualising Patterns in Historical Concept Mapping

Curating Patterns with Visualisations

To interpret the patterns emerging from our metric calculations, we’re deploying visualisations that make these complex relationships easier to grasp. Using tools like network graphs and heatmaps, we can identify clusters of words that form cohesive themes. These visualisations allow us to observe how concepts interrelate, helping us to curate distinct patterns within the constellations.

In this context, we’re using a Sankey diagram to illustrate the traffic  (i.e., the frequency or prominence of certain lemmas within the constellation), complexity, and interconnectedness of various lemmas within concept constellations. Specifically, we’re applying it to the ‘church’ constellation, where we previously discussed the high degree of lemma variability.

Sankey diagram of ‘church’ constellation

The Sankey diagram allows us to visualise the flow and complexity within the ‘church’ constellation:

  • The constellation is composed of lots of different lemmas, with a particularly high traffic around specific lemmas.
  • Key lemmas with heavier traffic include:
    • ‘bishop,’ ‘presbyter,’ and ‘apostle’ – suggesting a focus on hierarchical or ecclesiastical roles.
    • ‘faith,’ ‘heaven,’ and ‘believe’ – indicating central religious themes within the constellation.

This visualisation underscores the diversity within the ‘church’ constellation while highlighting focal points in religious discourse.

Thematic Clustering of Constellation Lemmas

To deepen our understanding of thematic structures, we’re clustering lemmas within constellations by theme. This thematic clustering allows us to examine the composition and distribution of lemmas, shedding light on the thematic constructs of different concepts.

Tree map of the distribution of lemmas in constellations of ‘church’, ‘heaven’ and ‘earth’ clustered by thematic categories

We applied this approach to three constellations, ‘church’, ‘heaven’, and ‘earth’, to examine the thematic composition of each. Using a tree diagram, we’ve visualised the distribution of lemmas clustered by thematic categories across these constellations. This helps us evaluate the thematic makeup of each constellation and observe how certain themes were interconnected within Early Modern English religious discourse.

  • The constellations for ‘church’, ‘heaven’, and ‘earth’ all relate to religious discourse yet represent different dimensions of it.
  • Themes such as ‘Roles and Hierarchy,’ ‘Nature,’ ‘Supernature,’ and ‘Body Parts‘ emerge as pivotal across these constellations.
  • For instance, ‘Roles and Hierarchy‘ is a recurring theme in all three constellations, with a particularly prominent role within the church constellation.

These visualisations and thematic clusters allow us to explore and interpret nuances within historical language that may otherwise remain hidden. As our analysis continues, we’re excited to see how these patterns evolve and what they reveal about Early Modern English concepts and their complex interrelations.

Mapping Historical Concepts with LDNA

Our current project employs the LDNA (Linguistic DNA) demonstrator to investigate historical texts, aiming to uncover how ideas evolved in Early Modern English. By analysing ‘concept constellations’ which are clusters of the quads (sets of four highly frequent common co-occurring lemmas, including a central node lemma), we gain insights into how groups of words represented broader themes and evolving concepts over time.

Understanding Quads and Lemmas in Concept Constellations

Two essential building blocks in our analysis are ‘quads’ and ‘lemmas.’ Quads are sets of frequently co-occurring lemmas that appear within a proximity of 100 words in a text, suggesting a thematic or conceptual link, while lemmas represent the root forms of words. Examining the relationships between quads and lemmas enables us to map out how key ideas, centring on lemmas such as ‘church,’ ‘heaven,’ and ‘earth,’ were expressed and discussed across historical texts.

As an example, we’re using the concept constellation centred on the node lemma ‘church’:

  • The constellation for ‘church’ is notably large, comprising 276 unique quads, each involving three other related words.
  • Altogether, this constellation includes 263 distinct lemmas that form various quads around the node lemma ‘church.’
Heatmap of metrics for 34 studied constellations generated by the LDNA demonstrator

Calculating Metrics of Quads and Lemmas

We’re employing several metrics to analyse patterns within these constellations. By measuring frequency, co-occurrence, and variability, we can identify concepts that were stable or prominent in historical discourse and others that were more complex and diverse. This analytical approach lays the groundwork for visualising these relationships and coding them against established historical linguistic frameworks.

In this study, we’ve reviewed 34 constellations of quads generated by the LDNA demonstrator. The metrics highlight:

  • The size of each constellation and the number of lemmas within each one,
  • The density and complexity of these constellations, revealing both the conceptual richness and how densely interconnected certain ideas were.

The metrics provide insights, such as:

  • The constellation for ‘church’ is not only large but also contains the most variable lemmas, suggesting that this concept is particularly complex or discursive as it occurs in discourses.
  • In contrast, while the constellation for ‘king’ is substantial (consisting of 116 quads), it contains only 27 unique lemmas. This may suggest a simpler or less varied concept, with a more consistent discourse surrounding it.

In the next blog entry, we’ll move beyond these metrics to explain how visualisations and thematic coding further enhance our understanding of concept patterns. In doing so, we’ll continue building a fuller picture of these historical constellations, bringing the past to life in new ways.