Digital Humanities Cheat Sheet
The core ideas of Digital Humanities distilled into a single, scannable reference — perfect for review or quick lookup.
Quick Reference
Distant Reading
A computational approach to literary analysis, coined by Franco Moretti, that uses quantitative methods to study large collections of texts rather than performing close reading of individual works. It reveals macro-level patterns in literary history that are invisible when reading single texts.
Text Encoding Initiative (TEI)
An international standard for representing texts in digital form using XML markup. TEI provides guidelines for encoding manuscripts, literary works, linguistic corpora, and other textual materials so they can be searched, analyzed, and displayed by computer programs.
Corpus Linguistics
The study of language through large, structured collections of texts (corpora) using computational tools. It allows researchers to identify patterns of word usage, collocations, and linguistic change over time that would be impossible to detect through manual reading.
GIS and Spatial Humanities
The application of Geographic Information Systems to humanities research, enabling scholars to create layered digital maps that visualize spatial relationships in historical, literary, or cultural data. Spatial humanities treats geography as a category of analysis equal to time and theme.
Digital Archives and Collections
Online repositories of digitized primary source materials such as manuscripts, photographs, audio recordings, and artifacts. These archives use metadata standards and digital preservation practices to make cultural heritage materials widely accessible for research and public engagement.
Network Analysis
A method drawn from graph theory and social network analysis used to model and visualize relationships between entities such as historical figures, literary characters, institutions, or concepts. Nodes represent entities and edges represent connections between them.
Topic Modeling
A machine learning technique, commonly using Latent Dirichlet Allocation (LDA), that automatically identifies clusters of co-occurring words (topics) across a large collection of documents. It helps researchers discover thematic structures without prior knowledge of the content.
Digital Scholarly Editing
The practice of creating critical editions of texts in digital formats that can represent textual variation, editorial annotations, and multimedia context in ways that print editions cannot. Digital editions often allow users to compare manuscript witnesses side by side.
Stylometry
The statistical analysis of literary style, typically measuring features like word frequency, sentence length, and function word usage to identify authorship, detect forgeries, or trace stylistic influence. It operates on the principle that every author has a measurable linguistic fingerprint.
Linked Open Data
A method for publishing structured, interoperable data on the web so that datasets from different institutions can be connected and queried together. In digital humanities, it allows cultural heritage data from museums, libraries, and archives to be cross-referenced automatically.
Key Terms at a Glance
Get study tips in your inbox
We'll send you evidence-based study strategies and new cheat sheets as they're published.
We'll notify you about updates. No spam, unsubscribe anytime.