The paper illustrates how the combination of frequency analysis and diachronic word embeddings can help to explore and understand the development of long-running discourses in large sets of historical data.
Latest Posts by Journal of Digital History
The paper shows how combining frequency measures with diachronic word embeddings helps trace long-running political discourses in large historical corpora, revealing how meanings and associations shift over decades.
To assess the semantic changes in how German politicians spoke about the Wehrmacht, we use overlapping diachronic word embeddings. An additional analysis of term frequency measures the level of interest in the Wehrmacht over time.
The Bundestag - (West) Germany’s federal legislature - has published its proceedings since 1949. Digitized editions of these proceedings contain over 900,000 speeches, questions, and interjections, forming a valuable but underutilized corpus of sources on post-war German politics.
How have German politicians dealt with the burdensome legacy of militarism after the Second World War? This paper explores the connotations of the Wehrmacht in political speech through word embeddings trained on parliamentary proceedings.
Many thanks for the team at @jdighist.bsky.social for everything to bring this article to fruition! Check it out to learn more the utility and limits of LLMs as digital tools for historical research - like using LLMs for OCR, which is a hot topic of late.
How did early modern people imagine the future—and who shaped it? Analyzing 4,900 English letters (1410–1695), Sara Budts shows how phrases like “God willing” slowly faded, revealing a shift from Providence toward human agency and a more uncertain, self-shaped future.
What can 7,500 divine references in early modern letters tell us about how people imagined the future? Using the CEEC, Sara Budts maps when and how writers invoked God, revealing a patchy decline: routine phrases faded, but appeals persisted in illness, crisis, and war.
From “God willing” to “I trust in God,” early modern letters show how people used divine appeals to navigate uncertainty. Through corpus analysis, Sara Budts traces England’s uneven secularisation: routine invocations faded, but personal faith deepened, especially in times of crisis.
Early modern letters are full of phrases like “God willing” or “By God’s grace.” Sara Budts’ analysis of 5,000 letters shows these weren’t clichés but ways to navigate faith, agency, and uncertainty. From 1450–1700, people balanced divine will with human action in shifting ways.
From tools to sources: By examining LLMs through source criticism, historians can evaluate generative AI's uneven potential while contributing to broader debates about these technologies' societal impact.
Mapping AI's historical knowledge: Benchmarking LLM performance on historical facts, oral history transcription accuracy, and OCR correction tasks reveals uneven capabilities. Results show whose history has been digitized and made computationally legible. #oralhistory #OCR #benchmarks
Examining LLMs as historical sources: This study applies source criticism to AI training data, revealing how patterns of digitization shape what these tools encode. Case studies map this 'jagged frontier' of capabilities across historical tasks, languages & time periods. #DigitalHistory #LLMs
History-focused: What happened to Swedish public service radio when commercial broadcasting arrived? Analyzing 1,600 hours of P1 & P3, this article uncovers shifts in sonic patterns & content variation, offering new insights into the sound history of broadcasting
Methodology-focused: How can historians work with thousands of hours of sound? This article introduces segmentation & dimensionality reduction methods for large-scale radio archives, showing how computational analysis reveals hidden structures in public service broadcasting
The research shows how digital tools like Aventura.js can mediate archival inquiry. Through an interactive atlas and the Carguero case study, it offers a new approach to reading visual history. journalofdigitalhist...
Inspired by Warburg’s principles, the article transforms static images into interlinked, interactive panels. Its case study of the 19th‐century Carguero deepens our understanding of visual symbolism.
By merging digital techniques with human interpretation, the article demonstrates how interactive panels and data visualizations offer fresh insights into visual archives via the Carguero case.
The study repurposes Aventura.js—a bilingual, open‐source tool—to build digital atlases that trace symbolic connections in historical images, exemplified by Colombia’s Carguero.
A forthcoming article by Rodríguez Gómez and Urueña presents a method for exploring visual collections using interactive panels, inspired by Warburg’s Mnemosyne Atlas, to analyze the 19th‐century Carguero case. journalofdigitalhist...
But these sources raise questions. Can we ethically use archived posts from forums & blogs without consent? How do we avoid overinterpreting what's preserved—and what's lost? The paper ends with a call for cautious, critical digital history.
From lean employment pressures to restrictive benefits, the 2000s marked a turning point. Web archives reveal raw, everyday reflections—from sick workers to HR managers—offering insight into how people understood these changes, not just endured them.
How did British citizens experience sickness & disability policy under New Labour? This article uses blogs, message boards & the Internet Archive to explore how workers, managers & families navigated welfare reform—and how they remembered the welfare state. journalofdigitalhist...
GaLiLeO is unfinished & its corpus incomplete. But that’s the point. It shows how partial, purpose-built tools can still yield insight—especially when they trace relationships that archives and metadata might miss. A third space between archive and algorithm.
Unlike most text mining tools, GaLiLeO doesn’t aim for sweeping models. Instead, it helps researchers identify meaningful links across Galileo’s library—prioritizing interpretive, humanistic inquiry over scale. It’s built for discovery, not datafication.
What if digital tools didn’t summarize a whole corpus—but helped historians find meaning in just a few documents? 📜 GaLiLeO (Galileo’s Library and Letters Online) is a prototype digital lab that builds interpretive paths through Galileo’s letters, books & notes. journalofdigitalhist...
We worked to recover and make accessible 10,000+ defaced websites, captured between 1995–2010. These archived hacks aren't just noise - they reveal political messages, digital protest, and forgotten web aesthetics. Here's how they help us understand web history: journalofdigitalhist...
What can hacked websites tell us about the history of the web?
From online activism to digital vandalism, political web defacements were part of shaping the early internet. We just published a new article in the JDH diving into this fascinating and underexplored archive. journalofdigitalhist...
Excited to introduce Arvest - the platform which will be at the core of the ERC 'From Stage to Data’ project. Designed to process and analyse digital traces of performing arts, it's transforming how we study theatrical heritage. Explore arvest.app #PerformingArts #DigitalArchives"
Discover Arvest, the cloud-based platform for IIIF image & video annotation. No server installation needed - just create an account and start annotating your digital resources immediately. Try it today at arvest.app #IIIF #DigitalHumanities #Annotation journalofdigitalhist...