Resources

Introduction to Programming for NLP with Python
The aim of this virtual course is to offer basic knowledge and skills in programming in Python. Target audiences are undergraduate and graduate students in the Humanities and Social Sciences who want to acquire hands-on knowledge and skills in working with textual data or quantitative data in language and humanities research.
Authors
Koenraad De Smedt
Read more →
Word Embeddings
Natural language processing is one of the most powerful concepts in modern linguistics and computer science, bridging the understanding of language from human to machine, and in turn programming machines so they can perform complex linguistic tasks on their own. This short video introduces learners to the key concepts of word embeddings and how they can be used in digital humanities projects.
Authors
Joseph Flanagan
Read more →
The Learning Curve in Sharing Data with the EHRI Project
A partnership between Kazerne Dossin and EHRI was established to enable sharing of metadata with a broader audience. This partnership resulted in changes to the practices of cataloguing archival materials within Kazerne Dossin. Using the example of the Lewkowicz family collection, this article focuses on the revolution Kazerne Dossin went through while standardising descriptions, and on the tools EHRI provided to optimise the workflow for collection holding institutes.
Authors
Dorien Styven
Marius Caragea
Veerle Vanden Daelen
Read more →
From Digital Culture to Digital Heritage
With the evolution of the digital world, the term ‘digital culture’ has emerged. How does digital culture tie into the idea of heritage, and how does digital heritage emerge? This video lecture discusses the meaning of 'culture' in a historical and digital context, offering an introduction to 'digital culture' and how this is intertwined with digital heritage.
Authors
Johanna Enqvist
Read more →
Using Spatial Data in Tableau
Tableau is a powerful digital tool for analysing data that can help with mapping and interrogating data. In this short guide we will focus on an aspect of data analysis using mapping that has particular application for Holocaust and refugee studies.
Authors
Rachel Pistol
Read more →
Use of vocabularies for metadata curation and quality assessment in Social Sciences and Humanities
This event, organised in the framework of the TRIPLE project, provided insights into the use of “topical vocabularies” and their use in metadata curation and quality assessment in the Social Sciences and Humanities (in the EOSC context). The sessions introduces learners to have a better understanding of the interoperability challenges faced within/by the SSH branch of the EOSC, and be familiar with some initiatives related to metadata curation and enrichment in the SSH.
Authors
Laure Barbot
Marco Raciti
Matej Ďurčo
Read more →
Entity Matching
EHRI (European Holocaust Research Infrastructure) supports the use of digital tools that can assist in the research of Holocaust and refugee related topics. In a continued effort to make these tools as accessible as possible so that researchers who have no experience with digital tools will consider trying new ways of using their data, this GitHub-based lesson showcases the use of entity match tools when dealing with geographic data.
Authors
Rachel Pistol
Read more →
Data Journalism and AI: New frontiers in investigation and storytelling
Data is now an indispensable part of investigative work and storytelling for journalists and newsrooms. Computational methods and artificial intelligence are making their way to newsrooms more than ever before, and promise to open up new opportunities for journalists, as well as new challenges. This talk provides an overview of how data and Artificial Intelligence can be used in the journalism workflow, investigative reporting and storytelling.
Authors
Bahareh Heravi
Read more →
What Can I Do With This Messy Spreadsheet? Converting from Excel Sheets to Fully Compliant EAD-XML files
Many Galleries, Libraries, Archives, and Museums (GLAMs) face difficulties sharing their collections metadata in standardised and sustainable ways, meaning that staff rely on more familiar general purpose office programs such as spreadsheets. However, while these tools offer a simple approach to data registration and digitisation they don’t allow for more advanced uses. This blogpost from EHRI explains a procedure for producing EAD (Encoded Archival Description) files from an Excel spreadsheet using OpenRefine.
Authors
Herminio Garcia González
Read more →
More Watching, Less Searching: Repurposing Fortunoff Archive Metadata for Visual Searching
The Fortunoff Visual Search is a tool for both data visualisation and collection discovery from the Fortunoff Video Archive for Holocaust Tesimonies. This blogpost demonstrates the Visual Search tool in the Fortunoff Video Archive, including the search and filtering interface, as well as interpreting the resulting visualisations
Authors
Stephen Naron
Jake Kara
Read more →
How to Learn and Love Digital Text in Four Easy Steps
Is ChatGPT unsettling you? Are you annoyed to always land on the same webportal when googling for a specific book? Do you hate it when just the one page you need to consult is nowhere to be found on the internet? This presentation by Anne Baillot is for you!
Authors
Anne Baillot
Read more →
Using Named Entity Recognition to Enhance Access to a Museum Catalog
This blog discusses the applicability of services such as automatic metadata generation and semantic annotation for automatic extraction of person names and locations from large datasets. This is demonstrated using Oral History Transcripts provided by the United States Holocaust Memorial Museum (USHMM).
Authors
Ivelina Nikolova
Michael Levy
Read more →

Resources

Filter by topic