Skip to main content

Resources

  • Introduction to Programming for NLP with Python

    The aim of this virtual course is to offer basic knowledge and skills in programming in Python. Target audiences are undergraduate and graduate students in the Humanities and Social Sciences who want to acquire hands-on knowledge and skills in working with textual data or quantitative data in language and humanities research.
  • Word Embeddings

    Natural language processing is one of the most powerful concepts in modern linguistics and computer science, bridging the understanding of language from human to machine, and in turn programming machines so they can perform complex linguistic tasks on their own. This short video introduces learners to the key concepts of word embeddings and how they can be used in digital humanities projects.
  • The Learning Curve in Sharing Data with the EHRI Project

    A partnership between Kazerne Dossin and EHRI was established to enable sharing of metadata with a broader audience. This partnership resulted in changes to the practices of cataloguing archival materials within Kazerne Dossin. Using the example of the Lewkowicz family collection, this article focuses on the revolution Kazerne Dossin went through while standardising descriptions, and on the tools EHRI provided to optimise the workflow for collection holding institutes.
    Authors
    • Dorien Styven
    • Marius Caragea
    • Veerle Vanden Daelen
    Read more
  • From Digital Culture to Digital Heritage

    With the evolution of the digital world, the term ‘digital culture’ has emerged. How does digital culture tie into the idea of heritage, and how does digital heritage emerge? This video lecture discusses the meaning of 'culture' in a historical and digital context, offering an introduction to 'digital culture' and how this is intertwined with digital heritage.
  • Using Spatial Data in Tableau

    Tableau is a powerful digital tool for analysing data that can help with mapping and interrogating data. In this short guide we will focus on an aspect of data analysis using mapping that has particular application for Holocaust and refugee studies.
  • Use of vocabularies for metadata curation and quality assessment in Social Sciences and Humanities

    This event, organised in the framework of the TRIPLE project, provided insights into the use of “topical vocabularies” and their use in metadata curation and quality assessment in the Social Sciences and Humanities (in the EOSC context). The sessions introduces learners to have a better understanding of the interoperability challenges faced within/by the SSH branch of the EOSC, and be familiar with some initiatives related to metadata curation and enrichment in the SSH.
  • Entity Matching

    EHRI (European Holocaust Research Infrastructure) supports the use of digital tools that can assist in the research of Holocaust and refugee related topics. In a continued effort to make these tools as accessible as possible so that researchers who have no experience with digital tools will consider trying new ways of using their data, this GitHub-based lesson showcases the use of entity match tools when dealing with geographic data.
  • Data Journalism and AI: New frontiers in investigation and storytelling

    Data is now an indispensable part of investigative work and storytelling for journalists and newsrooms. Computational methods and artificial intelligence are making their way to newsrooms more than ever before, and promise to open up new opportunities for journalists, as well as new challenges. This talk provides an overview of how data and Artificial Intelligence can be used in the journalism workflow, investigative reporting and storytelling.
  • What Can I Do With This Messy Spreadsheet? Converting from Excel Sheets to Fully Compliant EAD-XML files

    Many Galleries, Libraries, Archives, and Museums (GLAMs) face difficulties sharing their collections metadata in standardised and sustainable ways, meaning that staff rely on more familiar general purpose office programs such as spreadsheets. However, while these tools offer a simple approach to data registration and digitisation they don’t allow for more advanced uses. This blogpost from EHRI explains a procedure for producing EAD (Encoded Archival Description) files from an Excel spreadsheet using OpenRefine.
  • How to Learn and Love Digital Text in Four Easy Steps

    Is ChatGPT unsettling you? Are you annoyed to always land on the same webportal when googling for a specific book? Do you hate it when just the one page you need to consult is nowhere to be found on the internet? This presentation by Anne Baillot is for you!
  • Using Named Entity Recognition to Enhance Access to a Museum Catalog

    This blog discusses the applicability of services such as automatic metadata generation and semantic annotation for automatic extraction of person names and locations from large datasets. This is demonstrated using Oral History Transcripts provided by the United States Holocaust Memorial Museum (USHMM).