XML

XML (eXtensible Markup Language) is a markup language for storing and transporting data in both human-readable and machine-readable format, making it interoperable across digital platforms

The Learning Curve in Sharing Data with the EHRI Project
A partnership between Kazerne Dossin and EHRI was established to enable sharing of metadata with a broader audience. This partnership resulted in changes to the practices of cataloguing archival materials within Kazerne Dossin. Using the example of the Lewkowicz family collection, this article focuses on the revolution Kazerne Dossin went through while standardising descriptions, and on the tools EHRI provided to optimise the workflow for collection holding institutes.
Authors
Dorien Styven
Marius Caragea
Veerle Vanden Daelen
Read more →
What Can I Do With This Messy Spreadsheet? Converting from Excel Sheets to Fully Compliant EAD-XML files
Many Galleries, Libraries, Archives, and Museums (GLAMs) face difficulties sharing their collections metadata in standardised and sustainable ways, meaning that staff rely on more familiar general purpose office programs such as spreadsheets. However, while these tools offer a simple approach to data registration and digitisation they don’t allow for more advanced uses. This blogpost from EHRI explains a procedure for producing EAD (Encoded Archival Description) files from an Excel spreadsheet using OpenRefine.
Authors
Herminio Garcia González
Read more →
Digitizing the Materiality of the Premodern Book
The foundational skills at the intersection of digitization, bibliography, and the Digital Humanities are crucial for many scholars, yet instruction frequently only covers one maybe two of these intersecting aspects. For example, use of the Text Encoding Initiative XML standard is increasingly the norm in digital scholarly editing, but many individuals working with textual materials do not have access to relevant scholarly training in DH. Conversely, many DH departments, lack rare book specialists. The goal of this video class is to teach the necessary skills for understanding how the materiality of pre-modern books can be digitized and provide a foundation for putting those skills into practice. After completing this course, students will understand the fundamentals of digitization and how books and manuscripts are described in the TEI, including the msdescription and transcription modules. This class is part of the project Digitizing the Materiality of the Premodern Book and licensed Creative Commons BY NC SA. This project (2022-2023) is funded by CLARIAH-AT with the support of BMBWF. The videos were produced by Moving Stills.
Authors
Sarah A. Lang
Sean Winslow
Erich Renhart
Read more →
Extracting Lexical Data: XPath for Dictionary Nerds
XPath (XML Path Language) is a standard query language for selecting nodes from XML documents. In this step-by-step tutorial, you will learn how to write XPath expressions in order to navigate around our XML-encoded dictionaries and select only those bits of data that you are interested in.
Authors
Toma Tasovac
Read more →
Text Encoding and the Text Encoding Initiative
This course introduces the theory and practice of text encoding using the Guidelines of the Text Encoding Initiative.
Authors
Susan Schreibman
Roman Bleier
Read more →
Digital Scholarly Editions: Manuscripts, Texts and TEI Encoding
This course will introduce you to the creation of digital scholarly editions, for manuscripts or printed texts, with the help of the TEI and other related technologies.
Authors
Marjorie Burghart
Elena Pierazzo
Read more →
Digitising Dictionaries
This course is an introduction to the theories, practices, and methods of digitizing legacy dictionaries for research, preservation and online distribution. It focuses on a particular technique of modeling and describing lexical data using eXtensible Markup Language (XML) in accordance with the Guidelines of the Text Encoding Initiative, a de-facto standard for text encoding among humanities researchers.
Authors
Toma Tasovac
Read more →