Skip to main content

XML

XML (eXtensible Markup Language) is a markup language for storing and transporting data in both human-readable and machine-readable format, making it interoperable across digital platforms

Resources

  • The Learning Curve in Sharing Data with the EHRI Project

    A partnership between Kazerne Dossin and EHRI was established to enable sharing of metadata with a broader audience. This partnership resulted in changes to the practices of cataloguing archival materials within Kazerne Dossin. Using the example of the Lewkowicz family collection, this article focuses on the revolution Kazerne Dossin went through while standardising descriptions, and on the tools EHRI provided to optimise the workflow for collection holding institutes.
    Authors
    • Dorien Styven
    • Marius Caragea
    • Veerle Vanden Daelen
    Read more
  • What Can I Do With This Messy Spreadsheet? Converting from Excel Sheets to Fully Compliant EAD-XML files

    Many Galleries, Libraries, Archives, and Museums (GLAMs) face difficulties sharing their collections metadata in standardised and sustainable ways, meaning that staff rely on more familiar general purpose office programs such as spreadsheets. However, while these tools offer a simple approach to data registration and digitisation they don’t allow for more advanced uses. This blogpost from EHRI explains a procedure for producing EAD (Encoded Archival Description) files from an Excel spreadsheet using OpenRefine.
  • Digitizing the Materiality of the Premodern Book

    The foundational skills at the intersection of digitization, bibliography, and the Digital Humanities are crucial for many scholars, yet instruction frequently only covers one maybe two of these intersecting aspects. For example, use of the Text Encoding Initiative XML standard is increasingly the norm in digital scholarly editing, but many individuals working with textual materials do not have access to relevant scholarly training in DH. Conversely, many DH departments, lack rare book specialists. The goal of this video class is to teach the necessary skills for understanding how the materiality of pre-modern books can be digitized and provide a foundation for putting those skills into practice. After completing this course, students will understand the fundamentals of digitization and how books and manuscripts are described in the TEI, including the msdescription and transcription modules. This class is part of the project Digitizing the Materiality of the Premodern Book and licensed Creative Commons BY NC SA. This project (2022-2023) is funded by CLARIAH-AT with the support of BMBWF. The videos were produced by Moving Stills.
  • Extracting Lexical Data: XPath for Dictionary Nerds

    XPath (XML Path Language) is a standard query language for selecting nodes from XML documents. In this step-by-step tutorial, you will learn how to write XPath expressions in order to navigate around our XML-encoded dictionaries and select only those bits of data that you are interested in.
  • Digitising Dictionaries

    This course is an introduction to the theories, practices, and methods of digitizing legacy dictionaries for research, preservation and online distribution. It focuses on a particular technique of modeling and describing lexical data using eXtensible Markup Language (XML) in accordance with the Guidelines of the Text Encoding Initiative, a de-facto standard for text encoding among humanities researchers.