DARIAH is a pan-European infrastructure for arts and humanities scholars working with computational methods. It supports digital research as well as the teaching of digital research methods.
The course builds upon Extracting Lexical Data: XPath for Dictionary Nerds and introduces the basics of XSL Transformations (XSLT), a standard language for transforming XML documents.
This course will help users learn how to use oXygen XML, a versatile, professional-grade XML editor to edit, validate, query and transform lexicographic data.
This course will introduce the concept and the ELEXIS implementation of the dictionary matrix, a universal repository of linked senses, and other types of lexical information found in existing lexicographic resources.
This course explores the principles of open access, open data, FAIR principles and open science as they apply to lexicography including the specific challenges posed by intellectual property rights and copyright issues in the context of lexicographic work.
This course will present an overview of resources available from CLARIN that may be useful for the lexicographer; we refer to lexical datasets but also to textual resources such as corpora, as well as tools.
The course will explore how software tools for dictionary production (so-called dictionary writing systems, or DWS) can be used to streamline and facilitate the structural coherence and quality assurance in a dictionary project by focusing on Lexonomy, a dictionary-writing system developed as part of ELEXIS.
The course will introduce corpus tools available in ELEXIS and describe various ways in which they can be exploited in lexicographic research and for compiling dictionaries.
This course will introduce the theories, practices, and methods of digitizing legacy dictionaries for research, preservation and online distribution by focusing on the process of converting paper-based dictionaries to electronic format through image capture, text capture, data modeling and data enrichment.
This course will explore the notion of lexicographic evidence and the limitation of subjective views on language by tracing the changes in lexicographic practice from the extensive use of manually selected citations to the employment of large language corpora.