Alignments / Crosswalks: (Web View)
Alignments/Crosswalks (Web View)
1. Crosswalks from MARC to Music Ontology and Other Schemas
All of the MARC fields and subfields and their matching Music Ontology (MO) elements that correspond to the nice common groups (introduced in this page [link]) are managed in an OWL file using Protege tool. This unified crosswalk also consisted the elements from other metadata vocabularies (e.g., Dublin Core and FOAF) that were useful. The OWL file was exported into OWL DOC format for human-readable views.
2. Archival Description and Linked Data: A Preliminary Study of Opportunities and Implementation Challenges
This project investigates how archives can connect their collections to related data sources through the implementation of linked data. Questions explored included: a) What types of data currently available in archival surrogates such as Encoded Archival Description (EAD) finding aids and Machine Readable Cataloging (MARC) records may be useful if converted to linked data? b) For those potentially useful data points identified in archival surrogates, how might one align data structures found in those surrogates to the data structures of other relevant internal or external information sources? c) What features of current standards and data structures present impediments or challenges that must be overcome in order to achieve interoperability among disparate data sources?
The research team identified metadata elements of potential use as linked data in archival surrogates, as well as metadata element sets and vocabularies of data sets that could serve as pathways to relevant external data sources. Data sets chosen for the study included DBpedia and schema.org; metadata element sets examined included Friend of a Friend (FOAF), GeoNames, and Linking Open Description of Events (LODE). The team then aligned tags found in the EAD encoding standard to related classes and properties found in these linked data sources and metadata element sets. To investigate the third question about impediments to incorporating linked data in archival descriptions, the researcher analyzed the locations and frequencies at which controlled and uncontrolled access points (personal and family name, corporate name, geographic name, and genre/form entities) appeared in a sample of MARC and EAD archival descriptive records by using a combination of hand counts and the natural language processing (NLP) tool, OpenCalais. The results of the location and frequency analysis, combined with the results of the alignment process, helped the research team identify several critical challenges currently impeding interoperability among archival information systems and relevant linked data sources, including differences in granularity between archival and other data source vocabularies, and inadequacies of current encoding standards to support semantic tagging of potential access points embedded in free text areas of archival surrogates.
This work was supported by a grant from IMLS. It is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. reserved.