Publikace:
From XML to MARC: RDF Behind the Scenes

UPOZORNĚNÍ: Pokud video nejde přehrát, přihlašte se jednotným přihlášením NTK do repozitáře. Poté klikněte pravým tlačítkem na odkaz na soubor, který chcete přehrát a zvolte "Uložit odkaz jako" a soubor uložte.

Datum
2018
Název časopisu
ISSN časopisu
Název svazku
Vydavatel
Výzkumné projekty
Organizační jednotky
Číslo časopisu
Abstrakt

We collect heterogeneous metadata packages from various publishers. Although all of them are in XML, they vary a lot in terms of vocabulary, structure, granularity, precision, and accuracy. It is quite a challenge to cope with this jungle and recycling it to meet the needs of the Sudoc, the French academic union cataloguing system. How to integrate and enrich these metadata? How to integrate them in order to process them in a regular way, not through ad hoc processes? How to integrate them with specific or generic controlled vocabularies ? How to enrich them with author identifiers, for instance? RDF looks like the ideal solution for integration and enrichment. Metadata are stored in the Virtuoso RDF database and processed through a workflow steered by the Oracle DB. We will illustrate this generic solution with Oxford UP metadata: ONIX records for printed books and KBART package description for ebooks. So. A relational database as glue and pipeline engine… RDF as internal model… MARC as output …. Quite weird… Was this abstract written by an ELAG-specific random text generator?

Popis
Klíčová slova
formáty popisu, metadata, nakladatelství, katalogy, konverze, description formats, metadata, publishing houses, catalogues, conversion
Citace
DOI
ISBN
ISSN
Název akce
ELAG 2018
Datum a místo konání
2018-06-04
Praha (CZ)