Publikace:
The datahub: de/blending museum data

Datum
2018-06-06
Název časopisu
ISSN časopisu
Název svazku
Vydavatel
Výzkumné projekty
Organizační jednotky
Číslo časopisu
Abstrakt

The Flemish Museums for Fine and Contemporary Art offer an overview of the art production in the Southern Netherlands and Belgium form the Middle Ages to the Twenty-First Century. The Flemish Art Collection is a non-profit organisation tasked with promoting the collection to an international and diverse audience. Delivery of knowledge and expertise curated by the museums is a big challenge. Blending cultural object records stored accross various databases and commercial registration systems is non-trivial and prevents opening up the collections across the walls of the museums. In 2015, the Flemish Art Collection started the Datahub Project. Over the past years, a modern metadata aggregation platform was built, leveraging open source technologies and open standards. This presentation will highlight the architecture of this platform, and the design process. The Datahub platform is a service oriented architecture and consists of three major components. The core is a home grown, reusable metadata aggregator called The Datahub. This web-application is build with the Symfony framework. Metadata records are ingested via a RESTful API, stored in a MongoDB database and disseminated via an OAI-PMH endpoint. User friendly discovery of metadata is covered via Project Blacklight and geared towards museal workers as well as the general public. Finally, we repurposed the Catmandu framework for flexible and extensible setup of ETL pipelines between the registration systems of the data providers, the Datahub and the discovery interface. Since we are exchanging information about cultural heritage objects, we use the LIDO XML exchange format designed and developed by ICOM. The project taught us several valuable lessons. What are the benefits of looking across the borders of your own domain? What are key success factors? How do you identify pitfalls? But it also raises a set of new questions. How do we go from here? What’s next? The tools and the codebase are freely available under a GPLv3 license and are actively documented and maintained on Github.

Popis
Klíčová slova
Citace