Semantic enrichment for enhancing LAM data and supporting digital humanities. Review article



Palabras clave:

Semantic enrichment, Libraries, archives, and museums, LAMs, Digital humanities, DH, Smart data, Metadata, Structured data, Semi-structured data, Unstructured data, Knowledge discovery, Entity-centric modeling and information access,


With the rapid development of the digital humanities (DH) field, demands for historical and cultural heritage data have generated deep interest the data provided by libraries, archives, and museums (LAMs). In order to enhance LAM data´s quality and discoverability while enabling a self-sustaining ecosystem, "semantic enrichment" becomes a strategy increasingly used by LAMs during recent years. This article introduces a number of semantic enrichment methods and efforts that can be applied to LAM data at various levels, aiming to support deeper and wider exploration and use of LAM data in DH research. The real cases, research projects, experiments, and pilot studies shared in this article demonstrate endless potential for LAM data, whether they are structured, semi-structured, or unstructured, regardless of what types of original artifacts carry the data. Following their roadmaps would encourage more effective initiatives and strengthen this effort to maximize LAM data´s discoverability, use- and reuse-ability, and their value in the mainstream of DH and Semantic Web.


Los datos de descargas todavía no están disponibles.


Albritton, Benjamin (2013). Digital manuscript interoperability: Shared canvas and IIIF in practice.

Alemu, Getaneh; Brett, Stevens; Ross, Penny; Chandler, Jane (2012). "Linked data for libraries: Benefits of a conceptual shift from library-specific record structures to RDF-based data models". New library world, v. 113, n. 11/12, pp. 549-570.

Allen, Robert B. (2017). "Rich semantics and direct representation for digital collections". In: ACM/IEEE Joint conference on digital libraries (JDCL), pp. 348-349.

Appleby, Michael; Crane, Tom; Sanderson, Robert; Stroop, Jon; Warnet, Simeon (2012a). "IIIF Image API 2.1.1". IIIF.

Appleby, Michael; Crane, Tom; Sanderson, Robert; Stroop, Jon; Warnet, Simeon (2012b). "IIIF Presentation API 2.1.1". IIIF.

Bainbridge, David; Hinze, Annika; Cunningham, Sally-Jo; Downie, J. Stephen (2016). "Low-cost semantic enhancement to digital library metadata and indexing: Simple yet effective strategies". In: 2016 ACM/IEEE Joint conference on digital libraries (JDCL), pp. 93-102.

Baker, Thomas; Bermès, Emmanuelle; Coyle, Karen; Dunsire, Gordon; Isaac, Antoine; Murray, Peter; Panzer, Michael; Schneider, Jodi; Singer, Ross; Summers, Ed; Waites, William; Young, Jeff; Zeng, Marcia Lei (2011). Library linked data Incubator Group Final Report. W3C Incubator Group Report 25.

Bensmann, Felix; Zapilko, Benjamin; Mayr, Philipp (2017). "Interlinking large-scale library data with authority records". Frontiers in digital humanities, n. 4, p. 5.

Borgman, Christine L. (2015). Big data, little data, no data: Scholarship in the networked world. Cambridge, MA: MIT Press. ISBN: 978 0 262529914

Burdick, Anne; Drucker, Johanna; Lunenfeld, Peter; Presner, Todd; Schnapp, Jeffrey (2012). Digital_Humanities. Cambridge, MA: MIT Press. ISBN: 978 0 262528863

Clarke, David (2015). "Deep image annotation: Making a difference in knowledge organization". Fourth ISKO-UK Biennial conference of the International Society for Knowledge Organization.

Consultative Committee for Space Data Systems (2012). Reference model for an Open Archival Information System. Washington DC: CCSDS.

Damjanovic, Violeta; Kurz, Thomas; Westenthaler, Rupert; Behrendt, Wernher; Gruber, Andreas; Schaffert, Sebastian (2011). "Semantic enhancement: The key to massive and heterogeneous data pools". In: Proceedings of the 20th intl IEEE ERK (Electrotechnical and Computer Science) conference, pp. 413-416.

Dunsire, Gordon; Willer, Mirna (2011). "Standard library metadata models and structures for the semantic web". Library hi tech news, v. 28, n. 3, pp. 1-12.

Farias-Lóscio, Bernadette; Burle, Caroline; Calegari, Newton (2017). Data on the web best practices. W3C Recommendation 31 January 2017.

Floridi, Luciano (2010). Information: A very short introduction. Oxford: Oxford University Press. ISBN: 978 0 199551378

Gardner, Dan (2012). "An ocean of data [Introduction]". In: Smolan, Rick; Erwitt, Jennifer (eds.). The human face of big data. Sausalito, CA: Against All Odds Productions, pp. 14-17. ISBN: 978 1 454908272

Gracy, Karen; Davidson, Sammy (2014). "Helping users find the "˜good stuff´: Using the semantic analysis method (SAM) tool to identify and extract potential access points from archival finding aids". In: SAA Research Forum, Society of American Archivists.

Gracy, Karen; Zeng, Marcia Lei (2015). "Creating linked data within archival description: Tools for extracting, validating, and encoding access points for finding aids". Digital humanities conference of the Alliance of Digital Humanities Organizations (ADHO).

Gracy, Karen; Zeng, Marcia Lei; Skirvin, Laurence (2013). "Exploring methods to improve access to music resources by aligning library data with linked data: A report of methodologies and preliminary findings". Journal of the American Society for Information Science and Technology (JASIS&T), v. 64, n. 10, pp. 2078-2099.

Gruber, Ethan (2017). "Final report to the NEH for online coins of the Roman Empire". Day of archaeology, July 28.

Hinze, Annika; Taube-Schock, Craig; Bainbridge, David; Matamua, Rangi; Downie, J. Stephen (2015). "Improving access to large-scale digital libraries through semantic-enhanced search and disambiguation". In: Proceedings of the 15th ACM/IEEE-CS Joint conference on digital libraries. Association for Computational Linguistics, pp. 147-156.

Hyví¶nen, Eero (2016). "Cultural heritage linked data on the semantic web: Three case studies using the sampo model". VIII Encounter of documentation centres of contemporary art: open linked data and integral management of information in cultural centres. Artium, Vitoria-Gasteiz, Spain, October 19-20.

Hyví¶nen, Eero; Heino, Erkki; Leskinen, Petri; Ikkala, Esko; Koho, Mikko; Tamper, Minna; Tuominen, Jouni; Mí¤kelí¤, Eetu (2016). "Publishing Second World War history as linked data events on the semantic web". In: Proceedings of the digital humanities conference, pp. 571-573.

Hyví¶nen, Eero; Leskinen, Petri; Tamper, Minna; Rantala, Heikki; Tuominen, Jouni; Keravuori, Kirsi (2018). "Demonstrating BiographySampo in solving digital humanities research problems in biography and prosopography" [Submitted paper].

Ikkala, Esko; Tuominen, Jouni; Raunamaa, Jaakko; Aalto, Tiina; Ainiala, Terhi; Uusitalo, Heliní¤; Hyví¶nen, Eero (2018). "NameSampo: A linked open data infrastructure and workbench for toponomastic research". In: GeoHumanities 18, Proceedings of the 2nd ACM SIG Spatial workshop on geospatial humanities, Seattle, WA, USA, November 06-09, pp. 2:1-2:9, ACM.

IMLS (2018). Transforming communities: IMLS strategic plan (2018-2022). Washington DC: Institute of Museum and Library Services.

Isaac, Antoine; Manguinhas, Hugo; Stiller, Juliane; Charles, Valentine (2015). Report on enrichment and evaluation. The Hague, Netherlands: Europeana Task Force on Enrichment and Evaluation.

Kaplan, Frederic (2015). "A map for big data research in digital humanities". Frontiers in digital humanities, n. 2.

KBpedia (2018). KBpedia is now open source, October 23.

Kobielus, James (2016). "The evolution of big data to smart data". In: Smart data online, July 13 [PowerPoint slides].

Lin, Yuri; Michel, Jean-Baptiste; Lieberman-Aiden, Erez; Orwant, Jon; Brockman, Will; Petrov, Slav (2012). "Syntactic annotations for the Google Books Ngram corpus". In: Proceedings of the ACL 2012 System demonstrations. Association for Computational Linguistics, pp. 169-174.

Manguinhas, Hugo (ed.) (2016). Europeana semantic enrichment framework. Version 17, Nov.

Mayer, Daniel (2011). Mainstream semantic enrichment [YouTube video]. December 26.

Mayer-Schí¶nberger, Viktor; Cukier, Kenneth (2013). Big data: A revolution that will transform how we live, work, and think. New York, NY: Eamon Dolan/Mariner Books. ISBN: 978 0 544227750

Mukerjee, Prithwis (2014). "Introduction to data science" [PowerPoint slides], January 12.

Mutuvi, Stephen; Doucet, Antoine; Odeo, Moses; Jatowt, Adam (2018). "Evaluating the impact of OCR errors on topic modeling". In: Maturity and innovation in digital libraries. 20th Intl conf on Asia-Pacific digital libraries, ICADL 2018, Hamilton, New Zealand, November 19-22, Proceedings, pp. 3-14. ISBN: 978 3 030 04257 8

National Archives (2016). "Finding aid type". The national archives catalog.

Nguyen, Thi-Tuyet-Hai; Coustaty, Mickael; Doucet, Antoine; Jatowt, Adam; Nguyen, Nhu-Van (2018). "Adaptive edit-distance and regression approach for post-OCR text correction". In: Maturity and innovation in digital libraries. 20th Intl conf on Asia-Pacific digital libraries, ICADL 2018, Hamilton, New Zealand, November 19-22, Proceedings, pp. 278-289. ISBN: 978 3 030 04257 8

O´Neill, Ed; Mixter, Jeff (2013). "Maximizing the usage of value vocabularies in the linked data ecosystem". In: 76th Annual meeting of the American Society for Information Science and Technology (ASIS&T), Montreal, Canada, November.

Pattuelli, M. Cristina (2012). "Personal name vocabularies as linked open data: A case study of jazz artist names". Journal of information science, v. 38, n. 6, pp. 558-565.

Pattuelli, M. Cristina; Hwang, Karen; Miller, Matthew (2016). "Accidental discovery, intentional inquiry: Leveraging linked data to uncover the women of jazz". Digital scholarship in the humanities, v. 32, n. 4, pp. 918-924.

Prasad, A. R. D.; Giunchiglia, Fausto; Devika, P. Madalli (2017). "DERA: from document centric to entity centric knowledge modelling". In: Proceedings of the International UDC seminar 2017. Faceted classification today. London, September, pp. 169-179.

Riva, Pat; LeBoeuf, Patrick; Žumer, Maja (2017). IFLA library reference model: A conceptual model for bibliographic information. Netherlands: IFLA.

Schí¶ch, Christof (2013). "Big? Smart? Clean? Messy? Data in the Humanities". Journal of digital humanities, v. 2, n. 3, pp. 2-13.

Smith-Yoshimura, Karen (2018). "The rise of Wikidata as a linked data source". In: Hanging together. The OCLC research blog, August 6.

Stiller, Juliane; Petras, Vivien; Gí¤de, Maria; Isaac, Antoine (2014). "Automatic enrichments with controlled vocabularies in Europeana: Challenges and consequences." In: Euro-Mediterranean conf., pp. 238-247. Springer, Cham.

Svensson, Patrik (2010). "The landscape of digital humanities". Digital humanities quarterly, v. 4, n. 1.

Thorsen, Hilary K.; Pattuelli, M. Cristina (2016). "Linked open data and the cultural heritage landscape". In: Jones, Ed; Seikel, Michele (eds.). Linked data for cultural heritage. Chicago, IL: Alcts Publishing. ISBN: 978 1 783301621

TiECON East (2014). Data is the new oil.

Reinhard, Andrew; Van-Alfen, Peter; Bransbourg, Gilles; Gruber, Ethan; (2017). "Wishes granted: the ANS and the NEH". In: National Endowment for the Humanities. Announces. New grant recipients.

Van-Ruyskensvelde, Sarah (2014). "Towards a history of e-ducation? Exploring the possibilities of digital humanities for the history of education". Paedagogica historica, v. 50, n. 6, pp. 861-870.

Varner, Stewart; Hswe, Patricia (2016). Special report: Digital humanities in libraries. American Libraries.

W3C (2011). Library Linked Data Incubator Group Final Report

W3C (2017). Data on the Web best practices.

Wagner, Elisabeth; Matsumoto, Mallory; Kiel, Nikolai; Gronemeyer, Sven (2014). A checklist of museums with Maya Art.

Wang, Xiaoguang; Liu, Xuemei; Xia, Shengping (2017). "Design and implementation of deep semantic indexing on digital cultural heritage images". Journal of library and information science, v. 43, n. 1, pp. 98-121.

Weitz, Jay; Toves, Jenny; Vizine-Goetz, Diane; Naught, Nannette; Bremer, Robert (2016). "Mining MARC´s hidden treasures: Initial investigations into how notes of the past might shape our future". Journal of library metadata, v. 16, n. 3-4, pp. 166-180.

Zeng, Marcia Lei (2017). "Smart data for digital humanities". Journal of data and information science, v. 2, n. 1, pp. 1-12.

Zeng, Marcia Lei; Gracy, Karen; Skirvin, Laurence (2013). "Navigating the intersection of library bibliographic data and linked music information sources: A study in the identification of useful metadata elements for interlinking". Journal of library metadata, v. 13, n. 2-3, pp. 254-278.

Zeng, Marcia Lei; Gracy, Karen F.; Žumer, Maja (2014). "Using a semantic analysis tool to generate subject access points: A study using Panofsky´s theory and two research samples". Knowledge organization, v. 41, n. 6, pp. 440-451.

Zeng, Marcia Lei; Mayr, Philipp (2018). "Knowledge organization systems (KOS) in the semantic web". International journal on digital libraries.

Žumer, Maja (2018). "IFLA library reference model (IFLA LRM): Harmonisation of the FRBR family". Knowledge organization, v. 45, n. 4, pp. 310-318.

Also available in Hjí¸rland, Birger (ed.). ISKO Encyclopedia of knowledge organization.

Žumer, Maja; Riva, Pat (2017). "IFLA LRM-Finally here". In: Intl conf on Dublin Core and metadata applications, Washington, D.C., USA, 26-29 October, pp. 13-23.



Cómo citar

Zeng, M. L. (2019). Semantic enrichment for enhancing LAM data and supporting digital humanities. Review article. Profesional De La información Information Professional, 28(1).



Artí­culos de revisión / Review articles