Semantic enrichment for enhancing LAM data and supporting digital humanities. Review article
DOI:
https://doi.org/10.3145/epi.2019.ene.03Palabras clave:
Semantic enrichment, Libraries, archives, and museums, LAMs, Digital humanities, DH, Smart data, Metadata, Structured data, Semi-structured data, Unstructured data, Knowledge discovery, Entity-centric modeling and information access,Resumen
With the rapid development of the digital humanities (DH) field, demands for historical and cultural heritage data have generated deep interest the data provided by libraries, archives, and museums (LAMs). In order to enhance LAM data´s quality and discoverability while enabling a self-sustaining ecosystem, "semantic enrichment" becomes a strategy increasingly used by LAMs during recent years. This article introduces a number of semantic enrichment methods and efforts that can be applied to LAM data at various levels, aiming to support deeper and wider exploration and use of LAM data in DH research. The real cases, research projects, experiments, and pilot studies shared in this article demonstrate endless potential for LAM data, whether they are structured, semi-structured, or unstructured, regardless of what types of original artifacts carry the data. Following their roadmaps would encourage more effective initiatives and strengthen this effort to maximize LAM data´s discoverability, use- and reuse-ability, and their value in the mainstream of DH and Semantic Web.
Descargas
Citas
Albritton, Benjamin (2013). Digital manuscript interoperability: Shared canvas and IIIF in practice. https://slideplayer.com/slide/5840185
Alemu, Getaneh; Brett, Stevens; Ross, Penny; Chandler, Jane (2012). "Linked data for libraries: Benefits of a conceptual shift from library-specific record structures to RDF-based data models". New library world, v. 113, n. 11/12, pp. 549-570. https://doi.org/10.1108/03074801211282920
Allen, Robert B. (2017). "Rich semantics and direct representation for digital collections". In: ACM/IEEE Joint conference on digital libraries (JDCL), pp. 348-349. https://doi.org/10.1109/JCDL.2017.7991623
Appleby, Michael; Crane, Tom; Sanderson, Robert; Stroop, Jon; Warnet, Simeon (2012a). "IIIF Image API 2.1.1". IIIF. https://iiif.io/api/image/2.1
Appleby, Michael; Crane, Tom; Sanderson, Robert; Stroop, Jon; Warnet, Simeon (2012b). "IIIF Presentation API 2.1.1". IIIF. https://iiif.io/api/presentation/2.1
Bainbridge, David; Hinze, Annika; Cunningham, Sally-Jo; Downie, J. Stephen (2016). "Low-cost semantic enhancement to digital library metadata and indexing: Simple yet effective strategies". In: 2016 ACM/IEEE Joint conference on digital libraries (JDCL), pp. 93-102. https://core.ac.uk/download/pdf/44290466.pdf
Baker, Thomas; Bermès, Emmanuelle; Coyle, Karen; Dunsire, Gordon; Isaac, Antoine; Murray, Peter; Panzer, Michael; Schneider, Jodi; Singer, Ross; Summers, Ed; Waites, William; Young, Jeff; Zeng, Marcia Lei (2011). Library linked data Incubator Group Final Report. W3C Incubator Group Report 25. http://www.w3.org/2005/Incubator/lld/XGR-lld-20111025
Bensmann, Felix; Zapilko, Benjamin; Mayr, Philipp (2017). "Interlinking large-scale library data with authority records". Frontiers in digital humanities, n. 4, p. 5. https://doi.org/10.3389/fdigh.2017.00005
Borgman, Christine L. (2015). Big data, little data, no data: Scholarship in the networked world. Cambridge, MA: MIT Press. ISBN: 978 0 262529914
Burdick, Anne; Drucker, Johanna; Lunenfeld, Peter; Presner, Todd; Schnapp, Jeffrey (2012). Digital_Humanities. Cambridge, MA: MIT Press. ISBN: 978 0 262528863
Clarke, David (2015). "Deep image annotation: Making a difference in knowledge organization". Fourth ISKO-UK Biennial conference of the International Society for Knowledge Organization. http://docplayer.net/13812285-Deep-image-annotation-making-a-difference-in-knowledge-organization.html
Consultative Committee for Space Data Systems (2012). Reference model for an Open Archival Information System. Washington DC: CCSDS. https://public.ccsds.org/Pubs/650x0m2.pdf
Damjanovic, Violeta; Kurz, Thomas; Westenthaler, Rupert; Behrendt, Wernher; Gruber, Andreas; Schaffert, Sebastian (2011). "Semantic enhancement: The key to massive and heterogeneous data pools". In: Proceedings of the 20th intl IEEE ERK (Electrotechnical and Computer Science) conference, pp. 413-416. https://www.researchgate.net/publication/266603290_Semantic_Enhancement_The_Key_to_Massive_and_Heterogeneous_Data_Pools
Dunsire, Gordon; Willer, Mirna (2011). "Standard library metadata models and structures for the semantic web". Library hi tech news, v. 28, n. 3, pp. 1-12. https://doi.org/10.1108/07419051111145118
Farias-Lóscio, Bernadette; Burle, Caroline; Calegari, Newton (2017). Data on the web best practices. W3C Recommendation 31 January 2017. http://www.w3.org/TR/dwbp
Floridi, Luciano (2010). Information: A very short introduction. Oxford: Oxford University Press. ISBN: 978 0 199551378
Gardner, Dan (2012). "An ocean of data [Introduction]". In: Smolan, Rick; Erwitt, Jennifer (eds.). The human face of big data. Sausalito, CA: Against All Odds Productions, pp. 14-17. ISBN: 978 1 454908272
Gracy, Karen; Davidson, Sammy (2014). "Helping users find the "˜good stuff´: Using the semantic analysis method (SAM) tool to identify and extract potential access points from archival finding aids". In: SAA Research Forum, Society of American Archivists. http://files.archivists.org/pubs/proceedings/ResearchForum/2014/posters/GracyDavidson-ResearchForumPoster2014.pdf
Gracy, Karen; Zeng, Marcia Lei (2015). "Creating linked data within archival description: Tools for extracting, validating, and encoding access points for finding aids". Digital humanities conference of the Alliance of Digital Humanities Organizations (ADHO).
Gracy, Karen; Zeng, Marcia Lei; Skirvin, Laurence (2013). "Exploring methods to improve access to music resources by aligning library data with linked data: A report of methodologies and preliminary findings". Journal of the American Society for Information Science and Technology (JASIS&T), v. 64, n. 10, pp. 2078-2099. https://doi.org/10.1002/asi.22914
Gruber, Ethan (2017). "Final report to the NEH for online coins of the Roman Empire". Day of archaeology, July 28. http://www.dayofarchaeology.com/final-report-to-the-neh-for-online-coins-of-the-roman-empire
Hinze, Annika; Taube-Schock, Craig; Bainbridge, David; Matamua, Rangi; Downie, J. Stephen (2015). "Improving access to large-scale digital libraries through semantic-enhanced search and disambiguation". In: Proceedings of the 15th ACM/IEEE-CS Joint conference on digital libraries. Association for Computational Linguistics, pp. 147-156. https://doi.org/10.1145/2756406.2756920
Hyví¶nen, Eero (2016). "Cultural heritage linked data on the semantic web: Three case studies using the sampo model". VIII Encounter of documentation centres of contemporary art: open linked data and integral management of information in cultural centres. Artium, Vitoria-Gasteiz, Spain, October 19-20. https://seco.cs.aalto.fi/publications/submitted/hyvonen-vitoria-2017.pdf
Hyví¶nen, Eero; Heino, Erkki; Leskinen, Petri; Ikkala, Esko; Koho, Mikko; Tamper, Minna; Tuominen, Jouni; Mí¤kelí¤, Eetu (2016). "Publishing Second World War history as linked data events on the semantic web". In: Proceedings of the digital humanities conference, pp. 571-573. https://seco.cs.aalto.fi/publications/2016/hyvonen-et-al-warsa-dh2016.pdf
Hyví¶nen, Eero; Leskinen, Petri; Tamper, Minna; Rantala, Heikki; Tuominen, Jouni; Keravuori, Kirsi (2018). "Demonstrating BiographySampo in solving digital humanities research problems in biography and prosopography" [Submitted paper]. https://seco.cs.aalto.fi/publications/submitted/hyvonen-et-al-bs-2019.pdf
Ikkala, Esko; Tuominen, Jouni; Raunamaa, Jaakko; Aalto, Tiina; Ainiala, Terhi; Uusitalo, Heliní¤; Hyví¶nen, Eero (2018). "NameSampo: A linked open data infrastructure and workbench for toponomastic research". In: GeoHumanities 18, Proceedings of the 2nd ACM SIG Spatial workshop on geospatial humanities, Seattle, WA, USA, November 06-09, pp. 2:1-2:9, ACM. https://doi.org/10.1145/3282933.3282936
IMLS (2018). Transforming communities: IMLS strategic plan (2018-2022). Washington DC: Institute of Museum and Library Services. https://www.imls.gov/sites/default/files/publications/documents/imls-strategic-plan-2018-2022.pdf
Isaac, Antoine; Manguinhas, Hugo; Stiller, Juliane; Charles, Valentine (2015). Report on enrichment and evaluation. The Hague, Netherlands: Europeana Task Force on Enrichment and Evaluation. http://pro.europeana.eu/files/Europeana_Professional/EuropeanaTech/EuropeanaTech_taskforces/Enrichment_Evaluation/FinalReport_EnrichmentEvaluation_102015.pdf
Kaplan, Frederic (2015). "A map for big data research in digital humanities". Frontiers in digital humanities, n. 2. https://doi.org/10.3389/fdigh.2015.00001
KBpedia (2018). KBpedia is now open source, October 23. http://kbpedia.org/resources/news/kbpedia-is-open-source
Kobielus, James (2016). "The evolution of big data to smart data". In: Smart data online, July 13 [PowerPoint slides]. http://smartdata2016.dataversity.net
Lin, Yuri; Michel, Jean-Baptiste; Lieberman-Aiden, Erez; Orwant, Jon; Brockman, Will; Petrov, Slav (2012). "Syntactic annotations for the Google Books Ngram corpus". In: Proceedings of the ACL 2012 System demonstrations. Association for Computational Linguistics, pp. 169-174. http://aclweb.org/anthology/P12-3029
Manguinhas, Hugo (ed.) (2016). Europeana semantic enrichment framework. Version 17, Nov. http://shorturl.at/pEIQ5
Mayer, Daniel (2011). Mainstream semantic enrichment [YouTube video]. December 26. http://www.youtube.com/watch?v=YVxvQ7UpqI0
Mayer-Schí¶nberger, Viktor; Cukier, Kenneth (2013). Big data: A revolution that will transform how we live, work, and think. New York, NY: Eamon Dolan/Mariner Books. ISBN: 978 0 544227750
Mukerjee, Prithwis (2014). "Introduction to data science" [PowerPoint slides], January 12. http://www.slideshare.net/prithwis/01-intro2-datascienceyantrajaalblog
Mutuvi, Stephen; Doucet, Antoine; Odeo, Moses; Jatowt, Adam (2018). "Evaluating the impact of OCR errors on topic modeling". In: Maturity and innovation in digital libraries. 20th Intl conf on Asia-Pacific digital libraries, ICADL 2018, Hamilton, New Zealand, November 19-22, Proceedings, pp. 3-14. ISBN: 978 3 030 04257 8
National Archives (2016). "Finding aid type". The national archives catalog. https://www.archives.gov/research/catalog/lcdrg/elements/findingtype.html
Nguyen, Thi-Tuyet-Hai; Coustaty, Mickael; Doucet, Antoine; Jatowt, Adam; Nguyen, Nhu-Van (2018). "Adaptive edit-distance and regression approach for post-OCR text correction". In: Maturity and innovation in digital libraries. 20th Intl conf on Asia-Pacific digital libraries, ICADL 2018, Hamilton, New Zealand, November 19-22, Proceedings, pp. 278-289. ISBN: 978 3 030 04257 8
O´Neill, Ed; Mixter, Jeff (2013). "Maximizing the usage of value vocabularies in the linked data ecosystem". In: 76th Annual meeting of the American Society for Information Science and Technology (ASIS&T), Montreal, Canada, November. http://nkos.slis.kent.edu/ASIST2013/ONeill-Mixter.pptx
Pattuelli, M. Cristina (2012). "Personal name vocabularies as linked open data: A case study of jazz artist names". Journal of information science, v. 38, n. 6, pp. 558-565. https://doi.org/10.1177/0165551512455989
Pattuelli, M. Cristina; Hwang, Karen; Miller, Matthew (2016). "Accidental discovery, intentional inquiry: Leveraging linked data to uncover the women of jazz". Digital scholarship in the humanities, v. 32, n. 4, pp. 918-924. https://doi.org/10.1093/llc/fqw0
Prasad, A. R. D.; Giunchiglia, Fausto; Devika, P. Madalli (2017). "DERA: from document centric to entity centric knowledge modelling". In: Proceedings of the International UDC seminar 2017. Faceted classification today. London, September, pp. 169-179. http://seminar.udcc.org
Riva, Pat; LeBoeuf, Patrick; Žumer, Maja (2017). IFLA library reference model: A conceptual model for bibliographic information. Netherlands: IFLA. https://www.ifla.org/files/assets/cataloguing/frbr-lrm/ifla-lrm-august-2017.pdf
Schí¶ch, Christof (2013). "Big? Smart? Clean? Messy? Data in the Humanities". Journal of digital humanities, v. 2, n. 3, pp. 2-13. http://journalofdigitalhumanities.org/2-3/big-smart-clean-messy-data-in-the-humanities
Smith-Yoshimura, Karen (2018). "The rise of Wikidata as a linked data source". In: Hanging together. The OCLC research blog, August 6. http://hangingtogether.org/?p=6775
Stiller, Juliane; Petras, Vivien; Gí¤de, Maria; Isaac, Antoine (2014). "Automatic enrichments with controlled vocabularies in Europeana: Challenges and consequences." In: Euro-Mediterranean conf., pp. 238-247. Springer, Cham. https://doi.org/10.1007/978-3-319-13695-0_23
Svensson, Patrik (2010). "The landscape of digital humanities". Digital humanities quarterly, v. 4, n. 1. http://digitalhumanities.org/dhq/vol/4/1/000080/000080.html
Thorsen, Hilary K.; Pattuelli, M. Cristina (2016). "Linked open data and the cultural heritage landscape". In: Jones, Ed; Seikel, Michele (eds.). Linked data for cultural heritage. Chicago, IL: Alcts Publishing. ISBN: 978 1 783301621
TiECON East (2014). Data is the new oil. https://tieconeast.wordpress.com/page/2
Reinhard, Andrew; Van-Alfen, Peter; Bransbourg, Gilles; Gruber, Ethan; (2017). "Wishes granted: the ANS and the NEH". In: National Endowment for the Humanities. Announces. New grant recipients. http://numismatics.org/pocketchange/wp-content/uploads/sites/3/NEH-Article-ANS-Magazine.pdf
Van-Ruyskensvelde, Sarah (2014). "Towards a history of e-ducation? Exploring the possibilities of digital humanities for the history of education". Paedagogica historica, v. 50, n. 6, pp. 861-870. https://doi.org/10.1080/00309230.2014.955511
Varner, Stewart; Hswe, Patricia (2016). Special report: Digital humanities in libraries. American Libraries. https://americanlibrariesmagazine.org/2016/01/04/special-report-digital-humanities-libraries
W3C (2011). Library Linked Data Incubator Group Final Report https://www.w3.org/2005/Incubator/lld/XGR-lld-20111025
W3C (2017). Data on the Web best practices. https://www.w3.org/TR/dwbp
Wagner, Elisabeth; Matsumoto, Mallory; Kiel, Nikolai; Gronemeyer, Sven (2014). A checklist of museums with Maya Art. http://mayawoerterbuch.de/museumscollections
Wang, Xiaoguang; Liu, Xuemei; Xia, Shengping (2017). "Design and implementation of deep semantic indexing on digital cultural heritage images". Journal of library and information science, v. 43, n. 1, pp. 98-121. http://jlis.glis.ntnu.edu.tw/ojs/index.php/jlis/article/view/716
Weitz, Jay; Toves, Jenny; Vizine-Goetz, Diane; Naught, Nannette; Bremer, Robert (2016). "Mining MARC´s hidden treasures: Initial investigations into how notes of the past might shape our future". Journal of library metadata, v. 16, n. 3-4, pp. 166-180. https://doi.org/10.1080/19386389.2016.1262653
Zeng, Marcia Lei (2017). "Smart data for digital humanities". Journal of data and information science, v. 2, n. 1, pp. 1-12. https://doi.org/10.1515/jdis-2017-0001
Zeng, Marcia Lei; Gracy, Karen; Skirvin, Laurence (2013). "Navigating the intersection of library bibliographic data and linked music information sources: A study in the identification of useful metadata elements for interlinking". Journal of library metadata, v. 13, n. 2-3, pp. 254-278. https://doi.org/10.1080/19386389.2013.827513
Zeng, Marcia Lei; Gracy, Karen F.; Žumer, Maja (2014). "Using a semantic analysis tool to generate subject access points: A study using Panofsky´s theory and two research samples". Knowledge organization, v. 41, n. 6, pp. 440-451. https://pdfs.semanticscholar.org/bbeb/42b931fd32520a03167770d2b5de694128e6.pdf
Zeng, Marcia Lei; Mayr, Philipp (2018). "Knowledge organization systems (KOS) in the semantic web". International journal on digital libraries. https://doi.org/10.1007/s00799-018-0241-2
Žumer, Maja (2018). "IFLA library reference model (IFLA LRM): Harmonisation of the FRBR family". Knowledge organization, v. 45, n. 4, pp. 310-318.
Also available in Hjí¸rland, Birger (ed.). ISKO Encyclopedia of knowledge organization. http://www.isko.org/cyclo/lrm
Žumer, Maja; Riva, Pat (2017). "IFLA LRM-Finally here". In: Intl conf on Dublin Core and metadata applications, Washington, D.C., USA, 26-29 October, pp. 13-23. http://dcpapers.dublincore.org/pubs/article/download/3852/2037
Descargas
Publicado
Cómo citar
Número
Sección
Licencia
Condiciones de difusión de los artículos una vez son publicados
Los autores pueden publicitar libremente sus artículos en webs, redes sociales y repositorios
Deberán respetarse sin embargo, las siguientes condiciones:
- Solo deberá hacerse pública la versión editorial. Rogamos que no se publiquen preprints, postprints o pruebas de imprenta.
- Junto con esa copia ha de incluirse una mención específica de la publicación en la que ha aparecido el texto, añadiendo además un enlace clicable a la URL: http://revista.profesionaldelainformacion.com
La revista Profesional de la información ofrece los artículos en acceso abierto con una licencia Creative Commons BY.