Exploring Computational Descriptions for Metadata Creation for E-Books at the Library of Congress, United States of America
Lopullinen julkaistu versio - 229.55 KB
https://creativecommons.org/licenses/by/4.0/
Pysyvä osoite
Kuvaus
© The Author(s) 2026. This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
The Library of Congress’ (LoC) project “Exploring Computational Description” (ECD) is investigating the use of machine learning (ML) to create metadata for e-books that have not yet been catalogued. In-house the LC Labs carried out this initiative with the U.S. Programs, Law, and Literature Division and an external vendor. An initial budget of $250,000 from the National Digital Trust Fund was allocated for this experimental AI endeavour, prompted by a massive backlog of e-books. During the first project phase, five ML models were evaluated, and in the second project phase, human-in-the-loop prototypes that offer machine-generated terms to librarians were introduced. The integration of AI at the LoC has the potential to enhance cataloguing efficiency by automating repetitive tasks, thereby allowing librarians to focus more on intellectual tasks. At the same time, the project faced several challenges, including ensuring the reliability of AI-generated records, copyright concerns, and managing potentially harmful language in older texts used for training the models. Improving the accuracy of these models remains essential and depends on access to extensive digital data. However, human expertise remains crucial for ensuring high quality, and librarians need to develop a foundational understanding of ML to leverage these technologies effectively. The aim of the project is to develop innovative approaches that contribute to improving library practices.
Emojulkaisu
AI Innovations in Public Services: The Case of National Libraries
ISBN
978-3-032-01344-6
ISSN
Aihealue
OKM-julkaisutyyppi
A3 Kirjan tai muun kokoomateoksen osa (vertaisarvioitu)
