Digital Libraries that Demonstrate High Levels of Mutual Complementarity in Collection-level Metadata Give a Richer Representation of their Content and Improve Subject Access for Users
AbstractA Review of:
Zavalina, O. L. (2013). Complementarity in subject metadata in large-scale digital libraries: A comparative analysis. Cataloging & Classification Quarterly, 52(1), 77-89. http://dx.doi.org/10.1080/01639374.2013.848316
Objective – To determine how well digital library content is represented through free-text and subject headings. Specifically to examine whether a combination of free-text description data and controlled vocabulary is more comprehensive than free-text description data alone in describing digital collections.
Design – Qualitative content analysis and complementarity comparison.
Setting – Three large scale cultural heritage digital libraries: one in Europe and two in the United States of America.
Methods – The researcher retrieved XML files of complete metadata records for two of the digital libraries, while the third library openly exposed its full metadata. The systematic samples obtained for all three libraries enabled qualitative content analysis to uncover how metadata values relate to each other at the collection level. The researcher retrieved 99 collection-level metadata records in total for analysis. The breakdown was 39, 33, and 27 records per digital library. When comparing metadata in the free-text Description metadata element with data in four controlled vocabulary elements, Subject, Geographic Coverage, Temporal Coverage and Object Type, the researcher observed three types of complementarity: one-way, two-way and multiple-complementarity. The author refers to complementarity as “describing a collection’s subject matter with mutually complementary data values in controlled vocabulary and free-text subject metadata elements” (Zavalina, 2013, p. 77). For example, within a Temporal Coverage metadata element the term “19th century” would complement a Description metadata element “1850–1899” in the same record.
Main Results – The researcher found a high level of one-way complementarity in the metadata of all three digital libraries. This was mostly demonstrated by free-text data in the Description element complemented by data in the controlled vocabulary elements of Subject, Geographic Coverage, Temporal Coverage, and Object Type. Only one library demonstrated a significant proportion (19%) of redundancy between free-text and controlled vocabulary metadata. An example of redundancy found included a repetition of geographic information in both a Description and Geographic Coverage metadata elements.
Conclusion – The author reports high levels of mutual complementarity in the three cultural heritage digital libraries studied. The findings demonstrate that collection-level metadata which includes both free-text and controlled vocabulary is more representative of the intellectual content of the collections and improves subject access for users. The author maintains that there is no standard for collection-level metadata descriptions, and that this research may contribute to best practice guidelines in this area. It is unclear whether the digital libraries studied had written policies in place on how to describe collections and if those policies were adhered to in practice. The author expresses a need for further research to be conducted on collection-level metadata in other domains, such as science and interdisciplinary digital libraries, and on other scales (e.g., regional or state collections) and geographic regions beyond Europe and the United States.
The Creative Commons-Attribution-Noncommercial-Share Alike License 4.0 International applies to all works published by Evidence Based Library and Information Practice. Authors will retain copyright of the work.