Machine Learning Offers Opportunities to Advance Library Services
DOI:
https://doi.org/10.18438/eblip30527Abstract
A Review of:
Wang, Y. (2022). Using machine learning and natural language processing to analyze library chat reference transcripts. Information Technology and Libraries, 41(3). https://doi.org/10.6017/ital.v41i3.14967
Objective – The study sought to develop a model to predict if library chat questions are reference or non-reference.
Design – Supervised machine learning and natural language processing.
Setting – College of New Jersey academic library.
Subjects – 8,000 Springshare LibChat transactions collected from 2014 to 2021.
Methods – The chat logs were downloaded into Excel, cleaned, and individual questions were labelled reference or non-reference by hand. Labelled data were preprocessed to remove nonmeaningful and stop words, and reformatted to lowercase. Data were then stemmed to group words with similar meaning. The feature of question length was then added and data were transformed from text to numeric for text vectorization. Data were then divided into training and testing sets. The Python packages Natural Language Toolkit (NLTK) and scikit-learn were used for analysis, building random forest and gradient boosting models which were evaluated via confusion matrix.
Main Results – Both models performed very well in precision, recall and accuracy, with the random forest model having better overall results than the gradient boosting model, as well as a more efficient fit time, though slightly longer prediction time.
Conclusion – High volume library chat services could benefit from utilizing machine learning to develop models that inform plugins or chat enhancements to filter chat queries quickly.
Downloads
References
Al-Zaiti, S. S., Alghwiri, A. A., Hu, X., Clermont, G., Peace, A., Macfarlane, P., & Bond, R. (2022). A clinician's guide to understanding and critically appraising machine learning studies: a checklist for Ruling Out Bias Using Standard Tools in Machine Learning (ROBUST-ML). European Heart Journal. Digital Health, 3(2), 125–140. https://doi.org/10.1093/ehjdh/ztac016 DOI: https://doi.org/10.1093/ehjdh/ztac016
Wang, Y. (2022). Using machine learning and natural language processing to analyze library chat reference transcripts. Information Technology and Libraries, 41(3), https://doi.org/10.6017/ital.v41i3.14967 DOI: https://doi.org/10.6017/ital.v41i3.14967
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Samantha Kaplan
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
The Creative Commons-Attribution-Noncommercial-Share Alike License 4.0 International applies to all works published by Evidence Based Library and Information Practice. Authors will retain copyright of the work.