Golub, K.: Automated subject indexing : an overview (2021)
0.04
0.040487964 = product of:
0.14170787 = sum of:
0.059409913 = weight(_text_:subject in 718) [ClassicSimilarity], result of:
0.059409913 = score(doc=718,freq=8.0), product of:
0.10738805 = queryWeight, product of:
3.576596 = idf(docFreq=3361, maxDocs=44218)
0.03002521 = queryNorm
0.5532265 = fieldWeight in 718, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
3.576596 = idf(docFreq=3361, maxDocs=44218)
0.0546875 = fieldNorm(doc=718)
0.023552012 = weight(_text_:classification in 718) [ClassicSimilarity], result of:
0.023552012 = score(doc=718,freq=2.0), product of:
0.09562149 = queryWeight, product of:
3.1847067 = idf(docFreq=4974, maxDocs=44218)
0.03002521 = queryNorm
0.24630459 = fieldWeight in 718, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.1847067 = idf(docFreq=4974, maxDocs=44218)
0.0546875 = fieldNorm(doc=718)
0.035193928 = weight(_text_:bibliographic in 718) [ClassicSimilarity], result of:
0.035193928 = score(doc=718,freq=2.0), product of:
0.11688946 = queryWeight, product of:
3.893044 = idf(docFreq=2449, maxDocs=44218)
0.03002521 = queryNorm
0.30108726 = fieldWeight in 718, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.893044 = idf(docFreq=2449, maxDocs=44218)
0.0546875 = fieldNorm(doc=718)
0.023552012 = weight(_text_:classification in 718) [ClassicSimilarity], result of:
0.023552012 = score(doc=718,freq=2.0), product of:
0.09562149 = queryWeight, product of:
3.1847067 = idf(docFreq=4974, maxDocs=44218)
0.03002521 = queryNorm
0.24630459 = fieldWeight in 718, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.1847067 = idf(docFreq=4974, maxDocs=44218)
0.0546875 = fieldNorm(doc=718)
0.2857143 = coord(4/14)
- Abstract
- In the face of the ever-increasing document volume, libraries around the globe are more and more exploring (semi-) automated approaches to subject indexing. This helps sustain bibliographic objectives, enrich metadata, and establish more connections across documents from various collections, effectively leading to improved information retrieval and access. However, generally accepted automated approaches that are functional in operative systems are lacking. This article aims to provide an overview of basic principles used for automated subject indexing, major approaches in relation to their possible application in actual library systems, existing working examples, as well as related challenges calling for further research.
- Footnote
- Teil eines Themenheftes: Artificial intelligence (AI) and automated processes for subject sccess
- Source
- Cataloging and classification quarterly. 59(2021) no.8, p.702-719
Golub, K.; Tyrkkö, J.; Hansson, J.; Ahlström, I.: Subject indexing in humanities : a comparison between a local university repository and an international bibliographic service (2020)
0.01
0.013652044 = product of:
0.095564306 = sum of:
0.060013074 = weight(_text_:subject in 5982) [ClassicSimilarity], result of:
0.060013074 = score(doc=5982,freq=16.0), product of:
0.10738805 = queryWeight, product of:
3.576596 = idf(docFreq=3361, maxDocs=44218)
0.03002521 = queryNorm
0.55884314 = fieldWeight in 5982, product of:
4.0 = tf(freq=16.0), with freq of:
16.0 = termFreq=16.0
3.576596 = idf(docFreq=3361, maxDocs=44218)
0.0390625 = fieldNorm(doc=5982)
0.035551235 = weight(_text_:bibliographic in 5982) [ClassicSimilarity], result of:
0.035551235 = score(doc=5982,freq=4.0), product of:
0.11688946 = queryWeight, product of:
3.893044 = idf(docFreq=2449, maxDocs=44218)
0.03002521 = queryNorm
0.30414405 = fieldWeight in 5982, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.893044 = idf(docFreq=2449, maxDocs=44218)
0.0390625 = fieldNorm(doc=5982)
0.14285715 = coord(2/14)
- Abstract
- As the humanities develop in the realm of increasingly more pronounced digital scholarship, it is important to provide quality subject access to a vast range of heterogeneous information objects in digital services. The study aims to paint a representative picture of the current state of affairs of the use of subject index terms in humanities journal articles with particular reference to the well-established subject access needs of humanities researchers, with the purpose of identifying which improvements are needed in this context. Design/methodology/approach The comparison of subject metadata on a sample of 649 peer-reviewed journal articles from across the humanities is conducted in a university repository, against Scopus, the former reflecting local and national policies and the latter being the most comprehensive international abstract and citation database of research output. Findings The study shows that established bibliographic objectives to ensure subject access for humanities journal articles are not supported in either the world's largest commercial abstract and citation database Scopus or the local repository of a public university in Sweden. The indexing policies in the two services do not seem to address the needs of humanities scholars for highly granular subject index terms with appropriate facets; no controlled vocabularies for any humanities discipline are used whatsoever. Originality/value In all, not much has changed since 1990s when indexing for the humanities was shown to lag behind the sciences. The community of researchers and information professionals, today working together on digital humanities projects, as well as interdisciplinary research teams, should demand that their subject access needs be fulfilled, especially in commercial services like Scopus and discovery services.