Beall, J.; Kafadar, K.: Measuring typographical errors' impact on retrieval in bibliographic databases (2007)
0.03
0.025120806 = product of:
0.06698882 = sum of:
0.035423465 = weight(_text_:retrieval in 261) [ClassicSimilarity], result of:
0.035423465 = score(doc=261,freq=4.0), product of:
0.124912694 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.041294612 = queryNorm
0.2835858 = fieldWeight in 261, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.046875 = fieldNorm(doc=261)
0.022201622 = weight(_text_:of in 261) [ClassicSimilarity], result of:
0.022201622 = score(doc=261,freq=22.0), product of:
0.06457475 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.041294612 = queryNorm
0.34381276 = fieldWeight in 261, product of:
4.690416 = tf(freq=22.0), with freq of:
22.0 = termFreq=22.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.046875 = fieldNorm(doc=261)
0.009363732 = product of:
0.018727465 = sum of:
0.018727465 = weight(_text_:on in 261) [ClassicSimilarity], result of:
0.018727465 = score(doc=261,freq=4.0), product of:
0.090823986 = queryWeight, product of:
2.199415 = idf(docFreq=13325, maxDocs=44218)
0.041294612 = queryNorm
0.20619515 = fieldWeight in 261, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
2.199415 = idf(docFreq=13325, maxDocs=44218)
0.046875 = fieldNorm(doc=261)
0.5 = coord(1/2)
0.375 = coord(3/8)
- Abstract
- Typographical errors can block access to records in online catalogs; but, when a word contains a typo and is also spelled correctly elsewhere in the same record, access may not be blocked. To quantify the effect of typographical errors in records on information retrieval, we conducted a study to measure the proportion of records that contain a typographical error but that do not also contain a correct spelling of the same word. This article presents the experimental design, results of the study, and a statistical analysis of the results.We find that the average proportion of records that are blocked by the presence of a typo (that is, records in which a correct spelling of the word does not also occur) ranges from 35% to 99%, depending upon the frequency of the word being searched and the likelihood of the word being misspelled.
- Footnote
- Simultaneously published as Cataloger, Editor, and Scholar: Essays in Honor of Ruth C. Carter