Beall, J.; Kafadar, K.: Measuring typographical errors' impact on retrieval in bibliographic databases (2007)
0.02
0.016649358 = product of:
0.049948074 = sum of:
0.049948074 = product of:
0.07492211 = sum of:
0.031153653 = weight(_text_:online in 261) [ClassicSimilarity], result of:
0.031153653 = score(doc=261,freq=2.0), product of:
0.1548489 = queryWeight, product of:
3.0349014 = idf(docFreq=5778, maxDocs=44218)
0.051022716 = queryNorm
0.20118743 = fieldWeight in 261, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.0349014 = idf(docFreq=5778, maxDocs=44218)
0.046875 = fieldNorm(doc=261)
0.043768454 = weight(_text_:retrieval in 261) [ClassicSimilarity], result of:
0.043768454 = score(doc=261,freq=4.0), product of:
0.15433937 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.051022716 = queryNorm
0.2835858 = fieldWeight in 261, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.046875 = fieldNorm(doc=261)
0.6666667 = coord(2/3)
0.33333334 = coord(1/3)
- Abstract
- Typographical errors can block access to records in online catalogs; but, when a word contains a typo and is also spelled correctly elsewhere in the same record, access may not be blocked. To quantify the effect of typographical errors in records on information retrieval, we conducted a study to measure the proportion of records that contain a typographical error but that do not also contain a correct spelling of the same word. This article presents the experimental design, results of the study, and a statistical analysis of the results.We find that the average proportion of records that are blocked by the presence of a typo (that is, records in which a correct spelling of the word does not also occur) ranges from 35% to 99%, depending upon the frequency of the word being searched and the likelihood of the word being misspelled.