Document (#1500)

Author
Ballard, T.
Lifshin, A.
Title
Prediction of OPAC spelling errors through a keyword inventory
Source
Information technology and libraries. 11(1992), S.139-145
Year
1992
Abstract
In order to find and correct spelling errors in the online public access catalog at Adelphi University, a visual inspection was performed of the 117.000 keywords indexed in the system. More than 1.000 errors were found. Certain long but common words such as administration, education, and commercial were found to generate many different misspellings. Most of the records were derived from bibliographic utilities, so the findings can be generalized to other OPACs. The same misspellings were also found in substantial numbers in CD-ROM databases. Misspellings were analyzed by the machine-readable catalog (MARC) field in which they were found, part of speech, and type of mistake. Lists of commonly misspelled root words and specific mistakes are included
Theme
OPAC

Similar documents (author)

  1. Ballard, P.I.: Bound withs versus an online catalog : a practical solution (1992) 5.60
    5.5967755 = sum of:
      5.5967755 = weight(author_txt:ballard in 2968) [ClassicSimilarity], result of:
        5.5967755 = fieldWeight in 2968, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.954841 = idf(docFreq=14, maxDocs=42740)
          0.625 = fieldNorm(doc=2968)
    
  2. Ballard, T.: OCLC's EPIC : report from the field (1991) 5.60
    5.5967755 = sum of:
      5.5967755 = weight(author_txt:ballard in 4859) [ClassicSimilarity], result of:
        5.5967755 = fieldWeight in 4859, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.954841 = idf(docFreq=14, maxDocs=42740)
          0.625 = fieldNorm(doc=4859)
    
  3. Ballard, T.: Using FirstSearch in a bibliographic construction (1993) 5.60
    5.5967755 = sum of:
      5.5967755 = weight(author_txt:ballard in 7309) [ClassicSimilarity], result of:
        5.5967755 = fieldWeight in 7309, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.954841 = idf(docFreq=14, maxDocs=42740)
          0.625 = fieldNorm(doc=7309)
    
  4. Ballard, T.: Comparative searching styles of patrons and staff (1994) 5.60
    5.5967755 = sum of:
      5.5967755 = weight(author_txt:ballard in 501) [ClassicSimilarity], result of:
        5.5967755 = fieldWeight in 501, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.954841 = idf(docFreq=14, maxDocs=42740)
          0.625 = fieldNorm(doc=501)
    
  5. Ballard, T.: Library systems : transaction log fever; analyzing patron searches can reveal solutions to increase search success (1996) 5.60
    5.5967755 = sum of:
      5.5967755 = weight(author_txt:ballard in 5830) [ClassicSimilarity], result of:
        5.5967755 = fieldWeight in 5830, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.954841 = idf(docFreq=14, maxDocs=42740)
          0.625 = fieldNorm(doc=5830)
    

Similar documents (content)

  1. Drabenstott, K.M.; Weller, M.S.: Handling spelling errors in online catalog searches (1996) 0.41
    0.41082752 = sum of:
      0.41082752 = product of:
        1.4672412 = sum of:
          0.18367362 = weight(abstract_txt:misspelled in 889) [ClassicSimilarity], result of:
            0.18367362 = score(doc=889,freq=2.0), product of:
              0.21683528 = queryWeight, product of:
                1.5704886 = boost
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.014406992 = queryNorm
              0.8470652 = fieldWeight in 889, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.0625 = fieldNorm(doc=889)
          0.06421357 = weight(abstract_txt:words in 889) [ClassicSimilarity], result of:
            0.06421357 = score(doc=889,freq=2.0), product of:
              0.13557926 = queryWeight, product of:
                1.7562301 = boost
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.014406992 = queryNorm
              0.4736238 = fieldWeight in 889, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.0625 = fieldNorm(doc=889)
          0.29195687 = weight(abstract_txt:spelling in 889) [ClassicSimilarity], result of:
            0.29195687 = score(doc=889,freq=5.0), product of:
              0.27416238 = queryWeight, product of:
                2.4974036 = boost
                7.619839 = idf(docFreq=56, maxDocs=42740)
                0.014406992 = queryNorm
              1.0649049 = fieldWeight in 889, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.619839 = idf(docFreq=56, maxDocs=42740)
                0.0625 = fieldNorm(doc=889)
          0.054110717 = weight(abstract_txt:found in 889) [ClassicSimilarity], result of:
            0.054110717 = score(doc=889,freq=1.0), product of:
              0.19200723 = queryWeight, product of:
                2.955688 = boost
                4.5090566 = idf(docFreq=1278, maxDocs=42740)
                0.014406992 = queryNorm
              0.28181604 = fieldWeight in 889, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5090566 = idf(docFreq=1278, maxDocs=42740)
                0.0625 = fieldNorm(doc=889)
          0.24686179 = weight(abstract_txt:errors in 889) [ClassicSimilarity], result of:
            0.24686179 = score(doc=889,freq=4.0), product of:
              0.30229554 = queryWeight, product of:
                3.211784 = boost
                6.532992 = idf(docFreq=168, maxDocs=42740)
                0.014406992 = queryNorm
              0.816624 = fieldWeight in 889, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.532992 = idf(docFreq=168, maxDocs=42740)
                0.0625 = fieldNorm(doc=889)
          0.06301886 = weight(abstract_txt:were in 889) [ClassicSimilarity], result of:
            0.06301886 = score(doc=889,freq=2.0), product of:
              0.19310619 = queryWeight, product of:
                3.6303084 = boost
                3.69215 = idf(docFreq=2894, maxDocs=42740)
                0.014406992 = queryNorm
              0.32634303 = fieldWeight in 889, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.69215 = idf(docFreq=2894, maxDocs=42740)
                0.0625 = fieldNorm(doc=889)
          0.5634058 = weight(abstract_txt:misspellings in 889) [ClassicSimilarity], result of:
            0.5634058 = score(doc=889,freq=3.0), product of:
              0.57675266 = queryWeight, product of:
                4.436344 = boost
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.014406992 = queryNorm
              0.9768586 = fieldWeight in 889, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.0625 = fieldNorm(doc=889)
        0.28 = coord(7/25)
    
  2. Randall, N.B.: Spelling errors in the database : shadow or substance? (1999) 0.25
    0.25334355 = sum of:
      0.25334355 = product of:
        1.2667177 = sum of:
          0.05582716 = weight(abstract_txt:correct in 1232) [ClassicSimilarity], result of:
            0.05582716 = score(doc=1232,freq=1.0), product of:
              0.10643041 = queryWeight, product of:
                1.1002786 = boost
                6.7141304 = idf(docFreq=140, maxDocs=42740)
                0.014406992 = queryNorm
              0.52454144 = fieldWeight in 1232, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7141304 = idf(docFreq=140, maxDocs=42740)
                0.078125 = fieldNorm(doc=1232)
          0.23081218 = weight(abstract_txt:spelling in 1232) [ClassicSimilarity], result of:
            0.23081218 = score(doc=1232,freq=2.0), product of:
              0.27416238 = queryWeight, product of:
                2.4974036 = boost
                7.619839 = idf(docFreq=56, maxDocs=42740)
                0.014406992 = queryNorm
              0.8418813 = fieldWeight in 1232, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.619839 = idf(docFreq=56, maxDocs=42740)
                0.078125 = fieldNorm(doc=1232)
          0.30857724 = weight(abstract_txt:errors in 1232) [ClassicSimilarity], result of:
            0.30857724 = score(doc=1232,freq=4.0), product of:
              0.30229554 = queryWeight, product of:
                3.211784 = boost
                6.532992 = idf(docFreq=168, maxDocs=42740)
                0.014406992 = queryNorm
              1.02078 = fieldWeight in 1232, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.532992 = idf(docFreq=168, maxDocs=42740)
                0.078125 = fieldNorm(doc=1232)
          0.09647753 = weight(abstract_txt:were in 1232) [ClassicSimilarity], result of:
            0.09647753 = score(doc=1232,freq=3.0), product of:
              0.19310619 = queryWeight, product of:
                3.6303084 = boost
                3.69215 = idf(docFreq=2894, maxDocs=42740)
                0.014406992 = queryNorm
              0.4996087 = fieldWeight in 1232, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.69215 = idf(docFreq=2894, maxDocs=42740)
                0.078125 = fieldNorm(doc=1232)
          0.57502365 = weight(abstract_txt:misspellings in 1232) [ClassicSimilarity], result of:
            0.57502365 = score(doc=1232,freq=2.0), product of:
              0.57675266 = queryWeight, product of:
                4.436344 = boost
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.014406992 = queryNorm
              0.9970021 = fieldWeight in 1232, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.078125 = fieldNorm(doc=1232)
        0.2 = coord(5/25)
    
  3. Tüür-Fröhlich, T.: ¬The non-trivial effects of trivial errors in scientific communication and evaluation (2016) 0.25
    0.24509043 = sum of:
      0.24509043 = product of:
        0.87532294 = sum of:
          0.04149068 = weight(abstract_txt:indexed in 5138) [ClassicSimilarity], result of:
            0.04149068 = score(doc=5138,freq=2.0), product of:
              0.08791448 = queryWeight, product of:
                6.102209 = idf(docFreq=259, maxDocs=42740)
                0.014406992 = queryNorm
              0.47194365 = fieldWeight in 5138, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.102209 = idf(docFreq=259, maxDocs=42740)
                0.0546875 = fieldNorm(doc=5138)
          0.039079014 = weight(abstract_txt:correct in 5138) [ClassicSimilarity], result of:
            0.039079014 = score(doc=5138,freq=1.0), product of:
              0.10643041 = queryWeight, product of:
                1.1002786 = boost
                6.7141304 = idf(docFreq=140, maxDocs=42740)
                0.014406992 = queryNorm
              0.367179 = fieldWeight in 5138, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7141304 = idf(docFreq=140, maxDocs=42740)
                0.0546875 = fieldNorm(doc=5138)
          0.10950344 = weight(abstract_txt:mistake in 5138) [ClassicSimilarity], result of:
            0.10950344 = score(doc=5138,freq=1.0), product of:
              0.21153808 = queryWeight, product of:
                1.5511867 = boost
                9.465666 = idf(docFreq=8, maxDocs=42740)
                0.014406992 = queryNorm
              0.5176536 = fieldWeight in 5138, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.465666 = idf(docFreq=8, maxDocs=42740)
                0.0546875 = fieldNorm(doc=5138)
          0.04734688 = weight(abstract_txt:found in 5138) [ClassicSimilarity], result of:
            0.04734688 = score(doc=5138,freq=1.0), product of:
              0.19200723 = queryWeight, product of:
                2.955688 = boost
                4.5090566 = idf(docFreq=1278, maxDocs=42740)
                0.014406992 = queryNorm
              0.24658903 = fieldWeight in 5138, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5090566 = idf(docFreq=1278, maxDocs=42740)
                0.0546875 = fieldNorm(doc=5138)
          0.2857465 = weight(abstract_txt:errors in 5138) [ClassicSimilarity], result of:
            0.2857465 = score(doc=5138,freq=7.0), product of:
              0.30229554 = queryWeight, product of:
                3.211784 = boost
                6.532992 = idf(docFreq=168, maxDocs=42740)
                0.014406992 = queryNorm
              0.9452555 = fieldWeight in 5138, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.532992 = idf(docFreq=168, maxDocs=42740)
                0.0546875 = fieldNorm(doc=5138)
          0.06753427 = weight(abstract_txt:were in 5138) [ClassicSimilarity], result of:
            0.06753427 = score(doc=5138,freq=3.0), product of:
              0.19310619 = queryWeight, product of:
                3.6303084 = boost
                3.69215 = idf(docFreq=2894, maxDocs=42740)
                0.014406992 = queryNorm
              0.34972608 = fieldWeight in 5138, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.69215 = idf(docFreq=2894, maxDocs=42740)
                0.0546875 = fieldNorm(doc=5138)
          0.28462216 = weight(abstract_txt:misspellings in 5138) [ClassicSimilarity], result of:
            0.28462216 = score(doc=5138,freq=1.0), product of:
              0.57675266 = queryWeight, product of:
                4.436344 = boost
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.014406992 = queryNorm
              0.49349087 = fieldWeight in 5138, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.0546875 = fieldNorm(doc=5138)
        0.28 = coord(7/25)
    
  4. Ballard, T.: Spelling and typographical errors in library databases (1992) 0.21
    0.2137329 = sum of:
      0.2137329 = product of:
        1.3358307 = sum of:
          0.3025355 = weight(abstract_txt:adelphi in 887) [ClassicSimilarity], result of:
            0.3025355 = score(doc=887,freq=1.0), product of:
              0.20685513 = queryWeight, product of:
                1.5339209 = boost
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.014406992 = queryNorm
              1.4625478 = fieldWeight in 887, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.15625 = fieldNorm(doc=887)
          0.46162435 = weight(abstract_txt:spelling in 887) [ClassicSimilarity], result of:
            0.46162435 = score(doc=887,freq=2.0), product of:
              0.27416238 = queryWeight, product of:
                2.4974036 = boost
                7.619839 = idf(docFreq=56, maxDocs=42740)
                0.014406992 = queryNorm
              1.6837626 = fieldWeight in 887, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.619839 = idf(docFreq=56, maxDocs=42740)
                0.15625 = fieldNorm(doc=887)
          0.1352768 = weight(abstract_txt:found in 887) [ClassicSimilarity], result of:
            0.1352768 = score(doc=887,freq=1.0), product of:
              0.19200723 = queryWeight, product of:
                2.955688 = boost
                4.5090566 = idf(docFreq=1278, maxDocs=42740)
                0.014406992 = queryNorm
              0.7045401 = fieldWeight in 887, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5090566 = idf(docFreq=1278, maxDocs=42740)
                0.15625 = fieldNorm(doc=887)
          0.43639407 = weight(abstract_txt:errors in 887) [ClassicSimilarity], result of:
            0.43639407 = score(doc=887,freq=2.0), product of:
              0.30229554 = queryWeight, product of:
                3.211784 = boost
                6.532992 = idf(docFreq=168, maxDocs=42740)
                0.014406992 = queryNorm
              1.4436008 = fieldWeight in 887, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.532992 = idf(docFreq=168, maxDocs=42740)
                0.15625 = fieldNorm(doc=887)
        0.16 = coord(4/25)
    
  5. Berget, G.; Sandnes, F.E.: Do autocomplete functions reduce the impact of dyslexia on information-searching behavior? : the case of Google (2016) 0.16
    0.16253212 = sum of:
      0.16253212 = product of:
        1.0158257 = sum of:
          0.23081218 = weight(abstract_txt:spelling in 5113) [ClassicSimilarity], result of:
            0.23081218 = score(doc=5113,freq=2.0), product of:
              0.27416238 = queryWeight, product of:
                2.4974036 = boost
                7.619839 = idf(docFreq=56, maxDocs=42740)
                0.014406992 = queryNorm
              0.8418813 = fieldWeight in 5113, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.619839 = idf(docFreq=56, maxDocs=42740)
                0.078125 = fieldNorm(doc=5113)
          0.15428862 = weight(abstract_txt:errors in 5113) [ClassicSimilarity], result of:
            0.15428862 = score(doc=5113,freq=1.0), product of:
              0.30229554 = queryWeight, product of:
                3.211784 = boost
                6.532992 = idf(docFreq=168, maxDocs=42740)
                0.014406992 = queryNorm
              0.51039 = fieldWeight in 5113, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.532992 = idf(docFreq=168, maxDocs=42740)
                0.078125 = fieldNorm(doc=5113)
          0.05570133 = weight(abstract_txt:were in 5113) [ClassicSimilarity], result of:
            0.05570133 = score(doc=5113,freq=1.0), product of:
              0.19310619 = queryWeight, product of:
                3.6303084 = boost
                3.69215 = idf(docFreq=2894, maxDocs=42740)
                0.014406992 = queryNorm
              0.28844923 = fieldWeight in 5113, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.69215 = idf(docFreq=2894, maxDocs=42740)
                0.078125 = fieldNorm(doc=5113)
          0.57502365 = weight(abstract_txt:misspellings in 5113) [ClassicSimilarity], result of:
            0.57502365 = score(doc=5113,freq=2.0), product of:
              0.57675266 = queryWeight, product of:
                4.436344 = boost
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.014406992 = queryNorm
              0.9970021 = fieldWeight in 5113, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.078125 = fieldNorm(doc=5113)
        0.16 = coord(4/25)