Document (#1500)

Author
Ballard, T.
Lifshin, A.
Title
Prediction of OPAC spelling errors through a keyword inventory
Source
Information technology and libraries. 11(1992), S.139-145
Year
1992
Abstract
In order to find and correct spelling errors in the online public access catalog at Adelphi University, a visual inspection was performed of the 117.000 keywords indexed in the system. More than 1.000 errors were found. Certain long but common words such as administration, education, and commercial were found to generate many different misspellings. Most of the records were derived from bibliographic utilities, so the findings can be generalized to other OPACs. The same misspellings were also found in substantial numbers in CD-ROM databases. Misspellings were analyzed by the machine-readable catalog (MARC) field in which they were found, part of speech, and type of mistake. Lists of commonly misspelled root words and specific mistakes are included
Theme
OPAC

Similar documents (author)

  1. Ballard, P.I.: Bound withs versus an online catalog : a practical solution (1992) 5.61
    5.608596 = sum of:
      5.608596 = weight(author_txt:ballard in 2968) [ClassicSimilarity], result of:
        5.608596 = fieldWeight in 2968, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.973753 = idf(docFreq=14, maxDocs=43556)
          0.625 = fieldNorm(doc=2968)
    
  2. Ballard, T.: OCLC's EPIC : report from the field (1991) 5.61
    5.608596 = sum of:
      5.608596 = weight(author_txt:ballard in 4856) [ClassicSimilarity], result of:
        5.608596 = fieldWeight in 4856, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.973753 = idf(docFreq=14, maxDocs=43556)
          0.625 = fieldNorm(doc=4856)
    
  3. Ballard, T.: Using FirstSearch in a bibliographic construction (1993) 5.61
    5.608596 = sum of:
      5.608596 = weight(author_txt:ballard in 7306) [ClassicSimilarity], result of:
        5.608596 = fieldWeight in 7306, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.973753 = idf(docFreq=14, maxDocs=43556)
          0.625 = fieldNorm(doc=7306)
    
  4. Ballard, T.: Comparative searching styles of patrons and staff (1994) 5.61
    5.608596 = sum of:
      5.608596 = weight(author_txt:ballard in 498) [ClassicSimilarity], result of:
        5.608596 = fieldWeight in 498, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.973753 = idf(docFreq=14, maxDocs=43556)
          0.625 = fieldNorm(doc=498)
    
  5. Ballard, T.: Library systems : transaction log fever; analyzing patron searches can reveal solutions to increase search success (1996) 5.61
    5.608596 = sum of:
      5.608596 = weight(author_txt:ballard in 5827) [ClassicSimilarity], result of:
        5.608596 = fieldWeight in 5827, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.973753 = idf(docFreq=14, maxDocs=43556)
          0.625 = fieldNorm(doc=5827)
    

Similar documents (content)

  1. Drabenstott, K.M.; Weller, M.S.: Handling spelling errors in online catalog searches (1996) 0.41
    0.41190767 = sum of:
      0.41190767 = product of:
        1.4710988 = sum of:
          0.18437734 = weight(abstract_txt:misspelled in 971) [ClassicSimilarity], result of:
            0.18437734 = score(doc=971,freq=2.0), product of:
              0.21723734 = queryWeight, product of:
                1.5697104 = boost
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.014412419 = queryNorm
              0.8487369 = fieldWeight in 971, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.0625 = fieldNorm(doc=971)
          0.063915454 = weight(abstract_txt:words in 971) [ClassicSimilarity], result of:
            0.063915454 = score(doc=971,freq=2.0), product of:
              0.13506517 = queryWeight, product of:
                1.7504067 = boost
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.014412419 = queryNorm
              0.47321936 = fieldWeight in 971, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.0625 = fieldNorm(doc=971)
          0.2935219 = weight(abstract_txt:spelling in 971) [ClassicSimilarity], result of:
            0.2935219 = score(doc=971,freq=5.0), product of:
              0.2749496 = queryWeight, product of:
                2.497433 = boost
                7.6387515 = idf(docFreq=56, maxDocs=43556)
                0.014412419 = queryNorm
              1.067548 = fieldWeight in 971, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.6387515 = idf(docFreq=56, maxDocs=43556)
                0.0625 = fieldNorm(doc=971)
          0.053382993 = weight(abstract_txt:found in 971) [ClassicSimilarity], result of:
            0.053382993 = score(doc=971,freq=1.0), product of:
              0.19014929 = queryWeight, product of:
                2.9371738 = boost
                4.4918804 = idf(docFreq=1325, maxDocs=43556)
                0.014412419 = queryNorm
              0.28074253 = fieldWeight in 971, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4918804 = idf(docFreq=1325, maxDocs=43556)
                0.0625 = fieldNorm(doc=971)
          0.2478212 = weight(abstract_txt:errors in 971) [ClassicSimilarity], result of:
            0.2478212 = score(doc=971,freq=4.0), product of:
              0.30286714 = queryWeight, product of:
                3.210251 = boost
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.014412419 = queryNorm
              0.81825054 = fieldWeight in 971, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.0625 = fieldNorm(doc=971)
          0.062308196 = weight(abstract_txt:were in 971) [ClassicSimilarity], result of:
            0.062308196 = score(doc=971,freq=2.0), product of:
              0.19151817 = queryWeight, product of:
                3.6102138 = boost
                3.6807828 = idf(docFreq=2983, maxDocs=43556)
                0.014412419 = queryNorm
              0.3253383 = fieldWeight in 971, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6807828 = idf(docFreq=2983, maxDocs=43556)
                0.0625 = fieldNorm(doc=971)
          0.5657717 = weight(abstract_txt:misspellings in 971) [ClassicSimilarity], result of:
            0.5657717 = score(doc=971,freq=3.0), product of:
              0.5779633 = queryWeight, product of:
                4.4346876 = boost
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.014412419 = queryNorm
              0.9789059 = fieldWeight in 971, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.0625 = fieldNorm(doc=971)
        0.28 = coord(7/25)
    
  2. Randall, N.B.: Spelling errors in the database : shadow or substance? (1999) 0.25
    0.25406224 = sum of:
      0.25406224 = product of:
        1.2703111 = sum of:
          0.05565731 = weight(abstract_txt:correct in 1229) [ClassicSimilarity], result of:
            0.05565731 = score(doc=1229,freq=1.0), product of:
              0.10614044 = queryWeight, product of:
                1.0972176 = boost
                6.7119894 = idf(docFreq=143, maxDocs=43556)
                0.014412419 = queryNorm
              0.5243742 = fieldWeight in 1229, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7119894 = idf(docFreq=143, maxDocs=43556)
                0.078125 = fieldNorm(doc=1229)
          0.23204944 = weight(abstract_txt:spelling in 1229) [ClassicSimilarity], result of:
            0.23204944 = score(doc=1229,freq=2.0), product of:
              0.2749496 = queryWeight, product of:
                2.497433 = boost
                7.6387515 = idf(docFreq=56, maxDocs=43556)
                0.014412419 = queryNorm
              0.8439708 = fieldWeight in 1229, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6387515 = idf(docFreq=56, maxDocs=43556)
                0.078125 = fieldNorm(doc=1229)
          0.3097765 = weight(abstract_txt:errors in 1229) [ClassicSimilarity], result of:
            0.3097765 = score(doc=1229,freq=4.0), product of:
              0.30286714 = queryWeight, product of:
                3.210251 = boost
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.014412419 = queryNorm
              1.0228132 = fieldWeight in 1229, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.078125 = fieldNorm(doc=1229)
          0.09538956 = weight(abstract_txt:were in 1229) [ClassicSimilarity], result of:
            0.09538956 = score(doc=1229,freq=3.0), product of:
              0.19151817 = queryWeight, product of:
                3.6102138 = boost
                3.6807828 = idf(docFreq=2983, maxDocs=43556)
                0.014412419 = queryNorm
              0.49807054 = fieldWeight in 1229, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6807828 = idf(docFreq=2983, maxDocs=43556)
                0.078125 = fieldNorm(doc=1229)
          0.5774383 = weight(abstract_txt:misspellings in 1229) [ClassicSimilarity], result of:
            0.5774383 = score(doc=1229,freq=2.0), product of:
              0.5779633 = queryWeight, product of:
                4.4346876 = boost
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.014412419 = queryNorm
              0.9990916 = fieldWeight in 1229, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.078125 = fieldNorm(doc=1229)
        0.2 = coord(5/25)
    
  3. Tüür-Fröhlich, T.: ¬The non-trivial effects of trivial errors in scientific communication and evaluation (2016) 0.25
    0.24549282 = sum of:
      0.24549282 = product of:
        0.87676007 = sum of:
          0.041711614 = weight(abstract_txt:indexed in 135) [ClassicSimilarity], result of:
            0.041711614 = score(doc=135,freq=2.0), product of:
              0.088164836 = queryWeight, product of:
                6.1172824 = idf(docFreq=260, maxDocs=43556)
                0.014412419 = queryNorm
              0.47310942 = fieldWeight in 135, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1172824 = idf(docFreq=260, maxDocs=43556)
                0.0546875 = fieldNorm(doc=135)
          0.038960114 = weight(abstract_txt:correct in 135) [ClassicSimilarity], result of:
            0.038960114 = score(doc=135,freq=1.0), product of:
              0.10614044 = queryWeight, product of:
                1.0972176 = boost
                6.7119894 = idf(docFreq=143, maxDocs=43556)
                0.014412419 = queryNorm
              0.3670619 = fieldWeight in 135, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7119894 = idf(docFreq=143, maxDocs=43556)
                0.0546875 = fieldNorm(doc=135)
          0.10993107 = weight(abstract_txt:mistake in 135) [ClassicSimilarity], result of:
            0.10993107 = score(doc=135,freq=1.0), product of:
              0.2119407 = queryWeight, product of:
                1.5504562 = boost
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.014412419 = queryNorm
              0.51868784 = fieldWeight in 135, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.0546875 = fieldNorm(doc=135)
          0.04671012 = weight(abstract_txt:found in 135) [ClassicSimilarity], result of:
            0.04671012 = score(doc=135,freq=1.0), product of:
              0.19014929 = queryWeight, product of:
                2.9371738 = boost
                4.4918804 = idf(docFreq=1325, maxDocs=43556)
                0.014412419 = queryNorm
              0.24564971 = fieldWeight in 135, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4918804 = idf(docFreq=1325, maxDocs=43556)
                0.0546875 = fieldNorm(doc=135)
          0.28685707 = weight(abstract_txt:errors in 135) [ClassicSimilarity], result of:
            0.28685707 = score(doc=135,freq=7.0), product of:
              0.30286714 = queryWeight, product of:
                3.210251 = boost
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.014412419 = queryNorm
              0.94713825 = fieldWeight in 135, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.0546875 = fieldNorm(doc=135)
          0.06677269 = weight(abstract_txt:were in 135) [ClassicSimilarity], result of:
            0.06677269 = score(doc=135,freq=3.0), product of:
              0.19151817 = queryWeight, product of:
                3.6102138 = boost
                3.6807828 = idf(docFreq=2983, maxDocs=43556)
                0.014412419 = queryNorm
              0.34864938 = fieldWeight in 135, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6807828 = idf(docFreq=2983, maxDocs=43556)
                0.0546875 = fieldNorm(doc=135)
          0.28581738 = weight(abstract_txt:misspellings in 135) [ClassicSimilarity], result of:
            0.28581738 = score(doc=135,freq=1.0), product of:
              0.5779633 = queryWeight, product of:
                4.4346876 = boost
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.014412419 = queryNorm
              0.49452513 = fieldWeight in 135, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.0546875 = fieldNorm(doc=135)
        0.28 = coord(7/25)
    
  4. Ballard, T.: Spelling and typographical errors in library databases (1992) 0.21
    0.2143014 = sum of:
      0.2143014 = product of:
        1.3393838 = sum of:
          0.3037374 = weight(abstract_txt:adelphi in 969) [ClassicSimilarity], result of:
            0.3037374 = score(doc=969,freq=1.0), product of:
              0.20725815 = queryWeight, product of:
                1.5332328 = boost
                9.379218 = idf(docFreq=9, maxDocs=43556)
                0.014412419 = queryNorm
              1.4655029 = fieldWeight in 969, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.379218 = idf(docFreq=9, maxDocs=43556)
                0.15625 = fieldNorm(doc=969)
          0.46409887 = weight(abstract_txt:spelling in 969) [ClassicSimilarity], result of:
            0.46409887 = score(doc=969,freq=2.0), product of:
              0.2749496 = queryWeight, product of:
                2.497433 = boost
                7.6387515 = idf(docFreq=56, maxDocs=43556)
                0.014412419 = queryNorm
              1.6879416 = fieldWeight in 969, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6387515 = idf(docFreq=56, maxDocs=43556)
                0.15625 = fieldNorm(doc=969)
          0.13345748 = weight(abstract_txt:found in 969) [ClassicSimilarity], result of:
            0.13345748 = score(doc=969,freq=1.0), product of:
              0.19014929 = queryWeight, product of:
                2.9371738 = boost
                4.4918804 = idf(docFreq=1325, maxDocs=43556)
                0.014412419 = queryNorm
              0.7018563 = fieldWeight in 969, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4918804 = idf(docFreq=1325, maxDocs=43556)
                0.15625 = fieldNorm(doc=969)
          0.43809012 = weight(abstract_txt:errors in 969) [ClassicSimilarity], result of:
            0.43809012 = score(doc=969,freq=2.0), product of:
              0.30286714 = queryWeight, product of:
                3.210251 = boost
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.014412419 = queryNorm
              1.4464762 = fieldWeight in 969, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.15625 = fieldNorm(doc=969)
        0.16 = coord(4/25)
    
  5. Berget, G.; Sandnes, F.E.: Do autocomplete functions reduce the impact of dyslexia on information-searching behavior? : the case of Google (2016) 0.16
    0.16311188 = sum of:
      0.16311188 = product of:
        1.0194492 = sum of:
          0.23204944 = weight(abstract_txt:spelling in 110) [ClassicSimilarity], result of:
            0.23204944 = score(doc=110,freq=2.0), product of:
              0.2749496 = queryWeight, product of:
                2.497433 = boost
                7.6387515 = idf(docFreq=56, maxDocs=43556)
                0.014412419 = queryNorm
              0.8439708 = fieldWeight in 110, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6387515 = idf(docFreq=56, maxDocs=43556)
                0.078125 = fieldNorm(doc=110)
          0.15488826 = weight(abstract_txt:errors in 110) [ClassicSimilarity], result of:
            0.15488826 = score(doc=110,freq=1.0), product of:
              0.30286714 = queryWeight, product of:
                3.210251 = boost
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.014412419 = queryNorm
              0.5114066 = fieldWeight in 110, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.078125 = fieldNorm(doc=110)
          0.055073187 = weight(abstract_txt:were in 110) [ClassicSimilarity], result of:
            0.055073187 = score(doc=110,freq=1.0), product of:
              0.19151817 = queryWeight, product of:
                3.6102138 = boost
                3.6807828 = idf(docFreq=2983, maxDocs=43556)
                0.014412419 = queryNorm
              0.28756115 = fieldWeight in 110, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6807828 = idf(docFreq=2983, maxDocs=43556)
                0.078125 = fieldNorm(doc=110)
          0.5774383 = weight(abstract_txt:misspellings in 110) [ClassicSimilarity], result of:
            0.5774383 = score(doc=110,freq=2.0), product of:
              0.5779633 = queryWeight, product of:
                4.4346876 = boost
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.014412419 = queryNorm
              0.9990916 = fieldWeight in 110, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.078125 = fieldNorm(doc=110)
        0.16 = coord(4/25)