Document (#14792)

Author
Hosono, K.
Title
Information retrieval functions in digital libraries
Source
Pharmaceutical library bulletin [=Yakugaku Toshokan]. 41(1996) no.2, S.91-99
Year
1996
Abstract
Information retrieval functions in digital libraries have a different context from those which apply to searching commercial databases or OPACs. Different methods of browsing in this context are described, but the retrieval function should also include ordinary Boolean searching. Conversion of printed materials to electronic format using OCR can result in errors, which may cause problems for keyword searching. The n-gram method of approximate or fuzzy matching to reduce this problem is described
Footnote
[In japanisch]

Similar documents (content)

  1. Longshu, L.; Xia, Z.: On an aproximate fuzzy information retrieval agent (1998) 0.26
    0.25607982 = sum of:
      0.25607982 = product of:
        1.6004989 = sum of:
          0.22859117 = weight(abstract_txt:matching in 3294) [ClassicSimilarity], result of:
            0.22859117 = score(doc=3294,freq=2.0), product of:
              0.17104836 = queryWeight, product of:
                1.0840905 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.026088424 = queryNorm
              1.3364125 = fieldWeight in 3294, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.15625 = fieldNorm(doc=3294)
          0.5239695 = weight(abstract_txt:fuzzy in 3294) [ClassicSimilarity], result of:
            0.5239695 = score(doc=3294,freq=5.0), product of:
              0.21909708 = queryWeight, product of:
                1.2269429 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.026088424 = queryNorm
              2.3914945 = fieldWeight in 3294, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.15625 = fieldNorm(doc=3294)
          0.71783733 = weight(abstract_txt:approximate in 3294) [ClassicSimilarity], result of:
            0.71783733 = score(doc=3294,freq=4.0), product of:
              0.29112977 = queryWeight, product of:
                1.4143255 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.026088424 = queryNorm
              2.4656954 = fieldWeight in 3294, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.15625 = fieldNorm(doc=3294)
          0.13010103 = weight(abstract_txt:retrieval in 3294) [ClassicSimilarity], result of:
            0.13010103 = score(doc=3294,freq=2.0), product of:
              0.16942345 = queryWeight, product of:
                1.8687596 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.026088424 = queryNorm
              0.7679045 = fieldWeight in 3294, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.15625 = fieldNorm(doc=3294)
        0.16 = coord(4/25)
    
  2. Alexander, M.: Retrieving digital data with fuzzy matching (1996) 0.22
    0.21577066 = sum of:
      0.21577066 = product of:
        0.7706095 = sum of:
          0.025382696 = weight(abstract_txt:which in 6961) [ClassicSimilarity], result of:
            0.025382696 = score(doc=6961,freq=1.0), product of:
              0.07956567 = queryWeight, product of:
                1.0456442 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.026088424 = queryNorm
              0.31901568 = fieldWeight in 6961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.109375 = fieldNorm(doc=6961)
          0.12514032 = weight(abstract_txt:conversion in 6961) [ClassicSimilarity], result of:
            0.12514032 = score(doc=6961,freq=1.0), product of:
              0.1829316 = queryWeight, product of:
                1.1211157 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.026088424 = queryNorm
              0.68408257 = fieldWeight in 6961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.109375 = fieldNorm(doc=6961)
          0.12726788 = weight(abstract_txt:reduce in 6961) [ClassicSimilarity], result of:
            0.12726788 = score(doc=6961,freq=1.0), product of:
              0.18499917 = queryWeight, product of:
                1.1274335 = boost
                6.2897153 = idf(docFreq=222, maxDocs=44218)
                0.026088424 = queryNorm
              0.6879376 = fieldWeight in 6961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2897153 = idf(docFreq=222, maxDocs=44218)
                0.109375 = fieldNorm(doc=6961)
          0.14369081 = weight(abstract_txt:errors in 6961) [ClassicSimilarity], result of:
            0.14369081 = score(doc=6961,freq=1.0), product of:
              0.20059028 = queryWeight, product of:
                1.1739808 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.026088424 = queryNorm
              0.7163398 = fieldWeight in 6961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.109375 = fieldNorm(doc=6961)
          0.16402839 = weight(abstract_txt:fuzzy in 6961) [ClassicSimilarity], result of:
            0.16402839 = score(doc=6961,freq=1.0), product of:
              0.21909708 = queryWeight, product of:
                1.2269429 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.026088424 = queryNorm
              0.7486562 = fieldWeight in 6961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.109375 = fieldNorm(doc=6961)
          0.06439673 = weight(abstract_txt:retrieval in 6961) [ClassicSimilarity], result of:
            0.06439673 = score(doc=6961,freq=1.0), product of:
              0.16942345 = queryWeight, product of:
                1.8687596 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.026088424 = queryNorm
              0.38009337 = fieldWeight in 6961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=6961)
          0.120702595 = weight(abstract_txt:searching in 6961) [ClassicSimilarity], result of:
            0.120702595 = score(doc=6961,freq=1.0), product of:
              0.2575582 = queryWeight, product of:
                2.3041162 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.026088424 = queryNorm
              0.46864203 = fieldWeight in 6961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.109375 = fieldNorm(doc=6961)
        0.28 = coord(7/25)
    
  3. Ensor, P.: User characteristics of keyword searching in an OPAC (1992) 0.19
    0.18616483 = sum of:
      0.18616483 = product of:
        0.7756868 = sum of:
          0.04102463 = weight(abstract_txt:which in 2278) [ClassicSimilarity], result of:
            0.04102463 = score(doc=2278,freq=2.0), product of:
              0.07956567 = queryWeight, product of:
                1.0456442 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.026088424 = queryNorm
              0.5156072 = fieldWeight in 2278, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.125 = fieldNorm(doc=2278)
          0.11936749 = weight(abstract_txt:opacs in 2278) [ClassicSimilarity], result of:
            0.11936749 = score(doc=2278,freq=1.0), product of:
              0.16216357 = queryWeight, product of:
                1.0555595 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.026088424 = queryNorm
              0.7360931 = fieldWeight in 2278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.125 = fieldNorm(doc=2278)
          0.1819214 = weight(abstract_txt:keyword in 2278) [ClassicSimilarity], result of:
            0.1819214 = score(doc=2278,freq=2.0), product of:
              0.17045449 = queryWeight, product of:
                1.0822068 = boost
                6.037405 = idf(docFreq=286, maxDocs=44218)
                0.026088424 = queryNorm
              1.0672725 = fieldWeight in 2278, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.037405 = idf(docFreq=286, maxDocs=44218)
                0.125 = fieldNorm(doc=2278)
          0.1427214 = weight(abstract_txt:boolean in 2278) [ClassicSimilarity], result of:
            0.1427214 = score(doc=2278,freq=1.0), product of:
              0.18267901 = queryWeight, product of:
                1.1203414 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.026088424 = queryNorm
              0.7812687 = fieldWeight in 2278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.125 = fieldNorm(doc=2278)
          0.09556696 = weight(abstract_txt:context in 2278) [ClassicSimilarity], result of:
            0.09556696 = score(doc=2278,freq=1.0), product of:
              0.17616154 = queryWeight, product of:
                1.5558819 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.026088424 = queryNorm
              0.54249614 = fieldWeight in 2278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.125 = fieldNorm(doc=2278)
          0.19508485 = weight(abstract_txt:searching in 2278) [ClassicSimilarity], result of:
            0.19508485 = score(doc=2278,freq=2.0), product of:
              0.2575582 = queryWeight, product of:
                2.3041162 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.026088424 = queryNorm
              0.7574399 = fieldWeight in 2278, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.125 = fieldNorm(doc=2278)
        0.24 = coord(6/25)
    
  4. Borgman, C.L.; Hirsh, S.G.; Walter, V.A.; Gallagher, A.L.: Childrens searching behavior on browsing and keyword online catalogs : the Science Library Catalog project (1995) 0.16
    0.16178332 = sum of:
      0.16178332 = product of:
        0.5055729 = sum of:
          0.044403378 = weight(abstract_txt:browsing in 2591) [ClassicSimilarity], result of:
            0.044403378 = score(doc=2591,freq=1.0), product of:
              0.14554185 = queryWeight, product of:
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.026088424 = queryNorm
              0.3050901 = fieldWeight in 2591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2591)
          0.09045336 = weight(abstract_txt:opacs in 2591) [ClassicSimilarity], result of:
            0.09045336 = score(doc=2591,freq=3.0), product of:
              0.16216357 = queryWeight, product of:
                1.0555595 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.026088424 = queryNorm
              0.5577909 = fieldWeight in 2591, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2591)
          0.13785498 = weight(abstract_txt:keyword in 2591) [ClassicSimilarity], result of:
            0.13785498 = score(doc=2591,freq=6.0), product of:
              0.17045449 = queryWeight, product of:
                1.0822068 = boost
                6.037405 = idf(docFreq=286, maxDocs=44218)
                0.026088424 = queryNorm
              0.8087495 = fieldWeight in 2591, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.037405 = idf(docFreq=286, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2591)
          0.06244061 = weight(abstract_txt:boolean in 2591) [ClassicSimilarity], result of:
            0.06244061 = score(doc=2591,freq=1.0), product of:
              0.18267901 = queryWeight, product of:
                1.1203414 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.026088424 = queryNorm
              0.34180507 = fieldWeight in 2591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2591)
          0.025189886 = weight(abstract_txt:different in 2591) [ClassicSimilarity], result of:
            0.025189886 = score(doc=2591,freq=1.0), product of:
              0.12566221 = queryWeight, product of:
                1.3140849 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.026088424 = queryNorm
              0.20045713 = fieldWeight in 2591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2591)
          0.02768275 = weight(abstract_txt:libraries in 2591) [ClassicSimilarity], result of:
            0.02768275 = score(doc=2591,freq=1.0), product of:
              0.13382176 = queryWeight, product of:
                1.3560772 = boost
                3.782635 = idf(docFreq=2735, maxDocs=44218)
                0.026088424 = queryNorm
              0.20686285 = fieldWeight in 2591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.782635 = idf(docFreq=2735, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2591)
          0.032198366 = weight(abstract_txt:retrieval in 2591) [ClassicSimilarity], result of:
            0.032198366 = score(doc=2591,freq=1.0), product of:
              0.16942345 = queryWeight, product of:
                1.8687596 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.026088424 = queryNorm
              0.19004668 = fieldWeight in 2591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2591)
          0.08534962 = weight(abstract_txt:searching in 2591) [ClassicSimilarity], result of:
            0.08534962 = score(doc=2591,freq=2.0), product of:
              0.2575582 = queryWeight, product of:
                2.3041162 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.026088424 = queryNorm
              0.33137995 = fieldWeight in 2591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2591)
        0.32 = coord(8/25)
    
  5. Tenopir, C.: Common end user errors (1997) 0.16
    0.16161074 = sum of:
      0.16161074 = product of:
        0.6733781 = sum of:
          0.08445098 = weight(abstract_txt:commercial in 410) [ClassicSimilarity], result of:
            0.08445098 = score(doc=410,freq=1.0), product of:
              0.15597616 = queryWeight, product of:
                1.035226 = boost
                5.7753086 = idf(docFreq=372, maxDocs=44218)
                0.026088424 = queryNorm
              0.5414352 = fieldWeight in 410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7753086 = idf(docFreq=372, maxDocs=44218)
                0.09375 = fieldNorm(doc=410)
          0.021756595 = weight(abstract_txt:which in 410) [ClassicSimilarity], result of:
            0.021756595 = score(doc=410,freq=1.0), product of:
              0.07956567 = queryWeight, product of:
                1.0456442 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.026088424 = queryNorm
              0.273442 = fieldWeight in 410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.09375 = fieldNorm(doc=410)
          0.10704105 = weight(abstract_txt:boolean in 410) [ClassicSimilarity], result of:
            0.10704105 = score(doc=410,freq=1.0), product of:
              0.18267901 = queryWeight, product of:
                1.1203414 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.026088424 = queryNorm
              0.58595157 = fieldWeight in 410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.09375 = fieldNorm(doc=410)
          0.36949065 = weight(abstract_txt:errors in 410) [ClassicSimilarity], result of:
            0.36949065 = score(doc=410,freq=9.0), product of:
              0.20059028 = queryWeight, product of:
                1.1739808 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.026088424 = queryNorm
              1.8420167 = fieldWeight in 410, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.09375 = fieldNorm(doc=410)
          0.043182664 = weight(abstract_txt:different in 410) [ClassicSimilarity], result of:
            0.043182664 = score(doc=410,freq=1.0), product of:
              0.12566221 = queryWeight, product of:
                1.3140849 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.026088424 = queryNorm
              0.3436408 = fieldWeight in 410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.09375 = fieldNorm(doc=410)
          0.047456145 = weight(abstract_txt:libraries in 410) [ClassicSimilarity], result of:
            0.047456145 = score(doc=410,freq=1.0), product of:
              0.13382176 = queryWeight, product of:
                1.3560772 = boost
                3.782635 = idf(docFreq=2735, maxDocs=44218)
                0.026088424 = queryNorm
              0.35462204 = fieldWeight in 410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.782635 = idf(docFreq=2735, maxDocs=44218)
                0.09375 = fieldNorm(doc=410)
        0.24 = coord(6/25)