Document (#14792)

Author
Hosono, K.
Title
Information retrieval functions in digital libraries
Source
Pharmaceutical library bulletin [=Yakugaku Toshokan]. 41(1996) no.2, S.91-99
Year
1996
Abstract
Information retrieval functions in digital libraries have a different context from those which apply to searching commercial databases or OPACs. Different methods of browsing in this context are described, but the retrieval function should also include ordinary Boolean searching. Conversion of printed materials to electronic format using OCR can result in errors, which may cause problems for keyword searching. The n-gram method of approximate or fuzzy matching to reduce this problem is described
Footnote
[In japanisch]

Similar documents (content)

  1. Longshu, L.; Xia, Z.: On an aproximate fuzzy information retrieval agent (1998) 0.25
    0.25397223 = sum of:
      0.25397223 = product of:
        1.5873264 = sum of:
          0.22935596 = weight(abstract_txt:matching in 5295) [ClassicSimilarity], result of:
            0.22935596 = score(doc=5295,freq=2.0), product of:
              0.17133278 = queryWeight, product of:
                1.0832253 = boost
                6.058074 = idf(docFreq=274, maxDocs=43254)
                0.02610881 = queryNorm
              1.3386579 = fieldWeight in 5295, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.058074 = idf(docFreq=274, maxDocs=43254)
                0.15625 = fieldNorm(doc=5295)
          0.51804453 = weight(abstract_txt:fuzzy in 5295) [ClassicSimilarity], result of:
            0.51804453 = score(doc=5295,freq=5.0), product of:
              0.21731938 = queryWeight, product of:
                1.2199662 = boost
                6.822815 = idf(docFreq=127, maxDocs=43254)
                0.02610881 = queryNorm
              2.3837934 = fieldWeight in 5295, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.822815 = idf(docFreq=127, maxDocs=43254)
                0.15625 = fieldNorm(doc=5295)
          0.7106318 = weight(abstract_txt:approximate in 5295) [ClassicSimilarity], result of:
            0.7106318 = score(doc=5295,freq=4.0), product of:
              0.28901488 = queryWeight, product of:
                1.4068851 = boost
                7.8681827 = idf(docFreq=44, maxDocs=43254)
                0.02610881 = queryNorm
              2.458807 = fieldWeight in 5295, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.8681827 = idf(docFreq=44, maxDocs=43254)
                0.15625 = fieldNorm(doc=5295)
          0.12929401 = weight(abstract_txt:retrieval in 5295) [ClassicSimilarity], result of:
            0.12929401 = score(doc=5295,freq=2.0), product of:
              0.16862674 = queryWeight, product of:
                1.8613259 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.02610881 = queryNorm
              0.76674676 = fieldWeight in 5295, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.15625 = fieldNorm(doc=5295)
        0.16 = coord(4/25)
    
  2. Alexander, M.: Retrieving digital data with fuzzy matching (1996) 0.21
    0.21475683 = sum of:
      0.21475683 = product of:
        0.7669887 = sum of:
          0.02562164 = weight(abstract_txt:which in 31) [ClassicSimilarity], result of:
            0.02562164 = score(doc=31,freq=1.0), product of:
              0.080018975 = queryWeight, product of:
                1.0469117 = boost
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.02610881 = queryNorm
              0.32019454 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.109375 = fieldNorm(doc=31)
          0.12439153 = weight(abstract_txt:conversion in 31) [ClassicSimilarity], result of:
            0.12439153 = score(doc=31,freq=1.0), product of:
              0.18209818 = queryWeight, product of:
                1.1167382 = boost
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.02610881 = queryNorm
              0.6831015 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.109375 = fieldNorm(doc=31)
          0.12793246 = weight(abstract_txt:reduce in 31) [ClassicSimilarity], result of:
            0.12793246 = score(doc=31,freq=1.0), product of:
              0.18553773 = queryWeight, product of:
                1.1272355 = boost
                6.304207 = idf(docFreq=214, maxDocs=43254)
                0.02610881 = queryNorm
              0.6895226 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.304207 = idf(docFreq=214, maxDocs=43254)
                0.109375 = fieldNorm(doc=31)
          0.14315538 = weight(abstract_txt:errors in 31) [ClassicSimilarity], result of:
            0.14315538 = score(doc=31,freq=1.0), product of:
              0.19997859 = queryWeight, product of:
                1.1702814 = boost
                6.544946 = idf(docFreq=168, maxDocs=43254)
                0.02610881 = queryNorm
              0.7158535 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.544946 = idf(docFreq=168, maxDocs=43254)
                0.109375 = fieldNorm(doc=31)
          0.16217358 = weight(abstract_txt:fuzzy in 31) [ClassicSimilarity], result of:
            0.16217358 = score(doc=31,freq=1.0), product of:
              0.21731938 = queryWeight, product of:
                1.2199662 = boost
                6.822815 = idf(docFreq=127, maxDocs=43254)
                0.02610881 = queryNorm
              0.7462454 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.822815 = idf(docFreq=127, maxDocs=43254)
                0.109375 = fieldNorm(doc=31)
          0.06399727 = weight(abstract_txt:retrieval in 31) [ClassicSimilarity], result of:
            0.06399727 = score(doc=31,freq=1.0), product of:
              0.16862674 = queryWeight, product of:
                1.8613259 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.02610881 = queryNorm
              0.3795203 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.109375 = fieldNorm(doc=31)
          0.11971682 = weight(abstract_txt:searching in 31) [ClassicSimilarity], result of:
            0.11971682 = score(doc=31,freq=1.0), product of:
              0.25600922 = queryWeight, product of:
                2.293438 = boost
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.02610881 = queryNorm
              0.467627 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.109375 = fieldNorm(doc=31)
        0.28 = coord(7/25)
    
  3. Ensor, P.: User characteristics of keyword searching in an OPAC (1992) 0.19
    0.18547101 = sum of:
      0.18547101 = product of:
        0.7727959 = sum of:
          0.04141082 = weight(abstract_txt:which in 2278) [ClassicSimilarity], result of:
            0.04141082 = score(doc=2278,freq=2.0), product of:
              0.080018975 = queryWeight, product of:
                1.0469117 = boost
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.02610881 = queryNorm
              0.5175125 = fieldWeight in 2278, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.125 = fieldNorm(doc=2278)
          0.118195415 = weight(abstract_txt:opacs in 2278) [ClassicSimilarity], result of:
            0.118195415 = score(doc=2278,freq=1.0), product of:
              0.16100925 = queryWeight, product of:
                1.0500839 = boost
                5.8727264 = idf(docFreq=330, maxDocs=43254)
                0.02610881 = queryNorm
              0.7340908 = fieldWeight in 2278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8727264 = idf(docFreq=330, maxDocs=43254)
                0.125 = fieldNorm(doc=2278)
          0.1818524 = weight(abstract_txt:keyword in 2278) [ClassicSimilarity], result of:
            0.1818524 = score(doc=2278,freq=2.0), product of:
              0.17031509 = queryWeight, product of:
                1.0800034 = boost
                6.0400553 = idf(docFreq=279, maxDocs=43254)
                0.02610881 = queryNorm
              1.067741 = fieldWeight in 2278, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0400553 = idf(docFreq=279, maxDocs=43254)
                0.125 = fieldNorm(doc=2278)
          0.14127098 = weight(abstract_txt:boolean in 2278) [ClassicSimilarity], result of:
            0.14127098 = score(doc=2278,freq=1.0), product of:
              0.18133672 = queryWeight, product of:
                1.1144009 = boost
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.02610881 = queryNorm
              0.7790534 = fieldWeight in 2278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.125 = fieldNorm(doc=2278)
          0.09657466 = weight(abstract_txt:context in 2278) [ClassicSimilarity], result of:
            0.09657466 = score(doc=2278,freq=1.0), product of:
              0.17729747 = queryWeight, product of:
                1.5583494 = boost
                4.3576326 = idf(docFreq=1505, maxDocs=43254)
                0.02610881 = queryNorm
              0.5447041 = fieldWeight in 2278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3576326 = idf(docFreq=1505, maxDocs=43254)
                0.125 = fieldNorm(doc=2278)
          0.19349161 = weight(abstract_txt:searching in 2278) [ClassicSimilarity], result of:
            0.19349161 = score(doc=2278,freq=2.0), product of:
              0.25600922 = queryWeight, product of:
                2.293438 = boost
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.02610881 = queryNorm
              0.75579935 = fieldWeight in 2278, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.125 = fieldNorm(doc=2278)
        0.24 = coord(6/25)
    
  4. Tenopir, C.: Common end user errors (1997) 0.16
    0.16101073 = sum of:
      0.16101073 = product of:
        0.67087805 = sum of:
          0.08393416 = weight(abstract_txt:commercial in 2411) [ClassicSimilarity], result of:
            0.08393416 = score(doc=2411,freq=1.0), product of:
              0.15525135 = queryWeight, product of:
                1.0311368 = boost
                5.7667623 = idf(docFreq=367, maxDocs=43254)
                0.02610881 = queryNorm
              0.540634 = fieldWeight in 2411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7667623 = idf(docFreq=367, maxDocs=43254)
                0.09375 = fieldNorm(doc=2411)
          0.021961404 = weight(abstract_txt:which in 2411) [ClassicSimilarity], result of:
            0.021961404 = score(doc=2411,freq=1.0), product of:
              0.080018975 = queryWeight, product of:
                1.0469117 = boost
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.02610881 = queryNorm
              0.27445245 = fieldWeight in 2411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.09375 = fieldNorm(doc=2411)
          0.10595323 = weight(abstract_txt:boolean in 2411) [ClassicSimilarity], result of:
            0.10595323 = score(doc=2411,freq=1.0), product of:
              0.18133672 = queryWeight, product of:
                1.1144009 = boost
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.02610881 = queryNorm
              0.58429 = fieldWeight in 2411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.09375 = fieldNorm(doc=2411)
          0.36811382 = weight(abstract_txt:errors in 2411) [ClassicSimilarity], result of:
            0.36811382 = score(doc=2411,freq=9.0), product of:
              0.19997859 = queryWeight, product of:
                1.1702814 = boost
                6.544946 = idf(docFreq=168, maxDocs=43254)
                0.02610881 = queryNorm
              1.8407661 = fieldWeight in 2411, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.544946 = idf(docFreq=168, maxDocs=43254)
                0.09375 = fieldNorm(doc=2411)
          0.0437027 = weight(abstract_txt:different in 2411) [ClassicSimilarity], result of:
            0.0437027 = score(doc=2411,freq=1.0), product of:
              0.1265975 = queryWeight, product of:
                1.3168191 = boost
                3.6822383 = idf(docFreq=2958, maxDocs=43254)
                0.02610881 = queryNorm
              0.34520984 = fieldWeight in 2411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6822383 = idf(docFreq=2958, maxDocs=43254)
                0.09375 = fieldNorm(doc=2411)
          0.04721274 = weight(abstract_txt:libraries in 2411) [ClassicSimilarity], result of:
            0.04721274 = score(doc=2411,freq=1.0), product of:
              0.1332884 = queryWeight, product of:
                1.3511692 = boost
                3.7782922 = idf(docFreq=2687, maxDocs=43254)
                0.02610881 = queryNorm
              0.3542149 = fieldWeight in 2411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7782922 = idf(docFreq=2687, maxDocs=43254)
                0.09375 = fieldNorm(doc=2411)
        0.24 = coord(6/25)
    
  5. Albertson, D.; Meadows III, C.: Situated topic complexity in interactive video retrieval (2011) 0.16
    0.15830024 = sum of:
      0.15830024 = product of:
        0.49468824 = sum of:
          0.0609993 = weight(abstract_txt:apply in 1219) [ClassicSimilarity], result of:
            0.0609993 = score(doc=1219,freq=1.0), product of:
              0.16444486 = queryWeight, product of:
                1.061228 = boost
                5.935052 = idf(docFreq=310, maxDocs=43254)
                0.02610881 = queryNorm
              0.37094074 = fieldWeight in 1219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.935052 = idf(docFreq=310, maxDocs=43254)
                0.0625 = fieldNorm(doc=1219)
          0.0909262 = weight(abstract_txt:keyword in 1219) [ClassicSimilarity], result of:
            0.0909262 = score(doc=1219,freq=2.0), product of:
              0.17031509 = queryWeight, product of:
                1.0800034 = boost
                6.0400553 = idf(docFreq=279, maxDocs=43254)
                0.02610881 = queryNorm
              0.5338705 = fieldWeight in 1219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0400553 = idf(docFreq=279, maxDocs=43254)
                0.0625 = fieldNorm(doc=1219)
          0.0412033 = weight(abstract_txt:different in 1219) [ClassicSimilarity], result of:
            0.0412033 = score(doc=1219,freq=2.0), product of:
              0.1265975 = queryWeight, product of:
                1.3168191 = boost
                3.6822383 = idf(docFreq=2958, maxDocs=43254)
                0.02610881 = queryNorm
              0.32546696 = fieldWeight in 1219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6822383 = idf(docFreq=2958, maxDocs=43254)
                0.0625 = fieldNorm(doc=1219)
          0.031475157 = weight(abstract_txt:libraries in 1219) [ClassicSimilarity], result of:
            0.031475157 = score(doc=1219,freq=1.0), product of:
              0.1332884 = queryWeight, product of:
                1.3511692 = boost
                3.7782922 = idf(docFreq=2687, maxDocs=43254)
                0.02610881 = queryNorm
              0.23614326 = fieldWeight in 1219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7782922 = idf(docFreq=2687, maxDocs=43254)
                0.0625 = fieldNorm(doc=1219)
          0.06850784 = weight(abstract_txt:digital in 1219) [ClassicSimilarity], result of:
            0.06850784 = score(doc=1219,freq=2.0), product of:
              0.17767675 = queryWeight, product of:
                1.5600153 = boost
                4.3622913 = idf(docFreq=1498, maxDocs=43254)
                0.02610881 = queryNorm
              0.3855757 = fieldWeight in 1219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3622913 = idf(docFreq=1498, maxDocs=43254)
                0.0625 = fieldNorm(doc=1219)
          0.03656987 = weight(abstract_txt:retrieval in 1219) [ClassicSimilarity], result of:
            0.03656987 = score(doc=1219,freq=1.0), product of:
              0.16862674 = queryWeight, product of:
                1.8613259 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.02610881 = queryNorm
              0.21686874 = fieldWeight in 1219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=1219)
          0.09659696 = weight(abstract_txt:functions in 1219) [ClassicSimilarity], result of:
            0.09659696 = score(doc=1219,freq=1.0), product of:
              0.28148553 = queryWeight, product of:
                1.9635484 = boost
                5.490696 = idf(docFreq=484, maxDocs=43254)
                0.02610881 = queryNorm
              0.3431685 = fieldWeight in 1219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.490696 = idf(docFreq=484, maxDocs=43254)
                0.0625 = fieldNorm(doc=1219)
          0.068409614 = weight(abstract_txt:searching in 1219) [ClassicSimilarity], result of:
            0.068409614 = score(doc=1219,freq=1.0), product of:
              0.25600922 = queryWeight, product of:
                2.293438 = boost
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.02610881 = queryNorm
              0.26721543 = fieldWeight in 1219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.0625 = fieldNorm(doc=1219)
        0.32 = coord(8/25)