Document (#14193)

Author
Joss, M.W.
Wszola, S.
Title
¬The engines that can : text search and retrieval software, their strategies, and vendors
Source
CD-ROM professional. 9(1996) no.6, S.30+(14 S.)
Year
1996
Abstract
Traces the development of text searching and retrieval software designed to cope with the increasing demands made by the storage and handling of large amounts of data, recorded on high data storage media, from CD-ROM to multi gigabyte storage media and online information services, with particular reference to the need to cope with graphics as well as conventional ASCII text. Includes details of: Boolean searching, fuzzy searching and matching; relevance ranking; proximity searching and improved strategies for dealing with text searching in very large databases. Concludes that the best searching tools for CD-ROM publishers are those optimized for searching and retrieval on CD-ROM. CD-ROM drives have relatively lower random seek times than hard discs and so the software most appropriate to the medium is that which can effectively arrange the indexes and text on the CD-ROM to avoid continuous random access searching. Lists and reviews a selection of software packages designed to achieve the sort of results required for rapid CD-ROM searching
Theme
Retrievalalgorithmen

Similar documents (content)

  1. Moffat, A.; Bell, T.A.H.: In situ generation of compressed inverted files (1995) 0.23
    0.23114 = sum of:
      0.23114 = product of:
        0.7223125 = sum of:
          0.012691594 = weight(abstract_txt:that in 3717) [ClassicSimilarity], result of:
            0.012691594 = score(doc=3717,freq=2.0), product of:
              0.04815562 = queryWeight, product of:
                1.0247371 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.019700186 = queryNorm
              0.26355374 = fieldWeight in 3717, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.078125 = fieldNorm(doc=3717)
          0.08874227 = weight(abstract_txt:sort in 3717) [ClassicSimilarity], result of:
            0.08874227 = score(doc=3717,freq=1.0), product of:
              0.15382472 = queryWeight, product of:
                1.0574051 = boost
                7.3843856 = idf(docFreq=72, maxDocs=43254)
                0.019700186 = queryNorm
              0.57690513 = fieldWeight in 3717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3843856 = idf(docFreq=72, maxDocs=43254)
                0.078125 = fieldNorm(doc=3717)
          0.06804897 = weight(abstract_txt:large in 3717) [ClassicSimilarity], result of:
            0.06804897 = score(doc=3717,freq=3.0), product of:
              0.11257874 = queryWeight, product of:
                1.2792975 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.019700186 = queryNorm
              0.60445666 = fieldWeight in 3717, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.078125 = fieldNorm(doc=3717)
          0.17105085 = weight(abstract_txt:gigabyte in 3717) [ClassicSimilarity], result of:
            0.17105085 = score(doc=3717,freq=1.0), product of:
              0.23824434 = queryWeight, product of:
                1.3159508 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.019700186 = queryNorm
              0.71796393 = fieldWeight in 3717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.078125 = fieldNorm(doc=3717)
          0.05326073 = weight(abstract_txt:designed in 3717) [ClassicSimilarity], result of:
            0.05326073 = score(doc=3717,freq=1.0), product of:
              0.13789669 = queryWeight, product of:
                1.4158598 = boost
                4.9438267 = idf(docFreq=837, maxDocs=43254)
                0.019700186 = queryNorm
              0.38623646 = fieldWeight in 3717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9438267 = idf(docFreq=837, maxDocs=43254)
                0.078125 = fieldNorm(doc=3717)
          0.12324256 = weight(abstract_txt:random in 3717) [ClassicSimilarity], result of:
            0.12324256 = score(doc=3717,freq=1.0), product of:
              0.24124385 = queryWeight, product of:
                1.872714 = boost
                6.539047 = idf(docFreq=169, maxDocs=43254)
                0.019700186 = queryNorm
              0.510863 = fieldWeight in 3717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.539047 = idf(docFreq=169, maxDocs=43254)
                0.078125 = fieldNorm(doc=3717)
          0.13208781 = weight(abstract_txt:storage in 3717) [ClassicSimilarity], result of:
            0.13208781 = score(doc=3717,freq=1.0), product of:
              0.28921536 = queryWeight, product of:
                2.511306 = boost
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.019700186 = queryNorm
              0.4567109 = fieldWeight in 3717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.078125 = fieldNorm(doc=3717)
          0.07318771 = weight(abstract_txt:text in 3717) [ClassicSimilarity], result of:
            0.07318771 = score(doc=3717,freq=1.0), product of:
              0.23132427 = queryWeight, product of:
                2.8995056 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.019700186 = queryNorm
              0.31638578 = fieldWeight in 3717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.078125 = fieldNorm(doc=3717)
        0.32 = coord(8/25)
    
  2. Tenopir, C.: Full-text retrieval : systems and files (1994) 0.22
    0.22042553 = sum of:
      0.22042553 = product of:
        0.91843975 = sum of:
          0.012564038 = weight(abstract_txt:that in 3493) [ClassicSimilarity], result of:
            0.012564038 = score(doc=3493,freq=1.0), product of:
              0.04815562 = queryWeight, product of:
                1.0247371 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.019700186 = queryNorm
              0.2609049 = fieldWeight in 3493, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.109375 = fieldNorm(doc=3493)
          0.16877592 = weight(abstract_txt:discs in 3493) [ClassicSimilarity], result of:
            0.16877592 = score(doc=3493,freq=1.0), product of:
              0.18868066 = queryWeight, product of:
                1.1710948 = boost
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.019700186 = queryNorm
              0.8945056 = fieldWeight in 3493, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.109375 = fieldNorm(doc=3493)
          0.038671188 = weight(abstract_txt:retrieval in 3493) [ClassicSimilarity], result of:
            0.038671188 = score(doc=3493,freq=1.0), product of:
              0.101894915 = queryWeight, product of:
                1.4906142 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.019700186 = queryNorm
              0.3795203 = fieldWeight in 3493, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.109375 = fieldNorm(doc=3493)
          0.10170552 = weight(abstract_txt:software in 3493) [ClassicSimilarity], result of:
            0.10170552 = score(doc=3493,freq=1.0), product of:
              0.21368305 = queryWeight, product of:
                2.4925473 = boost
                4.351674 = idf(docFreq=1514, maxDocs=43254)
                0.019700186 = queryNorm
              0.47596437 = fieldWeight in 3493, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.351674 = idf(docFreq=1514, maxDocs=43254)
                0.109375 = fieldNorm(doc=3493)
          0.28980854 = weight(abstract_txt:text in 3493) [ClassicSimilarity], result of:
            0.28980854 = score(doc=3493,freq=8.0), product of:
              0.23132427 = queryWeight, product of:
                2.8995056 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.019700186 = queryNorm
              1.2528237 = fieldWeight in 3493, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.109375 = fieldNorm(doc=3493)
          0.30691457 = weight(abstract_txt:searching in 3493) [ClassicSimilarity], result of:
            0.30691457 = score(doc=3493,freq=2.0), product of:
              0.46409076 = queryWeight, product of:
                5.5099936 = boost
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.019700186 = queryNorm
              0.66132444 = fieldWeight in 3493, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.109375 = fieldNorm(doc=3493)
        0.24 = coord(6/25)
    
  3. Flanders, B.: On-line books : an advanced technology electronic library system (1992) 0.18
    0.17635806 = sum of:
      0.17635806 = product of:
        0.73482525 = sum of:
          0.008974313 = weight(abstract_txt:that in 2661) [ClassicSimilarity], result of:
            0.008974313 = score(doc=2661,freq=1.0), product of:
              0.04815562 = queryWeight, product of:
                1.0247371 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.019700186 = queryNorm
              0.18636064 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.078125 = fieldNorm(doc=2661)
          0.12055422 = weight(abstract_txt:discs in 2661) [ClassicSimilarity], result of:
            0.12055422 = score(doc=2661,freq=1.0), product of:
              0.18868066 = queryWeight, product of:
                1.1710948 = boost
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.019700186 = queryNorm
              0.6389326 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.078125 = fieldNorm(doc=2661)
          0.027622277 = weight(abstract_txt:retrieval in 2661) [ClassicSimilarity], result of:
            0.027622277 = score(doc=2661,freq=1.0), product of:
              0.101894915 = queryWeight, product of:
                1.4906142 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.019700186 = queryNorm
              0.27108592 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.078125 = fieldNorm(doc=2661)
          0.34947145 = weight(abstract_txt:storage in 2661) [ClassicSimilarity], result of:
            0.34947145 = score(doc=2661,freq=7.0), product of:
              0.28921536 = queryWeight, product of:
                2.511306 = boost
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.019700186 = queryNorm
              1.2083434 = fieldWeight in 2661, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.078125 = fieldNorm(doc=2661)
          0.07318771 = weight(abstract_txt:text in 2661) [ClassicSimilarity], result of:
            0.07318771 = score(doc=2661,freq=1.0), product of:
              0.23132427 = queryWeight, product of:
                2.8995056 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.019700186 = queryNorm
              0.31638578 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.078125 = fieldNorm(doc=2661)
          0.15501527 = weight(abstract_txt:searching in 2661) [ClassicSimilarity], result of:
            0.15501527 = score(doc=2661,freq=1.0), product of:
              0.46409076 = queryWeight, product of:
                5.5099936 = boost
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.019700186 = queryNorm
              0.3340193 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.078125 = fieldNorm(doc=2661)
        0.24 = coord(6/25)
    
  4. Casale, M.: Full text retrieval for the Web (1996) 0.17
    0.17053074 = sum of:
      0.17053074 = product of:
        0.60903835 = sum of:
          0.09507549 = weight(abstract_txt:vendors in 826) [ClassicSimilarity], result of:
            0.09507549 = score(doc=826,freq=1.0), product of:
              0.14262554 = queryWeight, product of:
                1.0181857 = boost
                7.110497 = idf(docFreq=95, maxDocs=43254)
                0.019700186 = queryNorm
              0.6666091 = fieldWeight in 826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.110497 = idf(docFreq=95, maxDocs=43254)
                0.09375 = fieldNorm(doc=826)
          0.015229913 = weight(abstract_txt:that in 826) [ClassicSimilarity], result of:
            0.015229913 = score(doc=826,freq=2.0), product of:
              0.04815562 = queryWeight, product of:
                1.0247371 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.019700186 = queryNorm
              0.3162645 = fieldWeight in 826, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.09375 = fieldNorm(doc=826)
          0.016741287 = weight(abstract_txt:with in 826) [ClassicSimilarity], result of:
            0.016741287 = score(doc=826,freq=1.0), product of:
              0.0711264 = queryWeight, product of:
                1.4380493 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.019700186 = queryNorm
              0.23537374 = fieldWeight in 826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.09375 = fieldNorm(doc=826)
          0.03314673 = weight(abstract_txt:retrieval in 826) [ClassicSimilarity], result of:
            0.03314673 = score(doc=826,freq=1.0), product of:
              0.101894915 = queryWeight, product of:
                1.4906142 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.019700186 = queryNorm
              0.3253031 = fieldWeight in 826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.09375 = fieldNorm(doc=826)
          0.08717616 = weight(abstract_txt:software in 826) [ClassicSimilarity], result of:
            0.08717616 = score(doc=826,freq=1.0), product of:
              0.21368305 = queryWeight, product of:
                2.4925473 = boost
                4.351674 = idf(docFreq=1514, maxDocs=43254)
                0.019700186 = queryNorm
              0.40796944 = fieldWeight in 826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.351674 = idf(docFreq=1514, maxDocs=43254)
                0.09375 = fieldNorm(doc=826)
          0.1756505 = weight(abstract_txt:text in 826) [ClassicSimilarity], result of:
            0.1756505 = score(doc=826,freq=4.0), product of:
              0.23132427 = queryWeight, product of:
                2.8995056 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.019700186 = queryNorm
              0.75932586 = fieldWeight in 826, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.09375 = fieldNorm(doc=826)
          0.18601832 = weight(abstract_txt:searching in 826) [ClassicSimilarity], result of:
            0.18601832 = score(doc=826,freq=1.0), product of:
              0.46409076 = queryWeight, product of:
                5.5099936 = boost
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.019700186 = queryNorm
              0.40082315 = fieldWeight in 826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.09375 = fieldNorm(doc=826)
        0.28 = coord(7/25)
    
  5. Schmidt, J.: Full-text searching : as seen from a non-bibliographic searcher's point of view (1989) 0.16
    0.16145036 = sum of:
      0.16145036 = product of:
        0.6727098 = sum of:
          0.024080606 = weight(abstract_txt:that in 3945) [ClassicSimilarity], result of:
            0.024080606 = score(doc=3945,freq=5.0), product of:
              0.04815562 = queryWeight, product of:
                1.0247371 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.019700186 = queryNorm
              0.50005805 = fieldWeight in 3945, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.09375 = fieldNorm(doc=3945)
          0.11070035 = weight(abstract_txt:designed in 3945) [ClassicSimilarity], result of:
            0.11070035 = score(doc=3945,freq=3.0), product of:
              0.13789669 = queryWeight, product of:
                1.4158598 = boost
                4.9438267 = idf(docFreq=837, maxDocs=43254)
                0.019700186 = queryNorm
              0.8027774 = fieldWeight in 3945, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9438267 = idf(docFreq=837, maxDocs=43254)
                0.09375 = fieldNorm(doc=3945)
          0.016741287 = weight(abstract_txt:with in 3945) [ClassicSimilarity], result of:
            0.016741287 = score(doc=3945,freq=1.0), product of:
              0.0711264 = queryWeight, product of:
                1.4380493 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.019700186 = queryNorm
              0.23537374 = fieldWeight in 3945, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.09375 = fieldNorm(doc=3945)
          0.04687656 = weight(abstract_txt:retrieval in 3945) [ClassicSimilarity], result of:
            0.04687656 = score(doc=3945,freq=2.0), product of:
              0.101894915 = queryWeight, product of:
                1.4906142 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.019700186 = queryNorm
              0.46004808 = fieldWeight in 3945, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.09375 = fieldNorm(doc=3945)
          0.1521178 = weight(abstract_txt:text in 3945) [ClassicSimilarity], result of:
            0.1521178 = score(doc=3945,freq=3.0), product of:
              0.23132427 = queryWeight, product of:
                2.8995056 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.019700186 = queryNorm
              0.6575955 = fieldWeight in 3945, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.09375 = fieldNorm(doc=3945)
          0.32219318 = weight(abstract_txt:searching in 3945) [ClassicSimilarity], result of:
            0.32219318 = score(doc=3945,freq=3.0), product of:
              0.46409076 = queryWeight, product of:
                5.5099936 = boost
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.019700186 = queryNorm
              0.69424605 = fieldWeight in 3945, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.275447 = idf(docFreq=1634, maxDocs=43254)
                0.09375 = fieldNorm(doc=3945)
        0.24 = coord(6/25)