Document (#14193)

Author
Joss, M.W.
Wszola, S.
Title
¬The engines that can : text search and retrieval software, their strategies, and vendors
Source
CD-ROM professional. 9(1996) no.6, S.30+(14 S.)
Year
1996
Abstract
Traces the development of text searching and retrieval software designed to cope with the increasing demands made by the storage and handling of large amounts of data, recorded on high data storage media, from CD-ROM to multi gigabyte storage media and online information services, with particular reference to the need to cope with graphics as well as conventional ASCII text. Includes details of: Boolean searching, fuzzy searching and matching; relevance ranking; proximity searching and improved strategies for dealing with text searching in very large databases. Concludes that the best searching tools for CD-ROM publishers are those optimized for searching and retrieval on CD-ROM. CD-ROM drives have relatively lower random seek times than hard discs and so the software most appropriate to the medium is that which can effectively arrange the indexes and text on the CD-ROM to avoid continuous random access searching. Lists and reviews a selection of software packages designed to achieve the sort of results required for rapid CD-ROM searching
Theme
Retrievalalgorithmen

Similar documents (content)

  1. Moffat, A.; Bell, T.A.H.: In situ generation of compressed inverted files (1995) 0.23
    0.22963195 = sum of:
      0.22963195 = product of:
        0.71759987 = sum of:
          0.012447947 = weight(abstract_txt:that in 2648) [ClassicSimilarity], result of:
            0.012447947 = score(doc=2648,freq=2.0), product of:
              0.04754891 = queryWeight, product of:
                1.0146863 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019776827 = queryNorm
              0.26179248 = fieldWeight in 2648, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=2648)
          0.08862741 = weight(abstract_txt:sort in 2648) [ClassicSimilarity], result of:
            0.08862741 = score(doc=2648,freq=1.0), product of:
              0.15372944 = queryWeight, product of:
                1.0533663 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.019776827 = queryNorm
              0.57651556 = fieldWeight in 2648, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.078125 = fieldNorm(doc=2648)
          0.067510664 = weight(abstract_txt:large in 2648) [ClassicSimilarity], result of:
            0.067510664 = score(doc=2648,freq=3.0), product of:
              0.11201155 = queryWeight, product of:
                1.2715906 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.019776827 = queryNorm
              0.6027116 = fieldWeight in 2648, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.078125 = fieldNorm(doc=2648)
          0.16795538 = weight(abstract_txt:gigabyte in 2648) [ClassicSimilarity], result of:
            0.16795538 = score(doc=2648,freq=1.0), product of:
              0.23541869 = queryWeight, product of:
                1.303531 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.019776827 = queryNorm
              0.71343267 = fieldWeight in 2648, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.078125 = fieldNorm(doc=2648)
          0.0532875 = weight(abstract_txt:designed in 2648) [ClassicSimilarity], result of:
            0.0532875 = score(doc=2648,freq=1.0), product of:
              0.13797653 = queryWeight, product of:
                1.4112973 = boost
                4.9434495 = idf(docFreq=856, maxDocs=44218)
                0.019776827 = queryNorm
              0.38620698 = fieldWeight in 2648, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9434495 = idf(docFreq=856, maxDocs=44218)
                0.078125 = fieldNorm(doc=2648)
          0.12294017 = weight(abstract_txt:random in 2648) [ClassicSimilarity], result of:
            0.12294017 = score(doc=2648,freq=1.0), product of:
              0.24090779 = queryWeight, product of:
                1.8648388 = boost
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.019776827 = queryNorm
              0.5103204 = fieldWeight in 2648, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.078125 = fieldNorm(doc=2648)
          0.13190761 = weight(abstract_txt:storage in 2648) [ClassicSimilarity], result of:
            0.13190761 = score(doc=2648,freq=1.0), product of:
              0.2890227 = queryWeight, product of:
                2.5016556 = boost
                5.8418155 = idf(docFreq=348, maxDocs=44218)
                0.019776827 = queryNorm
              0.45639184 = fieldWeight in 2648, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8418155 = idf(docFreq=348, maxDocs=44218)
                0.078125 = fieldNorm(doc=2648)
          0.072923176 = weight(abstract_txt:text in 2648) [ClassicSimilarity], result of:
            0.072923176 = score(doc=2648,freq=1.0), product of:
              0.2308228 = queryWeight, product of:
                2.8861923 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019776827 = queryNorm
              0.3159271 = fieldWeight in 2648, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=2648)
        0.32 = coord(8/25)
    
  2. Tenopir, C.: Full-text retrieval : systems and files (1994) 0.22
    0.22115004 = sum of:
      0.22115004 = product of:
        0.92145854 = sum of:
          0.012322839 = weight(abstract_txt:that in 2424) [ClassicSimilarity], result of:
            0.012322839 = score(doc=2424,freq=1.0), product of:
              0.04754891 = queryWeight, product of:
                1.0146863 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019776827 = queryNorm
              0.25916135 = fieldWeight in 2424, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.109375 = fieldNorm(doc=2424)
          0.17026873 = weight(abstract_txt:discs in 2424) [ClassicSimilarity], result of:
            0.17026873 = score(doc=2424,freq=1.0), product of:
              0.18983789 = queryWeight, product of:
                1.1705564 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.019776827 = queryNorm
              0.8969165 = fieldWeight in 2424, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.109375 = fieldNorm(doc=2424)
          0.038875055 = weight(abstract_txt:retrieval in 2424) [ClassicSimilarity], result of:
            0.038875055 = score(doc=2424,freq=1.0), product of:
              0.102277644 = queryWeight, product of:
                1.4881678 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019776827 = queryNorm
              0.38009337 = fieldWeight in 2424, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=2424)
          0.102087386 = weight(abstract_txt:software in 2424) [ClassicSimilarity], result of:
            0.102087386 = score(doc=2424,freq=1.0), product of:
              0.21426983 = queryWeight, product of:
                2.487204 = boost
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.019776827 = queryNorm
              0.4764431 = fieldWeight in 2424, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.109375 = fieldNorm(doc=2424)
          0.28876105 = weight(abstract_txt:text in 2424) [ClassicSimilarity], result of:
            0.28876105 = score(doc=2424,freq=8.0), product of:
              0.2308228 = queryWeight, product of:
                2.8861923 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019776827 = queryNorm
              1.2510074 = fieldWeight in 2424, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.109375 = fieldNorm(doc=2424)
          0.30914348 = weight(abstract_txt:searching in 2424) [ClassicSimilarity], result of:
            0.30914348 = score(doc=2424,freq=2.0), product of:
              0.4664487 = queryWeight, product of:
                5.504579 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.019776827 = queryNorm
              0.6627599 = fieldWeight in 2424, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.109375 = fieldNorm(doc=2424)
        0.24 = coord(6/25)
    
  3. Flanders, B.: On-line books : an advanced technology electronic library system (1992) 0.18
    0.17669985 = sum of:
      0.17669985 = product of:
        0.7362494 = sum of:
          0.008802028 = weight(abstract_txt:that in 2661) [ClassicSimilarity], result of:
            0.008802028 = score(doc=2661,freq=1.0), product of:
              0.04754891 = queryWeight, product of:
                1.0146863 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019776827 = queryNorm
              0.18511525 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=2661)
          0.12162052 = weight(abstract_txt:discs in 2661) [ClassicSimilarity], result of:
            0.12162052 = score(doc=2661,freq=1.0), product of:
              0.18983789 = queryWeight, product of:
                1.1705564 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.019776827 = queryNorm
              0.6406546 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.078125 = fieldNorm(doc=2661)
          0.027767895 = weight(abstract_txt:retrieval in 2661) [ClassicSimilarity], result of:
            0.027767895 = score(doc=2661,freq=1.0), product of:
              0.102277644 = queryWeight, product of:
                1.4881678 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019776827 = queryNorm
              0.27149525 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=2661)
          0.3489947 = weight(abstract_txt:storage in 2661) [ClassicSimilarity], result of:
            0.3489947 = score(doc=2661,freq=7.0), product of:
              0.2890227 = queryWeight, product of:
                2.5016556 = boost
                5.8418155 = idf(docFreq=348, maxDocs=44218)
                0.019776827 = queryNorm
              1.2074993 = fieldWeight in 2661, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.8418155 = idf(docFreq=348, maxDocs=44218)
                0.078125 = fieldNorm(doc=2661)
          0.072923176 = weight(abstract_txt:text in 2661) [ClassicSimilarity], result of:
            0.072923176 = score(doc=2661,freq=1.0), product of:
              0.2308228 = queryWeight, product of:
                2.8861923 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019776827 = queryNorm
              0.3159271 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=2661)
          0.15614104 = weight(abstract_txt:searching in 2661) [ClassicSimilarity], result of:
            0.15614104 = score(doc=2661,freq=1.0), product of:
              0.4664487 = queryWeight, product of:
                5.504579 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.019776827 = queryNorm
              0.3347443 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.078125 = fieldNorm(doc=2661)
        0.24 = coord(6/25)
    
  4. Casale, M.: Full text retrieval for the Web (1996) 0.17
    0.17100044 = sum of:
      0.17100044 = product of:
        0.61071587 = sum of:
          0.014937537 = weight(abstract_txt:that in 6757) [ClassicSimilarity], result of:
            0.014937537 = score(doc=6757,freq=2.0), product of:
              0.04754891 = queryWeight, product of:
                1.0146863 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019776827 = queryNorm
              0.314151 = fieldWeight in 6757, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=6757)
          0.09603261 = weight(abstract_txt:vendors in 6757) [ClassicSimilarity], result of:
            0.09603261 = score(doc=6757,freq=1.0), product of:
              0.14361615 = queryWeight, product of:
                1.0181284 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.019776827 = queryNorm
              0.66867554 = fieldWeight in 6757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.09375 = fieldNorm(doc=6757)
          0.016535884 = weight(abstract_txt:with in 6757) [ClassicSimilarity], result of:
            0.016535884 = score(doc=6757,freq=1.0), product of:
              0.07056063 = queryWeight, product of:
                1.4272896 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019776827 = queryNorm
              0.23435001 = fieldWeight in 6757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.09375 = fieldNorm(doc=6757)
          0.033321474 = weight(abstract_txt:retrieval in 6757) [ClassicSimilarity], result of:
            0.033321474 = score(doc=6757,freq=1.0), product of:
              0.102277644 = queryWeight, product of:
                1.4881678 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019776827 = queryNorm
              0.3257943 = fieldWeight in 6757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=6757)
          0.08750348 = weight(abstract_txt:software in 6757) [ClassicSimilarity], result of:
            0.08750348 = score(doc=6757,freq=1.0), product of:
              0.21426983 = queryWeight, product of:
                2.487204 = boost
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.019776827 = queryNorm
              0.40837982 = fieldWeight in 6757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.09375 = fieldNorm(doc=6757)
          0.17501561 = weight(abstract_txt:text in 6757) [ClassicSimilarity], result of:
            0.17501561 = score(doc=6757,freq=4.0), product of:
              0.2308228 = queryWeight, product of:
                2.8861923 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019776827 = queryNorm
              0.75822496 = fieldWeight in 6757, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=6757)
          0.18736926 = weight(abstract_txt:searching in 6757) [ClassicSimilarity], result of:
            0.18736926 = score(doc=6757,freq=1.0), product of:
              0.4664487 = queryWeight, product of:
                5.504579 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.019776827 = queryNorm
              0.40169317 = fieldWeight in 6757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.09375 = fieldNorm(doc=6757)
        0.28 = coord(7/25)
    
  5. Schmidt, J.: Full-text searching : as seen from a non-bibliographic searcher's point of view (1989) 0.16
    0.16179237 = sum of:
      0.16179237 = product of:
        0.6741349 = sum of:
          0.023618318 = weight(abstract_txt:that in 2876) [ClassicSimilarity], result of:
            0.023618318 = score(doc=2876,freq=5.0), product of:
              0.04754891 = queryWeight, product of:
                1.0146863 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019776827 = queryNorm
              0.49671632 = fieldWeight in 2876, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=2876)
          0.11075599 = weight(abstract_txt:designed in 2876) [ClassicSimilarity], result of:
            0.11075599 = score(doc=2876,freq=3.0), product of:
              0.13797653 = queryWeight, product of:
                1.4112973 = boost
                4.9434495 = idf(docFreq=856, maxDocs=44218)
                0.019776827 = queryNorm
              0.80271614 = fieldWeight in 2876, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9434495 = idf(docFreq=856, maxDocs=44218)
                0.09375 = fieldNorm(doc=2876)
          0.016535884 = weight(abstract_txt:with in 2876) [ClassicSimilarity], result of:
            0.016535884 = score(doc=2876,freq=1.0), product of:
              0.07056063 = queryWeight, product of:
                1.4272896 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019776827 = queryNorm
              0.23435001 = fieldWeight in 2876, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.09375 = fieldNorm(doc=2876)
          0.047123678 = weight(abstract_txt:retrieval in 2876) [ClassicSimilarity], result of:
            0.047123678 = score(doc=2876,freq=2.0), product of:
              0.102277644 = queryWeight, product of:
                1.4881678 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019776827 = queryNorm
              0.4607427 = fieldWeight in 2876, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=2876)
          0.15156797 = weight(abstract_txt:text in 2876) [ClassicSimilarity], result of:
            0.15156797 = score(doc=2876,freq=3.0), product of:
              0.2308228 = queryWeight, product of:
                2.8861923 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019776827 = queryNorm
              0.6566421 = fieldWeight in 2876, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=2876)
          0.32453308 = weight(abstract_txt:searching in 2876) [ClassicSimilarity], result of:
            0.32453308 = score(doc=2876,freq=3.0), product of:
              0.4664487 = queryWeight, product of:
                5.504579 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.019776827 = queryNorm
              0.695753 = fieldWeight in 2876, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.09375 = fieldNorm(doc=2876)
        0.24 = coord(6/25)