Document (#7658)

Author
Couvreur, T.R.
Benzel, R.N.
Miller, S.F.
Zeitler, D.N.
Lee, D.L.
Singhal, M.
Shivaratri, N.
Wong, W.Y.P.
Title
¬An analysis of performance and cost factors in searching large text databases using parallel search systems
Source
Journal of the American Society for Information Science. 45(1994) no.7, S.443-464
Year
1994
Abstract
The results of modelling the performance of searching large text databases (>10 GBytes) via various parallel hardware architectures and search algorithms are discussed. The performance under load and the cost of each configuration are compared. Strengths, weaknesses, performance sensitivities, and search features supported for each configuration are also addressed. In addition, a common search workload used in the modelling is described. The search workload is derived from a set of searches run against the Chemical Abstracts file of bibliographic and abstract text available on STN International. This common workload is applied to all configurations modelled to provide a common basis of comparison
Theme
Retrievalalgorithmen
Volltextretrieval

Similar documents (author)

  1. Singhal, A.: Document length normalization (1996) 1.47
    1.4706984 = sum of:
      1.4706984 = product of:
        4.412095 = sum of:
          4.412095 = weight(author_txt:singhal in 6699) [ClassicSimilarity], result of:
            4.412095 = score(doc=6699,freq=1.0), product of:
              0.7374045 = queryWeight, product of:
                1.3465632 = boost
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.05720316 = queryNorm
              5.9832764 = fieldWeight in 6699, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.625 = fieldNorm(doc=6699)
        0.33333334 = coord(1/3)
    
  2. Wong, S.K.M.: On modelling information retrieval with probabilistic inference (1995) 0.92
    0.9198406 = sum of:
      0.9198406 = product of:
        2.7595217 = sum of:
          2.7595217 = weight(author_txt:wong in 2007) [ClassicSimilarity], result of:
            2.7595217 = score(doc=2007,freq=1.0), product of:
              0.5393017 = queryWeight, product of:
                1.1515684 = boost
                8.186948 = idf(docFreq=31, maxDocs=42306)
                0.05720316 = queryNorm
              5.1168423 = fieldWeight in 2007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.186948 = idf(docFreq=31, maxDocs=42306)
                0.625 = fieldNorm(doc=2007)
        0.33333334 = coord(1/3)
    
  3. Wong, K.: Frühe Spuren des menschlichen Geistes (2005) 0.92
    0.9198406 = sum of:
      0.9198406 = product of:
        2.7595217 = sum of:
          2.7595217 = weight(author_txt:wong in 1984) [ClassicSimilarity], result of:
            2.7595217 = score(doc=1984,freq=1.0), product of:
              0.5393017 = queryWeight, product of:
                1.1515684 = boost
                8.186948 = idf(docFreq=31, maxDocs=42306)
                0.05720316 = queryNorm
              5.1168423 = fieldWeight in 1984, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.186948 = idf(docFreq=31, maxDocs=42306)
                0.625 = fieldNorm(doc=1984)
        0.33333334 = coord(1/3)
    
  4. Salton, G.; Allan, J.; Singhal, A.: Automatic text decomposition and structuring (1996) 0.88
    0.88241905 = sum of:
      0.88241905 = product of:
        2.647257 = sum of:
          2.647257 = weight(author_txt:singhal in 4136) [ClassicSimilarity], result of:
            2.647257 = score(doc=4136,freq=1.0), product of:
              0.7374045 = queryWeight, product of:
                1.3465632 = boost
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.05720316 = queryNorm
              3.5899658 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.375 = fieldNorm(doc=4136)
        0.33333334 = coord(1/3)
    
  5. Singhal, A.; Buckley, C.; Mitra, M.: Using query zoning and correlation with SMART : TREC 5 (1997) 0.88
    0.88241905 = sum of:
      0.88241905 = product of:
        2.647257 = sum of:
          2.647257 = weight(author_txt:singhal in 4091) [ClassicSimilarity], result of:
            2.647257 = score(doc=4091,freq=1.0), product of:
              0.7374045 = queryWeight, product of:
                1.3465632 = boost
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.05720316 = queryNorm
              3.5899658 = fieldWeight in 4091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.375 = fieldNorm(doc=4091)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Stanfill, C.: Parallel information retrieval algorithms (1992) 0.28
    0.28024277 = sum of:
      0.28024277 = product of:
        1.1676782 = sum of:
          0.04480936 = weight(abstract_txt:searching in 4516) [ClassicSimilarity], result of:
            0.04480936 = score(doc=4516,freq=1.0), product of:
              0.09601631 = queryWeight, product of:
                1.4328833 = boost
                4.2668333 = idf(docFreq=1612, maxDocs=42306)
                0.01570466 = queryNorm
              0.46668488 = fieldWeight in 4516, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2668333 = idf(docFreq=1612, maxDocs=42306)
                0.109375 = fieldNorm(doc=4516)
          0.069053374 = weight(abstract_txt:databases in 4516) [ClassicSimilarity], result of:
            0.069053374 = score(doc=4516,freq=2.0), product of:
              0.1016746 = queryWeight, product of:
                1.4744992 = boost
                4.390757 = idf(docFreq=1424, maxDocs=42306)
                0.01570466 = queryNorm
              0.67916054 = fieldWeight in 4516, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.390757 = idf(docFreq=1424, maxDocs=42306)
                0.109375 = fieldNorm(doc=4516)
          0.081394695 = weight(abstract_txt:text in 4516) [ClassicSimilarity], result of:
            0.081394695 = score(doc=4516,freq=2.0), product of:
              0.12987244 = queryWeight, product of:
                2.0409973 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.01570466 = queryNorm
              0.626728 = fieldWeight in 4516, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.109375 = fieldNorm(doc=4516)
          0.21494699 = weight(abstract_txt:parallel in 4516) [ClassicSimilarity], result of:
            0.21494699 = score(doc=4516,freq=2.0), product of:
              0.21675882 = queryWeight, product of:
                2.152914 = boost
                6.4109373 = idf(docFreq=188, maxDocs=42306)
                0.01570466 = queryNorm
              0.9916412 = fieldWeight in 4516, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4109373 = idf(docFreq=188, maxDocs=42306)
                0.109375 = fieldNorm(doc=4516)
          0.16390574 = weight(abstract_txt:performance in 4516) [ClassicSimilarity], result of:
            0.16390574 = score(doc=4516,freq=2.0), product of:
              0.22794424 = queryWeight, product of:
                3.1222496 = boost
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.01570466 = queryNorm
              0.71906066 = fieldWeight in 4516, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.109375 = fieldNorm(doc=4516)
          0.5935681 = weight(abstract_txt:workload in 4516) [ClassicSimilarity], result of:
            0.5935681 = score(doc=4516,freq=1.0), product of:
              0.61533266 = queryWeight, product of:
                4.4426174 = boost
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.01570466 = queryNorm
              0.9646296 = fieldWeight in 4516, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.109375 = fieldNorm(doc=4516)
        0.24 = coord(6/25)
    
  2. Lu, Z.; McKinley, K.S.: ¬The effect of collection organization and query locality on information retrieval system performance (2000) 0.17
    0.16688609 = sum of:
      0.16688609 = product of:
        0.69535875 = sum of:
          0.069356985 = weight(abstract_txt:architectures in 1034) [ClassicSimilarity], result of:
            0.069356985 = score(doc=1034,freq=1.0), product of:
              0.14808396 = queryWeight, product of:
                1.25828 = boost
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.01570466 = queryNorm
              0.46836257 = fieldWeight in 1034, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.0625 = fieldNorm(doc=1034)
          0.12001213 = weight(abstract_txt:configurations in 1034) [ClassicSimilarity], result of:
            0.12001213 = score(doc=1034,freq=2.0), product of:
              0.16940309 = queryWeight, product of:
                1.3458107 = boost
                8.015098 = idf(docFreq=37, maxDocs=42306)
                0.01570466 = queryNorm
              0.7084412 = fieldWeight in 1034, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.015098 = idf(docFreq=37, maxDocs=42306)
                0.0625 = fieldNorm(doc=1034)
          0.032888424 = weight(abstract_txt:text in 1034) [ClassicSimilarity], result of:
            0.032888424 = score(doc=1034,freq=1.0), product of:
              0.12987244 = queryWeight, product of:
                2.0409973 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.01570466 = queryNorm
              0.25323635 = fieldWeight in 1034, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=1034)
          0.040259015 = weight(abstract_txt:search in 1034) [ClassicSimilarity], result of:
            0.040259015 = score(doc=1034,freq=1.0), product of:
              0.17620301 = queryWeight, product of:
                3.0691278 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.01570466 = queryNorm
              0.22848086 = fieldWeight in 1034, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.0625 = fieldNorm(doc=1034)
          0.09366042 = weight(abstract_txt:performance in 1034) [ClassicSimilarity], result of:
            0.09366042 = score(doc=1034,freq=2.0), product of:
              0.22794424 = queryWeight, product of:
                3.1222496 = boost
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.01570466 = queryNorm
              0.4108918 = fieldWeight in 1034, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.0625 = fieldNorm(doc=1034)
          0.33918175 = weight(abstract_txt:workload in 1034) [ClassicSimilarity], result of:
            0.33918175 = score(doc=1034,freq=1.0), product of:
              0.61533266 = queryWeight, product of:
                4.4426174 = boost
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.01570466 = queryNorm
              0.5512169 = fieldWeight in 1034, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.0625 = fieldNorm(doc=1034)
        0.24 = coord(6/25)
    
  3. Tedd, L.A.: ¬The changing face of CD-ROM (1995) 0.12
    0.11720511 = sum of:
      0.11720511 = product of:
        0.48835465 = sum of:
          0.07897401 = weight(abstract_txt:hardware in 3761) [ClassicSimilarity], result of:
            0.07897401 = score(doc=3761,freq=1.0), product of:
              0.11119331 = queryWeight, product of:
                1.0903417 = boost
                6.493629 = idf(docFreq=173, maxDocs=42306)
                0.01570466 = queryNorm
              0.71024066 = fieldWeight in 3761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.493629 = idf(docFreq=173, maxDocs=42306)
                0.109375 = fieldNorm(doc=3761)
          0.04480936 = weight(abstract_txt:searching in 3761) [ClassicSimilarity], result of:
            0.04480936 = score(doc=3761,freq=1.0), product of:
              0.09601631 = queryWeight, product of:
                1.4328833 = boost
                4.2668333 = idf(docFreq=1612, maxDocs=42306)
                0.01570466 = queryNorm
              0.46668488 = fieldWeight in 3761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2668333 = idf(docFreq=1612, maxDocs=42306)
                0.109375 = fieldNorm(doc=3761)
          0.08457278 = weight(abstract_txt:databases in 3761) [ClassicSimilarity], result of:
            0.08457278 = score(doc=3761,freq=3.0), product of:
              0.1016746 = queryWeight, product of:
                1.4744992 = boost
                4.390757 = idf(docFreq=1424, maxDocs=42306)
                0.01570466 = queryNorm
              0.83179843 = fieldWeight in 3761, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.390757 = idf(docFreq=1424, maxDocs=42306)
                0.109375 = fieldNorm(doc=3761)
          0.057554744 = weight(abstract_txt:text in 3761) [ClassicSimilarity], result of:
            0.057554744 = score(doc=3761,freq=1.0), product of:
              0.12987244 = queryWeight, product of:
                2.0409973 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.01570466 = queryNorm
              0.44316363 = fieldWeight in 3761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.109375 = fieldNorm(doc=3761)
          0.15199047 = weight(abstract_txt:parallel in 3761) [ClassicSimilarity], result of:
            0.15199047 = score(doc=3761,freq=1.0), product of:
              0.21675882 = queryWeight, product of:
                2.152914 = boost
                6.4109373 = idf(docFreq=188, maxDocs=42306)
                0.01570466 = queryNorm
              0.70119625 = fieldWeight in 3761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4109373 = idf(docFreq=188, maxDocs=42306)
                0.109375 = fieldNorm(doc=3761)
          0.07045328 = weight(abstract_txt:search in 3761) [ClassicSimilarity], result of:
            0.07045328 = score(doc=3761,freq=1.0), product of:
              0.17620301 = queryWeight, product of:
                3.0691278 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.01570466 = queryNorm
              0.39984152 = fieldWeight in 3761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.109375 = fieldNorm(doc=3761)
        0.24 = coord(6/25)
    
  4. Allen, B.: Individual differences and the conundrums of user-centered design : two experiments (2000) 0.11
    0.106147036 = sum of:
      0.106147036 = product of:
        0.663419 = sum of:
          0.2121535 = weight(abstract_txt:configurations in 5602) [ClassicSimilarity], result of:
            0.2121535 = score(doc=5602,freq=4.0), product of:
              0.16940309 = queryWeight, product of:
                1.3458107 = boost
                8.015098 = idf(docFreq=37, maxDocs=42306)
                0.01570466 = queryNorm
              1.252359 = fieldWeight in 5602, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.015098 = idf(docFreq=37, maxDocs=42306)
                0.078125 = fieldNorm(doc=5602)
          0.2838662 = weight(abstract_txt:configuration in 5602) [ClassicSimilarity], result of:
            0.2838662 = score(doc=5602,freq=2.0), product of:
              0.32652542 = queryWeight, product of:
                2.6423893 = boost
                7.8684945 = idf(docFreq=43, maxDocs=42306)
                0.01570466 = queryNorm
              0.869354 = fieldWeight in 5602, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8684945 = idf(docFreq=43, maxDocs=42306)
                0.078125 = fieldNorm(doc=5602)
          0.05032377 = weight(abstract_txt:search in 5602) [ClassicSimilarity], result of:
            0.05032377 = score(doc=5602,freq=1.0), product of:
              0.17620301 = queryWeight, product of:
                3.0691278 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.01570466 = queryNorm
              0.28560108 = fieldWeight in 5602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.078125 = fieldNorm(doc=5602)
          0.11707553 = weight(abstract_txt:performance in 5602) [ClassicSimilarity], result of:
            0.11707553 = score(doc=5602,freq=2.0), product of:
              0.22794424 = queryWeight, product of:
                3.1222496 = boost
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.01570466 = queryNorm
              0.5136148 = fieldWeight in 5602, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.078125 = fieldNorm(doc=5602)
        0.16 = coord(4/25)
    
  5. Beiser, K.: CD-ROM in 1994 - the year ahead (1994) 0.10
    0.10414306 = sum of:
      0.10414306 = product of:
        0.65089417 = sum of:
          0.09025601 = weight(abstract_txt:hardware in 7051) [ClassicSimilarity], result of:
            0.09025601 = score(doc=7051,freq=1.0), product of:
              0.11119331 = queryWeight, product of:
                1.0903417 = boost
                6.493629 = idf(docFreq=173, maxDocs=42306)
                0.01570466 = queryNorm
              0.8117036 = fieldWeight in 7051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.493629 = idf(docFreq=173, maxDocs=42306)
                0.125 = fieldNorm(doc=7051)
          0.06577685 = weight(abstract_txt:text in 7051) [ClassicSimilarity], result of:
            0.06577685 = score(doc=7051,freq=1.0), product of:
              0.12987244 = queryWeight, product of:
                2.0409973 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.01570466 = queryNorm
              0.5064727 = fieldWeight in 7051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.125 = fieldNorm(doc=7051)
          0.1737034 = weight(abstract_txt:parallel in 7051) [ClassicSimilarity], result of:
            0.1737034 = score(doc=7051,freq=1.0), product of:
              0.21675882 = queryWeight, product of:
                2.152914 = boost
                6.4109373 = idf(docFreq=188, maxDocs=42306)
                0.01570466 = queryNorm
              0.80136716 = fieldWeight in 7051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4109373 = idf(docFreq=188, maxDocs=42306)
                0.125 = fieldNorm(doc=7051)
          0.32115793 = weight(abstract_txt:configuration in 7051) [ClassicSimilarity], result of:
            0.32115793 = score(doc=7051,freq=1.0), product of:
              0.32652542 = queryWeight, product of:
                2.6423893 = boost
                7.8684945 = idf(docFreq=43, maxDocs=42306)
                0.01570466 = queryNorm
              0.9835618 = fieldWeight in 7051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8684945 = idf(docFreq=43, maxDocs=42306)
                0.125 = fieldNorm(doc=7051)
        0.16 = coord(4/25)