Document (#14178)

Author
MacFarlane, A.
Robertson, S.E.
McCann, J.A.
Title
Parallel computing for passage retrieval
Source
Aslib proceedings. 56(2004) no.4, S.201-211
Year
2004
Abstract
In this paper methods for both speeding up passage processing and examining more passages using parallel computers are explored. The number of passages processed are varied in order to examine the effect on retrieval effectiveness and efficiency. The particular algorithm applied has previously been used to good effect in Okapi experiments at TREC. This algorithm and the mechanism for applying parallel computing to speed up processing are described.
Theme
Retrievalalgorithmen
Object
Okapi
TREC

Similar documents (author)

  1. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing in information retrieval : an updated review (1997) 4.26
    4.259507 = sum of:
      4.259507 = sum of:
        1.5542746 = weight(author_txt:robertson in 520) [ClassicSimilarity], result of:
          1.5542746 = score(doc=520,freq=1.0), product of:
            0.5685451 = queryWeight, product of:
              7.2900677 = idf(docFreq=78, maxDocs=42596)
              0.077989005 = queryNorm
            2.7337754 = fieldWeight in 520, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.2900677 = idf(docFreq=78, maxDocs=42596)
              0.375 = fieldNorm(doc=520)
        2.7052329 = weight(author_txt:macfarlane in 520) [ClassicSimilarity], result of:
          2.7052329 = score(doc=520,freq=1.0), product of:
            0.82265204 = queryWeight, product of:
              1.2028892 = boost
              8.769144 = idf(docFreq=17, maxDocs=42596)
              0.077989005 = queryNorm
            3.288429 = fieldWeight in 520, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.769144 = idf(docFreq=17, maxDocs=42596)
              0.375 = fieldNorm(doc=520)
    
  2. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the generation of partitioned inverted files (2005) 4.26
    4.259507 = sum of:
      4.259507 = sum of:
        1.5542746 = weight(author_txt:robertson in 956) [ClassicSimilarity], result of:
          1.5542746 = score(doc=956,freq=1.0), product of:
            0.5685451 = queryWeight, product of:
              7.2900677 = idf(docFreq=78, maxDocs=42596)
              0.077989005 = queryNorm
            2.7337754 = fieldWeight in 956, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.2900677 = idf(docFreq=78, maxDocs=42596)
              0.375 = fieldNorm(doc=956)
        2.7052329 = weight(author_txt:macfarlane in 956) [ClassicSimilarity], result of:
          2.7052329 = score(doc=956,freq=1.0), product of:
            0.82265204 = queryWeight, product of:
              1.2028892 = boost
              8.769144 = idf(docFreq=17, maxDocs=42596)
              0.077989005 = queryNorm
            3.288429 = fieldWeight in 956, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.769144 = idf(docFreq=17, maxDocs=42596)
              0.375 = fieldNorm(doc=956)
    
  3. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the update of partitioned inverted files (2007) 4.26
    4.259507 = sum of:
      4.259507 = sum of:
        1.5542746 = weight(author_txt:robertson in 1999) [ClassicSimilarity], result of:
          1.5542746 = score(doc=1999,freq=1.0), product of:
            0.5685451 = queryWeight, product of:
              7.2900677 = idf(docFreq=78, maxDocs=42596)
              0.077989005 = queryNorm
            2.7337754 = fieldWeight in 1999, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.2900677 = idf(docFreq=78, maxDocs=42596)
              0.375 = fieldNorm(doc=1999)
        2.7052329 = weight(author_txt:macfarlane in 1999) [ClassicSimilarity], result of:
          2.7052329 = score(doc=1999,freq=1.0), product of:
            0.82265204 = queryWeight, product of:
              1.2028892 = boost
              8.769144 = idf(docFreq=17, maxDocs=42596)
              0.077989005 = queryNorm
            3.288429 = fieldWeight in 1999, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.769144 = idf(docFreq=17, maxDocs=42596)
              0.375 = fieldNorm(doc=1999)
    
  4. MacFarlane, A.: On open source IR (2003) 2.25
    2.2543607 = sum of:
      2.2543607 = product of:
        4.5087214 = sum of:
          4.5087214 = weight(author_txt:macfarlane in 3011) [ClassicSimilarity], result of:
            4.5087214 = score(doc=3011,freq=1.0), product of:
              0.82265204 = queryWeight, product of:
                1.2028892 = boost
                8.769144 = idf(docFreq=17, maxDocs=42596)
                0.077989005 = queryNorm
              5.480715 = fieldWeight in 3011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.769144 = idf(docFreq=17, maxDocs=42596)
                0.625 = fieldNorm(doc=3011)
        0.5 = coord(1/2)
    
  5. MacFarlane, A.: Evaluation of web search for the information practitioner (2007) 2.25
    2.2543607 = sum of:
      2.2543607 = product of:
        4.5087214 = sum of:
          4.5087214 = weight(author_txt:macfarlane in 1997) [ClassicSimilarity], result of:
            4.5087214 = score(doc=1997,freq=1.0), product of:
              0.82265204 = queryWeight, product of:
                1.2028892 = boost
                8.769144 = idf(docFreq=17, maxDocs=42596)
                0.077989005 = queryNorm
              5.480715 = fieldWeight in 1997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.769144 = idf(docFreq=17, maxDocs=42596)
                0.625 = fieldNorm(doc=1997)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Melucci, M.: Passage retrieval : a probabilistic technique (1998) 0.23
    0.22884864 = sum of:
      0.22884864 = product of:
        0.95353603 = sum of:
          0.032483548 = weight(abstract_txt:effectiveness in 2151) [ClassicSimilarity], result of:
            0.032483548 = score(doc=2151,freq=1.0), product of:
              0.08125579 = queryWeight, product of:
                5.1170435 = idf(docFreq=693, maxDocs=42596)
                0.015879441 = queryNorm
              0.399769 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1170435 = idf(docFreq=693, maxDocs=42596)
                0.078125 = fieldNorm(doc=2151)
          0.037461188 = weight(abstract_txt:experiments in 2151) [ClassicSimilarity], result of:
            0.037461188 = score(doc=2151,freq=1.0), product of:
              0.08935792 = queryWeight, product of:
                1.0486712 = boost
                5.3660965 = idf(docFreq=540, maxDocs=42596)
                0.015879441 = queryNorm
              0.4192263 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3660965 = idf(docFreq=540, maxDocs=42596)
                0.078125 = fieldNorm(doc=2151)
          0.020133512 = weight(abstract_txt:retrieval in 2151) [ClassicSimilarity], result of:
            0.020133512 = score(doc=2151,freq=1.0), product of:
              0.07442206 = queryWeight, product of:
                1.353439 = boost
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.015879441 = queryNorm
              0.2705315 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.078125 = fieldNorm(doc=2151)
          0.090839565 = weight(abstract_txt:algorithm in 2151) [ClassicSimilarity], result of:
            0.090839565 = score(doc=2151,freq=1.0), product of:
              0.20320703 = queryWeight, product of:
                2.2364397 = boost
                5.7219796 = idf(docFreq=378, maxDocs=42596)
                0.015879441 = queryNorm
              0.44702965 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7219796 = idf(docFreq=378, maxDocs=42596)
                0.078125 = fieldNorm(doc=2151)
          0.3816301 = weight(abstract_txt:passage in 2151) [ClassicSimilarity], result of:
            0.3816301 = score(doc=2151,freq=2.0), product of:
              0.41992697 = queryWeight, product of:
                3.2149537 = boost
                8.225529 = idf(docFreq=30, maxDocs=42596)
                0.015879441 = queryNorm
              0.9088011 = fieldWeight in 2151, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.225529 = idf(docFreq=30, maxDocs=42596)
                0.078125 = fieldNorm(doc=2151)
          0.39098814 = weight(abstract_txt:passages in 2151) [ClassicSimilarity], result of:
            0.39098814 = score(doc=2151,freq=2.0), product of:
              0.42676398 = queryWeight, product of:
                3.24102 = boost
                8.29222 = idf(docFreq=28, maxDocs=42596)
                0.015879441 = queryNorm
              0.9161695 = fieldWeight in 2151, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.29222 = idf(docFreq=28, maxDocs=42596)
                0.078125 = fieldNorm(doc=2151)
        0.24 = coord(6/25)
    
  2. Kaszkiel, M.; Zobel, J.: Effective ranking with arbitrary passages (2001) 0.18
    0.1787176 = sum of:
      0.1787176 = product of:
        0.893588 = sum of:
          0.02598684 = weight(abstract_txt:effectiveness in 6765) [ClassicSimilarity], result of:
            0.02598684 = score(doc=6765,freq=1.0), product of:
              0.08125579 = queryWeight, product of:
                5.1170435 = idf(docFreq=693, maxDocs=42596)
                0.015879441 = queryNorm
              0.31981522 = fieldWeight in 6765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1170435 = idf(docFreq=693, maxDocs=42596)
                0.0625 = fieldNorm(doc=6765)
          0.02996895 = weight(abstract_txt:experiments in 6765) [ClassicSimilarity], result of:
            0.02996895 = score(doc=6765,freq=1.0), product of:
              0.08935792 = queryWeight, product of:
                1.0486712 = boost
                5.3660965 = idf(docFreq=540, maxDocs=42596)
                0.015879441 = queryNorm
              0.33538103 = fieldWeight in 6765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3660965 = idf(docFreq=540, maxDocs=42596)
                0.0625 = fieldNorm(doc=6765)
          0.022778466 = weight(abstract_txt:retrieval in 6765) [ClassicSimilarity], result of:
            0.022778466 = score(doc=6765,freq=2.0), product of:
              0.07442206 = queryWeight, product of:
                1.353439 = boost
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.015879441 = queryNorm
              0.30607143 = fieldWeight in 6765, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.0625 = fieldNorm(doc=6765)
          0.43176517 = weight(abstract_txt:passage in 6765) [ClassicSimilarity], result of:
            0.43176517 = score(doc=6765,freq=4.0), product of:
              0.41992697 = queryWeight, product of:
                3.2149537 = boost
                8.225529 = idf(docFreq=30, maxDocs=42596)
                0.015879441 = queryNorm
              1.0281911 = fieldWeight in 6765, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.225529 = idf(docFreq=30, maxDocs=42596)
                0.0625 = fieldNorm(doc=6765)
          0.3830886 = weight(abstract_txt:passages in 6765) [ClassicSimilarity], result of:
            0.3830886 = score(doc=6765,freq=3.0), product of:
              0.42676398 = queryWeight, product of:
                3.24102 = boost
                8.29222 = idf(docFreq=28, maxDocs=42596)
                0.015879441 = queryNorm
              0.8976591 = fieldWeight in 6765, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.29222 = idf(docFreq=28, maxDocs=42596)
                0.0625 = fieldNorm(doc=6765)
        0.2 = coord(5/25)
    
  3. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the generation of partitioned inverted files (2005) 0.18
    0.17579408 = sum of:
      0.17579408 = product of:
        0.627836 = sum of:
          0.041486535 = weight(abstract_txt:examine in 956) [ClassicSimilarity], result of:
            0.041486535 = score(doc=956,freq=2.0), product of:
              0.0880941 = queryWeight, product of:
                1.041229 = boost
                5.328014 = idf(docFreq=561, maxDocs=42596)
                0.015879441 = queryNorm
              0.47093433 = fieldWeight in 956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.328014 = idf(docFreq=561, maxDocs=42596)
                0.0625 = fieldNorm(doc=956)
          0.02996895 = weight(abstract_txt:experiments in 956) [ClassicSimilarity], result of:
            0.02996895 = score(doc=956,freq=1.0), product of:
              0.08935792 = queryWeight, product of:
                1.0486712 = boost
                5.3660965 = idf(docFreq=540, maxDocs=42596)
                0.015879441 = queryNorm
              0.33538103 = fieldWeight in 956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3660965 = idf(docFreq=540, maxDocs=42596)
                0.0625 = fieldNorm(doc=956)
          0.04441997 = weight(abstract_txt:efficiency in 956) [ClassicSimilarity], result of:
            0.04441997 = score(doc=956,freq=1.0), product of:
              0.11616381 = queryWeight, product of:
                1.1956615 = boost
                6.1182523 = idf(docFreq=254, maxDocs=42596)
                0.015879441 = queryNorm
              0.38239077 = fieldWeight in 956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1182523 = idf(docFreq=254, maxDocs=42596)
                0.0625 = fieldNorm(doc=956)
          0.07807549 = weight(abstract_txt:speed in 956) [ClassicSimilarity], result of:
            0.07807549 = score(doc=956,freq=2.0), product of:
              0.13428222 = queryWeight, product of:
                1.2855296 = boost
                6.578111 = idf(docFreq=160, maxDocs=42596)
                0.015879441 = queryNorm
              0.58142835 = fieldWeight in 956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.578111 = idf(docFreq=160, maxDocs=42596)
                0.0625 = fieldNorm(doc=956)
          0.022778466 = weight(abstract_txt:retrieval in 956) [ClassicSimilarity], result of:
            0.022778466 = score(doc=956,freq=2.0), product of:
              0.07442206 = queryWeight, product of:
                1.353439 = boost
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.015879441 = queryNorm
              0.30607143 = fieldWeight in 956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.0625 = fieldNorm(doc=956)
          0.14666533 = weight(abstract_txt:computing in 956) [ClassicSimilarity], result of:
            0.14666533 = score(doc=956,freq=3.0), product of:
              0.22501247 = queryWeight, product of:
                2.353375 = boost
                6.021161 = idf(docFreq=280, maxDocs=42596)
                0.015879441 = queryNorm
              0.6518098 = fieldWeight in 956, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.021161 = idf(docFreq=280, maxDocs=42596)
                0.0625 = fieldNorm(doc=956)
          0.26444128 = weight(abstract_txt:parallel in 956) [ClassicSimilarity], result of:
            0.26444128 = score(doc=956,freq=3.0), product of:
              0.38156763 = queryWeight, product of:
                3.7533514 = boost
                6.4020205 = idf(docFreq=191, maxDocs=42596)
                0.015879441 = queryNorm
              0.69303906 = fieldWeight in 956, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4020205 = idf(docFreq=191, maxDocs=42596)
                0.0625 = fieldNorm(doc=956)
        0.28 = coord(7/25)
    
  4. Mengle, S.; Goharian, N.: Passage detection using text classification (2009) 0.18
    0.17549077 = sum of:
      0.17549077 = product of:
        1.4624231 = sum of:
          0.03151393 = weight(abstract_txt:retrieval in 3945) [ClassicSimilarity], result of:
            0.03151393 = score(doc=3945,freq=5.0), product of:
              0.07442206 = queryWeight, product of:
                1.353439 = boost
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.015879441 = queryNorm
              0.42344877 = fieldWeight in 3945, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.0546875 = fieldNorm(doc=3945)
          0.70678884 = weight(abstract_txt:passage in 3945) [ClassicSimilarity], result of:
            0.70678884 = score(doc=3945,freq=14.0), product of:
              0.41992697 = queryWeight, product of:
                3.2149537 = boost
                8.225529 = idf(docFreq=30, maxDocs=42596)
                0.015879441 = queryNorm
              1.6831232 = fieldWeight in 3945, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                8.225529 = idf(docFreq=30, maxDocs=42596)
                0.0546875 = fieldNorm(doc=3945)
          0.72412026 = weight(abstract_txt:passages in 3945) [ClassicSimilarity], result of:
            0.72412026 = score(doc=3945,freq=14.0), product of:
              0.42676398 = queryWeight, product of:
                3.24102 = boost
                8.29222 = idf(docFreq=28, maxDocs=42596)
                0.015879441 = queryNorm
              1.6967698 = fieldWeight in 3945, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                8.29222 = idf(docFreq=28, maxDocs=42596)
                0.0546875 = fieldNorm(doc=3945)
        0.12 = coord(3/25)
    
  5. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing in information retrieval : an updated review (1997) 0.17
    0.17155068 = sum of:
      0.17155068 = product of:
        1.0721917 = sum of:
          0.05579562 = weight(abstract_txt:retrieval in 520) [ClassicSimilarity], result of:
            0.05579562 = score(doc=520,freq=3.0), product of:
              0.07442206 = queryWeight, product of:
                1.353439 = boost
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.015879441 = queryNorm
              0.74971884 = fieldWeight in 520, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.125 = fieldNorm(doc=520)
          0.09410819 = weight(abstract_txt:processing in 520) [ClassicSimilarity], result of:
            0.09410819 = score(doc=520,freq=1.0), product of:
              0.15208754 = queryWeight, product of:
                1.9347936 = boost
                4.9502115 = idf(docFreq=819, maxDocs=42596)
                0.015879441 = queryNorm
              0.61877644 = fieldWeight in 520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9502115 = idf(docFreq=819, maxDocs=42596)
                0.125 = fieldNorm(doc=520)
          0.23950347 = weight(abstract_txt:computing in 520) [ClassicSimilarity], result of:
            0.23950347 = score(doc=520,freq=2.0), product of:
              0.22501247 = queryWeight, product of:
                2.353375 = boost
                6.021161 = idf(docFreq=280, maxDocs=42596)
                0.015879441 = queryNorm
              1.0644009 = fieldWeight in 520, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.021161 = idf(docFreq=280, maxDocs=42596)
                0.125 = fieldNorm(doc=520)
          0.68278444 = weight(abstract_txt:parallel in 520) [ClassicSimilarity], result of:
            0.68278444 = score(doc=520,freq=5.0), product of:
              0.38156763 = queryWeight, product of:
                3.7533514 = boost
                6.4020205 = idf(docFreq=191, maxDocs=42596)
                0.015879441 = queryNorm
              1.7894192 = fieldWeight in 520, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.4020205 = idf(docFreq=191, maxDocs=42596)
                0.125 = fieldNorm(doc=520)
        0.16 = coord(4/25)