Document (#14178)

Author
MacFarlane, A.
Robertson, S.E.
McCann, J.A.
Title
Parallel computing for passage retrieval
Source
Aslib proceedings. 56(2004) no.4, S.201-211
Year
2004
Abstract
In this paper methods for both speeding up passage processing and examining more passages using parallel computers are explored. The number of passages processed are varied in order to examine the effect on retrieval effectiveness and efficiency. The particular algorithm applied has previously been used to good effect in Okapi experiments at TREC. This algorithm and the mechanism for applying parallel computing to speed up processing are described.
Theme
Retrievalalgorithmen
Object
Okapi
TREC

Similar documents (author)

  1. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing in information retrieval : an updated review (1997) 4.22
    4.2223167 = sum of:
      4.2223167 = sum of:
        1.6064181 = weight(author_txt:robertson in 7450) [ClassicSimilarity], result of:
          1.6064181 = score(doc=7450,freq=1.0), product of:
            0.58562726 = queryWeight, product of:
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.08005993 = queryNorm
            2.7430727 = fieldWeight in 7450, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.375 = fieldNorm(doc=7450)
        2.6158984 = weight(author_txt:macfarlane in 7450) [ClassicSimilarity], result of:
          2.6158984 = score(doc=7450,freq=1.0), product of:
            0.81058043 = queryWeight, product of:
              1.1764878 = boost
              8.6058445 = idf(docFreq=21, maxDocs=44218)
              0.08005993 = queryNorm
            3.2271917 = fieldWeight in 7450, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.6058445 = idf(docFreq=21, maxDocs=44218)
              0.375 = fieldNorm(doc=7450)
    
  2. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the generation of partitioned inverted files (2005) 4.22
    4.2223167 = sum of:
      4.2223167 = sum of:
        1.6064181 = weight(author_txt:robertson in 651) [ClassicSimilarity], result of:
          1.6064181 = score(doc=651,freq=1.0), product of:
            0.58562726 = queryWeight, product of:
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.08005993 = queryNorm
            2.7430727 = fieldWeight in 651, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.375 = fieldNorm(doc=651)
        2.6158984 = weight(author_txt:macfarlane in 651) [ClassicSimilarity], result of:
          2.6158984 = score(doc=651,freq=1.0), product of:
            0.81058043 = queryWeight, product of:
              1.1764878 = boost
              8.6058445 = idf(docFreq=21, maxDocs=44218)
              0.08005993 = queryNorm
            3.2271917 = fieldWeight in 651, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.6058445 = idf(docFreq=21, maxDocs=44218)
              0.375 = fieldNorm(doc=651)
    
  3. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the update of partitioned inverted files (2007) 4.22
    4.2223167 = sum of:
      4.2223167 = sum of:
        1.6064181 = weight(author_txt:robertson in 819) [ClassicSimilarity], result of:
          1.6064181 = score(doc=819,freq=1.0), product of:
            0.58562726 = queryWeight, product of:
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.08005993 = queryNorm
            2.7430727 = fieldWeight in 819, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.375 = fieldNorm(doc=819)
        2.6158984 = weight(author_txt:macfarlane in 819) [ClassicSimilarity], result of:
          2.6158984 = score(doc=819,freq=1.0), product of:
            0.81058043 = queryWeight, product of:
              1.1764878 = boost
              8.6058445 = idf(docFreq=21, maxDocs=44218)
              0.08005993 = queryNorm
            3.2271917 = fieldWeight in 819, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.6058445 = idf(docFreq=21, maxDocs=44218)
              0.375 = fieldNorm(doc=819)
    
  4. MacFarlane, A.: On open source IR (2003) 2.18
    2.1799152 = sum of:
      2.1799152 = product of:
        4.3598304 = sum of:
          4.3598304 = weight(author_txt:macfarlane in 2010) [ClassicSimilarity], result of:
            4.3598304 = score(doc=2010,freq=1.0), product of:
              0.81058043 = queryWeight, product of:
                1.1764878 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.08005993 = queryNorm
              5.3786526 = fieldWeight in 2010, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.625 = fieldNorm(doc=2010)
        0.5 = coord(1/2)
    
  5. MacFarlane, A.: Evaluation of web search for the information practitioner (2007) 2.18
    2.1799152 = sum of:
      2.1799152 = product of:
        4.3598304 = sum of:
          4.3598304 = weight(author_txt:macfarlane in 817) [ClassicSimilarity], result of:
            4.3598304 = score(doc=817,freq=1.0), product of:
              0.81058043 = queryWeight, product of:
                1.1764878 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.08005993 = queryNorm
              5.3786526 = fieldWeight in 817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.625 = fieldNorm(doc=817)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Melucci, M.: Passage retrieval : a probabilistic technique (1998) 0.23
    0.2296204 = sum of:
      0.2296204 = product of:
        0.9567517 = sum of:
          0.032101396 = weight(abstract_txt:effectiveness in 1150) [ClassicSimilarity], result of:
            0.032101396 = score(doc=1150,freq=1.0), product of:
              0.08059384 = queryWeight, product of:
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.01580774 = queryNorm
              0.39831078 = fieldWeight in 1150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.078125 = fieldNorm(doc=1150)
          0.03668692 = weight(abstract_txt:experiments in 1150) [ClassicSimilarity], result of:
            0.03668692 = score(doc=1150,freq=1.0), product of:
              0.08809678 = queryWeight, product of:
                1.0455122 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.01580774 = queryNorm
              0.41643882 = fieldWeight in 1150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.078125 = fieldNorm(doc=1150)
          0.020331737 = weight(abstract_txt:retrieval in 1150) [ClassicSimilarity], result of:
            0.020331737 = score(doc=1150,freq=1.0), product of:
              0.074888 = queryWeight, product of:
                1.3632333 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.01580774 = queryNorm
              0.27149525 = fieldWeight in 1150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1150)
          0.08997489 = weight(abstract_txt:algorithm in 1150) [ClassicSimilarity], result of:
            0.08997489 = score(doc=1150,freq=1.0), product of:
              0.20185682 = queryWeight, product of:
                2.2381325 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.01580774 = queryNorm
              0.44573617 = fieldWeight in 1150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.078125 = fieldNorm(doc=1150)
          0.38651854 = weight(abstract_txt:passage in 1150) [ClassicSimilarity], result of:
            0.38651854 = score(doc=1150,freq=2.0), product of:
              0.42338237 = queryWeight, product of:
                3.2413838 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.01580774 = queryNorm
              0.91293013 = fieldWeight in 1150, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.078125 = fieldNorm(doc=1150)
          0.3911382 = weight(abstract_txt:passages in 1150) [ClassicSimilarity], result of:
            0.3911382 = score(doc=1150,freq=2.0), product of:
              0.4267492 = queryWeight, product of:
                3.2542465 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.01580774 = queryNorm
              0.91655284 = fieldWeight in 1150, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.078125 = fieldNorm(doc=1150)
        0.24 = coord(6/25)
    
  2. Kaszkiel, M.; Zobel, J.: Effective ranking with arbitrary passages (2001) 0.18
    0.17971297 = sum of:
      0.17971297 = product of:
        0.8985648 = sum of:
          0.025681118 = weight(abstract_txt:effectiveness in 5764) [ClassicSimilarity], result of:
            0.025681118 = score(doc=5764,freq=1.0), product of:
              0.08059384 = queryWeight, product of:
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.01580774 = queryNorm
              0.31864864 = fieldWeight in 5764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.0625 = fieldNorm(doc=5764)
          0.029349536 = weight(abstract_txt:experiments in 5764) [ClassicSimilarity], result of:
            0.029349536 = score(doc=5764,freq=1.0), product of:
              0.08809678 = queryWeight, product of:
                1.0455122 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.01580774 = queryNorm
              0.33315104 = fieldWeight in 5764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.0625 = fieldNorm(doc=5764)
          0.023002733 = weight(abstract_txt:retrieval in 5764) [ClassicSimilarity], result of:
            0.023002733 = score(doc=5764,freq=2.0), product of:
              0.074888 = queryWeight, product of:
                1.3632333 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.01580774 = queryNorm
              0.3071618 = fieldWeight in 5764, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=5764)
          0.4372958 = weight(abstract_txt:passage in 5764) [ClassicSimilarity], result of:
            0.4372958 = score(doc=5764,freq=4.0), product of:
              0.42338237 = queryWeight, product of:
                3.2413838 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.01580774 = queryNorm
              1.0328625 = fieldWeight in 5764, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0625 = fieldNorm(doc=5764)
          0.3832356 = weight(abstract_txt:passages in 5764) [ClassicSimilarity], result of:
            0.3832356 = score(doc=5764,freq=3.0), product of:
              0.4267492 = queryWeight, product of:
                3.2542465 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.01580774 = queryNorm
              0.89803475 = fieldWeight in 5764, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.0625 = fieldNorm(doc=5764)
        0.2 = coord(5/25)
    
  3. Mengle, S.; Goharian, N.: Passage detection using text classification (2009) 0.18
    0.17664775 = sum of:
      0.17664775 = product of:
        1.4720646 = sum of:
          0.0318242 = weight(abstract_txt:retrieval in 2765) [ClassicSimilarity], result of:
            0.0318242 = score(doc=2765,freq=5.0), product of:
              0.074888 = queryWeight, product of:
                1.3632333 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.01580774 = queryNorm
              0.4249573 = fieldWeight in 2765, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.7158423 = weight(abstract_txt:passage in 2765) [ClassicSimilarity], result of:
            0.7158423 = score(doc=2765,freq=14.0), product of:
              0.42338237 = queryWeight, product of:
                3.2413838 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.01580774 = queryNorm
              1.6907703 = fieldWeight in 2765, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.72439814 = weight(abstract_txt:passages in 2765) [ClassicSimilarity], result of:
            0.72439814 = score(doc=2765,freq=14.0), product of:
              0.4267492 = queryWeight, product of:
                3.2542465 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.01580774 = queryNorm
              1.6974797 = fieldWeight in 2765, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
        0.12 = coord(3/25)
    
  4. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the generation of partitioned inverted files (2005) 0.18
    0.1754856 = sum of:
      0.1754856 = product of:
        0.62673426 = sum of:
          0.0400833 = weight(abstract_txt:examine in 651) [ClassicSimilarity], result of:
            0.0400833 = score(doc=651,freq=2.0), product of:
              0.086071275 = queryWeight, product of:
                1.0334232 = boost
                5.268782 = idf(docFreq=618, maxDocs=44218)
                0.01580774 = queryNorm
              0.46569893 = fieldWeight in 651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.268782 = idf(docFreq=618, maxDocs=44218)
                0.0625 = fieldNorm(doc=651)
          0.029349536 = weight(abstract_txt:experiments in 651) [ClassicSimilarity], result of:
            0.029349536 = score(doc=651,freq=1.0), product of:
              0.08809678 = queryWeight, product of:
                1.0455122 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.01580774 = queryNorm
              0.33315104 = fieldWeight in 651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.0625 = fieldNorm(doc=651)
          0.043713596 = weight(abstract_txt:efficiency in 651) [ClassicSimilarity], result of:
            0.043713596 = score(doc=651,freq=1.0), product of:
              0.11489565 = queryWeight, product of:
                1.1939905 = boost
                6.087415 = idf(docFreq=272, maxDocs=44218)
                0.01580774 = queryNorm
              0.38046345 = fieldWeight in 651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.087415 = idf(docFreq=272, maxDocs=44218)
                0.0625 = fieldNorm(doc=651)
          0.07846486 = weight(abstract_txt:speed in 651) [ClassicSimilarity], result of:
            0.07846486 = score(doc=651,freq=2.0), product of:
              0.13468918 = queryWeight, product of:
                1.2927526 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.01580774 = queryNorm
              0.58256245 = fieldWeight in 651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.0625 = fieldNorm(doc=651)
          0.023002733 = weight(abstract_txt:retrieval in 651) [ClassicSimilarity], result of:
            0.023002733 = score(doc=651,freq=2.0), product of:
              0.074888 = queryWeight, product of:
                1.3632333 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.01580774 = queryNorm
              0.3071618 = fieldWeight in 651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=651)
          0.1464626 = weight(abstract_txt:computing in 651) [ClassicSimilarity], result of:
            0.1464626 = score(doc=651,freq=3.0), product of:
              0.2247398 = queryWeight, product of:
                2.3615878 = boost
                6.0201335 = idf(docFreq=291, maxDocs=44218)
                0.01580774 = queryNorm
              0.6516985 = fieldWeight in 651, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0201335 = idf(docFreq=291, maxDocs=44218)
                0.0625 = fieldNorm(doc=651)
          0.2656576 = weight(abstract_txt:parallel in 651) [ClassicSimilarity], result of:
            0.2656576 = score(doc=651,freq=3.0), product of:
              0.38262564 = queryWeight, product of:
                3.773955 = boost
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.01580774 = queryNorm
              0.6943016 = fieldWeight in 651, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.0625 = fieldNorm(doc=651)
        0.28 = coord(7/25)
    
  5. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing in information retrieval : an updated review (1997) 0.17
    0.17190817 = sum of:
      0.17190817 = product of:
        1.074426 = sum of:
          0.05634496 = weight(abstract_txt:retrieval in 7450) [ClassicSimilarity], result of:
            0.05634496 = score(doc=7450,freq=3.0), product of:
              0.074888 = queryWeight, product of:
                1.3632333 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.01580774 = queryNorm
              0.7523897 = fieldWeight in 7450, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.125 = fieldNorm(doc=7450)
          0.09298369 = weight(abstract_txt:processing in 7450) [ClassicSimilarity], result of:
            0.09298369 = score(doc=7450,freq=1.0), product of:
              0.15082978 = queryWeight, product of:
                1.9346733 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.01580774 = queryNorm
              0.616481 = fieldWeight in 7450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.125 = fieldNorm(doc=7450)
          0.23917243 = weight(abstract_txt:computing in 7450) [ClassicSimilarity], result of:
            0.23917243 = score(doc=7450,freq=2.0), product of:
              0.2247398 = queryWeight, product of:
                2.3615878 = boost
                6.0201335 = idf(docFreq=291, maxDocs=44218)
                0.01580774 = queryNorm
              1.0642192 = fieldWeight in 7450, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0201335 = idf(docFreq=291, maxDocs=44218)
                0.125 = fieldNorm(doc=7450)
          0.68592495 = weight(abstract_txt:parallel in 7450) [ClassicSimilarity], result of:
            0.68592495 = score(doc=7450,freq=5.0), product of:
              0.38262564 = queryWeight, product of:
                3.773955 = boost
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.01580774 = queryNorm
              1.7926791 = fieldWeight in 7450, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.125 = fieldNorm(doc=7450)
        0.16 = coord(4/25)