Document (#14178)

Author
MacFarlane, A.
Robertson, S.E.
McCann, J.A.
Title
Parallel computing for passage retrieval
Source
Aslib proceedings. 56(2004) no.4, S.201-211
Year
2004
Abstract
In this paper methods for both speeding up passage processing and examining more passages using parallel computers are explored. The number of passages processed are varied in order to examine the effect on retrieval effectiveness and efficiency. The particular algorithm applied has previously been used to good effect in Okapi experiments at TREC. This algorithm and the mechanism for applying parallel computing to speed up processing are described.
Theme
Retrievalalgorithmen
Object
Okapi
TREC

Similar documents (author)

  1. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing in information retrieval : an updated review (1997) 4.26
    4.2558875 = sum of:
      4.2558875 = sum of:
        1.5524857 = weight(author_txt:robertson in 520) [ClassicSimilarity], result of:
          1.5524857 = score(doc=520,freq=1.0), product of:
            0.5684234 = queryWeight, product of:
              7.2832365 = idf(docFreq=78, maxDocs=42306)
              0.078045435 = queryNorm
            2.7312136 = fieldWeight in 520, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.2832365 = idf(docFreq=78, maxDocs=42306)
              0.375 = fieldNorm(doc=520)
        2.7034018 = weight(author_txt:macfarlane in 520) [ClassicSimilarity], result of:
          2.7034018 = score(doc=520,freq=1.0), product of:
            0.8227362 = queryWeight, product of:
              1.2030796 = boost
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.078045435 = queryNorm
            3.2858672 = fieldWeight in 520, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.375 = fieldNorm(doc=520)
    
  2. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the generation of partitioned inverted files (2005) 4.26
    4.2558875 = sum of:
      4.2558875 = sum of:
        1.5524857 = weight(author_txt:robertson in 1777) [ClassicSimilarity], result of:
          1.5524857 = score(doc=1777,freq=1.0), product of:
            0.5684234 = queryWeight, product of:
              7.2832365 = idf(docFreq=78, maxDocs=42306)
              0.078045435 = queryNorm
            2.7312136 = fieldWeight in 1777, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.2832365 = idf(docFreq=78, maxDocs=42306)
              0.375 = fieldNorm(doc=1777)
        2.7034018 = weight(author_txt:macfarlane in 1777) [ClassicSimilarity], result of:
          2.7034018 = score(doc=1777,freq=1.0), product of:
            0.8227362 = queryWeight, product of:
              1.2030796 = boost
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.078045435 = queryNorm
            3.2858672 = fieldWeight in 1777, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.375 = fieldNorm(doc=1777)
    
  3. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the update of partitioned inverted files (2007) 4.26
    4.2558875 = sum of:
      4.2558875 = sum of:
        1.5524857 = weight(author_txt:robertson in 2820) [ClassicSimilarity], result of:
          1.5524857 = score(doc=2820,freq=1.0), product of:
            0.5684234 = queryWeight, product of:
              7.2832365 = idf(docFreq=78, maxDocs=42306)
              0.078045435 = queryNorm
            2.7312136 = fieldWeight in 2820, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.2832365 = idf(docFreq=78, maxDocs=42306)
              0.375 = fieldNorm(doc=2820)
        2.7034018 = weight(author_txt:macfarlane in 2820) [ClassicSimilarity], result of:
          2.7034018 = score(doc=2820,freq=1.0), product of:
            0.8227362 = queryWeight, product of:
              1.2030796 = boost
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.078045435 = queryNorm
            3.2858672 = fieldWeight in 2820, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.375 = fieldNorm(doc=2820)
    
  4. MacFarlane, A.: On open source IR (2003) 2.25
    2.252835 = sum of:
      2.252835 = product of:
        4.50567 = sum of:
          4.50567 = weight(author_txt:macfarlane in 3011) [ClassicSimilarity], result of:
            4.50567 = score(doc=3011,freq=1.0), product of:
              0.8227362 = queryWeight, product of:
                1.2030796 = boost
                8.762313 = idf(docFreq=17, maxDocs=42306)
                0.078045435 = queryNorm
              5.4764457 = fieldWeight in 3011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.762313 = idf(docFreq=17, maxDocs=42306)
                0.625 = fieldNorm(doc=3011)
        0.5 = coord(1/2)
    
  5. MacFarlane, A.: Evaluation of web search for the information practitioner (2007) 2.25
    2.252835 = sum of:
      2.252835 = product of:
        4.50567 = sum of:
          4.50567 = weight(author_txt:macfarlane in 2818) [ClassicSimilarity], result of:
            4.50567 = score(doc=2818,freq=1.0), product of:
              0.8227362 = queryWeight, product of:
                1.2030796 = boost
                8.762313 = idf(docFreq=17, maxDocs=42306)
                0.078045435 = queryNorm
              5.4764457 = fieldWeight in 2818, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.762313 = idf(docFreq=17, maxDocs=42306)
                0.625 = fieldNorm(doc=2818)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Melucci, M.: Passage retrieval : a probabilistic technique (1998) 0.23
    0.22849682 = sum of:
      0.22849682 = product of:
        0.9520701 = sum of:
          0.03254495 = weight(abstract_txt:effectiveness in 2151) [ClassicSimilarity], result of:
            0.03254495 = score(doc=2151,freq=1.0), product of:
              0.08135681 = queryWeight, product of:
                5.12035 = idf(docFreq=686, maxDocs=42306)
                0.015888916 = queryNorm
              0.40002733 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.12035 = idf(docFreq=686, maxDocs=42306)
                0.078125 = fieldNorm(doc=2151)
          0.03758916 = weight(abstract_txt:experiments in 2151) [ClassicSimilarity], result of:
            0.03758916 = score(doc=2151,freq=1.0), product of:
              0.08955983 = queryWeight, product of:
                1.0492034 = boost
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.015888916 = queryNorm
              0.41971 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.078125 = fieldNorm(doc=2151)
          0.0201291 = weight(abstract_txt:retrieval in 2151) [ClassicSimilarity], result of:
            0.0201291 = score(doc=2151,freq=1.0), product of:
              0.07440996 = queryWeight, product of:
                1.3524885 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.015888916 = queryNorm
              0.2705162 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.078125 = fieldNorm(doc=2151)
          0.09114254 = weight(abstract_txt:algorithm in 2151) [ClassicSimilarity], result of:
            0.09114254 = score(doc=2151,freq=1.0), product of:
              0.20365526 = queryWeight, product of:
                2.2375145 = boost
                5.7284284 = idf(docFreq=373, maxDocs=42306)
                0.015888916 = queryNorm
              0.44753346 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7284284 = idf(docFreq=373, maxDocs=42306)
                0.078125 = fieldNorm(doc=2151)
          0.3806611 = weight(abstract_txt:passage in 2151) [ClassicSimilarity], result of:
            0.3806611 = score(doc=2151,freq=2.0), product of:
              0.41920894 = queryWeight, product of:
                3.210209 = boost
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.015888916 = queryNorm
              0.90804625 = fieldWeight in 2151, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.078125 = fieldNorm(doc=2151)
          0.39000323 = weight(abstract_txt:passages in 2151) [ClassicSimilarity], result of:
            0.39000323 = score(doc=2151,freq=2.0), product of:
              0.42603996 = queryWeight, product of:
                3.2362585 = boost
                8.285388 = idf(docFreq=28, maxDocs=42306)
                0.015888916 = queryNorm
              0.9154147 = fieldWeight in 2151, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.285388 = idf(docFreq=28, maxDocs=42306)
                0.078125 = fieldNorm(doc=2151)
        0.24 = coord(6/25)
    
  2. Kaszkiel, M.; Zobel, J.: Effective ranking with arbitrary passages (2001) 0.18
    0.17833464 = sum of:
      0.17833464 = product of:
        0.8916732 = sum of:
          0.026035957 = weight(abstract_txt:effectiveness in 765) [ClassicSimilarity], result of:
            0.026035957 = score(doc=765,freq=1.0), product of:
              0.08135681 = queryWeight, product of:
                5.12035 = idf(docFreq=686, maxDocs=42306)
                0.015888916 = queryNorm
              0.32002187 = fieldWeight in 765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.12035 = idf(docFreq=686, maxDocs=42306)
                0.0625 = fieldNorm(doc=765)
          0.030071326 = weight(abstract_txt:experiments in 765) [ClassicSimilarity], result of:
            0.030071326 = score(doc=765,freq=1.0), product of:
              0.08955983 = queryWeight, product of:
                1.0492034 = boost
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.015888916 = queryNorm
              0.335768 = fieldWeight in 765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.0625 = fieldNorm(doc=765)
          0.022773474 = weight(abstract_txt:retrieval in 765) [ClassicSimilarity], result of:
            0.022773474 = score(doc=765,freq=2.0), product of:
              0.07440996 = queryWeight, product of:
                1.3524885 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.015888916 = queryNorm
              0.30605412 = fieldWeight in 765, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.0625 = fieldNorm(doc=765)
          0.4306689 = weight(abstract_txt:passage in 765) [ClassicSimilarity], result of:
            0.4306689 = score(doc=765,freq=4.0), product of:
              0.41920894 = queryWeight, product of:
                3.210209 = boost
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.015888916 = queryNorm
              1.0273371 = fieldWeight in 765, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.0625 = fieldNorm(doc=765)
          0.38212356 = weight(abstract_txt:passages in 765) [ClassicSimilarity], result of:
            0.38212356 = score(doc=765,freq=3.0), product of:
              0.42603996 = queryWeight, product of:
                3.2362585 = boost
                8.285388 = idf(docFreq=28, maxDocs=42306)
                0.015888916 = queryNorm
              0.89691955 = fieldWeight in 765, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.285388 = idf(docFreq=28, maxDocs=42306)
                0.0625 = fieldNorm(doc=765)
        0.2 = coord(5/25)
    
  3. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the generation of partitioned inverted files (2005) 0.18
    0.17618808 = sum of:
      0.17618808 = product of:
        0.62924314 = sum of:
          0.042175114 = weight(abstract_txt:examine in 1777) [ClassicSimilarity], result of:
            0.042175114 = score(doc=1777,freq=2.0), product of:
              0.089064725 = queryWeight, product of:
                1.0462992 = boost
                5.357418 = idf(docFreq=541, maxDocs=42306)
                0.015888916 = queryNorm
              0.47353333 = fieldWeight in 1777, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.357418 = idf(docFreq=541, maxDocs=42306)
                0.0625 = fieldNorm(doc=1777)
          0.030071326 = weight(abstract_txt:experiments in 1777) [ClassicSimilarity], result of:
            0.030071326 = score(doc=1777,freq=1.0), product of:
              0.08955983 = queryWeight, product of:
                1.0492034 = boost
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.015888916 = queryNorm
              0.335768 = fieldWeight in 1777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.0625 = fieldNorm(doc=1777)
          0.044440478 = weight(abstract_txt:efficiency in 1777) [ClassicSimilarity], result of:
            0.044440478 = score(doc=1777,freq=1.0), product of:
              0.11619765 = queryWeight, product of:
                1.195093 = boost
                6.1192946 = idf(docFreq=252, maxDocs=42306)
                0.015888916 = queryNorm
              0.38245592 = fieldWeight in 1777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1192946 = idf(docFreq=252, maxDocs=42306)
                0.0625 = fieldNorm(doc=1777)
          0.07782866 = weight(abstract_txt:speed in 1777) [ClassicSimilarity], result of:
            0.07782866 = score(doc=1777,freq=2.0), product of:
              0.13399684 = queryWeight, product of:
                1.2833654 = boost
                6.57128 = idf(docFreq=160, maxDocs=42306)
                0.015888916 = queryNorm
              0.58082455 = fieldWeight in 1777, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.57128 = idf(docFreq=160, maxDocs=42306)
                0.0625 = fieldNorm(doc=1777)
          0.022773474 = weight(abstract_txt:retrieval in 1777) [ClassicSimilarity], result of:
            0.022773474 = score(doc=1777,freq=2.0), product of:
              0.07440996 = queryWeight, product of:
                1.3524885 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.015888916 = queryNorm
              0.30605412 = fieldWeight in 1777, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.0625 = fieldNorm(doc=1777)
          0.14641953 = weight(abstract_txt:computing in 1777) [ClassicSimilarity], result of:
            0.14641953 = score(doc=1777,freq=3.0), product of:
              0.2247573 = queryWeight, product of:
                2.3505795 = boost
                6.0178947 = idf(docFreq=279, maxDocs=42306)
                0.015888916 = queryNorm
              0.6514562 = fieldWeight in 1777, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0178947 = idf(docFreq=279, maxDocs=42306)
                0.0625 = fieldNorm(doc=1777)
          0.26553458 = weight(abstract_txt:parallel in 1777) [ClassicSimilarity], result of:
            0.26553458 = score(doc=1777,freq=3.0), product of:
              0.38261232 = queryWeight, product of:
                3.7561517 = boost
                6.4109373 = idf(docFreq=188, maxDocs=42306)
                0.015888916 = queryNorm
              0.6940043 = fieldWeight in 1777, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4109373 = idf(docFreq=188, maxDocs=42306)
                0.0625 = fieldNorm(doc=1777)
        0.28 = coord(7/25)
    
  4. Mengle, S.; Goharian, N.: Passage detection using text classification (2009) 0.18
    0.1750557 = sum of:
      0.1750557 = product of:
        1.4587975 = sum of:
          0.031507023 = weight(abstract_txt:retrieval in 585) [ClassicSimilarity], result of:
            0.031507023 = score(doc=585,freq=5.0), product of:
              0.07440996 = queryWeight, product of:
                1.3524885 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.015888916 = queryNorm
              0.4234248 = fieldWeight in 585, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.0546875 = fieldNorm(doc=585)
          0.70499426 = weight(abstract_txt:passage in 585) [ClassicSimilarity], result of:
            0.70499426 = score(doc=585,freq=14.0), product of:
              0.41920894 = queryWeight, product of:
                3.210209 = boost
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.015888916 = queryNorm
              1.6817253 = fieldWeight in 585, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.0546875 = fieldNorm(doc=585)
          0.7222961 = weight(abstract_txt:passages in 585) [ClassicSimilarity], result of:
            0.7222961 = score(doc=585,freq=14.0), product of:
              0.42603996 = queryWeight, product of:
                3.2362585 = boost
                8.285388 = idf(docFreq=28, maxDocs=42306)
                0.015888916 = queryNorm
              1.6953717 = fieldWeight in 585, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                8.285388 = idf(docFreq=28, maxDocs=42306)
                0.0546875 = fieldNorm(doc=585)
        0.12 = coord(3/25)
    
  5. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing in information retrieval : an updated review (1997) 0.17
    0.17191772 = sum of:
      0.17191772 = product of:
        1.0744858 = sum of:
          0.05578339 = weight(abstract_txt:retrieval in 520) [ClassicSimilarity], result of:
            0.05578339 = score(doc=520,freq=3.0), product of:
              0.07440996 = queryWeight, product of:
                1.3524885 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.015888916 = queryNorm
              0.7496764 = fieldWeight in 520, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.125 = fieldNorm(doc=520)
          0.09399284 = weight(abstract_txt:processing in 520) [ClassicSimilarity], result of:
            0.09399284 = score(doc=520,freq=1.0), product of:
              0.15196073 = queryWeight, product of:
                1.9327859 = boost
                4.94827 = idf(docFreq=815, maxDocs=42306)
                0.015888916 = queryNorm
              0.61853373 = fieldWeight in 520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.94827 = idf(docFreq=815, maxDocs=42306)
                0.125 = fieldNorm(doc=520)
          0.23910211 = weight(abstract_txt:computing in 520) [ClassicSimilarity], result of:
            0.23910211 = score(doc=520,freq=2.0), product of:
              0.2247573 = queryWeight, product of:
                2.3505795 = boost
                6.0178947 = idf(docFreq=279, maxDocs=42306)
                0.015888916 = queryNorm
              1.0638236 = fieldWeight in 520, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0178947 = idf(docFreq=279, maxDocs=42306)
                0.125 = fieldNorm(doc=520)
          0.68560743 = weight(abstract_txt:parallel in 520) [ClassicSimilarity], result of:
            0.68560743 = score(doc=520,freq=5.0), product of:
              0.38261232 = queryWeight, product of:
                3.7561517 = boost
                6.4109373 = idf(docFreq=188, maxDocs=42306)
                0.015888916 = queryNorm
              1.7919115 = fieldWeight in 520, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.4109373 = idf(docFreq=188, maxDocs=42306)
                0.125 = fieldNorm(doc=520)
        0.16 = coord(4/25)