Search (1 results, page 1 of 1)

  • × author_ss:"Heinz, S."
  • × theme_ss:"Retrievalalgorithmen"
  • × type_ss:"a"
  1. Heinz, S.; Zobel, J.: Efficient single-pass index construction for text databases (2003) 0.00
    0.0049275304 = product of:
      0.024637653 = sum of:
        0.024637653 = product of:
          0.049275305 = sum of:
            0.049275305 = weight(_text_:data in 1678) [ClassicSimilarity], result of:
              0.049275305 = score(doc=1678,freq=4.0), product of:
                0.14247625 = queryWeight, product of:
                  3.1620505 = idf(docFreq=5088, maxDocs=44218)
                  0.04505818 = queryNorm
                0.34584928 = fieldWeight in 1678, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.1620505 = idf(docFreq=5088, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1678)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Abstract
    Efficient construction of inverted indexes is essential to provision of search over large collections of text data. In this article, we review the principal approaches to inversion, analyze their theoretical cost, and present experimental results. We identify the drawbacks of existing inversion approaches and propose a single-pass inversion method that, in contrast to previous approaches, does not require the complete vocabulary of the indexed collection in main memory, can operate within limited resources, and does not sacrifice speed with high temporary storage requirements. We show that the performance of the single-pass approach can be improved by constructing inverted files in segments, reducing the cost of disk accesses during inversion of large volumes of data.