Search (2 results, page 1 of 1)

  • × author_ss:"Zobel, J."
  • × theme_ss:"Retrievalalgorithmen"
  1. Kaszkiel, M.; Zobel, J.: Effective ranking with arbitrary passages (2001) 0.00
    0.004880361 = product of:
      0.014641082 = sum of:
        0.014641082 = product of:
          0.029282164 = sum of:
            0.029282164 = weight(_text_:of in 5764) [ClassicSimilarity], result of:
              0.029282164 = score(doc=5764,freq=34.0), product of:
                0.06850986 = queryWeight, product of:
                  1.5637573 = idf(docFreq=25162, maxDocs=44218)
                  0.043811057 = queryNorm
                0.4274153 = fieldWeight in 5764, product of:
                  5.8309517 = tf(freq=34.0), with freq of:
                    34.0 = termFreq=34.0
                  1.5637573 = idf(docFreq=25162, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5764)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Text retrieval systems store a great variety of documents, from abstracts, newspaper articles, and Web pages to journal articles, books, court transcripts, and legislation. Collections of diverse types of documents expose shortcomings in current approaches to ranking. Use of short fragments of documents, called passages, instead of whole documents can overcome these shortcomings: passage ranking provides convenient units of text to return to the user, can avoid the difficulties of comparing documents of different length, and enables identification of short blocks of relevant material among otherwise irrelevant text. In this article, we compare several kinds of passage in an extensive series of experiments. We introduce a new type of passage, overlapping fragments of either fixed or variable length. We show that ranking with these arbitrary passages gives substantial improvements in retrieval effectiveness over traditional document ranking schemes, particularly for queries on collections of long documents. Ranking with arbitrary passages shows consistent improvements compared to ranking with whole documents, and to ranking with previous passage types that depend on document structure or topic shifts in documents
    Source
    Journal of the American Society for Information Science and technology. 52(2001) no.4, S.344-364
  2. Heinz, S.; Zobel, J.: Efficient single-pass index construction for text databases (2003) 0.00
    0.00436691 = product of:
      0.01310073 = sum of:
        0.01310073 = product of:
          0.02620146 = sum of:
            0.02620146 = weight(_text_:of in 1678) [ClassicSimilarity], result of:
              0.02620146 = score(doc=1678,freq=20.0), product of:
                0.06850986 = queryWeight, product of:
                  1.5637573 = idf(docFreq=25162, maxDocs=44218)
                  0.043811057 = queryNorm
                0.38244802 = fieldWeight in 1678, product of:
                  4.472136 = tf(freq=20.0), with freq of:
                    20.0 = termFreq=20.0
                  1.5637573 = idf(docFreq=25162, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1678)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Efficient construction of inverted indexes is essential to provision of search over large collections of text data. In this article, we review the principal approaches to inversion, analyze their theoretical cost, and present experimental results. We identify the drawbacks of existing inversion approaches and propose a single-pass inversion method that, in contrast to previous approaches, does not require the complete vocabulary of the indexed collection in main memory, can operate within limited resources, and does not sacrifice speed with high temporary storage requirements. We show that the performance of the single-pass approach can be improved by constructing inverted files in segments, reducing the cost of disk accesses during inversion of large volumes of data.
    Source
    Journal of the American Society for Information Science and technology. 54(2003) no.8, S.713-729

Authors