Search (1 results, page 1 of 1)

  • × author_ss:"Can, F."
  1. Can, F.: On the efficiency of best-match cluster searches (1994) 0.04
    0.042939495 = product of:
      0.12881848 = sum of:
        0.09847082 = weight(_text_:storage in 7294) [ClassicSimilarity], result of:
          0.09847082 = score(doc=7294,freq=2.0), product of:
            0.23366846 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04288404 = queryNorm
            0.42141256 = fieldWeight in 7294, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7294)
        0.030347655 = weight(_text_:retrieval in 7294) [ClassicSimilarity], result of:
          0.030347655 = score(doc=7294,freq=2.0), product of:
            0.12972058 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04288404 = queryNorm
            0.23394634 = fieldWeight in 7294, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7294)
      0.33333334 = coord(2/6)
    
    Abstract
    The efficiency of various cluster-based retrieval (CBR) strategies is analyzed. The possibility of combining CBR and inverted index search (IIS) is investigated. A method for combining the two approaches is proposed and shown to be cost effective in terms of paging and CPU time. In the new method, the selection of documents from the best-matching clusters is done using the inverted index for all documents. Although this is counterintuitive to the concept of best-match CBR, the observations prove that it is much more efficient than conventional approaches. In the experiments, the effects of the number of selected clusters, page size, centroid length, and matching functions are considered. The experiments show that the storage overhead of the new method would be moderately higher than that of IIS