Search (2 results, page 1 of 1)

  • × author_ss:"Lee, D.L."
  1. Lee, D.L.; Ren, L.: Document ranking on weight-partitioned signature files (1996) 0.08
    0.08007109 = product of:
      0.16014218 = sum of:
        0.09847082 = weight(_text_:storage in 2417) [ClassicSimilarity], result of:
          0.09847082 = score(doc=2417,freq=2.0), product of:
            0.23366846 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04288404 = queryNorm
            0.42141256 = fieldWeight in 2417, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2417)
        0.030347655 = weight(_text_:retrieval in 2417) [ClassicSimilarity], result of:
          0.030347655 = score(doc=2417,freq=2.0), product of:
            0.12972058 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04288404 = queryNorm
            0.23394634 = fieldWeight in 2417, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2417)
        0.031323705 = weight(_text_:systems in 2417) [ClassicSimilarity], result of:
          0.031323705 = score(doc=2417,freq=2.0), product of:
            0.13179013 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04288404 = queryNorm
            0.23767869 = fieldWeight in 2417, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2417)
      0.5 = coord(3/6)
    
    Abstract
    Proposes the weight partitioned signature file, a signature file organization for supporting document ranking. It uses multiple signature files each corresponding to one term frequency to represent terms with different term frequencies. Words with the same term frequency in a document are grouped together and hased into the signature file corresponding to that term frequency. Investigates the effect of false drops on retrieval effectiveness. Analyses the performance of the weight partitioned signature file under different search strategies and configurations. Obtains an optimal formula for storage allocation to minimise the effect of false drops on document ranks. Analytical results are supported by experiments on document collections
    Source
    ACM transactions on information systems. 14(1996) no.2, S.109-137
  2. Lee, D.L.: Massive parallelism on the hybrid text-retrieval machine (1995) 0.06
    0.06461197 = product of:
      0.19383591 = sum of:
        0.15915288 = weight(_text_:storage in 4075) [ClassicSimilarity], result of:
          0.15915288 = score(doc=4075,freq=4.0), product of:
            0.23366846 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04288404 = queryNorm
            0.68110555 = fieldWeight in 4075, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.0625 = fieldNorm(doc=4075)
        0.034683034 = weight(_text_:retrieval in 4075) [ClassicSimilarity], result of:
          0.034683034 = score(doc=4075,freq=2.0), product of:
            0.12972058 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04288404 = queryNorm
            0.26736724 = fieldWeight in 4075, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=4075)
      0.33333334 = coord(2/6)
    
    Abstract
    Discusses the design of a high-performance, cost effective, machine for retrieving textual data, HYTREM. High performance and cost effectiveness are achieved by a combination of low cost hard discs, software filtering techniques, and a large amount of main memory. Focuses on the signature processor, which is based on the partitioned signature file technique, and the mass storage system, which is based on a disc array. Presents a performance evaluation on the individual system components, i.e. the signature processor and the mass storage system, as well as the entire system