Search (2 results, page 1 of 1)

Did you mean:
rswk_00%3a%22World wide web %2f elektronische bibliothek %2f information retrieval %2f kongress %2f trondheim %3.2003%3E%22 2
rswk_00%3a%22World wide web %2f elektronische bibliothek %2f information retrieval %2f kongress %2f trondheim %32003%3E%22 2
rswk_00%3a%22World wide web %2f elektronische bibliothek %2f information retrieval %2f kongresse %2f trondheim %3.2003%3E%22 2
rswk_00%3a%22World wide web %2f elektronische bibliothek %2f information retrieval %2f kongress %2f trondheim %3.2008%3E%22 2
rswk_00%3a%22World wide web %2f elektronische bibliothek %2f information retrieval %2f kongresu %2f trondheim %3.2003%3E%22 2

Crouch, C.J.: ¬An approach to the automatic construction of global thesauri (1990) 0.01

0.010502836 = product of:
  0.04901323 = sum of:
    0.009988253 = weight(_text_:information in 4042) [ClassicSimilarity], result of:
      0.009988253 = score(doc=4042,freq=4.0), product of:
        0.052020688 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.029633347 = queryNorm
        0.1920054 = fieldWeight in 4042, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4042)
    0.029656855 = weight(_text_:retrieval in 4042) [ClassicSimilarity], result of:
      0.029656855 = score(doc=4042,freq=4.0), product of:
        0.08963835 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.029633347 = queryNorm
        0.33085006 = fieldWeight in 4042, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4042)
    0.009368123 = product of:
      0.028104367 = sum of:
        0.028104367 = weight(_text_:22 in 4042) [ClassicSimilarity], result of:
          0.028104367 = score(doc=4042,freq=2.0), product of:
            0.103770934 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.029633347 = queryNorm
            0.2708308 = fieldWeight in 4042, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4042)
      0.33333334 = coord(1/3)
  0.21428572 = coord(3/14)

Abstract: The benefits of a well constructed thesaurus to an information retrieval system have long been recognised by both researchers and practitioners in the field. Examines both early and current approaches to automatic thesaurus construction and describes an approach to the automatic generation of global thesauri based on the term discrimination value model of Salton Yang, and Yu and on an appropriate clustering algorithm. This method has been implemented and applied to 2 document collections. Preliminary results indicate that this method, which produces improvements in retrieval performance in excess of 10 and 15% in the test collections, is viable and worthy of continued investigation.
Date: 22. 4.1996 3:39:53
Source: Information processing and management. 26(1990), no.5, S.629-640

Crouch, C.J.; Crouch, D.B.; Chen, Q.; Holtz, S.J.: Improving the retrieval effectiveness of very short queries (2002) 0.01

0.009356454 = product of:
  0.04366345 = sum of:
    0.017435152 = weight(_text_:web in 2572) [ClassicSimilarity], result of:
      0.017435152 = score(doc=2572,freq=2.0), product of:
        0.09670874 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.029633347 = queryNorm
        0.18028519 = fieldWeight in 2572, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2572)
    0.0050448296 = weight(_text_:information in 2572) [ClassicSimilarity], result of:
      0.0050448296 = score(doc=2572,freq=2.0), product of:
        0.052020688 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.029633347 = queryNorm
        0.09697737 = fieldWeight in 2572, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2572)
    0.021183468 = weight(_text_:retrieval in 2572) [ClassicSimilarity], result of:
      0.021183468 = score(doc=2572,freq=4.0), product of:
        0.08963835 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.029633347 = queryNorm
        0.23632148 = fieldWeight in 2572, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2572)
  0.21428572 = coord(3/14)

Abstract: This paper describes an automatic approach designed to improve the retrieval effectiveness of very short queries such as those used in web searching. The method is based on the observation that stemming, which is designed to maximize recall, often results in depressed precision. Our approach is based on pseudo-feedback and attempts to increase the number of relevant documents in the pseudo-relevant set by reranking those documents based on the presence of unstemmed query terms in the document text. The original experiments underlying this work were carried out using Smart 11.0 and the lnc.ltc weighting scheme on three sets of documents from the TREC collection with corresponding TREC (title only) topics as queries. (The average length of these queries after stoplisting ranges from 2.4 to 4.5 terms.) Results, evaluated in terms of P@20 and non-interpolated average precision, showed clearly that pseudo-feedback (PF) based on this approach was effective in increasing the number of relevant documents in the top ranks. Subsequent experiments, performed on the same data sets using Smart 13.0 and the improved Lnu.ltu weighting scheme, indicate that these results hold up even over the much higher baseline provided by the new weights. Query drift analysis presents a more detailed picture of the improvements produced by this process.
Source: Information processing and management. 38(2002) no.1, S.1-36

Search (2 results, page 1 of 1)

Authors

Years

Themes