Search (12 results, page 1 of 1)

  • × year_i:[2000 TO 2010}
  • × theme_ss:"Literaturübersicht"
  1. Liu, X.; Croft, W.B.: Statistical language modeling for information retrieval (2004) 0.03
    0.034903605 = product of:
      0.17451802 = sum of:
        0.17451802 = weight(_text_:grams in 4277) [ClassicSimilarity], result of:
          0.17451802 = score(doc=4277,freq=2.0), product of:
            0.39198354 = queryWeight, product of:
              8.059301 = idf(docFreq=37, maxDocs=44218)
              0.04863741 = queryNorm
            0.44521773 = fieldWeight in 4277, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.059301 = idf(docFreq=37, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4277)
      0.2 = coord(1/5)
    
    Abstract
    This chapter reviews research and applications in statistical language modeling for information retrieval (IR), which has emerged within the past several years as a new probabilistic framework for describing information retrieval processes. Generally speaking, statistical language modeling, or more simply language modeling (LM), involves estimating a probability distribution that captures statistical regularities of natural language use. Applied to information retrieval, language modeling refers to the problem of estimating the likelihood that a query and a document could have been generated by the same language model, given the language model of the document either with or without a language model of the query. The roots of statistical language modeling date to the beginning of the twentieth century when Markov tried to model letter sequences in works of Russian literature (Manning & Schütze, 1999). Zipf (1929, 1932, 1949, 1965) studied the statistical properties of text and discovered that the frequency of works decays as a Power function of each works rank. However, it was Shannon's (1951) work that inspired later research in this area. In 1951, eager to explore the applications of his newly founded information theory to human language, Shannon used a prediction game involving n-grams to investigate the information content of English text. He evaluated n-gram models' performance by comparing their crossentropy an texts with the true entropy estimated using predictions made by human subjects. For many years, statistical language models have been used primarily for automatic speech recognition. Since 1980, when the first significant language model was proposed (Rosenfeld, 2000), statistical language modeling has become a fundamental component of speech recognition, machine translation, and spelling correction.
  2. Enser, P.G.B.: Visual image retrieval (2008) 0.02
    0.021087032 = product of:
      0.105435155 = sum of:
        0.105435155 = weight(_text_:22 in 3281) [ClassicSimilarity], result of:
          0.105435155 = score(doc=3281,freq=2.0), product of:
            0.17031991 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04863741 = queryNorm
            0.61904186 = fieldWeight in 3281, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=3281)
      0.2 = coord(1/5)
    
    Date
    22. 1.2012 13:01:26
  3. Morris, S.A.: Mapping research specialties (2008) 0.02
    0.021087032 = product of:
      0.105435155 = sum of:
        0.105435155 = weight(_text_:22 in 3962) [ClassicSimilarity], result of:
          0.105435155 = score(doc=3962,freq=2.0), product of:
            0.17031991 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04863741 = queryNorm
            0.61904186 = fieldWeight in 3962, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=3962)
      0.2 = coord(1/5)
    
    Date
    13. 7.2008 9:30:22
  4. Fallis, D.: Social epistemology and information science (2006) 0.02
    0.021087032 = product of:
      0.105435155 = sum of:
        0.105435155 = weight(_text_:22 in 4368) [ClassicSimilarity], result of:
          0.105435155 = score(doc=4368,freq=2.0), product of:
            0.17031991 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04863741 = queryNorm
            0.61904186 = fieldWeight in 4368, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=4368)
      0.2 = coord(1/5)
    
    Date
    13. 7.2008 19:22:28
  5. Nicolaisen, J.: Citation analysis (2007) 0.02
    0.021087032 = product of:
      0.105435155 = sum of:
        0.105435155 = weight(_text_:22 in 6091) [ClassicSimilarity], result of:
          0.105435155 = score(doc=6091,freq=2.0), product of:
            0.17031991 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04863741 = queryNorm
            0.61904186 = fieldWeight in 6091, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=6091)
      0.2 = coord(1/5)
    
    Date
    13. 7.2008 19:53:22
  6. Kim, K.-S.: Recent work in cataloging and classification, 2000-2002 (2003) 0.01
    0.010543516 = product of:
      0.052717578 = sum of:
        0.052717578 = weight(_text_:22 in 152) [ClassicSimilarity], result of:
          0.052717578 = score(doc=152,freq=2.0), product of:
            0.17031991 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04863741 = queryNorm
            0.30952093 = fieldWeight in 152, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=152)
      0.2 = coord(1/5)
    
    Date
    10. 9.2000 17:38:22
  7. El-Sherbini, M.A.: Cataloging and classification : review of the literature 2005-06 (2008) 0.01
    0.010543516 = product of:
      0.052717578 = sum of:
        0.052717578 = weight(_text_:22 in 249) [ClassicSimilarity], result of:
          0.052717578 = score(doc=249,freq=2.0), product of:
            0.17031991 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04863741 = queryNorm
            0.30952093 = fieldWeight in 249, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=249)
      0.2 = coord(1/5)
    
    Date
    10. 9.2000 17:38:22
  8. Miksa, S.D.: ¬The challenges of change : a review of cataloging and classification literature, 2003-2004 (2007) 0.01
    0.010543516 = product of:
      0.052717578 = sum of:
        0.052717578 = weight(_text_:22 in 266) [ClassicSimilarity], result of:
          0.052717578 = score(doc=266,freq=2.0), product of:
            0.17031991 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04863741 = queryNorm
            0.30952093 = fieldWeight in 266, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=266)
      0.2 = coord(1/5)
    
    Date
    10. 9.2000 17:38:22
  9. Nielsen, M.L.: Thesaurus construction : key issues and selected readings (2004) 0.01
    0.009225576 = product of:
      0.04612788 = sum of:
        0.04612788 = weight(_text_:22 in 5006) [ClassicSimilarity], result of:
          0.04612788 = score(doc=5006,freq=2.0), product of:
            0.17031991 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04863741 = queryNorm
            0.2708308 = fieldWeight in 5006, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5006)
      0.2 = coord(1/5)
    
    Date
    18. 5.2006 20:06:22
  10. Weiss, A.K.; Carstens, T.V.: ¬The year's work in cataloging, 1999 (2001) 0.01
    0.009225576 = product of:
      0.04612788 = sum of:
        0.04612788 = weight(_text_:22 in 6084) [ClassicSimilarity], result of:
          0.04612788 = score(doc=6084,freq=2.0), product of:
            0.17031991 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04863741 = queryNorm
            0.2708308 = fieldWeight in 6084, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=6084)
      0.2 = coord(1/5)
    
    Date
    10. 9.2000 17:38:22
  11. Genereux, C.: Building connections : a review of the serials literature 2004 through 2005 (2007) 0.01
    0.007907636 = product of:
      0.039538182 = sum of:
        0.039538182 = weight(_text_:22 in 2548) [ClassicSimilarity], result of:
          0.039538182 = score(doc=2548,freq=2.0), product of:
            0.17031991 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04863741 = queryNorm
            0.23214069 = fieldWeight in 2548, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2548)
      0.2 = coord(1/5)
    
    Date
    10. 9.2000 17:38:22
  12. Corbett, L.E.: Serials: review of the literature 2000-2003 (2006) 0.01
    0.006589697 = product of:
      0.032948487 = sum of:
        0.032948487 = weight(_text_:22 in 1088) [ClassicSimilarity], result of:
          0.032948487 = score(doc=1088,freq=2.0), product of:
            0.17031991 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04863741 = queryNorm
            0.19345059 = fieldWeight in 1088, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1088)
      0.2 = coord(1/5)
    
    Date
    10. 9.2000 17:38:22