Document (#29278)

Author
Liu, X.
Croft, W.B.
Title
Statistical language modeling for information retrieval
Source
Annual review of information science and technology. 39(2005), S.3-32
Year
2004
Abstract
This chapter reviews research and applications in statistical language modeling for information retrieval (IR), which has emerged within the past several years as a new probabilistic framework for describing information retrieval processes. Generally speaking, statistical language modeling, or more simply language modeling (LM), involves estimating a probability distribution that captures statistical regularities of natural language use. Applied to information retrieval, language modeling refers to the problem of estimating the likelihood that a query and a document could have been generated by the same language model, given the language model of the document either with or without a language model of the query. The roots of statistical language modeling date to the beginning of the twentieth century when Markov tried to model letter sequences in works of Russian literature (Manning & Schütze, 1999). Zipf (1929, 1932, 1949, 1965) studied the statistical properties of text and discovered that the frequency of works decays as a Power function of each works rank. However, it was Shannon's (1951) work that inspired later research in this area. In 1951, eager to explore the applications of his newly founded information theory to human language, Shannon used a prediction game involving n-grams to investigate the information content of English text. He evaluated n-gram models' performance by comparing their crossentropy an texts with the true entropy estimated using predictions made by human subjects. For many years, statistical language models have been used primarily for automatic speech recognition. Since 1980, when the first significant language model was proposed (Rosenfeld, 2000), statistical language modeling has become a fundamental component of speech recognition, machine translation, and spelling correction.
Theme
Literaturübersicht
Computerlinguistik

Similar documents (author)

  1. Croft, W.B.: Approaches to intelligent information retrieval (1987) 5.02
    5.020828 = sum of:
      5.020828 = weight(author_txt:croft in 1094) [ClassicSimilarity], result of:
        5.020828 = score(doc=1094,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          5.0208282 = fieldWeight in 1094, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.625 = fieldNorm(doc=1094)
    
  2. Croft, W.B.: Clustering large files of documents using the single link method (1977) 5.02
    5.020828 = sum of:
      5.020828 = weight(author_txt:croft in 5489) [ClassicSimilarity], result of:
        5.020828 = score(doc=5489,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          5.0208282 = fieldWeight in 5489, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.625 = fieldNorm(doc=5489)
    
  3. Croft, W.B.: Knowledge-based and statistical approaches to text retrieval (1993) 5.02
    5.020828 = sum of:
      5.020828 = weight(author_txt:croft in 7863) [ClassicSimilarity], result of:
        5.020828 = score(doc=7863,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          5.0208282 = fieldWeight in 7863, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.625 = fieldNorm(doc=7863)
    
  4. Croft, W.B.: Hypertext and information retrieval : what are the fundamental concepts? (1990) 5.02
    5.020828 = sum of:
      5.020828 = weight(author_txt:croft in 8003) [ClassicSimilarity], result of:
        5.020828 = score(doc=8003,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          5.0208282 = fieldWeight in 8003, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.625 = fieldNorm(doc=8003)
    
  5. Croft, W.B.: What do people want from information retrieval? : the top 10 research issues for companies that use and sell IR systems (1995) 5.02
    5.020828 = sum of:
      5.020828 = weight(author_txt:croft in 3402) [ClassicSimilarity], result of:
        5.020828 = score(doc=3402,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          5.0208282 = fieldWeight in 3402, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.625 = fieldNorm(doc=3402)
    

Similar documents (content)

  1. Multilingual information management : current levels and future abilities. A report Commissioned by the US National Science Foundation and also delivered to the European Commission's Language Engineering Office and the US Defense Advanced Research Projects Agency, April 1999 (1999) 0.36
    0.3587855 = sum of:
      0.3587855 = product of:
        0.81542164 = sum of:
          0.019403195 = weight(abstract_txt:models in 6068) [ClassicSimilarity], result of:
            0.019403195 = score(doc=6068,freq=1.0), product of:
              0.07643986 = queryWeight, product of:
                1.0812947 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015230372 = queryNorm
              0.2538361 = fieldWeight in 6068, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.028344432 = weight(abstract_txt:years in 6068) [ClassicSimilarity], result of:
            0.028344432 = score(doc=6068,freq=2.0), product of:
              0.07810993 = queryWeight, product of:
                1.093043 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.015230372 = queryNorm
              0.36287874 = fieldWeight in 6068, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.00730094 = weight(abstract_txt:that in 6068) [ClassicSimilarity], result of:
            0.00730094 = score(doc=6068,freq=2.0), product of:
              0.03984039 = queryWeight, product of:
                1.1039792 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015230372 = queryNorm
              0.18325473 = fieldWeight in 6068, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.03584189 = weight(abstract_txt:applications in 6068) [ClassicSimilarity], result of:
            0.03584189 = score(doc=6068,freq=3.0), product of:
              0.079791725 = queryWeight, product of:
                1.1047475 = boost
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.015230372 = queryNorm
              0.44919306 = fieldWeight in 6068, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.077071175 = weight(abstract_txt:recognition in 6068) [ClassicSimilarity], result of:
            0.077071175 = score(doc=6068,freq=3.0), product of:
              0.1329307 = queryWeight, product of:
                1.4259253 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.015230372 = queryNorm
              0.57978463 = fieldWeight in 6068, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.10890243 = weight(abstract_txt:speech in 6068) [ClassicSimilarity], result of:
            0.10890243 = score(doc=6068,freq=3.0), product of:
              0.16738725 = queryWeight, product of:
                1.6000932 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.015230372 = queryNorm
              0.65060174 = fieldWeight in 6068, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.036417406 = weight(abstract_txt:retrieval in 6068) [ClassicSimilarity], result of:
            0.036417406 = score(doc=6068,freq=5.0), product of:
              0.08569662 = queryWeight, product of:
                1.6191272 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.015230372 = queryNorm
              0.4249573 = fieldWeight in 6068, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.023361411 = weight(abstract_txt:information in 6068) [ClassicSimilarity], result of:
            0.023361411 = score(doc=6068,freq=8.0), product of:
              0.062385093 = queryWeight, product of:
                1.6919409 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.015230372 = queryNorm
              0.37447104 = fieldWeight in 6068, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.14766775 = weight(abstract_txt:modeling in 6068) [ClassicSimilarity], result of:
            0.14766775 = score(doc=6068,freq=1.0), product of:
              0.4490391 = queryWeight, product of:
                4.9029813 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.015230372 = queryNorm
              0.32885277 = fieldWeight in 6068, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.13241701 = weight(abstract_txt:statistical in 6068) [ClassicSimilarity], result of:
            0.13241701 = score(doc=6068,freq=1.0), product of:
              0.43656963 = queryWeight, product of:
                5.1682186 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.015230372 = queryNorm
              0.30331245 = fieldWeight in 6068, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.19869398 = weight(abstract_txt:language in 6068) [ClassicSimilarity], result of:
            0.19869398 = score(doc=6068,freq=4.0), product of:
              0.4343837 = queryWeight, product of:
                6.8197727 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.015230372 = queryNorm
              0.45741582 = fieldWeight in 6068, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
        0.44 = coord(11/25)
    
  2. Collins-Thompson, K.; Callan, J.: Predicting reading difficulty with statistical language models (2005) 0.31
    0.30733857 = sum of:
      0.30733857 = product of:
        0.69849676 = sum of:
          0.017540138 = weight(abstract_txt:document in 4579) [ClassicSimilarity], result of:
            0.017540138 = score(doc=4579,freq=1.0), product of:
              0.06537802 = queryWeight, product of:
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.015230372 = queryNorm
              0.26828802 = fieldWeight in 4579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.022175081 = weight(abstract_txt:models in 4579) [ClassicSimilarity], result of:
            0.022175081 = score(doc=4579,freq=1.0), product of:
              0.07643986 = queryWeight, product of:
                1.0812947 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015230372 = queryNorm
              0.2900984 = fieldWeight in 4579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.013192914 = weight(abstract_txt:that in 4579) [ClassicSimilarity], result of:
            0.013192914 = score(doc=4579,freq=5.0), product of:
              0.03984039 = queryWeight, product of:
                1.1039792 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015230372 = queryNorm
              0.3311442 = fieldWeight in 4579, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.023822222 = weight(abstract_txt:query in 4579) [ClassicSimilarity], result of:
            0.023822222 = score(doc=4579,freq=1.0), product of:
              0.08017973 = queryWeight, product of:
                1.1074303 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.015230372 = queryNorm
              0.2971103 = fieldWeight in 4579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.01861298 = weight(abstract_txt:retrieval in 4579) [ClassicSimilarity], result of:
            0.01861298 = score(doc=4579,freq=1.0), product of:
              0.08569662 = queryWeight, product of:
                1.6191272 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.015230372 = queryNorm
              0.21719621 = fieldWeight in 4579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.009439435 = weight(abstract_txt:information in 4579) [ClassicSimilarity], result of:
            0.009439435 = score(doc=4579,freq=1.0), product of:
              0.062385093 = queryWeight, product of:
                1.6919409 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.015230372 = queryNorm
              0.15130915 = fieldWeight in 4579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.04773258 = weight(abstract_txt:works in 4579) [ClassicSimilarity], result of:
            0.04773258 = score(doc=4579,freq=1.0), product of:
              0.14587586 = queryWeight, product of:
                1.829454 = boost
                5.2354193 = idf(docFreq=639, maxDocs=44218)
                0.015230372 = queryNorm
              0.3272137 = fieldWeight in 4579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2354193 = idf(docFreq=639, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.04966067 = weight(abstract_txt:model in 4579) [ClassicSimilarity], result of:
            0.04966067 = score(doc=4579,freq=2.0), product of:
              0.14094667 = queryWeight, product of:
                2.3215687 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.015230372 = queryNorm
              0.35233662 = fieldWeight in 4579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.16876315 = weight(abstract_txt:modeling in 4579) [ClassicSimilarity], result of:
            0.16876315 = score(doc=4579,freq=1.0), product of:
              0.4490391 = queryWeight, product of:
                4.9029813 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.015230372 = queryNorm
              0.37583172 = fieldWeight in 4579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.21401818 = weight(abstract_txt:statistical in 4579) [ClassicSimilarity], result of:
            0.21401818 = score(doc=4579,freq=2.0), product of:
              0.43656963 = queryWeight, product of:
                5.1682186 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.015230372 = queryNorm
              0.49022692 = fieldWeight in 4579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.11353941 = weight(abstract_txt:language in 4579) [ClassicSimilarity], result of:
            0.11353941 = score(doc=4579,freq=1.0), product of:
              0.4343837 = queryWeight, product of:
                6.8197727 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.015230372 = queryNorm
              0.26138046 = fieldWeight in 4579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
        0.44 = coord(11/25)
    
  3. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: ¬A context-dependent relevance model (2016) 0.30
    0.29944402 = sum of:
      0.29944402 = product of:
        0.74861 = sum of:
          0.024805503 = weight(abstract_txt:document in 2778) [ClassicSimilarity], result of:
            0.024805503 = score(doc=2778,freq=2.0), product of:
              0.06537802 = queryWeight, product of:
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.015230372 = queryNorm
              0.37941656 = fieldWeight in 2778, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=2778)
          0.0313603 = weight(abstract_txt:models in 2778) [ClassicSimilarity], result of:
            0.0313603 = score(doc=2778,freq=2.0), product of:
              0.07643986 = queryWeight, product of:
                1.0812947 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015230372 = queryNorm
              0.4102611 = fieldWeight in 2778, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=2778)
          0.008343931 = weight(abstract_txt:that in 2778) [ClassicSimilarity], result of:
            0.008343931 = score(doc=2778,freq=2.0), product of:
              0.03984039 = queryWeight, product of:
                1.1039792 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015230372 = queryNorm
              0.20943399 = fieldWeight in 2778, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2778)
          0.0412613 = weight(abstract_txt:query in 2778) [ClassicSimilarity], result of:
            0.0412613 = score(doc=2778,freq=3.0), product of:
              0.08017973 = queryWeight, product of:
                1.1074303 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.015230372 = queryNorm
              0.5146101 = fieldWeight in 2778, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=2778)
          0.03722596 = weight(abstract_txt:retrieval in 2778) [ClassicSimilarity], result of:
            0.03722596 = score(doc=2778,freq=4.0), product of:
              0.08569662 = queryWeight, product of:
                1.6191272 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.015230372 = queryNorm
              0.43439242 = fieldWeight in 2778, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2778)
          0.02110722 = weight(abstract_txt:information in 2778) [ClassicSimilarity], result of:
            0.02110722 = score(doc=2778,freq=5.0), product of:
              0.062385093 = queryWeight, product of:
                1.6919409 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.015230372 = queryNorm
              0.33833754 = fieldWeight in 2778, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=2778)
          0.11843301 = weight(abstract_txt:estimating in 2778) [ClassicSimilarity], result of:
            0.11843301 = score(doc=2778,freq=1.0), product of:
              0.2335563 = queryWeight, product of:
                1.8900788 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.015230372 = queryNorm
              0.5070855 = fieldWeight in 2778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.0625 = fieldNorm(doc=2778)
          0.070230804 = weight(abstract_txt:model in 2778) [ClassicSimilarity], result of:
            0.070230804 = score(doc=2778,freq=4.0), product of:
              0.14094667 = queryWeight, product of:
                2.3215687 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.015230372 = queryNorm
              0.49827924 = fieldWeight in 2778, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=2778)
          0.16876315 = weight(abstract_txt:modeling in 2778) [ClassicSimilarity], result of:
            0.16876315 = score(doc=2778,freq=1.0), product of:
              0.4490391 = queryWeight, product of:
                4.9029813 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.015230372 = queryNorm
              0.37583172 = fieldWeight in 2778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0625 = fieldNorm(doc=2778)
          0.22707883 = weight(abstract_txt:language in 2778) [ClassicSimilarity], result of:
            0.22707883 = score(doc=2778,freq=4.0), product of:
              0.4343837 = queryWeight, product of:
                6.8197727 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.015230372 = queryNorm
              0.5227609 = fieldWeight in 2778, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=2778)
        0.4 = coord(10/25)
    
  4. Wang, J.; Oard, D.W.: Matching meaning for cross-language information retrieval (2012) 0.28
    0.27939337 = sum of:
      0.27939337 = product of:
        0.8731043 = sum of:
          0.012515898 = weight(abstract_txt:that in 7430) [ClassicSimilarity], result of:
            0.012515898 = score(doc=7430,freq=2.0), product of:
              0.03984039 = queryWeight, product of:
                1.1039792 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015230372 = queryNorm
              0.314151 = fieldWeight in 7430, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=7430)
          0.035474267 = weight(abstract_txt:applications in 7430) [ClassicSimilarity], result of:
            0.035474267 = score(doc=7430,freq=1.0), product of:
              0.079791725 = queryWeight, product of:
                1.1047475 = boost
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.015230372 = queryNorm
              0.4445858 = fieldWeight in 7430, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.09375 = fieldNorm(doc=7430)
          0.03573333 = weight(abstract_txt:query in 7430) [ClassicSimilarity], result of:
            0.03573333 = score(doc=7430,freq=1.0), product of:
              0.08017973 = queryWeight, product of:
                1.1074303 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.015230372 = queryNorm
              0.44566542 = fieldWeight in 7430, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.09375 = fieldNorm(doc=7430)
          0.048357945 = weight(abstract_txt:retrieval in 7430) [ClassicSimilarity], result of:
            0.048357945 = score(doc=7430,freq=3.0), product of:
              0.08569662 = queryWeight, product of:
                1.6191272 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.015230372 = queryNorm
              0.5642923 = fieldWeight in 7430, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=7430)
          0.020024067 = weight(abstract_txt:information in 7430) [ClassicSimilarity], result of:
            0.020024067 = score(doc=7430,freq=2.0), product of:
              0.062385093 = queryWeight, product of:
                1.6919409 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.015230372 = queryNorm
              0.32097518 = fieldWeight in 7430, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=7430)
          0.2531447 = weight(abstract_txt:modeling in 7430) [ClassicSimilarity], result of:
            0.2531447 = score(doc=7430,freq=1.0), product of:
              0.4490391 = queryWeight, product of:
                4.9029813 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.015230372 = queryNorm
              0.5637476 = fieldWeight in 7430, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.09375 = fieldNorm(doc=7430)
          0.2270006 = weight(abstract_txt:statistical in 7430) [ClassicSimilarity], result of:
            0.2270006 = score(doc=7430,freq=1.0), product of:
              0.43656963 = queryWeight, product of:
                5.1682186 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.015230372 = queryNorm
              0.5199642 = fieldWeight in 7430, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.09375 = fieldNorm(doc=7430)
          0.24085347 = weight(abstract_txt:language in 7430) [ClassicSimilarity], result of:
            0.24085347 = score(doc=7430,freq=2.0), product of:
              0.4343837 = queryWeight, product of:
                6.8197727 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.015230372 = queryNorm
              0.55447173 = fieldWeight in 7430, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.09375 = fieldNorm(doc=7430)
        0.32 = coord(8/25)
    
  5. Larkey, L.S.; Connell, M.E.: Structured queries, language modelling, and relevance modelling in cross-language information retrieval (2005) 0.28
    0.2778312 = sum of:
      0.2778312 = product of:
        0.8682225 = sum of:
          0.0313603 = weight(abstract_txt:models in 1022) [ClassicSimilarity], result of:
            0.0313603 = score(doc=1022,freq=2.0), product of:
              0.07643986 = queryWeight, product of:
                1.0812947 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015230372 = queryNorm
              0.4102611 = fieldWeight in 1022, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.0102191875 = weight(abstract_txt:that in 1022) [ClassicSimilarity], result of:
            0.0102191875 = score(doc=1022,freq=3.0), product of:
              0.03984039 = queryWeight, product of:
                1.1039792 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015230372 = queryNorm
              0.2565032 = fieldWeight in 1022, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.058352295 = weight(abstract_txt:query in 1022) [ClassicSimilarity], result of:
            0.058352295 = score(doc=1022,freq=6.0), product of:
              0.08017973 = queryWeight, product of:
                1.1074303 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.015230372 = queryNorm
              0.72776866 = fieldWeight in 1022, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.03223863 = weight(abstract_txt:retrieval in 1022) [ClassicSimilarity], result of:
            0.03223863 = score(doc=1022,freq=3.0), product of:
              0.08569662 = queryWeight, product of:
                1.6191272 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.015230372 = queryNorm
              0.37619486 = fieldWeight in 1022, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.009439435 = weight(abstract_txt:information in 1022) [ClassicSimilarity], result of:
            0.009439435 = score(doc=1022,freq=1.0), product of:
              0.062385093 = queryWeight, product of:
                1.6919409 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.015230372 = queryNorm
              0.15130915 = fieldWeight in 1022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.035115402 = weight(abstract_txt:model in 1022) [ClassicSimilarity], result of:
            0.035115402 = score(doc=1022,freq=1.0), product of:
              0.14094667 = queryWeight, product of:
                2.3215687 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.015230372 = queryNorm
              0.24913962 = fieldWeight in 1022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.4133836 = weight(abstract_txt:modeling in 1022) [ClassicSimilarity], result of:
            0.4133836 = score(doc=1022,freq=6.0), product of:
              0.4490391 = queryWeight, product of:
                4.9029813 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.015230372 = queryNorm
              0.920596 = fieldWeight in 1022, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.27811363 = weight(abstract_txt:language in 1022) [ClassicSimilarity], result of:
            0.27811363 = score(doc=1022,freq=6.0), product of:
              0.4343837 = queryWeight, product of:
                6.8197727 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.015230372 = queryNorm
              0.6402488 = fieldWeight in 1022, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
        0.32 = coord(8/25)