Document (#34038)

Author
Savoy, J.
Title
Searching strategies for the Hungarian language
Source
Information processing and management. 44(2008) no.1, S.310-324
Year
2008
Abstract
This paper reports on the underlying IR problems encountered when dealing with the complex morphology and compound constructions found in the Hungarian language. It describes evaluations carried out on two general stemming strategies for this language, and also demonstrates that a light stemming approach could be quite effective. Based on searches done on the CLEF test collection, we find that a more aggressive suffix-stripping approach may produce better MAP. When compared to an IR scheme without stemming or one based on only a light stemmer, we find the differences to be statistically significant. When compared with probabilistic, vector-space and language models, we find that the Okapi model results in the best retrieval effectiveness. The resulting MAP is found to be about 35% better than the classical tf idf approach, particularly for very short requests. Finally, we demonstrate that applying an automatic decompounding procedure for both queries and documents significantly improves IR performance (+10%), compared to word-based indexing strategies.
Theme
Computerlinguistik

Similar documents (author)

  1. Savoy, J.: Stemming of French words based on grammatical categories (1993) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 4650) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 4650, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=4650)
    
  2. Savoy, J.: Effectiveness of information retrieval systems used in a hypertext environment (1993) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 6511) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 6511, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=6511)
    
  3. Savoy, J.: ¬A learning scheme for information retrieval in hypertext (1994) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 7292) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 7292, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=7292)
    
  4. Savoy, J.: Bayesian inference networks and spreading activation in hypertext systems (1992) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 192) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 192, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=192)
    
  5. Savoy, J.: Searching information in legal hypertext systems (1993/94) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 757) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 757, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=757)
    

Similar documents (content)

  1. Dolamic, L.; Savoy, J.: Indexing and searching strategies for the Russian language (2009) 0.87
    0.86560965 = sum of:
      0.86560965 = product of:
        1.5457315 = sum of:
          0.0499003 = weight(abstract_txt:probabilistic in 3301) [ClassicSimilarity], result of:
            0.0499003 = score(doc=3301,freq=1.0), product of:
              0.117812574 = queryWeight, product of:
                1.0084453 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.017238831 = queryNorm
              0.42355666 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.071977966 = weight(abstract_txt:statistically in 3301) [ClassicSimilarity], result of:
            0.071977966 = score(doc=3301,freq=2.0), product of:
              0.11937478 = queryWeight, product of:
                1.0151093 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.017238831 = queryNorm
              0.6029579 = fieldWeight in 3301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.11643864 = weight(abstract_txt:okapi in 3301) [ClassicSimilarity], result of:
            0.11643864 = score(doc=3301,freq=2.0), product of:
              0.16450444 = queryWeight, product of:
                1.1916406 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.017238831 = queryNorm
              0.7078146 = fieldWeight in 3301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.17724857 = weight(abstract_txt:stemmer in 3301) [ClassicSimilarity], result of:
            0.17724857 = score(doc=3301,freq=2.0), product of:
              0.2176881 = queryWeight, product of:
                1.3707992 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.017238831 = queryNorm
              0.81423175 = fieldWeight in 3301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.17724857 = weight(abstract_txt:aggressive in 3301) [ClassicSimilarity], result of:
            0.17724857 = score(doc=3301,freq=2.0), product of:
              0.2176881 = queryWeight, product of:
                1.3707992 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.017238831 = queryNorm
              0.81423175 = fieldWeight in 3301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.008531494 = weight(abstract_txt:that in 3301) [ClassicSimilarity], result of:
            0.008531494 = score(doc=3301,freq=1.0), product of:
              0.057609342 = queryWeight, product of:
                1.4103696 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017238831 = queryNorm
              0.1480922 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.034637045 = weight(abstract_txt:better in 3301) [ClassicSimilarity], result of:
            0.034637045 = score(doc=3301,freq=1.0), product of:
              0.11636617 = queryWeight, product of:
                1.4173753 = boost
                4.76249 = idf(docFreq=1026, maxDocs=44218)
                0.017238831 = queryNorm
              0.2976556 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.76249 = idf(docFreq=1026, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.04376834 = weight(abstract_txt:approach in 3301) [ClassicSimilarity], result of:
            0.04376834 = score(doc=3301,freq=3.0), product of:
              0.10795172 = queryWeight, product of:
                1.6719832 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.017238831 = queryNorm
              0.40544364 = fieldWeight in 3301, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.11591401 = weight(abstract_txt:light in 3301) [ClassicSimilarity], result of:
            0.11591401 = score(doc=3301,freq=3.0), product of:
              0.1805163 = queryWeight, product of:
                1.7653455 = boost
                5.931696 = idf(docFreq=318, maxDocs=44218)
                0.017238831 = queryNorm
              0.6421249 = fieldWeight in 3301, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.931696 = idf(docFreq=318, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.034336038 = weight(abstract_txt:when in 3301) [ClassicSimilarity], result of:
            0.034336038 = score(doc=3301,freq=1.0), product of:
              0.13243316 = queryWeight, product of:
                1.8518914 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.017238831 = queryNorm
              0.2592707 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.056167223 = weight(abstract_txt:find in 3301) [ClassicSimilarity], result of:
            0.056167223 = score(doc=3301,freq=1.0), product of:
              0.18385915 = queryWeight, product of:
                2.1820252 = boost
                4.887848 = idf(docFreq=905, maxDocs=44218)
                0.017238831 = queryNorm
              0.3054905 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.887848 = idf(docFreq=905, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.09146831 = weight(abstract_txt:strategies in 3301) [ClassicSimilarity], result of:
            0.09146831 = score(doc=3301,freq=2.0), product of:
              0.2019918 = queryWeight, product of:
                2.2870939 = boost
                5.123207 = idf(docFreq=715, maxDocs=44218)
                0.017238831 = queryNorm
              0.4528318 = fieldWeight in 3301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.123207 = idf(docFreq=715, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.08124724 = weight(abstract_txt:language in 3301) [ClassicSimilarity], result of:
            0.08124724 = score(doc=3301,freq=3.0), product of:
              0.17946297 = queryWeight, product of:
                2.489281 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017238831 = queryNorm
              0.45272425 = fieldWeight in 3301, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
          0.4868477 = weight(abstract_txt:stemming in 3301) [ClassicSimilarity], result of:
            0.4868477 = score(doc=3301,freq=6.0), product of:
              0.42694795 = queryWeight, product of:
                3.3250992 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.017238831 = queryNorm
              1.1402975 = fieldWeight in 3301, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0625 = fieldNorm(doc=3301)
        0.56 = coord(14/25)
    
  2. Brychcín, T.; Konopík, M.: HPS: High precision stemmer (2015) 0.42
    0.42441612 = sum of:
      0.42441612 = product of:
        1.1789336 = sum of:
          0.12533367 = weight(abstract_txt:stemmer in 2686) [ClassicSimilarity], result of:
            0.12533367 = score(doc=2686,freq=1.0), product of:
              0.2176881 = queryWeight, product of:
                1.3707992 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.017238831 = queryNorm
              0.5757488 = fieldWeight in 2686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.0147769805 = weight(abstract_txt:that in 2686) [ClassicSimilarity], result of:
            0.0147769805 = score(doc=2686,freq=3.0), product of:
              0.057609342 = queryWeight, product of:
                1.4103696 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017238831 = queryNorm
              0.2565032 = fieldWeight in 2686, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.022037907 = weight(abstract_txt:based in 2686) [ClassicSimilarity], result of:
            0.022037907 = score(doc=2686,freq=2.0), product of:
              0.07821082 = queryWeight, product of:
                1.4231496 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.017238831 = queryNorm
              0.28177565 = fieldWeight in 2686, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.04376834 = weight(abstract_txt:approach in 2686) [ClassicSimilarity], result of:
            0.04376834 = score(doc=2686,freq=3.0), product of:
              0.10795172 = queryWeight, product of:
                1.6719832 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.017238831 = queryNorm
              0.40544364 = fieldWeight in 2686, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.048558492 = weight(abstract_txt:when in 2686) [ClassicSimilarity], result of:
            0.048558492 = score(doc=2686,freq=2.0), product of:
              0.13243316 = queryWeight, product of:
                1.8518914 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.017238831 = queryNorm
              0.36666414 = fieldWeight in 2686, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.08584717 = weight(abstract_txt:compared in 2686) [ClassicSimilarity], result of:
            0.08584717 = score(doc=2686,freq=2.0), product of:
              0.19362909 = queryWeight, product of:
                2.2392492 = boost
                5.0160327 = idf(docFreq=796, maxDocs=44218)
                0.017238831 = queryNorm
              0.44335884 = fieldWeight in 2686, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0160327 = idf(docFreq=796, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.046908114 = weight(abstract_txt:language in 2686) [ClassicSimilarity], result of:
            0.046908114 = score(doc=2686,freq=1.0), product of:
              0.17946297 = queryWeight, product of:
                2.489281 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017238831 = queryNorm
              0.26138046 = fieldWeight in 2686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.2658473 = weight(abstract_txt:hungarian in 2686) [ClassicSimilarity], result of:
            0.2658473 = score(doc=2686,freq=1.0), product of:
              0.4527805 = queryWeight, product of:
                2.7958596 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.017238831 = queryNorm
              0.5871439 = fieldWeight in 2686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.5258556 = weight(abstract_txt:stemming in 2686) [ClassicSimilarity], result of:
            0.5258556 = score(doc=2686,freq=7.0), product of:
              0.42694795 = queryWeight, product of:
                3.3250992 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.017238831 = queryNorm
              1.231662 = fieldWeight in 2686, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
        0.36 = coord(9/25)
    
  3. Nagy T., I.: Detecting multiword expressions and named entities in natural language texts (2014) 0.29
    0.29342914 = sum of:
      0.29342914 = product of:
        0.73357284 = sum of:
          0.042142063 = weight(abstract_txt:compound in 1536) [ClassicSimilarity], result of:
            0.042142063 = score(doc=1536,freq=1.0), product of:
              0.14399478 = queryWeight, product of:
                1.1148845 = boost
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.017238831 = queryNorm
              0.29266384 = fieldWeight in 1536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.13653865 = weight(abstract_txt:constructions in 1536) [ClassicSimilarity], result of:
            0.13653865 = score(doc=1536,freq=5.0), product of:
              0.18438119 = queryWeight, product of:
                1.26158 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.017238831 = queryNorm
              0.7405238 = fieldWeight in 1536, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.0130611295 = weight(abstract_txt:that in 1536) [ClassicSimilarity], result of:
            0.0130611295 = score(doc=1536,freq=6.0), product of:
              0.057609342 = queryWeight, product of:
                1.4103696 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017238831 = queryNorm
              0.22671895 = fieldWeight in 1536, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.021648152 = weight(abstract_txt:better in 1536) [ClassicSimilarity], result of:
            0.021648152 = score(doc=1536,freq=1.0), product of:
              0.11636617 = queryWeight, product of:
                1.4173753 = boost
                4.76249 = idf(docFreq=1026, maxDocs=44218)
                0.017238831 = queryNorm
              0.18603475 = fieldWeight in 1536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.76249 = idf(docFreq=1026, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.027547384 = weight(abstract_txt:based in 1536) [ClassicSimilarity], result of:
            0.027547384 = score(doc=1536,freq=8.0), product of:
              0.07821082 = queryWeight, product of:
                1.4231496 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.017238831 = queryNorm
              0.35221958 = fieldWeight in 1536, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.022335436 = weight(abstract_txt:approach in 1536) [ClassicSimilarity], result of:
            0.022335436 = score(doc=1536,freq=2.0), product of:
              0.10795172 = queryWeight, product of:
                1.6719832 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.017238831 = queryNorm
              0.20690209 = fieldWeight in 1536, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.09352772 = weight(abstract_txt:light in 1536) [ClassicSimilarity], result of:
            0.09352772 = score(doc=1536,freq=5.0), product of:
              0.1805163 = queryWeight, product of:
                1.7653455 = boost
                5.931696 = idf(docFreq=318, maxDocs=44218)
                0.017238831 = queryNorm
              0.5181123 = fieldWeight in 1536, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.931696 = idf(docFreq=318, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.030349057 = weight(abstract_txt:when in 1536) [ClassicSimilarity], result of:
            0.030349057 = score(doc=1536,freq=2.0), product of:
              0.13243316 = queryWeight, product of:
                1.8518914 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.017238831 = queryNorm
              0.22916509 = fieldWeight in 1536, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.05863514 = weight(abstract_txt:language in 1536) [ClassicSimilarity], result of:
            0.05863514 = score(doc=1536,freq=4.0), product of:
              0.17946297 = queryWeight, product of:
                2.489281 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017238831 = queryNorm
              0.32672557 = fieldWeight in 1536, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.28778812 = weight(abstract_txt:hungarian in 1536) [ClassicSimilarity], result of:
            0.28778812 = score(doc=1536,freq=3.0), product of:
              0.4527805 = queryWeight, product of:
                2.7958596 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.017238831 = queryNorm
              0.6356019 = fieldWeight in 1536, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
        0.4 = coord(10/25)
    
  4. Kettunen, K.; Kunttu, T.; Järvelin, K.: To stem or lemmatize a highly inflectional language in a probabilistic IR environment? (2005) 0.27
    0.26516363 = sum of:
      0.26516363 = product of:
        0.7365656 = sum of:
          0.06174847 = weight(abstract_txt:probabilistic in 4395) [ClassicSimilarity], result of:
            0.06174847 = score(doc=4395,freq=2.0), product of:
              0.117812574 = queryWeight, product of:
                1.0084453 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.017238831 = queryNorm
              0.5241246 = fieldWeight in 4395, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
          0.0445341 = weight(abstract_txt:statistically in 4395) [ClassicSimilarity], result of:
            0.0445341 = score(doc=4395,freq=1.0), product of:
              0.11937478 = queryWeight, product of:
                1.0151093 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.017238831 = queryNorm
              0.37306118 = fieldWeight in 4395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
          0.058998894 = weight(abstract_txt:compound in 4395) [ClassicSimilarity], result of:
            0.058998894 = score(doc=4395,freq=1.0), product of:
              0.14399478 = queryWeight, product of:
                1.1148845 = boost
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.017238831 = queryNorm
              0.4097294 = fieldWeight in 4395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
          0.1550925 = weight(abstract_txt:stemmer in 4395) [ClassicSimilarity], result of:
            0.1550925 = score(doc=4395,freq=2.0), product of:
              0.2176881 = queryWeight, product of:
                1.3707992 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.017238831 = queryNorm
              0.71245277 = fieldWeight in 4395, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
          0.012929858 = weight(abstract_txt:that in 4395) [ClassicSimilarity], result of:
            0.012929858 = score(doc=4395,freq=3.0), product of:
              0.057609342 = queryWeight, product of:
                1.4103696 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017238831 = queryNorm
              0.22444029 = fieldWeight in 4395, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
          0.022110956 = weight(abstract_txt:approach in 4395) [ClassicSimilarity], result of:
            0.022110956 = score(doc=4395,freq=1.0), product of:
              0.10795172 = queryWeight, product of:
                1.6719832 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.017238831 = queryNorm
              0.20482263 = fieldWeight in 4395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
          0.053115223 = weight(abstract_txt:compared in 4395) [ClassicSimilarity], result of:
            0.053115223 = score(doc=4395,freq=1.0), product of:
              0.19362909 = queryWeight, product of:
                2.2392492 = boost
                5.0160327 = idf(docFreq=796, maxDocs=44218)
                0.017238831 = queryNorm
              0.27431428 = fieldWeight in 4395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0160327 = idf(docFreq=796, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
          0.0820892 = weight(abstract_txt:language in 4395) [ClassicSimilarity], result of:
            0.0820892 = score(doc=4395,freq=4.0), product of:
              0.17946297 = queryWeight, product of:
                2.489281 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017238831 = queryNorm
              0.45741582 = fieldWeight in 4395, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
          0.24594644 = weight(abstract_txt:stemming in 4395) [ClassicSimilarity], result of:
            0.24594644 = score(doc=4395,freq=2.0), product of:
              0.42694795 = queryWeight, product of:
                3.3250992 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.017238831 = queryNorm
              0.5760572 = fieldWeight in 4395, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
        0.36 = coord(9/25)
    
  5. Fautsch, C.; Savoy, J.: Algorithmic stemmers or morphological analysis? : an evaluation (2009) 0.20
    0.20418826 = sum of:
      0.20418826 = product of:
        0.72924376 = sum of:
          0.06082137 = weight(abstract_txt:improves in 2950) [ClassicSimilarity], result of:
            0.06082137 = score(doc=2950,freq=1.0), product of:
              0.11584759 = queryWeight, product of:
                6.7201533 = idf(docFreq=144, maxDocs=44218)
                0.017238831 = queryNorm
              0.52501196 = fieldWeight in 2950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7201533 = idf(docFreq=144, maxDocs=44218)
                0.078125 = fieldNorm(doc=2950)
          0.15666708 = weight(abstract_txt:stemmer in 2950) [ClassicSimilarity], result of:
            0.15666708 = score(doc=2950,freq=1.0), product of:
              0.2176881 = queryWeight, product of:
                1.3707992 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.017238831 = queryNorm
              0.71968603 = fieldWeight in 2950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.078125 = fieldNorm(doc=2950)
          0.0150816925 = weight(abstract_txt:that in 2950) [ClassicSimilarity], result of:
            0.0150816925 = score(doc=2950,freq=2.0), product of:
              0.057609342 = queryWeight, product of:
                1.4103696 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017238831 = queryNorm
              0.26179248 = fieldWeight in 2950, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=2950)
          0.019478941 = weight(abstract_txt:based in 2950) [ClassicSimilarity], result of:
            0.019478941 = score(doc=2950,freq=1.0), product of:
              0.07821082 = queryWeight, product of:
                1.4231496 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.017238831 = queryNorm
              0.24905685 = fieldWeight in 2950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.078125 = fieldNorm(doc=2950)
          0.042920046 = weight(abstract_txt:when in 2950) [ClassicSimilarity], result of:
            0.042920046 = score(doc=2950,freq=1.0), product of:
              0.13243316 = queryWeight, product of:
                1.8518914 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.017238831 = queryNorm
              0.32408836 = fieldWeight in 2950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.078125 = fieldNorm(doc=2950)
          0.082922615 = weight(abstract_txt:language in 2950) [ClassicSimilarity], result of:
            0.082922615 = score(doc=2950,freq=2.0), product of:
              0.17946297 = queryWeight, product of:
                2.489281 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017238831 = queryNorm
              0.46205974 = fieldWeight in 2950, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=2950)
          0.35135204 = weight(abstract_txt:stemming in 2950) [ClassicSimilarity], result of:
            0.35135204 = score(doc=2950,freq=2.0), product of:
              0.42694795 = queryWeight, product of:
                3.3250992 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.017238831 = queryNorm
              0.8229388 = fieldWeight in 2950, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.078125 = fieldNorm(doc=2950)
        0.28 = coord(7/25)