Document (#34039)

Author
Savoy, J.
Title
Searching strategies for the Hungarian language
Source
Information processing and management. 44(2008) no.1, S.310-324
Year
2008
Abstract
This paper reports on the underlying IR problems encountered when dealing with the complex morphology and compound constructions found in the Hungarian language. It describes evaluations carried out on two general stemming strategies for this language, and also demonstrates that a light stemming approach could be quite effective. Based on searches done on the CLEF test collection, we find that a more aggressive suffix-stripping approach may produce better MAP. When compared to an IR scheme without stemming or one based on only a light stemmer, we find the differences to be statistically significant. When compared with probabilistic, vector-space and language models, we find that the Okapi model results in the best retrieval effectiveness. The resulting MAP is found to be about 35% better than the classical tf idf approach, particularly for very short requests. Finally, we demonstrate that applying an automatic decompounding procedure for both queries and documents significantly improves IR performance (+10%), compared to word-based indexing strategies.
Theme
Computerlinguistik

Similar documents (author)

  1. Savoy, J.: Stemming of French words based on grammatical categories (1993) 5.21
    5.2141504 = sum of:
      5.2141504 = weight(author_txt:savoy in 4650) [ClassicSimilarity], result of:
        5.2141504 = fieldWeight in 4650, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.342641 = idf(docFreq=27, maxDocs=43254)
          0.625 = fieldNorm(doc=4650)
    
  2. Savoy, J.: Effectiveness of information retrieval systems used in a hypertext environment (1993) 5.21
    5.2141504 = sum of:
      5.2141504 = weight(author_txt:savoy in 6511) [ClassicSimilarity], result of:
        5.2141504 = fieldWeight in 6511, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.342641 = idf(docFreq=27, maxDocs=43254)
          0.625 = fieldNorm(doc=6511)
    
  3. Savoy, J.: ¬A learning scheme for information retrieval in hypertext (1994) 5.21
    5.2141504 = sum of:
      5.2141504 = weight(author_txt:savoy in 292) [ClassicSimilarity], result of:
        5.2141504 = fieldWeight in 292, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.342641 = idf(docFreq=27, maxDocs=43254)
          0.625 = fieldNorm(doc=292)
    
  4. Savoy, J.: Bayesian inference networks and spreading activation in hypertext systems (1992) 5.21
    5.2141504 = sum of:
      5.2141504 = weight(author_txt:savoy in 1261) [ClassicSimilarity], result of:
        5.2141504 = fieldWeight in 1261, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.342641 = idf(docFreq=27, maxDocs=43254)
          0.625 = fieldNorm(doc=1261)
    
  5. Savoy, J.: Searching information in legal hypertext systems (1993/94) 5.21
    5.2141504 = sum of:
      5.2141504 = weight(author_txt:savoy in 1826) [ClassicSimilarity], result of:
        5.2141504 = fieldWeight in 1826, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.342641 = idf(docFreq=27, maxDocs=43254)
          0.625 = fieldNorm(doc=1826)
    

Similar documents (content)

  1. Dolamic, L.; Savoy, J.: Indexing and searching strategies for the Russian language (2009) 0.87
    0.86645174 = sum of:
      0.86645174 = product of:
        1.5472353 = sum of:
          0.04985507 = weight(abstract_txt:probabilistic in 302) [ClassicSimilarity], result of:
            0.04985507 = score(doc=302,freq=1.0), product of:
              0.11770407 = queryWeight, product of:
                1.0010983 = boost
                6.777005 = idf(docFreq=133, maxDocs=43254)
                0.0173491 = queryNorm
              0.42356282 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.777005 = idf(docFreq=133, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.071699254 = weight(abstract_txt:statistically in 302) [ClassicSimilarity], result of:
            0.071699254 = score(doc=302,freq=2.0), product of:
              0.1190287 = queryWeight, product of:
                1.0067157 = boost
                6.8150325 = idf(docFreq=128, maxDocs=43254)
                0.0173491 = queryNorm
              0.6023694 = fieldWeight in 302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8150325 = idf(docFreq=128, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.11537007 = weight(abstract_txt:okapi in 302) [ClassicSimilarity], result of:
            0.11537007 = score(doc=302,freq=2.0), product of:
              0.16344465 = queryWeight, product of:
                1.1796857 = boost
                7.9859657 = idf(docFreq=39, maxDocs=43254)
                0.0173491 = queryNorm
              0.7058663 = fieldWeight in 302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.9859657 = idf(docFreq=39, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.17581205 = weight(abstract_txt:aggressive in 302) [ClassicSimilarity], result of:
            0.17581205 = score(doc=302,freq=2.0), product of:
              0.21644175 = queryWeight, product of:
                1.3575364 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0173491 = queryNorm
              0.81228346 = fieldWeight in 302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.17581205 = weight(abstract_txt:stemmer in 302) [ClassicSimilarity], result of:
            0.17581205 = score(doc=302,freq=2.0), product of:
              0.21644175 = queryWeight, product of:
                1.3575364 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0173491 = queryNorm
              0.81228346 = fieldWeight in 302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.0086965775 = weight(abstract_txt:that in 302) [ClassicSimilarity], result of:
            0.0086965775 = score(doc=302,freq=1.0), product of:
              0.05833164 = queryWeight, product of:
                1.4094934 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0173491 = queryNorm
              0.14908852 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.035125144 = weight(abstract_txt:better in 302) [ClassicSimilarity], result of:
            0.035125144 = score(doc=302,freq=1.0), product of:
              0.11741962 = queryWeight, product of:
                1.414055 = boost
                4.7862725 = idf(docFreq=980, maxDocs=43254)
                0.0173491 = queryNorm
              0.29914203 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7862725 = idf(docFreq=980, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.044096414 = weight(abstract_txt:approach in 302) [ClassicSimilarity], result of:
            0.044096414 = score(doc=302,freq=3.0), product of:
              0.108456135 = queryWeight, product of:
                1.664442 = boost
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0173491 = queryNorm
              0.40658295 = fieldWeight in 302, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.1175353 = weight(abstract_txt:light in 302) [ClassicSimilarity], result of:
            0.1175353 = score(doc=302,freq=3.0), product of:
              0.18213794 = queryWeight, product of:
                1.7611493 = boost
                5.961112 = idf(docFreq=302, maxDocs=43254)
                0.0173491 = queryNorm
              0.64530927 = fieldWeight in 302, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.961112 = idf(docFreq=302, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.03459483 = weight(abstract_txt:when in 302) [ClassicSimilarity], result of:
            0.03459483 = score(doc=302,freq=1.0), product of:
              0.1330556 = queryWeight, product of:
                1.8435638 = boost
                4.160045 = idf(docFreq=1834, maxDocs=43254)
                0.0173491 = queryNorm
              0.26000282 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.160045 = idf(docFreq=1834, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.056793485 = weight(abstract_txt:find in 302) [ClassicSimilarity], result of:
            0.056793485 = score(doc=302,freq=1.0), product of:
              0.18516463 = queryWeight, product of:
                2.1748066 = boost
                4.9075017 = idf(docFreq=868, maxDocs=43254)
                0.0173491 = queryNorm
              0.30671886 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9075017 = idf(docFreq=868, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.09233985 = weight(abstract_txt:strategies in 302) [ClassicSimilarity], result of:
            0.09233985 = score(doc=302,freq=2.0), product of:
              0.20320848 = queryWeight, product of:
                2.2783084 = boost
                5.141056 = idf(docFreq=687, maxDocs=43254)
                0.0173491 = queryNorm
              0.45440945 = fieldWeight in 302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.141056 = idf(docFreq=687, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.081758134 = weight(abstract_txt:language in 302) [ClassicSimilarity], result of:
            0.081758134 = score(doc=302,freq=3.0), product of:
              0.18015742 = queryWeight, product of:
                2.477063 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0173491 = queryNorm
              0.45381495 = fieldWeight in 302, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.48774707 = weight(abstract_txt:stemming in 302) [ClassicSimilarity], result of:
            0.48774707 = score(doc=302,freq=6.0), product of:
              0.42733818 = queryWeight, product of:
                3.3039043 = boost
                7.4553375 = idf(docFreq=67, maxDocs=43254)
                0.0173491 = queryNorm
              1.1413609 = fieldWeight in 302, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.4553375 = idf(docFreq=67, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
        0.56 = coord(14/25)
    
  2. Brychcín, T.; Konopík, M.: HPS: High precision stemmer (2015) 0.42
    0.4242277 = sum of:
      0.4242277 = product of:
        1.1784103 = sum of:
          0.1243179 = weight(abstract_txt:stemmer in 4151) [ClassicSimilarity], result of:
            0.1243179 = score(doc=4151,freq=1.0), product of:
              0.21644175 = queryWeight, product of:
                1.3575364 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0173491 = queryNorm
              0.57437116 = fieldWeight in 4151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0625 = fieldNorm(doc=4151)
          0.015062913 = weight(abstract_txt:that in 4151) [ClassicSimilarity], result of:
            0.015062913 = score(doc=4151,freq=3.0), product of:
              0.05833164 = queryWeight, product of:
                1.4094934 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0173491 = queryNorm
              0.25822887 = fieldWeight in 4151, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0625 = fieldNorm(doc=4151)
          0.02231028 = weight(abstract_txt:based in 4151) [ClassicSimilarity], result of:
            0.02231028 = score(doc=4151,freq=2.0), product of:
              0.078828946 = queryWeight, product of:
                1.4190067 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0173491 = queryNorm
              0.28302142 = fieldWeight in 4151, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0625 = fieldNorm(doc=4151)
          0.044096414 = weight(abstract_txt:approach in 4151) [ClassicSimilarity], result of:
            0.044096414 = score(doc=4151,freq=3.0), product of:
              0.108456135 = queryWeight, product of:
                1.664442 = boost
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0173491 = queryNorm
              0.40658295 = fieldWeight in 4151, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0625 = fieldNorm(doc=4151)
          0.048924476 = weight(abstract_txt:when in 4151) [ClassicSimilarity], result of:
            0.048924476 = score(doc=4151,freq=2.0), product of:
              0.1330556 = queryWeight, product of:
                1.8435638 = boost
                4.160045 = idf(docFreq=1834, maxDocs=43254)
                0.0173491 = queryNorm
              0.3676995 = fieldWeight in 4151, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.160045 = idf(docFreq=1834, maxDocs=43254)
                0.0625 = fieldNorm(doc=4151)
          0.08593868 = weight(abstract_txt:compared in 4151) [ClassicSimilarity], result of:
            0.08593868 = score(doc=4151,freq=2.0), product of:
              0.19370528 = queryWeight, product of:
                2.2243972 = boost
                5.0194044 = idf(docFreq=776, maxDocs=43254)
                0.0173491 = queryNorm
              0.44365686 = fieldWeight in 4151, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0194044 = idf(docFreq=776, maxDocs=43254)
                0.0625 = fieldNorm(doc=4151)
          0.04720308 = weight(abstract_txt:language in 4151) [ClassicSimilarity], result of:
            0.04720308 = score(doc=4151,freq=1.0), product of:
              0.18015742 = queryWeight, product of:
                2.477063 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0173491 = queryNorm
              0.2620102 = fieldWeight in 4151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0625 = fieldNorm(doc=4151)
          0.26372957 = weight(abstract_txt:hungarian in 4151) [ClassicSimilarity], result of:
            0.26372957 = score(doc=4151,freq=1.0), product of:
              0.45023006 = queryWeight, product of:
                2.7689378 = boost
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.0173491 = queryNorm
              0.58576626 = fieldWeight in 4151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.0625 = fieldNorm(doc=4151)
          0.526827 = weight(abstract_txt:stemming in 4151) [ClassicSimilarity], result of:
            0.526827 = score(doc=4151,freq=7.0), product of:
              0.42733818 = queryWeight, product of:
                3.3039043 = boost
                7.4553375 = idf(docFreq=67, maxDocs=43254)
                0.0173491 = queryNorm
              1.2328105 = fieldWeight in 4151, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.4553375 = idf(docFreq=67, maxDocs=43254)
                0.0625 = fieldNorm(doc=4151)
        0.36 = coord(9/25)
    
  3. Nagy T., I.: Detecting multiword expressions and named entities in natural language texts (2014) 0.29
    0.29488146 = sum of:
      0.29488146 = product of:
        0.73720366 = sum of:
          0.04224148 = weight(abstract_txt:compound in 3001) [ClassicSimilarity], result of:
            0.04224148 = score(doc=3001,freq=1.0), product of:
              0.14417545 = queryWeight, product of:
                1.1079665 = boost
                7.500458 = idf(docFreq=64, maxDocs=43254)
                0.0173491 = queryNorm
              0.29298663 = fieldWeight in 3001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.500458 = idf(docFreq=64, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.13939121 = weight(abstract_txt:constructions in 3001) [ClassicSimilarity], result of:
            0.13939121 = score(doc=3001,freq=5.0), product of:
              0.18688117 = queryWeight, product of:
                1.2614317 = boost
                8.5393505 = idf(docFreq=22, maxDocs=43254)
                0.0173491 = queryNorm
              0.74588156 = fieldWeight in 3001, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.5393505 = idf(docFreq=22, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.013313861 = weight(abstract_txt:that in 3001) [ClassicSimilarity], result of:
            0.013313861 = score(doc=3001,freq=6.0), product of:
              0.05833164 = queryWeight, product of:
                1.4094934 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0173491 = queryNorm
              0.22824425 = fieldWeight in 3001, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.021953216 = weight(abstract_txt:better in 3001) [ClassicSimilarity], result of:
            0.021953216 = score(doc=3001,freq=1.0), product of:
              0.11741962 = queryWeight, product of:
                1.414055 = boost
                4.7862725 = idf(docFreq=980, maxDocs=43254)
                0.0173491 = queryNorm
              0.18696377 = fieldWeight in 3001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7862725 = idf(docFreq=980, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.027887851 = weight(abstract_txt:based in 3001) [ClassicSimilarity], result of:
            0.027887851 = score(doc=3001,freq=8.0), product of:
              0.078828946 = queryWeight, product of:
                1.4190067 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0173491 = queryNorm
              0.35377678 = fieldWeight in 3001, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.022502856 = weight(abstract_txt:approach in 3001) [ClassicSimilarity], result of:
            0.022502856 = score(doc=3001,freq=2.0), product of:
              0.108456135 = queryWeight, product of:
                1.664442 = boost
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0173491 = queryNorm
              0.20748349 = fieldWeight in 3001, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.094835885 = weight(abstract_txt:light in 3001) [ClassicSimilarity], result of:
            0.094835885 = score(doc=3001,freq=5.0), product of:
              0.18213794 = queryWeight, product of:
                1.7611493 = boost
                5.961112 = idf(docFreq=302, maxDocs=43254)
                0.0173491 = queryNorm
              0.5206817 = fieldWeight in 3001, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.961112 = idf(docFreq=302, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.030577797 = weight(abstract_txt:when in 3001) [ClassicSimilarity], result of:
            0.030577797 = score(doc=3001,freq=2.0), product of:
              0.1330556 = queryWeight, product of:
                1.8435638 = boost
                4.160045 = idf(docFreq=1834, maxDocs=43254)
                0.0173491 = queryNorm
              0.22981219 = fieldWeight in 3001, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.160045 = idf(docFreq=1834, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.059003852 = weight(abstract_txt:language in 3001) [ClassicSimilarity], result of:
            0.059003852 = score(doc=3001,freq=4.0), product of:
              0.18015742 = queryWeight, product of:
                2.477063 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0173491 = queryNorm
              0.32751274 = fieldWeight in 3001, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.28549564 = weight(abstract_txt:hungarian in 3001) [ClassicSimilarity], result of:
            0.28549564 = score(doc=3001,freq=3.0), product of:
              0.45023006 = queryWeight, product of:
                2.7689378 = boost
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.0173491 = queryNorm
              0.63411057 = fieldWeight in 3001, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
        0.4 = coord(10/25)
    
  4. Kettunen, K.; Kunttu, T.; Järvelin, K.: To stem or lemmatize a highly inflectional language in a probabilistic IR environment? (2005) 0.27
    0.2651985 = sum of:
      0.2651985 = product of:
        0.7366625 = sum of:
          0.0616925 = weight(abstract_txt:probabilistic in 396) [ClassicSimilarity], result of:
            0.0616925 = score(doc=396,freq=2.0), product of:
              0.11770407 = queryWeight, product of:
                1.0010983 = boost
                6.777005 = idf(docFreq=133, maxDocs=43254)
                0.0173491 = queryNorm
              0.52413225 = fieldWeight in 396, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.777005 = idf(docFreq=133, maxDocs=43254)
                0.0546875 = fieldNorm(doc=396)
          0.04436165 = weight(abstract_txt:statistically in 396) [ClassicSimilarity], result of:
            0.04436165 = score(doc=396,freq=1.0), product of:
              0.1190287 = queryWeight, product of:
                1.0067157 = boost
                6.8150325 = idf(docFreq=128, maxDocs=43254)
                0.0173491 = queryNorm
              0.3726971 = fieldWeight in 396, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8150325 = idf(docFreq=128, maxDocs=43254)
                0.0546875 = fieldNorm(doc=396)
          0.059138075 = weight(abstract_txt:compound in 396) [ClassicSimilarity], result of:
            0.059138075 = score(doc=396,freq=1.0), product of:
              0.14417545 = queryWeight, product of:
                1.1079665 = boost
                7.500458 = idf(docFreq=64, maxDocs=43254)
                0.0173491 = queryNorm
              0.41018128 = fieldWeight in 396, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.500458 = idf(docFreq=64, maxDocs=43254)
                0.0546875 = fieldNorm(doc=396)
          0.15383555 = weight(abstract_txt:stemmer in 396) [ClassicSimilarity], result of:
            0.15383555 = score(doc=396,freq=2.0), product of:
              0.21644175 = queryWeight, product of:
                1.3575364 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0173491 = queryNorm
              0.710748 = fieldWeight in 396, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0546875 = fieldNorm(doc=396)
          0.013180048 = weight(abstract_txt:that in 396) [ClassicSimilarity], result of:
            0.013180048 = score(doc=396,freq=3.0), product of:
              0.05833164 = queryWeight, product of:
                1.4094934 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0173491 = queryNorm
              0.22595026 = fieldWeight in 396, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0546875 = fieldNorm(doc=396)
          0.022276694 = weight(abstract_txt:approach in 396) [ClassicSimilarity], result of:
            0.022276694 = score(doc=396,freq=1.0), product of:
              0.108456135 = queryWeight, product of:
                1.664442 = boost
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0173491 = queryNorm
              0.20539819 = fieldWeight in 396, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0546875 = fieldNorm(doc=396)
          0.05317184 = weight(abstract_txt:compared in 396) [ClassicSimilarity], result of:
            0.05317184 = score(doc=396,freq=1.0), product of:
              0.19370528 = queryWeight, product of:
                2.2243972 = boost
                5.0194044 = idf(docFreq=776, maxDocs=43254)
                0.0173491 = queryNorm
              0.27449867 = fieldWeight in 396, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0194044 = idf(docFreq=776, maxDocs=43254)
                0.0546875 = fieldNorm(doc=396)
          0.08260539 = weight(abstract_txt:language in 396) [ClassicSimilarity], result of:
            0.08260539 = score(doc=396,freq=4.0), product of:
              0.18015742 = queryWeight, product of:
                2.477063 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0173491 = queryNorm
              0.45851782 = fieldWeight in 396, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0546875 = fieldNorm(doc=396)
          0.24640076 = weight(abstract_txt:stemming in 396) [ClassicSimilarity], result of:
            0.24640076 = score(doc=396,freq=2.0), product of:
              0.42733818 = queryWeight, product of:
                3.3039043 = boost
                7.4553375 = idf(docFreq=67, maxDocs=43254)
                0.0173491 = queryNorm
              0.5765943 = fieldWeight in 396, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4553375 = idf(docFreq=67, maxDocs=43254)
                0.0546875 = fieldNorm(doc=396)
        0.36 = coord(9/25)
    
  5. Fautsch, C.; Savoy, J.: Algorithmic stemmers or morphological analysis? : an evaluation (2009) 0.20
    0.2047621 = sum of:
      0.2047621 = product of:
        0.7312932 = sum of:
          0.062113956 = weight(abstract_txt:improves in 4951) [ClassicSimilarity], result of:
            0.062113956 = score(doc=4951,freq=1.0), product of:
              0.11744595 = queryWeight, product of:
                6.7695704 = idf(docFreq=134, maxDocs=43254)
                0.0173491 = queryNorm
              0.52887267 = fieldWeight in 4951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7695704 = idf(docFreq=134, maxDocs=43254)
                0.078125 = fieldNorm(doc=4951)
          0.15539737 = weight(abstract_txt:stemmer in 4951) [ClassicSimilarity], result of:
            0.15539737 = score(doc=4951,freq=1.0), product of:
              0.21644175 = queryWeight, product of:
                1.3575364 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0173491 = queryNorm
              0.71796393 = fieldWeight in 4951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.078125 = fieldNorm(doc=4951)
          0.0153735215 = weight(abstract_txt:that in 4951) [ClassicSimilarity], result of:
            0.0153735215 = score(doc=4951,freq=2.0), product of:
              0.05833164 = queryWeight, product of:
                1.4094934 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0173491 = queryNorm
              0.26355374 = fieldWeight in 4951, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.078125 = fieldNorm(doc=4951)
          0.019719688 = weight(abstract_txt:based in 4951) [ClassicSimilarity], result of:
            0.019719688 = score(doc=4951,freq=1.0), product of:
              0.078828946 = queryWeight, product of:
                1.4190067 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0173491 = queryNorm
              0.25015795 = fieldWeight in 4951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.078125 = fieldNorm(doc=4951)
          0.04324354 = weight(abstract_txt:when in 4951) [ClassicSimilarity], result of:
            0.04324354 = score(doc=4951,freq=1.0), product of:
              0.1330556 = queryWeight, product of:
                1.8435638 = boost
                4.160045 = idf(docFreq=1834, maxDocs=43254)
                0.0173491 = queryNorm
              0.32500353 = fieldWeight in 4951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.160045 = idf(docFreq=1834, maxDocs=43254)
                0.078125 = fieldNorm(doc=4951)
          0.083444044 = weight(abstract_txt:language in 4951) [ClassicSimilarity], result of:
            0.083444044 = score(doc=4951,freq=2.0), product of:
              0.18015742 = queryWeight, product of:
                2.477063 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0173491 = queryNorm
              0.46317294 = fieldWeight in 4951, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.078125 = fieldNorm(doc=4951)
          0.3520011 = weight(abstract_txt:stemming in 4951) [ClassicSimilarity], result of:
            0.3520011 = score(doc=4951,freq=2.0), product of:
              0.42733818 = queryWeight, product of:
                3.3039043 = boost
                7.4553375 = idf(docFreq=67, maxDocs=43254)
                0.0173491 = queryNorm
              0.82370615 = fieldWeight in 4951, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4553375 = idf(docFreq=67, maxDocs=43254)
                0.078125 = fieldNorm(doc=4951)
        0.28 = coord(7/25)