Document (#38166)

Author
Zhang, W.
Yoshida, T.
Tang, X.
Title
¬A comparative study of TF*IDF, LSI and multi-words for text classification
Source
Expert-systems with applications. 38(2011) no.3, S.2758-2765
Year
2011
Abstract
One of the main themes in text mining is text representation, which is fundamental and indispensable for text-based intellegent information processing. Generally, text representation inludes two tasks: indexing and weighting. This paper has comparatively studied TF*IDF, LSI and multi-word for text representation. We used a Chinese and an English document collection to respectively evaluate the three methods in information retreival and text categorization. Experimental results have demonstrated that in text categorization, LSI has better performance than other methods in both document collections. Also, LSI has produced the best performance in retrieving English documents. This outcome has shown that LSI has both favorable semantic and statistical quality and is different with the claim that LSI can not produce discriminative power for indexing.
Content
Vgl. unter: http://www.sciencedirect.com/science/article/pii/S0957417410008626.
Theme
Semantisches Umfeld in Indexierung u. Retrieval
Retrievalalgorithmen
Object
Latent Semantic Indexing

Similar documents (author)

  1. Zhang, Q.; Xue, H.; Tang, H.: Knowledge domain and emerging trends in vulnerability assessment in the context of climate change : a bibliometric analysis (1991-2017) (2018) 3.82
    3.8222456 = sum of:
      3.8222456 = sum of:
        1.3092761 = weight(author_txt:zhang in 4534) [ClassicSimilarity], result of:
          1.3092761 = score(doc=4534,freq=1.0), product of:
            0.5435031 = queryWeight, product of:
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.08460657 = queryNorm
            2.408958 = fieldWeight in 4534, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.375 = fieldNorm(doc=4534)
        2.5129695 = weight(author_txt:tang in 4534) [ClassicSimilarity], result of:
          2.5129695 = score(doc=4534,freq=1.0), product of:
            0.8394072 = queryWeight, product of:
              1.2427545 = boost
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.08460657 = queryNorm
            2.9937432 = fieldWeight in 4534, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.375 = fieldNorm(doc=4534)
    
  2. Zhang, J.; An, L.; Tang, T.; Hong, Y.: Visual health subject directory analysis based on users' traversal activities (2009) 3.19
    3.185205 = sum of:
      3.185205 = sum of:
        1.0910634 = weight(author_txt:zhang in 3112) [ClassicSimilarity], result of:
          1.0910634 = score(doc=3112,freq=1.0), product of:
            0.5435031 = queryWeight, product of:
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.08460657 = queryNorm
            2.007465 = fieldWeight in 3112, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.3125 = fieldNorm(doc=3112)
        2.0941415 = weight(author_txt:tang in 3112) [ClassicSimilarity], result of:
          2.0941415 = score(doc=3112,freq=1.0), product of:
            0.8394072 = queryWeight, product of:
              1.2427545 = boost
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.08460657 = queryNorm
            2.494786 = fieldWeight in 3112, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.3125 = fieldNorm(doc=3112)
    
  3. Jiang, Y.; Zhang, X.; Tang, Y.; Nie, R.: Feature-based approaches to semantic similarity assessment of concepts using Wikipedia (2015) 3.19
    3.185205 = sum of:
      3.185205 = sum of:
        1.0910634 = weight(author_txt:zhang in 2682) [ClassicSimilarity], result of:
          1.0910634 = score(doc=2682,freq=1.0), product of:
            0.5435031 = queryWeight, product of:
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.08460657 = queryNorm
            2.007465 = fieldWeight in 2682, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.3125 = fieldNorm(doc=2682)
        2.0941415 = weight(author_txt:tang in 2682) [ClassicSimilarity], result of:
          2.0941415 = score(doc=2682,freq=1.0), product of:
            0.8394072 = queryWeight, product of:
              1.2427545 = boost
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.08460657 = queryNorm
            2.494786 = fieldWeight in 2682, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.3125 = fieldNorm(doc=2682)
    
  4. Zhang, X.; Wang, D.; Tang, Y.; Xiao, Q.: How question type influences knowledge withholding in social Q&A community (2023) 3.19
    3.185205 = sum of:
      3.185205 = sum of:
        1.0910634 = weight(author_txt:zhang in 1067) [ClassicSimilarity], result of:
          1.0910634 = score(doc=1067,freq=1.0), product of:
            0.5435031 = queryWeight, product of:
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.08460657 = queryNorm
            2.007465 = fieldWeight in 1067, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.3125 = fieldNorm(doc=1067)
        2.0941415 = weight(author_txt:tang in 1067) [ClassicSimilarity], result of:
          2.0941415 = score(doc=1067,freq=1.0), product of:
            0.8394072 = queryWeight, product of:
              1.2427545 = boost
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.08460657 = queryNorm
            2.494786 = fieldWeight in 1067, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.3125 = fieldNorm(doc=1067)
    
  5. Li, D.; Tang, J.; Ding, Y.; Shuai, X.; Chambers, T.; Sun, G.; Luo, Z.; Zhang, J.: Topic-level opinion influence model (TOIM) : an investigation using tencent microblogging (2015) 2.55
    2.5481637 = sum of:
      2.5481637 = sum of:
        0.8728507 = weight(author_txt:zhang in 2345) [ClassicSimilarity], result of:
          0.8728507 = score(doc=2345,freq=1.0), product of:
            0.5435031 = queryWeight, product of:
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.08460657 = queryNorm
            1.6059719 = fieldWeight in 2345, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.25 = fieldNorm(doc=2345)
        1.675313 = weight(author_txt:tang in 2345) [ClassicSimilarity], result of:
          1.675313 = score(doc=2345,freq=1.0), product of:
            0.8394072 = queryWeight, product of:
              1.2427545 = boost
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.08460657 = queryNorm
            1.9958287 = fieldWeight in 2345, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.25 = fieldNorm(doc=2345)
    

Similar documents (content)

  1. Lu, K.; Mao, J.; Li, G.: Toward effective automated weighted subject indexing : a comparison of different approaches in different environments (2018) 0.20
    0.20301533 = sum of:
      0.20301533 = product of:
        0.72505474 = sum of:
          0.1451405 = weight(abstract_txt:weighting in 4292) [ClassicSimilarity], result of:
            0.1451405 = score(doc=4292,freq=4.0), product of:
              0.16638856 = queryWeight, product of:
                1.1604909 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.020545967 = queryNorm
              0.87229854 = fieldWeight in 4292, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
          0.06809768 = weight(abstract_txt:methods in 4292) [ClassicSimilarity], result of:
            0.06809768 = score(doc=4292,freq=5.0), product of:
              0.11750578 = queryWeight, product of:
                1.3791916 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020545967 = queryNorm
              0.5795262 = fieldWeight in 4292, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
          0.033782125 = weight(abstract_txt:document in 4292) [ClassicSimilarity], result of:
            0.033782125 = score(doc=4292,freq=1.0), product of:
              0.12591738 = queryWeight, product of:
                1.4277029 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.020545967 = queryNorm
              0.26828802 = fieldWeight in 4292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
          0.070290625 = weight(abstract_txt:indexing in 4292) [ClassicSimilarity], result of:
            0.070290625 = score(doc=4292,freq=4.0), product of:
              0.12928237 = queryWeight, product of:
                1.446654 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.020545967 = queryNorm
              0.54369843 = fieldWeight in 4292, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
          0.09481333 = weight(abstract_txt:performance in 4292) [ClassicSimilarity], result of:
            0.09481333 = score(doc=4292,freq=5.0), product of:
              0.14651564 = queryWeight, product of:
                1.5400577 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.020545967 = queryNorm
              0.6471209 = fieldWeight in 4292, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
          0.15316288 = weight(abstract_txt:representation in 4292) [ClassicSimilarity], result of:
            0.15316288 = score(doc=4292,freq=4.0), product of:
              0.24873705 = queryWeight, product of:
                2.4575982 = boost
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.020545967 = queryNorm
              0.61576223 = fieldWeight in 4292, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
          0.15976758 = weight(abstract_txt:text in 4292) [ClassicSimilarity], result of:
            0.15976758 = score(doc=4292,freq=2.0), product of:
              0.44698897 = queryWeight, product of:
                5.37989 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020545967 = queryNorm
              0.3574307 = fieldWeight in 4292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
        0.28 = coord(7/25)
    
  2. Zhang, C.; Zeng, D.; Li, J.; Wang, F.-Y.; Zuo, W.: Sentiment analysis of Chinese documents : from sentence to document level (2009) 0.19
    0.18998645 = sum of:
      0.18998645 = product of:
        0.67852306 = sum of:
          0.0628643 = weight(abstract_txt:mining in 3296) [ClassicSimilarity], result of:
            0.0628643 = score(doc=3296,freq=1.0), product of:
              0.1303008 = queryWeight, product of:
                1.02696 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.020545967 = queryNorm
              0.4824552 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.078125 = fieldNorm(doc=3296)
          0.1157869 = weight(abstract_txt:chinese in 3296) [ClassicSimilarity], result of:
            0.1157869 = score(doc=3296,freq=3.0), product of:
              0.13575117 = queryWeight, product of:
                1.0482185 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.020545967 = queryNorm
              0.85293484 = fieldWeight in 3296, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.078125 = fieldNorm(doc=3296)
          0.038067758 = weight(abstract_txt:methods in 3296) [ClassicSimilarity], result of:
            0.038067758 = score(doc=3296,freq=1.0), product of:
              0.11750578 = queryWeight, product of:
                1.3791916 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020545967 = queryNorm
              0.32396498 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.078125 = fieldNorm(doc=3296)
          0.042227652 = weight(abstract_txt:document in 3296) [ClassicSimilarity], result of:
            0.042227652 = score(doc=3296,freq=1.0), product of:
              0.12591738 = queryWeight, product of:
                1.4277029 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.020545967 = queryNorm
              0.33536002 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=3296)
          0.18588509 = weight(abstract_txt:favorable in 3296) [ClassicSimilarity], result of:
            0.18588509 = score(doc=3296,freq=1.0), product of:
              0.26843598 = queryWeight, product of:
                1.4740098 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.020545967 = queryNorm
              0.69247454 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.078125 = fieldNorm(doc=3296)
          0.0924754 = weight(abstract_txt:english in 3296) [ClassicSimilarity], result of:
            0.0924754 = score(doc=3296,freq=1.0), product of:
              0.21234328 = queryWeight, product of:
                1.8540193 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.020545967 = queryNorm
              0.43549955 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.078125 = fieldNorm(doc=3296)
          0.14121592 = weight(abstract_txt:text in 3296) [ClassicSimilarity], result of:
            0.14121592 = score(doc=3296,freq=1.0), product of:
              0.44698897 = queryWeight, product of:
                5.37989 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020545967 = queryNorm
              0.3159271 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=3296)
        0.28 = coord(7/25)
    
  3. Oyarce, G.: Using the shape recovery method to evaluate indexing techniques (2008) 0.18
    0.18394488 = sum of:
      0.18394488 = product of:
        0.656946 = sum of:
          0.07257025 = weight(abstract_txt:weighting in 1966) [ClassicSimilarity], result of:
            0.07257025 = score(doc=1966,freq=1.0), product of:
              0.16638856 = queryWeight, product of:
                1.1604909 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.020545967 = queryNorm
              0.43614927 = fieldWeight in 1966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=1966)
          0.008522576 = weight(abstract_txt:that in 1966) [ClassicSimilarity], result of:
            0.008522576 = score(doc=1966,freq=1.0), product of:
              0.057549123 = queryWeight, product of:
                1.1821157 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.020545967 = queryNorm
              0.1480922 = fieldWeight in 1966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1966)
          0.04777514 = weight(abstract_txt:document in 1966) [ClassicSimilarity], result of:
            0.04777514 = score(doc=1966,freq=2.0), product of:
              0.12591738 = queryWeight, product of:
                1.4277029 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.020545967 = queryNorm
              0.37941656 = fieldWeight in 1966, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=1966)
          0.049702976 = weight(abstract_txt:indexing in 1966) [ClassicSimilarity], result of:
            0.049702976 = score(doc=1966,freq=2.0), product of:
              0.12928237 = queryWeight, product of:
                1.446654 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.020545967 = queryNorm
              0.38445285 = fieldWeight in 1966, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0625 = fieldNorm(doc=1966)
          0.21030496 = weight(abstract_txt:discriminative in 1966) [ClassicSimilarity], result of:
            0.21030496 = score(doc=1966,freq=2.0), product of:
              0.26843598 = queryWeight, product of:
                1.4740098 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.020545967 = queryNorm
              0.7834455 = fieldWeight in 1966, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.0625 = fieldNorm(doc=1966)
          0.10830251 = weight(abstract_txt:representation in 1966) [ClassicSimilarity], result of:
            0.10830251 = score(doc=1966,freq=2.0), product of:
              0.24873705 = queryWeight, product of:
                2.4575982 = boost
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.020545967 = queryNorm
              0.43540964 = fieldWeight in 1966, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.0625 = fieldNorm(doc=1966)
          0.15976758 = weight(abstract_txt:text in 1966) [ClassicSimilarity], result of:
            0.15976758 = score(doc=1966,freq=2.0), product of:
              0.44698897 = queryWeight, product of:
                5.37989 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020545967 = queryNorm
              0.3574307 = fieldWeight in 1966, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1966)
        0.28 = coord(7/25)
    
  4. Aphinyanaphongs, Y.; Fu, L.D.; Li, Z.; Peskin, E.R.; Efstathiadis, E.; Aliferis, C.F.; Statnikov, A.: ¬A comprehensive empirical comparison of modern supervised classification and feature selection methods for text categorization (2014) 0.17
    0.1722629 = sum of:
      0.1722629 = product of:
        0.7177621 = sum of:
          0.01065322 = weight(abstract_txt:that in 1496) [ClassicSimilarity], result of:
            0.01065322 = score(doc=1496,freq=1.0), product of:
              0.057549123 = queryWeight, product of:
                1.1821157 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.020545967 = queryNorm
              0.18511525 = fieldWeight in 1496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=1496)
          0.029562479 = weight(abstract_txt:both in 1496) [ClassicSimilarity], result of:
            0.029562479 = score(doc=1496,freq=1.0), product of:
              0.099276915 = queryWeight, product of:
                1.2677077 = boost
                3.811558 = idf(docFreq=2657, maxDocs=44218)
                0.020545967 = queryNorm
              0.29777798 = fieldWeight in 1496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.811558 = idf(docFreq=2657, maxDocs=44218)
                0.078125 = fieldNorm(doc=1496)
          0.09324659 = weight(abstract_txt:methods in 1496) [ClassicSimilarity], result of:
            0.09324659 = score(doc=1496,freq=6.0), product of:
              0.11750578 = queryWeight, product of:
                1.3791916 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020545967 = queryNorm
              0.79354894 = fieldWeight in 1496, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.078125 = fieldNorm(doc=1496)
          0.074956514 = weight(abstract_txt:performance in 1496) [ClassicSimilarity], result of:
            0.074956514 = score(doc=1496,freq=2.0), product of:
              0.14651564 = queryWeight, product of:
                1.5400577 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.020545967 = queryNorm
              0.51159394 = fieldWeight in 1496, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.078125 = fieldNorm(doc=1496)
          0.26475018 = weight(abstract_txt:categorization in 1496) [ClassicSimilarity], result of:
            0.26475018 = score(doc=1496,freq=3.0), product of:
              0.29685074 = queryWeight, product of:
                2.1921186 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.020545967 = queryNorm
              0.891863 = fieldWeight in 1496, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.078125 = fieldNorm(doc=1496)
          0.24459314 = weight(abstract_txt:text in 1496) [ClassicSimilarity], result of:
            0.24459314 = score(doc=1496,freq=3.0), product of:
              0.44698897 = queryWeight, product of:
                5.37989 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020545967 = queryNorm
              0.54720175 = fieldWeight in 1496, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=1496)
        0.24 = coord(6/25)
    
  5. Robertson, S.E.; Sparck Jones, K.: Simple, proven approaches to text retrieval (1997) 0.17
    0.16993318 = sum of:
      0.16993318 = product of:
        0.5310412 = sum of:
          0.056074083 = weight(abstract_txt:comparative in 4532) [ClassicSimilarity], result of:
            0.056074083 = score(doc=4532,freq=1.0), product of:
              0.14010678 = queryWeight, product of:
                1.0649018 = boost
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.020545967 = queryNorm
              0.4002239 = fieldWeight in 4532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
          0.07257025 = weight(abstract_txt:weighting in 4532) [ClassicSimilarity], result of:
            0.07257025 = score(doc=4532,freq=1.0), product of:
              0.16638856 = queryWeight, product of:
                1.1604909 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.020545967 = queryNorm
              0.43614927 = fieldWeight in 4532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
          0.014761534 = weight(abstract_txt:that in 4532) [ClassicSimilarity], result of:
            0.014761534 = score(doc=4532,freq=3.0), product of:
              0.057549123 = queryWeight, product of:
                1.1821157 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.020545967 = queryNorm
              0.2565032 = fieldWeight in 4532, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
          0.023649983 = weight(abstract_txt:both in 4532) [ClassicSimilarity], result of:
            0.023649983 = score(doc=4532,freq=1.0), product of:
              0.099276915 = queryWeight, product of:
                1.2677077 = boost
                3.811558 = idf(docFreq=2657, maxDocs=44218)
                0.020545967 = queryNorm
              0.23822238 = fieldWeight in 4532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.811558 = idf(docFreq=2657, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
          0.04306875 = weight(abstract_txt:methods in 4532) [ClassicSimilarity], result of:
            0.04306875 = score(doc=4532,freq=2.0), product of:
              0.11750578 = queryWeight, product of:
                1.3791916 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020545967 = queryNorm
              0.36652455 = fieldWeight in 4532, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
          0.07553913 = weight(abstract_txt:document in 4532) [ClassicSimilarity], result of:
            0.07553913 = score(doc=4532,freq=5.0), product of:
              0.12591738 = queryWeight, product of:
                1.4277029 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.020545967 = queryNorm
              0.59991026 = fieldWeight in 4532, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
          0.049702976 = weight(abstract_txt:indexing in 4532) [ClassicSimilarity], result of:
            0.049702976 = score(doc=4532,freq=2.0), product of:
              0.12928237 = queryWeight, product of:
                1.446654 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.020545967 = queryNorm
              0.38445285 = fieldWeight in 4532, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
          0.19567451 = weight(abstract_txt:text in 4532) [ClassicSimilarity], result of:
            0.19567451 = score(doc=4532,freq=3.0), product of:
              0.44698897 = queryWeight, product of:
                5.37989 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020545967 = queryNorm
              0.4377614 = fieldWeight in 4532, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
        0.32 = coord(8/25)