Document (#13269)

Author
Yang, Y.
Wilbur, J.
Title
Using corpus statistics to remove redundant words in text categorization
Source
Journal of the American Society for Information Science. 47(1996) no.5, S.357-369
Year
1996
Abstract
This article studies aggressive word removal in text categorization to reduce the noice in free texts to enhance the computational efficiency of categorization. We use a novel stop word identification method to automatically generate domain specific stoplists which are much larger than a conventional domain-independent stoplist. In our tests with 3 categorization methods on text collections from different domains/applications, significant numbers of words were removed without sacrificing categorization effectiveness. In the test of the Expert Network method on CACM documents, for example, an 87% removal of unique qords reduced the vocabulary of documents from 8.002 distinct words to 1.045 words, which resulted in a 63% time savings and a 74% memory savings in the computation of category ranking, with a 10% precision improvement on average over not using word removal. It is evident in this study that automated word removal based on corpus statistics has a practical and significant impact on the computational tractability of categorization methods in large databases
Theme
Computerlinguistik

Similar documents (author)

  1. Wilbur, W.J.: Global term weights for document retrieval learned from TREC data (2001) 2.34
    2.3392131 = sum of:
      2.3392131 = product of:
        4.6784263 = sum of:
          4.6784263 = weight(author_txt:wilbur in 2647) [ClassicSimilarity], result of:
            4.6784263 = score(doc=2647,freq=1.0), product of:
              0.8359146 = queryWeight, product of:
                1.2341 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.07564038 = queryNorm
              5.5967755 = fieldWeight in 2647, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.625 = fieldNorm(doc=2647)
        0.5 = coord(1/2)
    
  2. Wilbur, W.J.: Human subjectivity and performance limits in document retrieval (1996) 2.34
    2.3392131 = sum of:
      2.3392131 = product of:
        4.6784263 = sum of:
          4.6784263 = weight(author_txt:wilbur in 6676) [ClassicSimilarity], result of:
            4.6784263 = score(doc=6676,freq=1.0), product of:
              0.8359146 = queryWeight, product of:
                1.2341 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.07564038 = queryNorm
              5.5967755 = fieldWeight in 6676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.625 = fieldNorm(doc=6676)
        0.5 = coord(1/2)
    
  3. Wilbur, W.J.: ¬A comparison of group and individual performance among subject experts and untrained workers at the document retrieval task (1998) 2.34
    2.3392131 = sum of:
      2.3392131 = product of:
        4.6784263 = sum of:
          4.6784263 = weight(author_txt:wilbur in 4264) [ClassicSimilarity], result of:
            4.6784263 = score(doc=4264,freq=1.0), product of:
              0.8359146 = queryWeight, product of:
                1.2341 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.07564038 = queryNorm
              5.5967755 = fieldWeight in 4264, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.625 = fieldNorm(doc=4264)
        0.5 = coord(1/2)
    
  4. Wilbur, W.J.: Human subjectivity and performance limits in document retrieval (1999) 2.34
    2.3392131 = sum of:
      2.3392131 = product of:
        4.6784263 = sum of:
          4.6784263 = weight(author_txt:wilbur in 5540) [ClassicSimilarity], result of:
            4.6784263 = score(doc=5540,freq=1.0), product of:
              0.8359146 = queryWeight, product of:
                1.2341 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.07564038 = queryNorm
              5.5967755 = fieldWeight in 5540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.625 = fieldNorm(doc=5540)
        0.5 = coord(1/2)
    
  5. Wilbur, W.J.: ¬A retrieval system based on automatic relevance weighting of search terms (1992) 2.34
    2.3392131 = sum of:
      2.3392131 = product of:
        4.6784263 = sum of:
          4.6784263 = weight(author_txt:wilbur in 185) [ClassicSimilarity], result of:
            4.6784263 = score(doc=185,freq=1.0), product of:
              0.8359146 = queryWeight, product of:
                1.2341 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.07564038 = queryNorm
              5.5967755 = fieldWeight in 185, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.625 = fieldNorm(doc=185)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Goren-Bar, D.; Kuflik, T.: Supporting user-subjective categorization with self-organizing maps and learning vector quantization (2005) 0.20
    0.19968624 = sum of:
      0.19968624 = product of:
        0.83202606 = sum of:
          0.009922507 = weight(abstract_txt:using in 4326) [ClassicSimilarity], result of:
            0.009922507 = score(doc=4326,freq=1.0), product of:
              0.04562737 = queryWeight, product of:
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.013113223 = queryNorm
              0.21746832 = fieldWeight in 4326, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.0625 = fieldNorm(doc=4326)
          0.028435886 = weight(abstract_txt:documents in 4326) [ClassicSimilarity], result of:
            0.028435886 = score(doc=4326,freq=3.0), product of:
              0.06382859 = queryWeight, product of:
                1.1827552 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.013113223 = queryNorm
              0.44550392 = fieldWeight in 4326, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.0625 = fieldNorm(doc=4326)
          0.017108783 = weight(abstract_txt:methods in 4326) [ClassicSimilarity], result of:
            0.017108783 = score(doc=4326,freq=1.0), product of:
              0.06560807 = queryWeight, product of:
                1.199129 = boost
                4.172361 = idf(docFreq=1790, maxDocs=42740)
                0.013113223 = queryNorm
              0.26077256 = fieldWeight in 4326, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.172361 = idf(docFreq=1790, maxDocs=42740)
                0.0625 = fieldNorm(doc=4326)
          0.021775244 = weight(abstract_txt:method in 4326) [ClassicSimilarity], result of:
            0.021775244 = score(doc=4326,freq=1.0), product of:
              0.07705246 = queryWeight, product of:
                1.2995127 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.013113223 = queryNorm
              0.28260285 = fieldWeight in 4326, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.0625 = fieldNorm(doc=4326)
          0.03634436 = weight(abstract_txt:domain in 4326) [ClassicSimilarity], result of:
            0.03634436 = score(doc=4326,freq=2.0), product of:
              0.08605164 = queryWeight, product of:
                1.3733046 = boost
                4.7784038 = idf(docFreq=976, maxDocs=42740)
                0.013113223 = queryNorm
              0.4223552 = fieldWeight in 4326, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7784038 = idf(docFreq=976, maxDocs=42740)
                0.0625 = fieldNorm(doc=4326)
          0.7184393 = weight(abstract_txt:categorization in 4326) [ClassicSimilarity], result of:
            0.7184393 = score(doc=4326,freq=12.0), product of:
              0.4993264 = queryWeight, product of:
                5.7298093 = boost
                6.645611 = idf(docFreq=150, maxDocs=42740)
                0.013113223 = queryNorm
              1.4388169 = fieldWeight in 4326, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.645611 = idf(docFreq=150, maxDocs=42740)
                0.0625 = fieldNorm(doc=4326)
        0.24 = coord(6/25)
    
  2. Díaz, I.; Ranilla, J.; Montañes, E.; Fernández, J.; Combarro, E.F.: Improving performance of text categorization by combining filtering and support vector machines (2004) 0.17
    0.16531521 = sum of:
      0.16531521 = product of:
        0.5904115 = sum of:
          0.020521833 = weight(abstract_txt:documents in 3235) [ClassicSimilarity], result of:
            0.020521833 = score(doc=3235,freq=1.0), product of:
              0.06382859 = queryWeight, product of:
                1.1827552 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.013113223 = queryNorm
              0.32151476 = fieldWeight in 3235, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.078125 = fieldNorm(doc=3235)
          0.027219053 = weight(abstract_txt:method in 3235) [ClassicSimilarity], result of:
            0.027219053 = score(doc=3235,freq=1.0), product of:
              0.07705246 = queryWeight, product of:
                1.2995127 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.013113223 = queryNorm
              0.35325354 = fieldWeight in 3235, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.078125 = fieldNorm(doc=3235)
          0.02933992 = weight(abstract_txt:text in 3235) [ClassicSimilarity], result of:
            0.02933992 = score(doc=3235,freq=1.0), product of:
              0.09272727 = queryWeight, product of:
                1.7459695 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.013113223 = queryNorm
              0.3164109 = fieldWeight in 3235, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.078125 = fieldNorm(doc=3235)
          0.06793606 = weight(abstract_txt:corpus in 3235) [ClassicSimilarity], result of:
            0.06793606 = score(doc=3235,freq=1.0), product of:
              0.14177665 = queryWeight, product of:
                1.7627456 = boost
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.013113223 = queryNorm
              0.47917667 = fieldWeight in 3235, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.078125 = fieldNorm(doc=3235)
          0.090600185 = weight(abstract_txt:words in 3235) [ClassicSimilarity], result of:
            0.090600185 = score(doc=3235,freq=1.0), product of:
              0.21642156 = queryWeight, product of:
                3.0800128 = boost
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.013113223 = queryNorm
              0.41862828 = fieldWeight in 3235, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.078125 = fieldNorm(doc=3235)
          0.095549986 = weight(abstract_txt:word in 3235) [ClassicSimilarity], result of:
            0.095549986 = score(doc=3235,freq=1.0), product of:
              0.22423404 = queryWeight, product of:
                3.135112 = boost
                5.4543004 = idf(docFreq=496, maxDocs=42740)
                0.013113223 = queryNorm
              0.4261172 = fieldWeight in 3235, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4543004 = idf(docFreq=496, maxDocs=42740)
                0.078125 = fieldNorm(doc=3235)
          0.25924444 = weight(abstract_txt:categorization in 3235) [ClassicSimilarity], result of:
            0.25924444 = score(doc=3235,freq=1.0), product of:
              0.4993264 = queryWeight, product of:
                5.7298093 = boost
                6.645611 = idf(docFreq=150, maxDocs=42740)
                0.013113223 = queryNorm
              0.51918834 = fieldWeight in 3235, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.645611 = idf(docFreq=150, maxDocs=42740)
                0.078125 = fieldNorm(doc=3235)
        0.28 = coord(7/25)
    
  3. Kim, W.; Wilbur, W.J.: Corpus-based statistical screening for content-bearing terms (2001) 0.16
    0.15815015 = sum of:
      0.15815015 = product of:
        0.43930596 = sum of:
          0.0074418806 = weight(abstract_txt:using in 189) [ClassicSimilarity], result of:
            0.0074418806 = score(doc=189,freq=1.0), product of:
              0.04562737 = queryWeight, product of:
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.013113223 = queryNorm
              0.16310124 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.046875 = fieldNorm(doc=189)
          0.053116683 = weight(abstract_txt:stop in 189) [ClassicSimilarity], result of:
            0.053116683 = score(doc=189,freq=2.0), product of:
              0.10655429 = queryWeight, product of:
                1.080582 = boost
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.013113223 = queryNorm
              0.4984941 = fieldWeight in 189, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.046875 = fieldNorm(doc=189)
          0.04921876 = weight(abstract_txt:removed in 189) [ClassicSimilarity], result of:
            0.04921876 = score(doc=189,freq=1.0), product of:
              0.12759905 = queryWeight, product of:
                1.1824859 = boost
                8.228904 = idf(docFreq=30, maxDocs=42740)
                0.013113223 = queryNorm
              0.38572985 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.228904 = idf(docFreq=30, maxDocs=42740)
                0.046875 = fieldNorm(doc=189)
          0.02753293 = weight(abstract_txt:documents in 189) [ClassicSimilarity], result of:
            0.02753293 = score(doc=189,freq=5.0), product of:
              0.06382859 = queryWeight, product of:
                1.1827552 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.013113223 = queryNorm
              0.43135732 = fieldWeight in 189, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.046875 = fieldNorm(doc=189)
          0.02222496 = weight(abstract_txt:methods in 189) [ClassicSimilarity], result of:
            0.02222496 = score(doc=189,freq=3.0), product of:
              0.06560807 = queryWeight, product of:
                1.199129 = boost
                4.172361 = idf(docFreq=1790, maxDocs=42740)
                0.013113223 = queryNorm
              0.33875346 = fieldWeight in 189, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.172361 = idf(docFreq=1790, maxDocs=42740)
                0.046875 = fieldNorm(doc=189)
          0.016331432 = weight(abstract_txt:method in 189) [ClassicSimilarity], result of:
            0.016331432 = score(doc=189,freq=1.0), product of:
              0.07705246 = queryWeight, product of:
                1.2995127 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.013113223 = queryNorm
              0.21195213 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.046875 = fieldNorm(doc=189)
          0.01760395 = weight(abstract_txt:text in 189) [ClassicSimilarity], result of:
            0.01760395 = score(doc=189,freq=1.0), product of:
              0.09272727 = queryWeight, product of:
                1.7459695 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.013113223 = queryNorm
              0.18984653 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.046875 = fieldNorm(doc=189)
          0.09415447 = weight(abstract_txt:words in 189) [ClassicSimilarity], result of:
            0.09415447 = score(doc=189,freq=3.0), product of:
              0.21642156 = queryWeight, product of:
                3.0800128 = boost
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.013113223 = queryNorm
              0.43505126 = fieldWeight in 189, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.046875 = fieldNorm(doc=189)
          0.1516809 = weight(abstract_txt:word in 189) [ClassicSimilarity], result of:
            0.1516809 = score(doc=189,freq=7.0), product of:
              0.22423404 = queryWeight, product of:
                3.135112 = boost
                5.4543004 = idf(docFreq=496, maxDocs=42740)
                0.013113223 = queryNorm
              0.6764401 = fieldWeight in 189, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.4543004 = idf(docFreq=496, maxDocs=42740)
                0.046875 = fieldNorm(doc=189)
        0.36 = coord(9/25)
    
  4. Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002) 0.15
    0.15117368 = sum of:
      0.15117368 = product of:
        0.47241774 = sum of:
          0.009922507 = weight(abstract_txt:using in 227) [ClassicSimilarity], result of:
            0.009922507 = score(doc=227,freq=1.0), product of:
              0.04562737 = queryWeight, product of:
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.013113223 = queryNorm
              0.21746832 = fieldWeight in 227, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.0625 = fieldNorm(doc=227)
          0.04088544 = weight(abstract_txt:reduced in 227) [ClassicSimilarity], result of:
            0.04088544 = score(doc=227,freq=1.0), product of:
              0.09307798 = queryWeight, product of:
                1.0099404 = boost
                7.0281615 = idf(docFreq=102, maxDocs=42740)
                0.013113223 = queryNorm
              0.4392601 = fieldWeight in 227, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0281615 = idf(docFreq=102, maxDocs=42740)
                0.0625 = fieldNorm(doc=227)
          0.05007889 = weight(abstract_txt:stop in 227) [ClassicSimilarity], result of:
            0.05007889 = score(doc=227,freq=1.0), product of:
              0.10655429 = queryWeight, product of:
                1.080582 = boost
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.013113223 = queryNorm
              0.46998474 = fieldWeight in 227, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.0625 = fieldNorm(doc=227)
          0.053222217 = weight(abstract_txt:computation in 227) [ClassicSimilarity], result of:
            0.053222217 = score(doc=227,freq=1.0), product of:
              0.110967666 = queryWeight, product of:
                1.1027334 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.013113223 = queryNorm
              0.47961915 = fieldWeight in 227, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=227)
          0.016417466 = weight(abstract_txt:documents in 227) [ClassicSimilarity], result of:
            0.016417466 = score(doc=227,freq=1.0), product of:
              0.06382859 = queryWeight, product of:
                1.1827552 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.013113223 = queryNorm
              0.2572118 = fieldWeight in 227, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.0625 = fieldNorm(doc=227)
          0.023471935 = weight(abstract_txt:text in 227) [ClassicSimilarity], result of:
            0.023471935 = score(doc=227,freq=1.0), product of:
              0.09272727 = queryWeight, product of:
                1.7459695 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.013113223 = queryNorm
              0.2531287 = fieldWeight in 227, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0625 = fieldNorm(doc=227)
          0.1255393 = weight(abstract_txt:words in 227) [ClassicSimilarity], result of:
            0.1255393 = score(doc=227,freq=3.0), product of:
              0.21642156 = queryWeight, product of:
                3.0800128 = boost
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.013113223 = queryNorm
              0.58006835 = fieldWeight in 227, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.0625 = fieldNorm(doc=227)
          0.15287998 = weight(abstract_txt:word in 227) [ClassicSimilarity], result of:
            0.15287998 = score(doc=227,freq=4.0), product of:
              0.22423404 = queryWeight, product of:
                3.135112 = boost
                5.4543004 = idf(docFreq=496, maxDocs=42740)
                0.013113223 = queryNorm
              0.68178755 = fieldWeight in 227, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4543004 = idf(docFreq=496, maxDocs=42740)
                0.0625 = fieldNorm(doc=227)
        0.32 = coord(8/25)
    
  5. Schutze, H.; Pederson, J.O.: ¬A cooccurrence-based thesaurus and two applications to information retrieval (1997) 0.15
    0.14604914 = sum of:
      0.14604914 = product of:
        0.52160406 = sum of:
          0.017364388 = weight(abstract_txt:using in 1154) [ClassicSimilarity], result of:
            0.017364388 = score(doc=1154,freq=1.0), product of:
              0.04562737 = queryWeight, product of:
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.013113223 = queryNorm
              0.38056958 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.109375 = fieldNorm(doc=1154)
          0.02994037 = weight(abstract_txt:methods in 1154) [ClassicSimilarity], result of:
            0.02994037 = score(doc=1154,freq=1.0), product of:
              0.06560807 = queryWeight, product of:
                1.199129 = boost
                4.172361 = idf(docFreq=1790, maxDocs=42740)
                0.013113223 = queryNorm
              0.45635197 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.172361 = idf(docFreq=1790, maxDocs=42740)
                0.109375 = fieldNorm(doc=1154)
          0.03810668 = weight(abstract_txt:method in 1154) [ClassicSimilarity], result of:
            0.03810668 = score(doc=1154,freq=1.0), product of:
              0.07705246 = queryWeight, product of:
                1.2995127 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.013113223 = queryNorm
              0.494555 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.109375 = fieldNorm(doc=1154)
          0.041075885 = weight(abstract_txt:text in 1154) [ClassicSimilarity], result of:
            0.041075885 = score(doc=1154,freq=1.0), product of:
              0.09272727 = queryWeight, product of:
                1.7459695 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.013113223 = queryNorm
              0.44297522 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.109375 = fieldNorm(doc=1154)
          0.13450654 = weight(abstract_txt:corpus in 1154) [ClassicSimilarity], result of:
            0.13450654 = score(doc=1154,freq=2.0), product of:
              0.14177665 = queryWeight, product of:
                1.7627456 = boost
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.013113223 = queryNorm
              0.9487214 = fieldWeight in 1154, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.109375 = fieldNorm(doc=1154)
          0.12684026 = weight(abstract_txt:words in 1154) [ClassicSimilarity], result of:
            0.12684026 = score(doc=1154,freq=1.0), product of:
              0.21642156 = queryWeight, product of:
                3.0800128 = boost
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.013113223 = queryNorm
              0.5860796 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.109375 = fieldNorm(doc=1154)
          0.13376999 = weight(abstract_txt:word in 1154) [ClassicSimilarity], result of:
            0.13376999 = score(doc=1154,freq=1.0), product of:
              0.22423404 = queryWeight, product of:
                3.135112 = boost
                5.4543004 = idf(docFreq=496, maxDocs=42740)
                0.013113223 = queryNorm
              0.5965641 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4543004 = idf(docFreq=496, maxDocs=42740)
                0.109375 = fieldNorm(doc=1154)
        0.28 = coord(7/25)