Document (#13269)

Author
Yang, Y.
Wilbur, J.
Title
Using corpus statistics to remove redundant words in text categorization
Source
Journal of the American Society for Information Science. 47(1996) no.5, S.357-369
Year
1996
Abstract
This article studies aggressive word removal in text categorization to reduce the noice in free texts to enhance the computational efficiency of categorization. We use a novel stop word identification method to automatically generate domain specific stoplists which are much larger than a conventional domain-independent stoplist. In our tests with 3 categorization methods on text collections from different domains/applications, significant numbers of words were removed without sacrificing categorization effectiveness. In the test of the Expert Network method on CACM documents, for example, an 87% removal of unique qords reduced the vocabulary of documents from 8.002 distinct words to 1.045 words, which resulted in a 63% time savings and a 74% memory savings in the computation of category ranking, with a 10% precision improvement on average over not using word removal. It is evident in this study that automated word removal based on corpus statistics has a practical and significant impact on the computational tractability of categorization methods in large databases
Theme
Computerlinguistik

Similar documents (author)

  1. Wilbur, W.J.: Global term weights for document retrieval learned from TREC data (2001) 2.36
    2.364801 = sum of:
      2.364801 = product of:
        4.729602 = sum of:
          4.729602 = weight(author_txt:wilbur in 2647) [ClassicSimilarity], result of:
            4.729602 = score(doc=2647,freq=1.0), product of:
              0.84186226 = queryWeight, product of:
                1.2489566 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.074987724 = queryNorm
              5.6180234 = fieldWeight in 2647, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=2647)
        0.5 = coord(1/2)
    
  2. Wilbur, W.J.: Human subjectivity and performance limits in document retrieval (1996) 2.36
    2.364801 = sum of:
      2.364801 = product of:
        4.729602 = sum of:
          4.729602 = weight(author_txt:wilbur in 6607) [ClassicSimilarity], result of:
            4.729602 = score(doc=6607,freq=1.0), product of:
              0.84186226 = queryWeight, product of:
                1.2489566 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.074987724 = queryNorm
              5.6180234 = fieldWeight in 6607, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=6607)
        0.5 = coord(1/2)
    
  3. Wilbur, W.J.: ¬A comparison of group and individual performance among subject experts and untrained workers at the document retrieval task (1998) 2.36
    2.364801 = sum of:
      2.364801 = product of:
        4.729602 = sum of:
          4.729602 = weight(author_txt:wilbur in 3263) [ClassicSimilarity], result of:
            4.729602 = score(doc=3263,freq=1.0), product of:
              0.84186226 = queryWeight, product of:
                1.2489566 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.074987724 = queryNorm
              5.6180234 = fieldWeight in 3263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=3263)
        0.5 = coord(1/2)
    
  4. Wilbur, W.J.: Human subjectivity and performance limits in document retrieval (1999) 2.36
    2.364801 = sum of:
      2.364801 = product of:
        4.729602 = sum of:
          4.729602 = weight(author_txt:wilbur in 4539) [ClassicSimilarity], result of:
            4.729602 = score(doc=4539,freq=1.0), product of:
              0.84186226 = queryWeight, product of:
                1.2489566 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.074987724 = queryNorm
              5.6180234 = fieldWeight in 4539, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=4539)
        0.5 = coord(1/2)
    
  5. Wilbur, W.J.: ¬A retrieval system based on automatic relevance weighting of search terms (1992) 2.36
    2.364801 = sum of:
      2.364801 = product of:
        4.729602 = sum of:
          4.729602 = weight(author_txt:wilbur in 5269) [ClassicSimilarity], result of:
            4.729602 = score(doc=5269,freq=1.0), product of:
              0.84186226 = queryWeight, product of:
                1.2489566 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.074987724 = queryNorm
              5.6180234 = fieldWeight in 5269, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=5269)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Goren-Bar, D.; Kuflik, T.: Supporting user-subjective categorization with self-organizing maps and learning vector quantization (2005) 0.20
    0.19824693 = sum of:
      0.19824693 = product of:
        0.8260289 = sum of:
          0.009941747 = weight(abstract_txt:using in 3325) [ClassicSimilarity], result of:
            0.009941747 = score(doc=3325,freq=1.0), product of:
              0.045932 = queryWeight, product of:
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.01326319 = queryNorm
              0.21644491 = fieldWeight in 3325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=3325)
          0.029021828 = weight(abstract_txt:documents in 3325) [ClassicSimilarity], result of:
            0.029021828 = score(doc=3325,freq=3.0), product of:
              0.06505035 = queryWeight, product of:
                1.1900553 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01326319 = queryNorm
              0.44614407 = fieldWeight in 3325, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=3325)
          0.017068084 = weight(abstract_txt:methods in 3325) [ClassicSimilarity], result of:
            0.017068084 = score(doc=3325,freq=1.0), product of:
              0.0658562 = queryWeight, product of:
                1.1974039 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.01326319 = queryNorm
              0.259172 = fieldWeight in 3325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=3325)
          0.021825952 = weight(abstract_txt:method in 3325) [ClassicSimilarity], result of:
            0.021825952 = score(doc=3325,freq=1.0), product of:
              0.07758701 = queryWeight, product of:
                1.2996812 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.01326319 = queryNorm
              0.28130937 = fieldWeight in 3325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=3325)
          0.035949953 = weight(abstract_txt:domain in 3325) [ClassicSimilarity], result of:
            0.035949953 = score(doc=3325,freq=2.0), product of:
              0.08588733 = queryWeight, product of:
                1.3674356 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.01326319 = queryNorm
              0.41857108 = fieldWeight in 3325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=3325)
          0.7122213 = weight(abstract_txt:categorization in 3325) [ClassicSimilarity], result of:
            0.7122213 = score(doc=3325,freq=12.0), product of:
              0.49911067 = queryWeight, product of:
                5.709543 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.01326319 = queryNorm
              1.4269807 = fieldWeight in 3325, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.0625 = fieldNorm(doc=3325)
        0.24 = coord(6/25)
    
  2. Díaz, I.; Ranilla, J.; Montañes, E.; Fernández, J.; Combarro, E.F.: Improving performance of text categorization by combining filtering and support vector machines (2004) 0.17
    0.1653829 = sum of:
      0.1653829 = product of:
        0.59065324 = sum of:
          0.020944702 = weight(abstract_txt:documents in 2234) [ClassicSimilarity], result of:
            0.020944702 = score(doc=2234,freq=1.0), product of:
              0.06505035 = queryWeight, product of:
                1.1900553 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01326319 = queryNorm
              0.32197678 = fieldWeight in 2234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=2234)
          0.027282441 = weight(abstract_txt:method in 2234) [ClassicSimilarity], result of:
            0.027282441 = score(doc=2234,freq=1.0), product of:
              0.07758701 = queryWeight, product of:
                1.2996812 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.01326319 = queryNorm
              0.3516367 = fieldWeight in 2234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=2234)
          0.029679215 = weight(abstract_txt:text in 2234) [ClassicSimilarity], result of:
            0.029679215 = score(doc=2234,freq=1.0), product of:
              0.09394324 = queryWeight, product of:
                1.7515426 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01326319 = queryNorm
              0.3159271 = fieldWeight in 2234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=2234)
          0.0678629 = weight(abstract_txt:corpus in 2234) [ClassicSimilarity], result of:
            0.0678629 = score(doc=2234,freq=1.0), product of:
              0.14243667 = queryWeight, product of:
                1.7609751 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.01326319 = queryNorm
              0.4764426 = fieldWeight in 2234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.078125 = fieldNorm(doc=2234)
          0.09178972 = weight(abstract_txt:words in 2234) [ClassicSimilarity], result of:
            0.09178972 = score(doc=2234,freq=1.0), product of:
              0.2194857 = queryWeight, product of:
                3.0914373 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01326319 = queryNorm
              0.41820365 = fieldWeight in 2234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.078125 = fieldNorm(doc=2234)
          0.09609354 = weight(abstract_txt:word in 2234) [ClassicSimilarity], result of:
            0.09609354 = score(doc=2234,freq=1.0), product of:
              0.22629398 = queryWeight, product of:
                3.139018 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.01326319 = queryNorm
              0.4246403 = fieldWeight in 2234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.078125 = fieldNorm(doc=2234)
          0.2570007 = weight(abstract_txt:categorization in 2234) [ClassicSimilarity], result of:
            0.2570007 = score(doc=2234,freq=1.0), product of:
              0.49911067 = queryWeight, product of:
                5.709543 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.01326319 = queryNorm
              0.5149173 = fieldWeight in 2234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.078125 = fieldNorm(doc=2234)
        0.28 = coord(7/25)
    
  3. Han, K.; Rezapour, R.; Nakamura, K.; Devkota, D.; Miller, D.C.; Diesner, J.: ¬An expert-in-the-loop method for domain-specific document categorization based on small training data (2023) 0.16
    0.16447593 = sum of:
      0.16447593 = product of:
        0.587414 = sum of:
          0.023696223 = weight(abstract_txt:documents in 967) [ClassicSimilarity], result of:
            0.023696223 = score(doc=967,freq=2.0), product of:
              0.06505035 = queryWeight, product of:
                1.1900553 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01326319 = queryNorm
              0.36427513 = fieldWeight in 967, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.024137916 = weight(abstract_txt:methods in 967) [ClassicSimilarity], result of:
            0.024137916 = score(doc=967,freq=2.0), product of:
              0.0658562 = queryWeight, product of:
                1.1974039 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.01326319 = queryNorm
              0.36652455 = fieldWeight in 967, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.021825952 = weight(abstract_txt:method in 967) [ClassicSimilarity], result of:
            0.021825952 = score(doc=967,freq=1.0), product of:
              0.07758701 = queryWeight, product of:
                1.2996812 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.01326319 = queryNorm
              0.28130937 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.05084091 = weight(abstract_txt:domain in 967) [ClassicSimilarity], result of:
            0.05084091 = score(doc=967,freq=4.0), product of:
              0.08588733 = queryWeight, product of:
                1.3674356 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.01326319 = queryNorm
              0.5919489 = fieldWeight in 967, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.02374337 = weight(abstract_txt:text in 967) [ClassicSimilarity], result of:
            0.02374337 = score(doc=967,freq=1.0), product of:
              0.09394324 = queryWeight, product of:
                1.7515426 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01326319 = queryNorm
              0.25274166 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.087059 = weight(abstract_txt:computational in 967) [ClassicSimilarity], result of:
            0.087059 = score(doc=967,freq=2.0), product of:
              0.1548838 = queryWeight, product of:
                1.8363072 = boost
                6.3593493 = idf(docFreq=207, maxDocs=44218)
                0.01326319 = queryNorm
              0.56209236 = fieldWeight in 967, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3593493 = idf(docFreq=207, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.35611066 = weight(abstract_txt:categorization in 967) [ClassicSimilarity], result of:
            0.35611066 = score(doc=967,freq=3.0), product of:
              0.49911067 = queryWeight, product of:
                5.709543 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.01326319 = queryNorm
              0.71349037 = fieldWeight in 967, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
        0.28 = coord(7/25)
    
  4. Kim, W.; Wilbur, W.J.: Corpus-based statistical screening for content-bearing terms (2001) 0.16
    0.16002609 = sum of:
      0.16002609 = product of:
        0.4445169 = sum of:
          0.0074563106 = weight(abstract_txt:using in 5188) [ClassicSimilarity], result of:
            0.0074563106 = score(doc=5188,freq=1.0), product of:
              0.045932 = queryWeight, product of:
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.01326319 = queryNorm
              0.16233368 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.05403728 = weight(abstract_txt:stop in 5188) [ClassicSimilarity], result of:
            0.05403728 = score(doc=5188,freq=2.0), product of:
              0.108361505 = queryWeight, product of:
                1.0860876 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.01326319 = queryNorm
              0.498676 = fieldWeight in 5188, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.028100267 = weight(abstract_txt:documents in 5188) [ClassicSimilarity], result of:
            0.028100267 = score(doc=5188,freq=5.0), product of:
              0.06505035 = queryWeight, product of:
                1.1900553 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01326319 = queryNorm
              0.43197718 = fieldWeight in 5188, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.05063951 = weight(abstract_txt:removed in 5188) [ClassicSimilarity], result of:
            0.05063951 = score(doc=5188,freq=1.0), product of:
              0.13074218 = queryWeight, product of:
                1.1929855 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.01326319 = queryNorm
              0.38732344 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.022172092 = weight(abstract_txt:methods in 5188) [ClassicSimilarity], result of:
            0.022172092 = score(doc=5188,freq=3.0), product of:
              0.0658562 = queryWeight, product of:
                1.1974039 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.01326319 = queryNorm
              0.3366743 = fieldWeight in 5188, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.016369464 = weight(abstract_txt:method in 5188) [ClassicSimilarity], result of:
            0.016369464 = score(doc=5188,freq=1.0), product of:
              0.07758701 = queryWeight, product of:
                1.2996812 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.01326319 = queryNorm
              0.21098202 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.017807527 = weight(abstract_txt:text in 5188) [ClassicSimilarity], result of:
            0.017807527 = score(doc=5188,freq=1.0), product of:
              0.09394324 = queryWeight, product of:
                1.7515426 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01326319 = queryNorm
              0.18955624 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.095390685 = weight(abstract_txt:words in 5188) [ClassicSimilarity], result of:
            0.095390685 = score(doc=5188,freq=3.0), product of:
              0.2194857 = queryWeight, product of:
                3.0914373 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01326319 = queryNorm
              0.43461 = fieldWeight in 5188, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.15254375 = weight(abstract_txt:word in 5188) [ClassicSimilarity], result of:
            0.15254375 = score(doc=5188,freq=7.0), product of:
              0.22629398 = queryWeight, product of:
                3.139018 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.01326319 = queryNorm
              0.6740955 = fieldWeight in 5188, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
        0.36 = coord(9/25)
    
  5. Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002) 0.15
    0.15279862 = sum of:
      0.15279862 = product of:
        0.47749573 = sum of:
          0.009941747 = weight(abstract_txt:using in 5226) [ClassicSimilarity], result of:
            0.009941747 = score(doc=5226,freq=1.0), product of:
              0.045932 = queryWeight, product of:
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.01326319 = queryNorm
              0.21644491 = fieldWeight in 5226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
          0.040363356 = weight(abstract_txt:reduced in 5226) [ClassicSimilarity], result of:
            0.040363356 = score(doc=5226,freq=1.0), product of:
              0.09278014 = queryWeight, product of:
                1.004974 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.01326319 = queryNorm
              0.43504304 = fieldWeight in 5226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
          0.05094684 = weight(abstract_txt:stop in 5226) [ClassicSimilarity], result of:
            0.05094684 = score(doc=5226,freq=1.0), product of:
              0.108361505 = queryWeight, product of:
                1.0860876 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.01326319 = queryNorm
              0.47015625 = fieldWeight in 5226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
          0.054807425 = weight(abstract_txt:computation in 5226) [ClassicSimilarity], result of:
            0.054807425 = score(doc=5226,freq=1.0), product of:
              0.113768786 = queryWeight, product of:
                1.1128558 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.01326319 = queryNorm
              0.48174396 = fieldWeight in 5226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
          0.016755762 = weight(abstract_txt:documents in 5226) [ClassicSimilarity], result of:
            0.016755762 = score(doc=5226,freq=1.0), product of:
              0.06505035 = queryWeight, product of:
                1.1900553 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01326319 = queryNorm
              0.2575814 = fieldWeight in 5226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
          0.02374337 = weight(abstract_txt:text in 5226) [ClassicSimilarity], result of:
            0.02374337 = score(doc=5226,freq=1.0), product of:
              0.09394324 = queryWeight, product of:
                1.7515426 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01326319 = queryNorm
              0.25274166 = fieldWeight in 5226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
          0.12718756 = weight(abstract_txt:words in 5226) [ClassicSimilarity], result of:
            0.12718756 = score(doc=5226,freq=3.0), product of:
              0.2194857 = queryWeight, product of:
                3.0914373 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01326319 = queryNorm
              0.57948 = fieldWeight in 5226, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
          0.15374966 = weight(abstract_txt:word in 5226) [ClassicSimilarity], result of:
            0.15374966 = score(doc=5226,freq=4.0), product of:
              0.22629398 = queryWeight, product of:
                3.139018 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.01326319 = queryNorm
              0.67942446 = fieldWeight in 5226, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
        0.32 = coord(8/25)