Document (#32833)

Author
Peng, F.
Huang, X.
Title
Machine learning for Asian language text classification
Source
Journal of documentation. 63(2007) no.3, S.378-397
Year
2007
Abstract
Purpose - The purpose of this research is to compare several machine learning techniques on the task of Asian language text classification, such as Chinese and Japanese where no word boundary information is available in written text. The paper advocates a simple language modeling based approach for this task. Design/methodology/approach - Naïve Bayes, maximum entropy model, support vector machines, and language modeling approaches were implemented and were applied to Chinese and Japanese text classification. To investigate the influence of word segmentation, different word segmentation approaches were investigated and applied to Chinese text. A segmentation-based approach was compared with the non-segmentation-based approach. Findings - There were two findings: the experiments show that statistical language modeling can significantly outperform standard techniques, given the same set of features; and it was found that classification with word level features normally yields improved classification performance, but that classification performance is not monotonically related to segmentation accuracy. In particular, classification performance may initially improve with increased segmentation accuracy, but eventually classification performance stops improving, and can in fact even decrease, after a certain level of segmentation accuracy. Practical implications - Apply the findings to real web text classification is ongoing work. Originality/value - The paper is very relevant to Chinese and Japanese information processing, e.g. webpage classification, web search.
Theme
Computerlinguistik
Automatisches Klassifizieren

Similar documents (author)

  1. Huang, X.; Peng, F,; An, A.; Schuurmans, D.: Dynamic Web log session identification with statistical language models (2004) 3.65
    3.645877 = sum of:
      3.645877 = sum of:
        1.2304841 = weight(author_txt:huang in 4097) [ClassicSimilarity], result of:
          1.2304841 = score(doc=4097,freq=1.0), product of:
            0.5377732 = queryWeight, product of:
              7.321951 = idf(docFreq=75, maxDocs=42306)
              0.073446706 = queryNorm
            2.2881098 = fieldWeight in 4097, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.321951 = idf(docFreq=75, maxDocs=42306)
              0.3125 = fieldNorm(doc=4097)
        2.4153929 = weight(author_txt:peng in 4097) [ClassicSimilarity], result of:
          2.4153929 = score(doc=4097,freq=1.0), product of:
            0.84308946 = queryWeight, product of:
              1.252095 = boost
              9.167778 = idf(docFreq=11, maxDocs=42306)
              0.073446706 = queryNorm
            2.8649306 = fieldWeight in 4097, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.167778 = idf(docFreq=11, maxDocs=42306)
              0.3125 = fieldNorm(doc=4097)
    
  2. Choi, B.; Peng, X.: Dynamic and hierarchical classification of Web pages (2004) 1.93
    1.9323143 = sum of:
      1.9323143 = product of:
        3.8646286 = sum of:
          3.8646286 = weight(author_txt:peng in 375) [ClassicSimilarity], result of:
            3.8646286 = score(doc=375,freq=1.0), product of:
              0.84308946 = queryWeight, product of:
                1.252095 = boost
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.073446706 = queryNorm
              4.583889 = fieldWeight in 375, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.5 = fieldNorm(doc=375)
        0.5 = coord(1/2)
    
  3. Peng, T.-Q.; Zhu, J.J.H.: Where you publish matters most : a multilevel analysis of factors affecting citations of internet studies (2012) 1.69
    1.6907749 = sum of:
      1.6907749 = product of:
        3.3815498 = sum of:
          3.3815498 = weight(author_txt:peng in 2387) [ClassicSimilarity], result of:
            3.3815498 = score(doc=2387,freq=1.0), product of:
              0.84308946 = queryWeight, product of:
                1.252095 = boost
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.073446706 = queryNorm
              4.010903 = fieldWeight in 2387, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.4375 = fieldNorm(doc=2387)
        0.5 = coord(1/2)
    
  4. Huang, G.W.: Accessing information in an information society (1989) 1.23
    1.2304841 = sum of:
      1.2304841 = product of:
        2.4609683 = sum of:
          2.4609683 = weight(author_txt:huang in 2566) [ClassicSimilarity], result of:
            2.4609683 = score(doc=2566,freq=1.0), product of:
              0.5377732 = queryWeight, product of:
                7.321951 = idf(docFreq=75, maxDocs=42306)
                0.073446706 = queryNorm
              4.5762196 = fieldWeight in 2566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.321951 = idf(docFreq=75, maxDocs=42306)
                0.625 = fieldNorm(doc=2566)
        0.5 = coord(1/2)
    
  5. Huang, X.: Applying a generic function-based topical relevance typology to structure clinical questions and answers (2013) 1.23
    1.2304841 = sum of:
      1.2304841 = product of:
        2.4609683 = sum of:
          2.4609683 = weight(author_txt:huang in 2531) [ClassicSimilarity], result of:
            2.4609683 = score(doc=2531,freq=1.0), product of:
              0.5377732 = queryWeight, product of:
                7.321951 = idf(docFreq=75, maxDocs=42306)
                0.073446706 = queryNorm
              4.5762196 = fieldWeight in 2531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.321951 = idf(docFreq=75, maxDocs=42306)
                0.625 = fieldNorm(doc=2531)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Yang, C.C.; Li, K.W.: ¬A heuristic method based on a statistical approach for chinese text segmentation (2005) 0.46
    0.45539218 = sum of:
      0.45539218 = product of:
        1.6264006 = sum of:
          0.014156388 = weight(abstract_txt:based in 581) [ClassicSimilarity], result of:
            0.014156388 = score(doc=581,freq=2.0), product of:
              0.049814027 = queryWeight, product of:
                1.0701077 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.014478327 = queryNorm
              0.28418478 = fieldWeight in 581, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
          0.021844594 = weight(abstract_txt:approach in 581) [ClassicSimilarity], result of:
            0.021844594 = score(doc=581,freq=1.0), product of:
              0.092243455 = queryWeight, product of:
                1.6814687 = boost
                3.789033 = idf(docFreq=2600, maxDocs=42306)
                0.014478327 = queryNorm
              0.23681456 = fieldWeight in 581, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.789033 = idf(docFreq=2600, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
          0.04034189 = weight(abstract_txt:performance in 581) [ClassicSimilarity], result of:
            0.04034189 = score(doc=581,freq=1.0), product of:
              0.13884932 = queryWeight, product of:
                2.06297 = boost
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.014478327 = queryNorm
              0.2905444 = fieldWeight in 581, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
          0.0924546 = weight(abstract_txt:word in 581) [ClassicSimilarity], result of:
            0.0924546 = score(doc=581,freq=2.0), product of:
              0.1915646 = queryWeight, product of:
                2.4231408 = boost
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.014478327 = queryNorm
              0.48262882 = fieldWeight in 581, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
          0.106007636 = weight(abstract_txt:text in 581) [ClassicSimilarity], result of:
            0.106007636 = score(doc=581,freq=7.0), product of:
              0.15822026 = queryWeight, product of:
                2.697104 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.014478327 = queryNorm
              0.6700004 = fieldWeight in 581, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
          0.30621767 = weight(abstract_txt:chinese in 581) [ClassicSimilarity], result of:
            0.30621767 = score(doc=581,freq=9.0), product of:
              0.25781742 = queryWeight, product of:
                2.8111055 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.014478327 = queryNorm
              1.1877308 = fieldWeight in 581, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
          1.0453779 = weight(abstract_txt:segmentation in 581) [ClassicSimilarity], result of:
            1.0453779 = score(doc=581,freq=9.0), product of:
              0.7044015 = queryWeight, product of:
                6.1468153 = boost
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.014478327 = queryNorm
              1.4840653 = fieldWeight in 581, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
        0.28 = coord(7/25)
    
  2. Wang, F.L.; Yang, C.C.: Mining Web data for Chinese segmentation (2007) 0.38
    0.37892047 = sum of:
      0.37892047 = product of:
        1.5788352 = sum of:
          0.010010078 = weight(abstract_txt:based in 2605) [ClassicSimilarity], result of:
            0.010010078 = score(doc=2605,freq=1.0), product of:
              0.049814027 = queryWeight, product of:
                1.0701077 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.014478327 = queryNorm
              0.20094898 = fieldWeight in 2605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=2605)
          0.06431994 = weight(abstract_txt:language in 2605) [ClassicSimilarity], result of:
            0.06431994 = score(doc=2605,freq=3.0), product of:
              0.14153579 = queryWeight, product of:
                2.3286765 = boost
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.014478327 = queryNorm
              0.45444295 = fieldWeight in 2605, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.0625 = fieldNorm(doc=2605)
          0.0924546 = weight(abstract_txt:word in 2605) [ClassicSimilarity], result of:
            0.0924546 = score(doc=2605,freq=2.0), product of:
              0.1915646 = queryWeight, product of:
                2.4231408 = boost
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.014478327 = queryNorm
              0.48262882 = fieldWeight in 2605, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.0625 = fieldNorm(doc=2605)
          0.04006712 = weight(abstract_txt:text in 2605) [ClassicSimilarity], result of:
            0.04006712 = score(doc=2605,freq=1.0), product of:
              0.15822026 = queryWeight, product of:
                2.697104 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.014478327 = queryNorm
              0.25323635 = fieldWeight in 2605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=2605)
          0.2700586 = weight(abstract_txt:chinese in 2605) [ClassicSimilarity], result of:
            0.2700586 = score(doc=2605,freq=7.0), product of:
              0.25781742 = queryWeight, product of:
                2.8111055 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.014478327 = queryNorm
              1.0474801 = fieldWeight in 2605, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.0625 = fieldNorm(doc=2605)
          1.1019249 = weight(abstract_txt:segmentation in 2605) [ClassicSimilarity], result of:
            1.1019249 = score(doc=2605,freq=10.0), product of:
              0.7044015 = queryWeight, product of:
                6.1468153 = boost
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.014478327 = queryNorm
              1.5643421 = fieldWeight in 2605, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.0625 = fieldNorm(doc=2605)
        0.24 = coord(6/25)
    
  3. Huang, X.; Robertson, S.E.: Application of probilistic methods to Chinese text retrieval (1997) 0.37
    0.37431043 = sum of:
      0.37431043 = product of:
        1.1697202 = sum of:
          0.028083844 = weight(abstract_txt:purpose in 707) [ClassicSimilarity], result of:
            0.028083844 = score(doc=707,freq=1.0), product of:
              0.066060185 = queryWeight, product of:
                1.0061805 = boost
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.014478327 = queryNorm
              0.42512512 = fieldWeight in 707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.09375 = fieldNorm(doc=707)
          0.021234581 = weight(abstract_txt:based in 707) [ClassicSimilarity], result of:
            0.021234581 = score(doc=707,freq=2.0), product of:
              0.049814027 = queryWeight, product of:
                1.0701077 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.014478327 = queryNorm
              0.42627716 = fieldWeight in 707, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.09375 = fieldNorm(doc=707)
          0.034038473 = weight(abstract_txt:applied in 707) [ClassicSimilarity], result of:
            0.034038473 = score(doc=707,freq=1.0), product of:
              0.07509577 = queryWeight, product of:
                1.0727876 = boost
                4.8348536 = idf(docFreq=913, maxDocs=42306)
                0.014478327 = queryNorm
              0.4532675 = fieldWeight in 707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8348536 = idf(docFreq=913, maxDocs=42306)
                0.09375 = fieldNorm(doc=707)
          0.055702705 = weight(abstract_txt:language in 707) [ClassicSimilarity], result of:
            0.055702705 = score(doc=707,freq=1.0), product of:
              0.14153579 = queryWeight, product of:
                2.3286765 = boost
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.014478327 = queryNorm
              0.39355916 = fieldWeight in 707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.09375 = fieldNorm(doc=707)
          0.1386819 = weight(abstract_txt:word in 707) [ClassicSimilarity], result of:
            0.1386819 = score(doc=707,freq=2.0), product of:
              0.1915646 = queryWeight, product of:
                2.4231408 = boost
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.014478327 = queryNorm
              0.72394323 = fieldWeight in 707, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.09375 = fieldNorm(doc=707)
          0.10409743 = weight(abstract_txt:text in 707) [ClassicSimilarity], result of:
            0.10409743 = score(doc=707,freq=3.0), product of:
              0.15822026 = queryWeight, product of:
                2.697104 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.014478327 = queryNorm
              0.65792733 = fieldWeight in 707, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.09375 = fieldNorm(doc=707)
          0.26519227 = weight(abstract_txt:chinese in 707) [ClassicSimilarity], result of:
            0.26519227 = score(doc=707,freq=3.0), product of:
              0.25781742 = queryWeight, product of:
                2.8111055 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.014478327 = queryNorm
              1.028605 = fieldWeight in 707, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.09375 = fieldNorm(doc=707)
          0.5226889 = weight(abstract_txt:segmentation in 707) [ClassicSimilarity], result of:
            0.5226889 = score(doc=707,freq=1.0), product of:
              0.7044015 = queryWeight, product of:
                6.1468153 = boost
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.014478327 = queryNorm
              0.74203265 = fieldWeight in 707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.09375 = fieldNorm(doc=707)
        0.32 = coord(8/25)
    
  4. Lee, K.H.; Ng, M.K.M.; Lu, Q.: Text segmentation for Chinese spell checking (1999) 0.34
    0.34298143 = sum of:
      0.34298143 = product of:
        1.2249336 = sum of:
          0.026548889 = weight(abstract_txt:level in 4914) [ClassicSimilarity], result of:
            0.026548889 = score(doc=4914,freq=2.0), product of:
              0.06617854 = queryWeight, product of:
                1.0070814 = boost
                4.538728 = idf(docFreq=1228, maxDocs=42306)
                0.014478327 = queryNorm
              0.40117067 = fieldWeight in 4914, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.538728 = idf(docFreq=1228, maxDocs=42306)
                0.0625 = fieldNorm(doc=4914)
          0.014156388 = weight(abstract_txt:based in 4914) [ClassicSimilarity], result of:
            0.014156388 = score(doc=4914,freq=2.0), product of:
              0.049814027 = queryWeight, product of:
                1.0701077 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.014478327 = queryNorm
              0.28418478 = fieldWeight in 4914, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=4914)
          0.037135135 = weight(abstract_txt:language in 4914) [ClassicSimilarity], result of:
            0.037135135 = score(doc=4914,freq=1.0), product of:
              0.14153579 = queryWeight, product of:
                2.3286765 = boost
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.014478327 = queryNorm
              0.26237276 = fieldWeight in 4914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.0625 = fieldNorm(doc=4914)
          0.13075055 = weight(abstract_txt:word in 4914) [ClassicSimilarity], result of:
            0.13075055 = score(doc=4914,freq=4.0), product of:
              0.1915646 = queryWeight, product of:
                2.4231408 = boost
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.014478327 = queryNorm
              0.68254024 = fieldWeight in 4914, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.0625 = fieldNorm(doc=4914)
          0.06939829 = weight(abstract_txt:text in 4914) [ClassicSimilarity], result of:
            0.06939829 = score(doc=4914,freq=3.0), product of:
              0.15822026 = queryWeight, product of:
                2.697104 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.014478327 = queryNorm
              0.4386182 = fieldWeight in 4914, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=4914)
          0.25002572 = weight(abstract_txt:chinese in 4914) [ClassicSimilarity], result of:
            0.25002572 = score(doc=4914,freq=6.0), product of:
              0.25781742 = queryWeight, product of:
                2.8111055 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.014478327 = queryNorm
              0.9697782 = fieldWeight in 4914, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.0625 = fieldNorm(doc=4914)
          0.69691855 = weight(abstract_txt:segmentation in 4914) [ClassicSimilarity], result of:
            0.69691855 = score(doc=4914,freq=4.0), product of:
              0.7044015 = queryWeight, product of:
                6.1468153 = boost
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.014478327 = queryNorm
              0.98937684 = fieldWeight in 4914, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.0625 = fieldNorm(doc=4914)
        0.28 = coord(7/25)
    
  5. Doval, Y.; Gómez-Rodríguez, C.: Comparing neural- and N-gram-based language models for word segmentation (2019) 0.31
    0.3091187 = sum of:
      0.3091187 = product of:
        0.85866296 = sum of:
          0.0187729 = weight(abstract_txt:level in 1594) [ClassicSimilarity], result of:
            0.0187729 = score(doc=1594,freq=1.0), product of:
              0.06617854 = queryWeight, product of:
                1.0070814 = boost
                4.538728 = idf(docFreq=1228, maxDocs=42306)
                0.014478327 = queryNorm
              0.2836705 = fieldWeight in 1594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.538728 = idf(docFreq=1228, maxDocs=42306)
                0.0625 = fieldNorm(doc=1594)
          0.010010078 = weight(abstract_txt:based in 1594) [ClassicSimilarity], result of:
            0.010010078 = score(doc=1594,freq=1.0), product of:
              0.049814027 = queryWeight, product of:
                1.0701077 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.014478327 = queryNorm
              0.20094898 = fieldWeight in 1594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=1594)
          0.024327021 = weight(abstract_txt:task in 1594) [ClassicSimilarity], result of:
            0.024327021 = score(doc=1594,freq=1.0), product of:
              0.07866029 = queryWeight, product of:
                1.0979531 = boost
                4.94827 = idf(docFreq=815, maxDocs=42306)
                0.014478327 = queryNorm
              0.30926687 = fieldWeight in 1594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.94827 = idf(docFreq=815, maxDocs=42306)
                0.0625 = fieldNorm(doc=1594)
          0.021844594 = weight(abstract_txt:approach in 1594) [ClassicSimilarity], result of:
            0.021844594 = score(doc=1594,freq=1.0), product of:
              0.092243455 = queryWeight, product of:
                1.6814687 = boost
                3.789033 = idf(docFreq=2600, maxDocs=42306)
                0.014478327 = queryNorm
              0.23681456 = fieldWeight in 1594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.789033 = idf(docFreq=2600, maxDocs=42306)
                0.0625 = fieldNorm(doc=1594)
          0.04034189 = weight(abstract_txt:performance in 1594) [ClassicSimilarity], result of:
            0.04034189 = score(doc=1594,freq=1.0), product of:
              0.13884932 = queryWeight, product of:
                2.06297 = boost
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.014478327 = queryNorm
              0.2905444 = fieldWeight in 1594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.0625 = fieldNorm(doc=1594)
          0.06431994 = weight(abstract_txt:language in 1594) [ClassicSimilarity], result of:
            0.06431994 = score(doc=1594,freq=3.0), product of:
              0.14153579 = queryWeight, product of:
                2.3286765 = boost
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.014478327 = queryNorm
              0.45444295 = fieldWeight in 1594, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.0625 = fieldNorm(doc=1594)
          0.14618357 = weight(abstract_txt:word in 1594) [ClassicSimilarity], result of:
            0.14618357 = score(doc=1594,freq=5.0), product of:
              0.1915646 = queryWeight, product of:
                2.4231408 = boost
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.014478327 = queryNorm
              0.7631032 = fieldWeight in 1594, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.0625 = fieldNorm(doc=1594)
          0.04006712 = weight(abstract_txt:text in 1594) [ClassicSimilarity], result of:
            0.04006712 = score(doc=1594,freq=1.0), product of:
              0.15822026 = queryWeight, product of:
                2.697104 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.014478327 = queryNorm
              0.25323635 = fieldWeight in 1594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=1594)
          0.4927958 = weight(abstract_txt:segmentation in 1594) [ClassicSimilarity], result of:
            0.4927958 = score(doc=1594,freq=2.0), product of:
              0.7044015 = queryWeight, product of:
                6.1468153 = boost
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.014478327 = queryNorm
              0.69959503 = fieldWeight in 1594, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.0625 = fieldNorm(doc=1594)
        0.36 = coord(9/25)