Document (#34079)

Author
Zhou, G.D.
Zhang, M.
Ji, D.H.
Zhu, Q.M.
Title
Hierarchical learning strategy in semantic relation extraction
Source
Information processing and management. 44(2008) no.3, S.1008-1021
Year
2008
Abstract
This paper proposes a novel hierarchical learning strategy to deal with the data sparseness problem in semantic relation extraction by modeling the commonality among related classes. For each class in the hierarchy either manually predefined or automatically clustered, a discriminative function is determined in a top-down way. As the upper-level class normally has much more positive training examples than the lower-level class, the corresponding discriminative function can be determined more reliably and guide the discriminative function learning in the lower-level one more effectively, which otherwise might suffer from limited training data. In this paper, two classifier learning approaches, i.e. the simple perceptron algorithm and the state-of-the-art Support Vector Machines, are applied using the hierarchical learning strategy. Moreover, several kinds of class hierarchies either manually predefined or automatically clustered are explored and compared. Evaluation on the ACE RDC 2003 and 2004 corpora shows that the hierarchical learning strategy much improves the performance on least- and medium-frequent relations.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Zhou, L.; Zhang, D.: NLPIR: a theoretical framework for applying Natural Language Processing to information retrieval (2003) 5.22
    5.2187114 = sum of:
      5.2187114 = sum of:
        1.7995015 = weight(author_txt:zhang in 149) [ClassicSimilarity], result of:
          1.7995015 = score(doc=149,freq=1.0), product of:
            0.5460803 = queryWeight, product of:
              6.5906115 = idf(docFreq=158, maxDocs=42596)
              0.0828573 = queryNorm
            3.2953057 = fieldWeight in 149, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.5906115 = idf(docFreq=158, maxDocs=42596)
              0.5 = fieldNorm(doc=149)
        3.41921 = weight(author_txt:zhou in 149) [ClassicSimilarity], result of:
          3.41921 = score(doc=149,freq=1.0), product of:
            0.83773285 = queryWeight, product of:
              1.2385813 = boost
              8.163008 = idf(docFreq=32, maxDocs=42596)
              0.0828573 = queryNorm
            4.081504 = fieldWeight in 149, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.163008 = idf(docFreq=32, maxDocs=42596)
              0.5 = fieldNorm(doc=149)
    
  2. Zhou, G.D.; Zhang, M.: Extracting relation information from text documents by exploring various types of knowledge (2007) 5.22
    5.2187114 = sum of:
      5.2187114 = sum of:
        1.7995015 = weight(author_txt:zhang in 2107) [ClassicSimilarity], result of:
          1.7995015 = score(doc=2107,freq=1.0), product of:
            0.5460803 = queryWeight, product of:
              6.5906115 = idf(docFreq=158, maxDocs=42596)
              0.0828573 = queryNorm
            3.2953057 = fieldWeight in 2107, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.5906115 = idf(docFreq=158, maxDocs=42596)
              0.5 = fieldNorm(doc=2107)
        3.41921 = weight(author_txt:zhou in 2107) [ClassicSimilarity], result of:
          3.41921 = score(doc=2107,freq=1.0), product of:
            0.83773285 = queryWeight, product of:
              1.2385813 = boost
              8.163008 = idf(docFreq=32, maxDocs=42596)
              0.0828573 = queryNorm
            4.081504 = fieldWeight in 2107, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.163008 = idf(docFreq=32, maxDocs=42596)
              0.5 = fieldNorm(doc=2107)
    
  3. Zhang, M.; Zhou, G.D.; Aw, A.: Exploring syntactic structured features over parse trees for relation extraction using kernel methods (2008) 3.91
    3.9140334 = sum of:
      3.9140334 = sum of:
        1.3496262 = weight(author_txt:zhang in 3235) [ClassicSimilarity], result of:
          1.3496262 = score(doc=3235,freq=1.0), product of:
            0.5460803 = queryWeight, product of:
              6.5906115 = idf(docFreq=158, maxDocs=42596)
              0.0828573 = queryNorm
            2.4714794 = fieldWeight in 3235, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.5906115 = idf(docFreq=158, maxDocs=42596)
              0.375 = fieldNorm(doc=3235)
        2.5644073 = weight(author_txt:zhou in 3235) [ClassicSimilarity], result of:
          2.5644073 = score(doc=3235,freq=1.0), product of:
            0.83773285 = queryWeight, product of:
              1.2385813 = boost
              8.163008 = idf(docFreq=32, maxDocs=42596)
              0.0828573 = queryNorm
            3.061128 = fieldWeight in 3235, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.163008 = idf(docFreq=32, maxDocs=42596)
              0.375 = fieldNorm(doc=3235)
    
  4. Zhang, D.; Zambrowicz, C.; Zhou, H.; Roderer, N.K.: User information seeking behavior in a medical Web portal environment : a preliminary study (2004) 3.26
    3.2616947 = sum of:
      3.2616947 = sum of:
        1.1246884 = weight(author_txt:zhang in 3262) [ClassicSimilarity], result of:
          1.1246884 = score(doc=3262,freq=1.0), product of:
            0.5460803 = queryWeight, product of:
              6.5906115 = idf(docFreq=158, maxDocs=42596)
              0.0828573 = queryNorm
            2.059566 = fieldWeight in 3262, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.5906115 = idf(docFreq=158, maxDocs=42596)
              0.3125 = fieldNorm(doc=3262)
        2.1370063 = weight(author_txt:zhou in 3262) [ClassicSimilarity], result of:
          2.1370063 = score(doc=3262,freq=1.0), product of:
            0.83773285 = queryWeight, product of:
              1.2385813 = boost
              8.163008 = idf(docFreq=32, maxDocs=42596)
              0.0828573 = queryNorm
            2.55094 = fieldWeight in 3262, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.163008 = idf(docFreq=32, maxDocs=42596)
              0.3125 = fieldNorm(doc=3262)
    
  5. Chang, K.-C.; Zhou, W.; Zhang, S.; Yuan, C,-C.: Threshold effects of the patent H-index in the relationship between patent citations and market value (2015) 3.26
    3.2616947 = sum of:
      3.2616947 = sum of:
        1.1246884 = weight(author_txt:zhang in 3345) [ClassicSimilarity], result of:
          1.1246884 = score(doc=3345,freq=1.0), product of:
            0.5460803 = queryWeight, product of:
              6.5906115 = idf(docFreq=158, maxDocs=42596)
              0.0828573 = queryNorm
            2.059566 = fieldWeight in 3345, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.5906115 = idf(docFreq=158, maxDocs=42596)
              0.3125 = fieldNorm(doc=3345)
        2.1370063 = weight(author_txt:zhou in 3345) [ClassicSimilarity], result of:
          2.1370063 = score(doc=3345,freq=1.0), product of:
            0.83773285 = queryWeight, product of:
              1.2385813 = boost
              8.163008 = idf(docFreq=32, maxDocs=42596)
              0.0828573 = queryNorm
            2.55094 = fieldWeight in 3345, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.163008 = idf(docFreq=32, maxDocs=42596)
              0.3125 = fieldNorm(doc=3345)
    

Similar documents (content)

  1. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.14
    0.13535747 = sum of:
      0.13535747 = product of:
        0.48341954 = sum of:
          0.07056234 = weight(abstract_txt:suffer in 653) [ClassicSimilarity], result of:
            0.07056234 = score(doc=653,freq=1.0), product of:
              0.1361514 = queryWeight, product of:
                8.29222 = idf(docFreq=28, maxDocs=42596)
                0.016419174 = queryNorm
              0.51826376 = fieldWeight in 653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29222 = idf(docFreq=28, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
          0.045068465 = weight(abstract_txt:semantic in 653) [ClassicSimilarity], result of:
            0.045068465 = score(doc=653,freq=4.0), product of:
              0.08014541 = queryWeight, product of:
                1.0850338 = boost
                4.4986696 = idf(docFreq=1287, maxDocs=42596)
                0.016419174 = queryNorm
              0.5623337 = fieldWeight in 653, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4986696 = idf(docFreq=1287, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
          0.030730214 = weight(abstract_txt:much in 653) [ClassicSimilarity], result of:
            0.030730214 = score(doc=653,freq=1.0), product of:
              0.09855845 = queryWeight, product of:
                1.2032361 = boost
                4.9887495 = idf(docFreq=788, maxDocs=42596)
                0.016419174 = queryNorm
              0.31179684 = fieldWeight in 653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9887495 = idf(docFreq=788, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
          0.03341698 = weight(abstract_txt:training in 653) [ClassicSimilarity], result of:
            0.03341698 = score(doc=653,freq=1.0), product of:
              0.10422253 = queryWeight, product of:
                1.2373277 = boost
                5.130097 = idf(docFreq=684, maxDocs=42596)
                0.016419174 = queryNorm
              0.32063106 = fieldWeight in 653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.130097 = idf(docFreq=684, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
          0.12634616 = weight(abstract_txt:relation in 653) [ClassicSimilarity], result of:
            0.12634616 = score(doc=653,freq=10.0), product of:
              0.11740633 = queryWeight, product of:
                1.3132569 = boost
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.016419174 = queryNorm
              1.0761443 = fieldWeight in 653, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
          0.058587898 = weight(abstract_txt:automatically in 653) [ClassicSimilarity], result of:
            0.058587898 = score(doc=653,freq=2.0), product of:
              0.12027594 = queryWeight, product of:
                1.3292091 = boost
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.016419174 = queryNorm
              0.48711237 = fieldWeight in 653, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
          0.11870746 = weight(abstract_txt:extraction in 653) [ClassicSimilarity], result of:
            0.11870746 = score(doc=653,freq=4.0), product of:
              0.15285589 = queryWeight, product of:
                1.4984596 = boost
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.016419174 = queryNorm
              0.77659726 = fieldWeight in 653, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
        0.28 = coord(7/25)
    
  2. Jiang, X.; Tan, A.-H.: CRCTOL: a semantic-based domain ontology learning system (2009) 0.13
    0.12819721 = sum of:
      0.12819721 = product of:
        0.4006163 = sum of:
          0.034151632 = weight(abstract_txt:semantic in 4500) [ClassicSimilarity], result of:
            0.034151632 = score(doc=4500,freq=3.0), product of:
              0.08014541 = queryWeight, product of:
                1.0850338 = boost
                4.4986696 = idf(docFreq=1287, maxDocs=42596)
                0.016419174 = queryNorm
              0.42612085 = fieldWeight in 4500, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4986696 = idf(docFreq=1287, maxDocs=42596)
                0.0546875 = fieldNorm(doc=4500)
          0.01308776 = weight(abstract_txt:more in 4500) [ClassicSimilarity], result of:
            0.01308776 = score(doc=4500,freq=1.0), product of:
              0.069810174 = queryWeight, product of:
                1.2402492 = boost
                3.4281397 = idf(docFreq=3756, maxDocs=42596)
                0.016419174 = queryNorm
              0.1874764 = fieldWeight in 4500, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4281397 = idf(docFreq=3756, maxDocs=42596)
                0.0546875 = fieldNorm(doc=4500)
          0.049440753 = weight(abstract_txt:relation in 4500) [ClassicSimilarity], result of:
            0.049440753 = score(doc=4500,freq=2.0), product of:
              0.11740633 = queryWeight, product of:
                1.3132569 = boost
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.016419174 = queryNorm
              0.42110807 = fieldWeight in 4500, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.0546875 = fieldNorm(doc=4500)
          0.03624941 = weight(abstract_txt:automatically in 4500) [ClassicSimilarity], result of:
            0.03624941 = score(doc=4500,freq=1.0), product of:
              0.12027594 = queryWeight, product of:
                1.3292091 = boost
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.016419174 = queryNorm
              0.3013854 = fieldWeight in 4500, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.0546875 = fieldNorm(doc=4500)
          0.038088866 = weight(abstract_txt:either in 4500) [ClassicSimilarity], result of:
            0.038088866 = score(doc=4500,freq=1.0), product of:
              0.124311164 = queryWeight, product of:
                1.3513225 = boost
                5.6027317 = idf(docFreq=426, maxDocs=42596)
                0.016419174 = queryNorm
              0.3063994 = fieldWeight in 4500, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6027317 = idf(docFreq=426, maxDocs=42596)
                0.0546875 = fieldNorm(doc=4500)
          0.051934518 = weight(abstract_txt:extraction in 4500) [ClassicSimilarity], result of:
            0.051934518 = score(doc=4500,freq=1.0), product of:
              0.15285589 = queryWeight, product of:
                1.4984596 = boost
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.016419174 = queryNorm
              0.33976132 = fieldWeight in 4500, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.0546875 = fieldNorm(doc=4500)
          0.052396193 = weight(abstract_txt:level in 4500) [ClassicSimilarity], result of:
            0.052396193 = score(doc=4500,freq=3.0), product of:
              0.122039735 = queryWeight, product of:
                1.6398351 = boost
                4.5326247 = idf(docFreq=1244, maxDocs=42596)
                0.016419174 = queryNorm
              0.42933714 = fieldWeight in 4500, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5326247 = idf(docFreq=1244, maxDocs=42596)
                0.0546875 = fieldNorm(doc=4500)
          0.12526716 = weight(abstract_txt:learning in 4500) [ClassicSimilarity], result of:
            0.12526716 = score(doc=4500,freq=3.0), product of:
              0.27491784 = queryWeight, product of:
                3.4806955 = boost
                4.810449 = idf(docFreq=942, maxDocs=42596)
                0.016419174 = queryNorm
              0.4556531 = fieldWeight in 4500, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.810449 = idf(docFreq=942, maxDocs=42596)
                0.0546875 = fieldNorm(doc=4500)
        0.32 = coord(8/25)
    
  3. Yu, N.: Exploring co-training strategies for opinion detection (2014) 0.12
    0.12397178 = sum of:
      0.12397178 = product of:
        0.44275635 = sum of:
          0.030730214 = weight(abstract_txt:much in 2504) [ClassicSimilarity], result of:
            0.030730214 = score(doc=2504,freq=1.0), product of:
              0.09855845 = queryWeight, product of:
                1.2032361 = boost
                4.9887495 = idf(docFreq=788, maxDocs=42596)
                0.016419174 = queryNorm
              0.31179684 = fieldWeight in 2504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9887495 = idf(docFreq=788, maxDocs=42596)
                0.0625 = fieldNorm(doc=2504)
          0.06683396 = weight(abstract_txt:training in 2504) [ClassicSimilarity], result of:
            0.06683396 = score(doc=2504,freq=4.0), product of:
              0.10422253 = queryWeight, product of:
                1.2373277 = boost
                5.130097 = idf(docFreq=684, maxDocs=42596)
                0.016419174 = queryNorm
              0.6412621 = fieldWeight in 2504, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.130097 = idf(docFreq=684, maxDocs=42596)
                0.0625 = fieldNorm(doc=2504)
          0.014957439 = weight(abstract_txt:more in 2504) [ClassicSimilarity], result of:
            0.014957439 = score(doc=2504,freq=1.0), product of:
              0.069810174 = queryWeight, product of:
                1.2402492 = boost
                3.4281397 = idf(docFreq=3756, maxDocs=42596)
                0.016419174 = queryNorm
              0.21425873 = fieldWeight in 2504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4281397 = idf(docFreq=3756, maxDocs=42596)
                0.0625 = fieldNorm(doc=2504)
          0.058587898 = weight(abstract_txt:automatically in 2504) [ClassicSimilarity], result of:
            0.058587898 = score(doc=2504,freq=2.0), product of:
              0.12027594 = queryWeight, product of:
                1.3292091 = boost
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.016419174 = queryNorm
              0.48711237 = fieldWeight in 2504, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.0625 = fieldNorm(doc=2504)
          0.03457252 = weight(abstract_txt:level in 2504) [ClassicSimilarity], result of:
            0.03457252 = score(doc=2504,freq=1.0), product of:
              0.122039735 = queryWeight, product of:
                1.6398351 = boost
                4.5326247 = idf(docFreq=1244, maxDocs=42596)
                0.016419174 = queryNorm
              0.28328905 = fieldWeight in 2504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5326247 = idf(docFreq=1244, maxDocs=42596)
                0.0625 = fieldNorm(doc=2504)
          0.09391181 = weight(abstract_txt:strategy in 2504) [ClassicSimilarity], result of:
            0.09391181 = score(doc=2504,freq=1.0), product of:
              0.26150116 = queryWeight, product of:
                2.7717607 = boost
                5.7460127 = idf(docFreq=369, maxDocs=42596)
                0.016419174 = queryNorm
              0.3591258 = fieldWeight in 2504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7460127 = idf(docFreq=369, maxDocs=42596)
                0.0625 = fieldNorm(doc=2504)
          0.14316247 = weight(abstract_txt:learning in 2504) [ClassicSimilarity], result of:
            0.14316247 = score(doc=2504,freq=3.0), product of:
              0.27491784 = queryWeight, product of:
                3.4806955 = boost
                4.810449 = idf(docFreq=942, maxDocs=42596)
                0.016419174 = queryNorm
              0.5207464 = fieldWeight in 2504, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.810449 = idf(docFreq=942, maxDocs=42596)
                0.0625 = fieldNorm(doc=2504)
        0.28 = coord(7/25)
    
  4. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 0.11
    0.110300325 = sum of:
      0.110300325 = product of:
        0.55150163 = sum of:
          0.12233416 = weight(abstract_txt:relation in 2791) [ClassicSimilarity], result of:
            0.12233416 = score(doc=2791,freq=6.0), product of:
              0.11740633 = queryWeight, product of:
                1.3132569 = boost
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.016419174 = queryNorm
              1.0419724 = fieldWeight in 2791, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.078125 = fieldNorm(doc=2791)
          0.07323487 = weight(abstract_txt:automatically in 2791) [ClassicSimilarity], result of:
            0.07323487 = score(doc=2791,freq=2.0), product of:
              0.12027594 = queryWeight, product of:
                1.3292091 = boost
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.016419174 = queryNorm
              0.6088905 = fieldWeight in 2791, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.078125 = fieldNorm(doc=2791)
          0.1285046 = weight(abstract_txt:extraction in 2791) [ClassicSimilarity], result of:
            0.1285046 = score(doc=2791,freq=3.0), product of:
              0.15285589 = queryWeight, product of:
                1.4984596 = boost
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.016419174 = queryNorm
              0.8406912 = fieldWeight in 2791, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.078125 = fieldNorm(doc=2791)
          0.081313394 = weight(abstract_txt:function in 2791) [ClassicSimilarity], result of:
            0.081313394 = score(doc=2791,freq=1.0), product of:
              0.186001 = queryWeight, product of:
                2.0244508 = boost
                5.5957303 = idf(docFreq=429, maxDocs=42596)
                0.016419174 = queryNorm
              0.43716642 = fieldWeight in 2791, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5957303 = idf(docFreq=429, maxDocs=42596)
                0.078125 = fieldNorm(doc=2791)
          0.14611459 = weight(abstract_txt:learning in 2791) [ClassicSimilarity], result of:
            0.14611459 = score(doc=2791,freq=2.0), product of:
              0.27491784 = queryWeight, product of:
                3.4806955 = boost
                4.810449 = idf(docFreq=942, maxDocs=42596)
                0.016419174 = queryNorm
              0.53148454 = fieldWeight in 2791, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.810449 = idf(docFreq=942, maxDocs=42596)
                0.078125 = fieldNorm(doc=2791)
        0.2 = coord(5/25)
    
  5. Wu, T.; Pottenger, W.M.: ¬A semi-supervised active learning algorithm for information extraction from textual data (2005) 0.11
    0.108430125 = sum of:
      0.108430125 = product of:
        0.5421506 = sum of:
          0.057879906 = weight(abstract_txt:training in 4238) [ClassicSimilarity], result of:
            0.057879906 = score(doc=4238,freq=3.0), product of:
              0.10422253 = queryWeight, product of:
                1.2373277 = boost
                5.130097 = idf(docFreq=684, maxDocs=42596)
                0.016419174 = queryNorm
              0.5553493 = fieldWeight in 4238, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.130097 = idf(docFreq=684, maxDocs=42596)
                0.0625 = fieldNorm(doc=4238)
          0.058587898 = weight(abstract_txt:automatically in 4238) [ClassicSimilarity], result of:
            0.058587898 = score(doc=4238,freq=2.0), product of:
              0.12027594 = queryWeight, product of:
                1.3292091 = boost
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.016419174 = queryNorm
              0.48711237 = fieldWeight in 4238, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.0625 = fieldNorm(doc=4238)
          0.11870746 = weight(abstract_txt:extraction in 4238) [ClassicSimilarity], result of:
            0.11870746 = score(doc=4238,freq=4.0), product of:
              0.15285589 = queryWeight, product of:
                1.4984596 = boost
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.016419174 = queryNorm
              0.77659726 = fieldWeight in 4238, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.0625 = fieldNorm(doc=4238)
          0.073192015 = weight(abstract_txt:manually in 4238) [ClassicSimilarity], result of:
            0.073192015 = score(doc=4238,freq=1.0), product of:
              0.17577589 = queryWeight, product of:
                1.6068805 = boost
                6.6623034 = idf(docFreq=147, maxDocs=42596)
                0.016419174 = queryNorm
              0.41639397 = fieldWeight in 4238, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6623034 = idf(docFreq=147, maxDocs=42596)
                0.0625 = fieldNorm(doc=4238)
          0.23378333 = weight(abstract_txt:learning in 4238) [ClassicSimilarity], result of:
            0.23378333 = score(doc=4238,freq=8.0), product of:
              0.27491784 = queryWeight, product of:
                3.4806955 = boost
                4.810449 = idf(docFreq=942, maxDocs=42596)
                0.016419174 = queryNorm
              0.8503753 = fieldWeight in 4238, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.810449 = idf(docFreq=942, maxDocs=42596)
                0.0625 = fieldNorm(doc=4238)
        0.2 = coord(5/25)