Document (#32929)

Author
Zhou, G.D.
Zhang, M.
Title
Extracting relation information from text documents by exploring various types of knowledge
Source
Information processing and management. 43(2007) no.4, S.969-982
Year
2007
Abstract
Extracting semantic relationships between entities from text documents is challenging in information extraction and important for deep information processing and management. This paper investigates the incorporation of diverse lexical, syntactic and semantic knowledge in feature-based relation extraction using support vector machines. Our study illustrates that the base phrase chunking information is very effective for relation extraction and contributes to most of the performance improvement from syntactic aspect while current commonly used features from full parsing give limited further enhancement. This suggests that most of useful information in full parse trees for relation extraction is shallow and can be captured by chunking. This indicates that a cheap and robust solution in relation extraction can be achieved without decreasing too much in performance. We also demonstrate how semantic information such as WordNet, can be used in feature-based relation extraction to further improve the performance. Evaluation on the ACE benchmark corpora shows that effective incorporation of diverse features enables our system outperform previously best-reported systems. It also shows that our feature-based system significantly outperforms tree kernel-based systems. This suggests that current tree kernels fail to effectively explore structured syntactic information in relation extraction.
Theme
Theorie verbaler Dokumentationssprachen

Similar documents (author)

  1. Zhou, L.; Zhang, D.: NLPIR: a theoretical framework for applying Natural Language Processing to information retrieval (2003) 5.24
    5.238286 = sum of:
      5.238286 = sum of:
        1.8128607 = weight(author_txt:zhang in 149) [ClassicSimilarity], result of:
          1.8128607 = score(doc=149,freq=1.0), product of:
            0.5475063 = queryWeight, product of:
              6.6222463 = idf(docFreq=152, maxDocs=42306)
              0.08267682 = queryNorm
            3.3111231 = fieldWeight in 149, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.6222463 = idf(docFreq=152, maxDocs=42306)
              0.5 = fieldNorm(doc=149)
        3.4254255 = weight(author_txt:zhou in 149) [ClassicSimilarity], result of:
          3.4254255 = score(doc=149,freq=1.0), product of:
            0.8368016 = queryWeight, product of:
              1.2362796 = boost
              8.186948 = idf(docFreq=31, maxDocs=42306)
              0.08267682 = queryNorm
            4.093474 = fieldWeight in 149, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.186948 = idf(docFreq=31, maxDocs=42306)
              0.5 = fieldNorm(doc=149)
    
  2. Zhang, M.; Zhou, G.D.; Aw, A.: Exploring syntactic structured features over parse trees for relation extraction using kernel methods (2008) 3.93
    3.9287148 = sum of:
      3.9287148 = sum of:
        1.3596456 = weight(author_txt:zhang in 4056) [ClassicSimilarity], result of:
          1.3596456 = score(doc=4056,freq=1.0), product of:
            0.5475063 = queryWeight, product of:
              6.6222463 = idf(docFreq=152, maxDocs=42306)
              0.08267682 = queryNorm
            2.4833424 = fieldWeight in 4056, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.6222463 = idf(docFreq=152, maxDocs=42306)
              0.375 = fieldNorm(doc=4056)
        2.5690691 = weight(author_txt:zhou in 4056) [ClassicSimilarity], result of:
          2.5690691 = score(doc=4056,freq=1.0), product of:
            0.8368016 = queryWeight, product of:
              1.2362796 = boost
              8.186948 = idf(docFreq=31, maxDocs=42306)
              0.08267682 = queryNorm
            3.0701056 = fieldWeight in 4056, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.186948 = idf(docFreq=31, maxDocs=42306)
              0.375 = fieldNorm(doc=4056)
    
  3. Zhang, D.; Zambrowicz, C.; Zhou, H.; Roderer, N.K.: User information seeking behavior in a medical Web portal environment : a preliminary study (2004) 3.27
    3.2739286 = sum of:
      3.2739286 = sum of:
        1.1330379 = weight(author_txt:zhang in 3262) [ClassicSimilarity], result of:
          1.1330379 = score(doc=3262,freq=1.0), product of:
            0.5475063 = queryWeight, product of:
              6.6222463 = idf(docFreq=152, maxDocs=42306)
              0.08267682 = queryNorm
            2.069452 = fieldWeight in 3262, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.6222463 = idf(docFreq=152, maxDocs=42306)
              0.3125 = fieldNorm(doc=3262)
        2.1408908 = weight(author_txt:zhou in 3262) [ClassicSimilarity], result of:
          2.1408908 = score(doc=3262,freq=1.0), product of:
            0.8368016 = queryWeight, product of:
              1.2362796 = boost
              8.186948 = idf(docFreq=31, maxDocs=42306)
              0.08267682 = queryNorm
            2.5584211 = fieldWeight in 3262, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.186948 = idf(docFreq=31, maxDocs=42306)
              0.3125 = fieldNorm(doc=3262)
    
  4. Zhou, G.D.; Zhang, M.; Ji, D.H.; Zhu, Q.M.: Hierarchical learning strategy in semantic relation extraction (2008) 3.27
    3.2739286 = sum of:
      3.2739286 = sum of:
        1.1330379 = weight(author_txt:zhang in 4078) [ClassicSimilarity], result of:
          1.1330379 = score(doc=4078,freq=1.0), product of:
            0.5475063 = queryWeight, product of:
              6.6222463 = idf(docFreq=152, maxDocs=42306)
              0.08267682 = queryNorm
            2.069452 = fieldWeight in 4078, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.6222463 = idf(docFreq=152, maxDocs=42306)
              0.3125 = fieldNorm(doc=4078)
        2.1408908 = weight(author_txt:zhou in 4078) [ClassicSimilarity], result of:
          2.1408908 = score(doc=4078,freq=1.0), product of:
            0.8368016 = queryWeight, product of:
              1.2362796 = boost
              8.186948 = idf(docFreq=31, maxDocs=42306)
              0.08267682 = queryNorm
            2.5584211 = fieldWeight in 4078, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.186948 = idf(docFreq=31, maxDocs=42306)
              0.3125 = fieldNorm(doc=4078)
    
  5. Chang, K.-C.; Zhou, W.; Zhang, S.; Yuan, C,-C.: Threshold effects of the patent H-index in the relationship between patent citations and market value (2015) 3.27
    3.2739286 = sum of:
      3.2739286 = sum of:
        1.1330379 = weight(author_txt:zhang in 4345) [ClassicSimilarity], result of:
          1.1330379 = score(doc=4345,freq=1.0), product of:
            0.5475063 = queryWeight, product of:
              6.6222463 = idf(docFreq=152, maxDocs=42306)
              0.08267682 = queryNorm
            2.069452 = fieldWeight in 4345, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.6222463 = idf(docFreq=152, maxDocs=42306)
              0.3125 = fieldNorm(doc=4345)
        2.1408908 = weight(author_txt:zhou in 4345) [ClassicSimilarity], result of:
          2.1408908 = score(doc=4345,freq=1.0), product of:
            0.8368016 = queryWeight, product of:
              1.2362796 = boost
              8.186948 = idf(docFreq=31, maxDocs=42306)
              0.08267682 = queryNorm
            2.5584211 = fieldWeight in 4345, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.186948 = idf(docFreq=31, maxDocs=42306)
              0.3125 = fieldNorm(doc=4345)
    

Similar documents (content)

  1. Zhang, M.; Zhou, G.D.; Aw, A.: Exploring syntactic structured features over parse trees for relation extraction using kernel methods (2008) 1.55
    1.5497631 = sum of:
      1.5497631 = product of:
        2.1524487 = sum of:
          0.16778722 = weight(abstract_txt:parse in 4056) [ClassicSimilarity], result of:
            0.16778722 = score(doc=4056,freq=3.0), product of:
              0.17055427 = queryWeight, product of:
                1.0520629 = boost
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.017838782 = queryNorm
              0.98377615 = fieldWeight in 4056, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.049226623 = weight(abstract_txt:features in 4056) [ClassicSimilarity], result of:
            0.049226623 = score(doc=4056,freq=4.0), product of:
              0.086202614 = queryWeight, product of:
                1.0577563 = boost
                4.5684576 = idf(docFreq=1192, maxDocs=42306)
                0.017838782 = queryNorm
              0.5710572 = fieldWeight in 4056, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5684576 = idf(docFreq=1192, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.029455349 = weight(abstract_txt:effective in 4056) [ClassicSimilarity], result of:
            0.029455349 = score(doc=4056,freq=1.0), product of:
              0.097166486 = queryWeight, product of:
                1.12301 = boost
                4.8502893 = idf(docFreq=899, maxDocs=42306)
                0.017838782 = queryNorm
              0.30314308 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8502893 = idf(docFreq=899, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.010795705 = weight(abstract_txt:this in 4056) [ClassicSimilarity], result of:
            0.010795705 = score(doc=4056,freq=2.0), product of:
              0.049763136 = queryWeight, product of:
                1.136565 = boost
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.017838782 = queryNorm
              0.21694182 = fieldWeight in 4056, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.2143605 = weight(abstract_txt:kernels in 4056) [ClassicSimilarity], result of:
            0.2143605 = score(doc=4056,freq=3.0), product of:
              0.20081054 = queryWeight, product of:
                1.141573 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.017838782 = queryNorm
              1.0674764 = fieldWeight in 4056, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.060803145 = weight(abstract_txt:shows in 4056) [ClassicSimilarity], result of:
            0.060803145 = score(doc=4056,freq=3.0), product of:
              0.10922381 = queryWeight, product of:
                1.1906499 = boost
                5.142426 = idf(docFreq=671, maxDocs=42306)
                0.017838782 = queryNorm
              0.55668396 = fieldWeight in 4056, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.142426 = idf(docFreq=671, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.011295665 = weight(abstract_txt:from in 4056) [ClassicSimilarity], result of:
            0.011295665 = score(doc=4056,freq=1.0), product of:
              0.06461871 = queryWeight, product of:
                1.2951484 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.017838782 = queryNorm
              0.17480488 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.017159656 = weight(abstract_txt:based in 4056) [ClassicSimilarity], result of:
            0.017159656 = score(doc=4056,freq=1.0), product of:
              0.08539309 = queryWeight, product of:
                1.4888529 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.017838782 = queryNorm
              0.20094898 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.035257958 = weight(abstract_txt:semantic in 4056) [ClassicSimilarity], result of:
            0.035257958 = score(doc=4056,freq=1.0), product of:
              0.12539366 = queryWeight, product of:
                1.5624597 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.017838782 = queryNorm
              0.28117815 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.21939547 = weight(abstract_txt:tree in 4056) [ClassicSimilarity], result of:
            0.21939547 = score(doc=4056,freq=7.0), product of:
              0.1937282 = queryWeight, product of:
                1.5857029 = boost
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.017838782 = queryNorm
              1.1324912 = fieldWeight in 4056, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.08674186 = weight(abstract_txt:extracting in 4056) [ClassicSimilarity], result of:
            0.08674186 = score(doc=4056,freq=1.0), product of:
              0.19963019 = queryWeight, product of:
                1.6096761 = boost
                6.9522038 = idf(docFreq=109, maxDocs=42306)
                0.017838782 = queryNorm
              0.43451273 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9522038 = idf(docFreq=109, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.038900025 = weight(abstract_txt:performance in 4056) [ClassicSimilarity], result of:
            0.038900025 = score(doc=4056,freq=1.0), product of:
              0.13388668 = queryWeight, product of:
                1.6145062 = boost
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.017838782 = queryNorm
              0.2905444 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.02408432 = weight(abstract_txt:that in 4056) [ClassicSimilarity], result of:
            0.02408432 = score(doc=4056,freq=5.0), product of:
              0.07166059 = queryWeight, product of:
                1.6704221 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.017838782 = queryNorm
              0.33608878 = fieldWeight in 4056, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.022617754 = weight(abstract_txt:information in 4056) [ClassicSimilarity], result of:
            0.022617754 = score(doc=4056,freq=3.0), product of:
              0.08577399 = queryWeight, product of:
                1.9739549 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.017838782 = queryNorm
              0.2636901 = fieldWeight in 4056, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.11587163 = weight(abstract_txt:feature in 4056) [ClassicSimilarity], result of:
            0.11587163 = score(doc=4056,freq=2.0), product of:
              0.2199947 = queryWeight, product of:
                2.0695558 = boost
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.017838782 = queryNorm
              0.5267019 = fieldWeight in 4056, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.18562369 = weight(abstract_txt:syntactic in 4056) [ClassicSimilarity], result of:
            0.18562369 = score(doc=4056,freq=3.0), product of:
              0.26311928 = queryWeight, product of:
                2.2633274 = boost
                6.5168858 = idf(docFreq=169, maxDocs=42306)
                0.017838782 = queryNorm
              0.70547354 = fieldWeight in 4056, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5168858 = idf(docFreq=169, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.32964495 = weight(abstract_txt:relation in 4056) [ClassicSimilarity], result of:
            0.32964495 = score(doc=4056,freq=5.0), product of:
              0.43165556 = queryWeight, product of:
                4.428209 = boost
                5.46442 = idf(docFreq=486, maxDocs=42306)
                0.017838782 = queryNorm
              0.7636759 = fieldWeight in 4056, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.46442 = idf(docFreq=486, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.5334272 = weight(abstract_txt:extraction in 4056) [ClassicSimilarity], result of:
            0.5334272 = score(doc=4056,freq=6.0), product of:
              0.55988145 = queryWeight, product of:
                5.043215 = boost
                6.2233386 = idf(docFreq=227, maxDocs=42306)
                0.017838782 = queryNorm
              0.95275027 = fieldWeight in 4056, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2233386 = idf(docFreq=227, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
        0.72 = coord(18/25)
    
  2. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 0.62
    0.6215353 = sum of:
      0.6215353 = product of:
        1.5538383 = sum of:
          0.009542145 = weight(abstract_txt:this in 3612) [ClassicSimilarity], result of:
            0.009542145 = score(doc=3612,freq=1.0), product of:
              0.049763136 = queryWeight, product of:
                1.136565 = boost
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.017838782 = queryNorm
              0.19175129 = fieldWeight in 3612, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
          0.21878077 = weight(abstract_txt:kernels in 3612) [ClassicSimilarity], result of:
            0.21878077 = score(doc=3612,freq=2.0), product of:
              0.20081054 = queryWeight, product of:
                1.141573 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.017838782 = queryNorm
              1.0894885 = fieldWeight in 3612, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
          0.019968104 = weight(abstract_txt:from in 3612) [ClassicSimilarity], result of:
            0.019968104 = score(doc=3612,freq=2.0), product of:
              0.06461871 = queryWeight, product of:
                1.2951484 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.017838782 = queryNorm
              0.3090143 = fieldWeight in 3612, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
          0.037151743 = weight(abstract_txt:based in 3612) [ClassicSimilarity], result of:
            0.037151743 = score(doc=3612,freq=3.0), product of:
              0.08539309 = queryWeight, product of:
                1.4888529 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.017838782 = queryNorm
              0.4350673 = fieldWeight in 3612, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
          0.20730925 = weight(abstract_txt:tree in 3612) [ClassicSimilarity], result of:
            0.20730925 = score(doc=3612,freq=4.0), product of:
              0.1937282 = queryWeight, product of:
                1.5857029 = boost
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.017838782 = queryNorm
              1.0701036 = fieldWeight in 3612, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
          0.108427316 = weight(abstract_txt:extracting in 3612) [ClassicSimilarity], result of:
            0.108427316 = score(doc=3612,freq=1.0), product of:
              0.19963019 = queryWeight, product of:
                1.6096761 = boost
                6.9522038 = idf(docFreq=109, maxDocs=42306)
                0.017838782 = queryNorm
              0.5431409 = fieldWeight in 3612, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9522038 = idf(docFreq=109, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
          0.013463545 = weight(abstract_txt:that in 3612) [ClassicSimilarity], result of:
            0.013463545 = score(doc=3612,freq=1.0), product of:
              0.07166059 = queryWeight, product of:
                1.6704221 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.017838782 = queryNorm
              0.18787934 = fieldWeight in 3612, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
          0.016322957 = weight(abstract_txt:information in 3612) [ClassicSimilarity], result of:
            0.016322957 = score(doc=3612,freq=1.0), product of:
              0.08577399 = queryWeight, product of:
                1.9739549 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.017838782 = queryNorm
              0.19030195 = fieldWeight in 3612, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
          0.45138496 = weight(abstract_txt:relation in 3612) [ClassicSimilarity], result of:
            0.45138496 = score(doc=3612,freq=6.0), product of:
              0.43165556 = queryWeight, product of:
                4.428209 = boost
                5.46442 = idf(docFreq=486, maxDocs=42306)
                0.017838782 = queryNorm
              1.0457064 = fieldWeight in 3612, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.46442 = idf(docFreq=486, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
          0.47148746 = weight(abstract_txt:extraction in 3612) [ClassicSimilarity], result of:
            0.47148746 = score(doc=3612,freq=3.0), product of:
              0.55988145 = queryWeight, product of:
                5.043215 = boost
                6.2233386 = idf(docFreq=227, maxDocs=42306)
                0.017838782 = queryNorm
              0.8421202 = fieldWeight in 3612, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2233386 = idf(docFreq=227, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
        0.4 = coord(10/25)
    
  3. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.46
    0.45508766 = sum of:
      0.45508766 = product of:
        1.1377192 = sum of:
          0.03480848 = weight(abstract_txt:features in 1974) [ClassicSimilarity], result of:
            0.03480848 = score(doc=1974,freq=2.0), product of:
              0.086202614 = queryWeight, product of:
                1.0577563 = boost
                4.5684576 = idf(docFreq=1192, maxDocs=42306)
                0.017838782 = queryNorm
              0.4037984 = fieldWeight in 1974, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5684576 = idf(docFreq=1192, maxDocs=42306)
                0.0625 = fieldNorm(doc=1974)
          0.029455349 = weight(abstract_txt:effective in 1974) [ClassicSimilarity], result of:
            0.029455349 = score(doc=1974,freq=1.0), product of:
              0.097166486 = queryWeight, product of:
                1.12301 = boost
                4.8502893 = idf(docFreq=899, maxDocs=42306)
                0.017838782 = queryNorm
              0.30314308 = fieldWeight in 1974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8502893 = idf(docFreq=899, maxDocs=42306)
                0.0625 = fieldNorm(doc=1974)
          0.0076337163 = weight(abstract_txt:this in 1974) [ClassicSimilarity], result of:
            0.0076337163 = score(doc=1974,freq=1.0), product of:
              0.049763136 = queryWeight, product of:
                1.136565 = boost
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.017838782 = queryNorm
              0.15340103 = fieldWeight in 1974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.0625 = fieldNorm(doc=1974)
          0.015974483 = weight(abstract_txt:from in 1974) [ClassicSimilarity], result of:
            0.015974483 = score(doc=1974,freq=2.0), product of:
              0.06461871 = queryWeight, product of:
                1.2951484 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.017838782 = queryNorm
              0.24721143 = fieldWeight in 1974, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0625 = fieldNorm(doc=1974)
          0.017159656 = weight(abstract_txt:based in 1974) [ClassicSimilarity], result of:
            0.017159656 = score(doc=1974,freq=1.0), product of:
              0.08539309 = queryWeight, product of:
                1.4888529 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.017838782 = queryNorm
              0.20094898 = fieldWeight in 1974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=1974)
          0.070515916 = weight(abstract_txt:semantic in 1974) [ClassicSimilarity], result of:
            0.070515916 = score(doc=1974,freq=4.0), product of:
              0.12539366 = queryWeight, product of:
                1.5624597 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.017838782 = queryNorm
              0.5623563 = fieldWeight in 1974, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.0625 = fieldNorm(doc=1974)
          0.038900025 = weight(abstract_txt:performance in 1974) [ClassicSimilarity], result of:
            0.038900025 = score(doc=1974,freq=1.0), product of:
              0.13388668 = queryWeight, product of:
                1.6145062 = boost
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.017838782 = queryNorm
              0.2905444 = fieldWeight in 1974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.0625 = fieldNorm(doc=1974)
          0.021541672 = weight(abstract_txt:that in 1974) [ClassicSimilarity], result of:
            0.021541672 = score(doc=1974,freq=4.0), product of:
              0.07166059 = queryWeight, product of:
                1.6704221 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.017838782 = queryNorm
              0.30060694 = fieldWeight in 1974, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=1974)
          0.46618834 = weight(abstract_txt:relation in 1974) [ClassicSimilarity], result of:
            0.46618834 = score(doc=1974,freq=10.0), product of:
              0.43165556 = queryWeight, product of:
                4.428209 = boost
                5.46442 = idf(docFreq=486, maxDocs=42306)
                0.017838782 = queryNorm
              1.0800008 = fieldWeight in 1974, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.46442 = idf(docFreq=486, maxDocs=42306)
                0.0625 = fieldNorm(doc=1974)
          0.43554148 = weight(abstract_txt:extraction in 1974) [ClassicSimilarity], result of:
            0.43554148 = score(doc=1974,freq=4.0), product of:
              0.55988145 = queryWeight, product of:
                5.043215 = boost
                6.2233386 = idf(docFreq=227, maxDocs=42306)
                0.017838782 = queryNorm
              0.7779173 = fieldWeight in 1974, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2233386 = idf(docFreq=227, maxDocs=42306)
                0.0625 = fieldNorm(doc=1974)
        0.4 = coord(10/25)
    
  4. Vlachidis, A.; Tudhope, D.: ¬A knowledge-based approach to information extraction for semantic interoperability in the archaeology domain (2016) 0.35
    0.35255498 = sum of:
      0.35255498 = product of:
        0.97931933 = sum of:
          0.03510471 = weight(abstract_txt:shows in 4896) [ClassicSimilarity], result of:
            0.03510471 = score(doc=4896,freq=1.0), product of:
              0.10922381 = queryWeight, product of:
                1.1906499 = boost
                5.142426 = idf(docFreq=671, maxDocs=42306)
                0.017838782 = queryNorm
              0.32140163 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.142426 = idf(docFreq=671, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.019564666 = weight(abstract_txt:from in 4896) [ClassicSimilarity], result of:
            0.019564666 = score(doc=4896,freq=3.0), product of:
              0.06461871 = queryWeight, product of:
                1.2951484 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.017838782 = queryNorm
              0.30277094 = fieldWeight in 4896, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.029721394 = weight(abstract_txt:based in 4896) [ClassicSimilarity], result of:
            0.029721394 = score(doc=4896,freq=3.0), product of:
              0.08539309 = queryWeight, product of:
                1.4888529 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.017838782 = queryNorm
              0.34805384 = fieldWeight in 4896, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.086364 = weight(abstract_txt:semantic in 4896) [ClassicSimilarity], result of:
            0.086364 = score(doc=4896,freq=6.0), product of:
              0.12539366 = queryWeight, product of:
                1.5624597 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.017838782 = queryNorm
              0.688743 = fieldWeight in 4896, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.038900025 = weight(abstract_txt:performance in 4896) [ClassicSimilarity], result of:
            0.038900025 = score(doc=4896,freq=1.0), product of:
              0.13388668 = queryWeight, product of:
                1.6145062 = boost
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.017838782 = queryNorm
              0.2905444 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.018467316 = weight(abstract_txt:information in 4896) [ClassicSimilarity], result of:
            0.018467316 = score(doc=4896,freq=2.0), product of:
              0.08577399 = queryWeight, product of:
                1.9739549 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.017838782 = queryNorm
              0.21530207 = fieldWeight in 4896, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.1071699 = weight(abstract_txt:syntactic in 4896) [ClassicSimilarity], result of:
            0.1071699 = score(doc=4896,freq=1.0), product of:
              0.26311928 = queryWeight, product of:
                2.2633274 = boost
                6.5168858 = idf(docFreq=169, maxDocs=42306)
                0.017838782 = queryNorm
              0.40730536 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5168858 = idf(docFreq=169, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.20848577 = weight(abstract_txt:relation in 4896) [ClassicSimilarity], result of:
            0.20848577 = score(doc=4896,freq=2.0), product of:
              0.43165556 = queryWeight, product of:
                4.428209 = boost
                5.46442 = idf(docFreq=486, maxDocs=42306)
                0.017838782 = queryNorm
              0.48299104 = fieldWeight in 4896, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.46442 = idf(docFreq=486, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.43554148 = weight(abstract_txt:extraction in 4896) [ClassicSimilarity], result of:
            0.43554148 = score(doc=4896,freq=4.0), product of:
              0.55988145 = queryWeight, product of:
                5.043215 = boost
                6.2233386 = idf(docFreq=227, maxDocs=42306)
                0.017838782 = queryNorm
              0.7779173 = fieldWeight in 4896, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2233386 = idf(docFreq=227, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
        0.36 = coord(9/25)
    
  5. Collovini de Abreu, S.; Vieira, R.: RelP: Portuguese open relation extraction (2017) 0.34
    0.3441667 = sum of:
      0.3441667 = product of:
        0.95601857 = sum of:
          0.024613312 = weight(abstract_txt:features in 540) [ClassicSimilarity], result of:
            0.024613312 = score(doc=540,freq=1.0), product of:
              0.086202614 = queryWeight, product of:
                1.0577563 = boost
                4.5684576 = idf(docFreq=1192, maxDocs=42306)
                0.017838782 = queryNorm
              0.2855286 = fieldWeight in 540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5684576 = idf(docFreq=1192, maxDocs=42306)
                0.0625 = fieldNorm(doc=540)
          0.015267433 = weight(abstract_txt:this in 540) [ClassicSimilarity], result of:
            0.015267433 = score(doc=540,freq=4.0), product of:
              0.049763136 = queryWeight, product of:
                1.136565 = boost
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.017838782 = queryNorm
              0.30680206 = fieldWeight in 540, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.0625 = fieldNorm(doc=540)
          0.015974483 = weight(abstract_txt:from in 540) [ClassicSimilarity], result of:
            0.015974483 = score(doc=540,freq=2.0), product of:
              0.06461871 = queryWeight, product of:
                1.2951484 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.017838782 = queryNorm
              0.24721143 = fieldWeight in 540, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0625 = fieldNorm(doc=540)
          0.017159656 = weight(abstract_txt:based in 540) [ClassicSimilarity], result of:
            0.017159656 = score(doc=540,freq=1.0), product of:
              0.08539309 = queryWeight, product of:
                1.4888529 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.017838782 = queryNorm
              0.20094898 = fieldWeight in 540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=540)
          0.035257958 = weight(abstract_txt:semantic in 540) [ClassicSimilarity], result of:
            0.035257958 = score(doc=540,freq=1.0), product of:
              0.12539366 = queryWeight, product of:
                1.5624597 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.017838782 = queryNorm
              0.28117815 = fieldWeight in 540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.0625 = fieldNorm(doc=540)
          0.015232262 = weight(abstract_txt:that in 540) [ClassicSimilarity], result of:
            0.015232262 = score(doc=540,freq=2.0), product of:
              0.07166059 = queryWeight, product of:
                1.6704221 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.017838782 = queryNorm
              0.2125612 = fieldWeight in 540, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=540)
          0.013058366 = weight(abstract_txt:information in 540) [ClassicSimilarity], result of:
            0.013058366 = score(doc=540,freq=1.0), product of:
              0.08577399 = queryWeight, product of:
                1.9739549 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.017838782 = queryNorm
              0.15224156 = fieldWeight in 540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.0625 = fieldNorm(doc=540)
          0.4422651 = weight(abstract_txt:relation in 540) [ClassicSimilarity], result of:
            0.4422651 = score(doc=540,freq=9.0), product of:
              0.43165556 = queryWeight, product of:
                4.428209 = boost
                5.46442 = idf(docFreq=486, maxDocs=42306)
                0.017838782 = queryNorm
              1.0245787 = fieldWeight in 540, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.46442 = idf(docFreq=486, maxDocs=42306)
                0.0625 = fieldNorm(doc=540)
          0.37719 = weight(abstract_txt:extraction in 540) [ClassicSimilarity], result of:
            0.37719 = score(doc=540,freq=3.0), product of:
              0.55988145 = queryWeight, product of:
                5.043215 = boost
                6.2233386 = idf(docFreq=227, maxDocs=42306)
                0.017838782 = queryNorm
              0.67369616 = fieldWeight in 540, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2233386 = idf(docFreq=227, maxDocs=42306)
                0.0625 = fieldNorm(doc=540)
        0.36 = coord(9/25)