Document (#36368)

Author
Suakkaphong, N.
Zhang, Z.
Chen, H.
Title
Disease named entity recognition using semisupervised learning and conditional random fields
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.4, S.727-737
Year
2011
Abstract
Information extraction is an important text-mining task that aims at extracting prespecified types of information from large text collections and making them available in structured representations such as databases. In the biomedical domain, information extraction can be applied to help biologists make the most use of their digital-literature archives. Currently, there are large amounts of biomedical literature that contain rich information about biomedical substances. Extracting such knowledge requires a good named entity recognition technique. In this article, we combine conditional random fields (CRFs), a state-of-the-art sequence-labeling algorithm, with two semisupervised learning techniques, bootstrapping and feature sampling, to recognize disease names from biomedical literature. Two data-processing strategies for each technique also were analyzed: one sequentially processing unlabeled data partitions and another one processing unlabeled data partitions in a round-robin fashion. The experimental results showed the advantage of semisupervised learning techniques given limited labeled training data. Specifically, CRFs with bootstrapping implemented in sequential fashion outperformed strictly supervised CRFs for disease name recognition. The project was supported by NIH/NLM Grant R33 LM07299-01, 2002-2005.
Theme
Data Mining

Similar documents (author)

  1. Chen, Z.; Wenyin, L.; Zhang, F.; Li, M.; Zhang, H.: Web mining for Web image retrieval (2001) 3.39
    3.3916821 = sum of:
      3.3916821 = sum of:
        1.2993189 = weight(author_txt:chen in 6521) [ClassicSimilarity], result of:
          1.2993189 = score(doc=6521,freq=1.0), product of:
            0.6758805 = queryWeight, product of:
              6.1517096 = idf(docFreq=255, maxDocs=44218)
              0.10986873 = queryNorm
            1.9224093 = fieldWeight in 6521, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.1517096 = idf(docFreq=255, maxDocs=44218)
              0.3125 = fieldNorm(doc=6521)
        2.0923634 = weight(author_txt:zhang in 6521) [ClassicSimilarity], result of:
          2.0923634 = score(doc=6521,freq=2.0), product of:
            0.7370113 = queryWeight, product of:
              1.0442443 = boost
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.10986873 = queryNorm
            2.838984 = fieldWeight in 6521, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.3125 = fieldNorm(doc=6521)
    
  2. Chen, H.; Zhang, Y.; Houston, A.L.: Semantic indexing and searching using a Hopfield net (1998) 3.33
    3.334612 = sum of:
      3.334612 = sum of:
        1.5591826 = weight(author_txt:chen in 5704) [ClassicSimilarity], result of:
          1.5591826 = score(doc=5704,freq=1.0), product of:
            0.6758805 = queryWeight, product of:
              6.1517096 = idf(docFreq=255, maxDocs=44218)
              0.10986873 = queryNorm
            2.306891 = fieldWeight in 5704, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.1517096 = idf(docFreq=255, maxDocs=44218)
              0.375 = fieldNorm(doc=5704)
        1.7754292 = weight(author_txt:zhang in 5704) [ClassicSimilarity], result of:
          1.7754292 = score(doc=5704,freq=1.0), product of:
            0.7370113 = queryWeight, product of:
              1.0442443 = boost
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.10986873 = queryNorm
            2.408958 = fieldWeight in 5704, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.375 = fieldNorm(doc=5704)
    
  3. Wenyin, L.; Chen, Z.; Li, M.; Zhang, H.: ¬A media agent for automatically builiding a personalized semantic index of Web media objects (2001) 2.78
    2.7788434 = sum of:
      2.7788434 = sum of:
        1.2993189 = weight(author_txt:chen in 6522) [ClassicSimilarity], result of:
          1.2993189 = score(doc=6522,freq=1.0), product of:
            0.6758805 = queryWeight, product of:
              6.1517096 = idf(docFreq=255, maxDocs=44218)
              0.10986873 = queryNorm
            1.9224093 = fieldWeight in 6522, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.1517096 = idf(docFreq=255, maxDocs=44218)
              0.3125 = fieldNorm(doc=6522)
        1.4795244 = weight(author_txt:zhang in 6522) [ClassicSimilarity], result of:
          1.4795244 = score(doc=6522,freq=1.0), product of:
            0.7370113 = queryWeight, product of:
              1.0442443 = boost
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.10986873 = queryNorm
            2.007465 = fieldWeight in 6522, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.3125 = fieldNorm(doc=6522)
    
  4. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 2.78
    2.7788434 = sum of:
      2.7788434 = sum of:
        1.2993189 = weight(author_txt:chen in 1611) [ClassicSimilarity], result of:
          1.2993189 = score(doc=1611,freq=1.0), product of:
            0.6758805 = queryWeight, product of:
              6.1517096 = idf(docFreq=255, maxDocs=44218)
              0.10986873 = queryNorm
            1.9224093 = fieldWeight in 1611, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.1517096 = idf(docFreq=255, maxDocs=44218)
              0.3125 = fieldNorm(doc=1611)
        1.4795244 = weight(author_txt:zhang in 1611) [ClassicSimilarity], result of:
          1.4795244 = score(doc=1611,freq=1.0), product of:
            0.7370113 = queryWeight, product of:
              1.0442443 = boost
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.10986873 = queryNorm
            2.007465 = fieldWeight in 1611, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.3125 = fieldNorm(doc=1611)
    
  5. Chen, C.; Ibekwe-SanJuan, F.; Pinho, R.; Zhang, J.: ¬The impact of the sloan digital sky survey on astronomical research : the role of culture, identity, and international collaboration (2008) 2.78
    2.7788434 = sum of:
      2.7788434 = sum of:
        1.2993189 = weight(author_txt:chen in 2275) [ClassicSimilarity], result of:
          1.2993189 = score(doc=2275,freq=1.0), product of:
            0.6758805 = queryWeight, product of:
              6.1517096 = idf(docFreq=255, maxDocs=44218)
              0.10986873 = queryNorm
            1.9224093 = fieldWeight in 2275, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.1517096 = idf(docFreq=255, maxDocs=44218)
              0.3125 = fieldNorm(doc=2275)
        1.4795244 = weight(author_txt:zhang in 2275) [ClassicSimilarity], result of:
          1.4795244 = score(doc=2275,freq=1.0), product of:
            0.7370113 = queryWeight, product of:
              1.0442443 = boost
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.10986873 = queryNorm
            2.007465 = fieldWeight in 2275, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.3125 = fieldNorm(doc=2275)
    

Similar documents (content)

  1. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 0.27
    0.27017817 = sum of:
      0.27017817 = product of:
        0.9649221 = sum of:
          0.008132941 = weight(abstract_txt:information in 1611) [ClassicSimilarity], result of:
            0.008132941 = score(doc=1611,freq=1.0), product of:
              0.043000393 = queryWeight, product of:
                1.1310993 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.01570314 = queryNorm
              0.18913643 = fieldWeight in 1611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.11782082 = weight(abstract_txt:extraction in 1611) [ClassicSimilarity], result of:
            0.11782082 = score(doc=1611,freq=3.0), product of:
              0.14062794 = queryWeight, product of:
                1.4463898 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.01570314 = queryNorm
              0.83781946 = fieldWeight in 1611, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.03695598 = weight(abstract_txt:literature in 1611) [ClassicSimilarity], result of:
            0.03695598 = score(doc=1611,freq=1.0), product of:
              0.10718094 = queryWeight, product of:
                1.5465143 = boost
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.01570314 = queryNorm
              0.3447999 = fieldWeight in 1611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.090978354 = weight(abstract_txt:named in 1611) [ClassicSimilarity], result of:
            0.090978354 = score(doc=1611,freq=1.0), product of:
              0.17070885 = queryWeight, product of:
                1.5935935 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.01570314 = queryNorm
              0.53294456 = fieldWeight in 1611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.09557574 = weight(abstract_txt:extracting in 1611) [ClassicSimilarity], result of:
            0.09557574 = score(doc=1611,freq=1.0), product of:
              0.1764124 = queryWeight, product of:
                1.6199965 = boost
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.01570314 = queryNorm
              0.5417745 = fieldWeight in 1611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.06519115 = weight(abstract_txt:learning in 1611) [ClassicSimilarity], result of:
            0.06519115 = score(doc=1611,freq=2.0), product of:
              0.12419674 = queryWeight, product of:
                1.6647547 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.01570314 = queryNorm
              0.5249023 = fieldWeight in 1611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.5502671 = weight(abstract_txt:biomedical in 1611) [ClassicSimilarity], result of:
            0.5502671 = score(doc=1611,freq=7.0), product of:
              0.37324187 = queryWeight, product of:
                3.3324199 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.01570314 = queryNorm
              1.474291 = fieldWeight in 1611, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
        0.28 = coord(7/25)
    
  2. Song, M.; Kang, K.; An, J.Y.: Investigating drug-disease interactions in drug-symptom-disease triples via citation relations (2018) 0.25
    0.24695663 = sum of:
      0.24695663 = product of:
        1.028986 = sum of:
          0.0065063527 = weight(abstract_txt:information in 4545) [ClassicSimilarity], result of:
            0.0065063527 = score(doc=4545,freq=1.0), product of:
              0.043000393 = queryWeight, product of:
                1.1310993 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.01570314 = queryNorm
              0.15130915 = fieldWeight in 4545, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=4545)
          0.08016578 = weight(abstract_txt:entity in 4545) [ClassicSimilarity], result of:
            0.08016578 = score(doc=4545,freq=2.0), product of:
              0.14450628 = queryWeight, product of:
                1.4661989 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.01570314 = queryNorm
              0.5547564 = fieldWeight in 4545, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=4545)
          0.041810915 = weight(abstract_txt:literature in 4545) [ClassicSimilarity], result of:
            0.041810915 = score(doc=4545,freq=2.0), product of:
              0.10718094 = queryWeight, product of:
                1.5465143 = boost
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.01570314 = queryNorm
              0.39009655 = fieldWeight in 4545, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.0625 = fieldNorm(doc=4545)
          0.108131595 = weight(abstract_txt:extracting in 4545) [ClassicSimilarity], result of:
            0.108131595 = score(doc=4545,freq=2.0), product of:
              0.1764124 = queryWeight, product of:
                1.6199965 = boost
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.01570314 = queryNorm
              0.6129478 = fieldWeight in 4545, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.0625 = fieldNorm(doc=4545)
          0.35215765 = weight(abstract_txt:disease in 4545) [ClassicSimilarity], result of:
            0.35215765 = score(doc=4545,freq=5.0), product of:
              0.32691577 = queryWeight, product of:
                2.7009287 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.01570314 = queryNorm
              1.0772122 = fieldWeight in 4545, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.0625 = fieldNorm(doc=4545)
          0.44021368 = weight(abstract_txt:biomedical in 4545) [ClassicSimilarity], result of:
            0.44021368 = score(doc=4545,freq=7.0), product of:
              0.37324187 = queryWeight, product of:
                3.3324199 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.01570314 = queryNorm
              1.1794327 = fieldWeight in 4545, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=4545)
        0.24 = coord(6/25)
    
  3. Collovini de Abreu, S.; Vieira, R.: RelP: Portuguese open relation extraction (2017) 0.19
    0.18890777 = sum of:
      0.18890777 = product of:
        0.5247438 = sum of:
          0.021310981 = weight(abstract_txt:techniques in 3621) [ClassicSimilarity], result of:
            0.021310981 = score(doc=3621,freq=1.0), product of:
              0.075273074 = queryWeight, product of:
                1.0582039 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.01570314 = queryNorm
              0.2831156 = fieldWeight in 3621, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=3621)
          0.0065063527 = weight(abstract_txt:information in 3621) [ClassicSimilarity], result of:
            0.0065063527 = score(doc=3621,freq=1.0), product of:
              0.043000393 = queryWeight, product of:
                1.1310993 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.01570314 = queryNorm
              0.15130915 = fieldWeight in 3621, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=3621)
          0.029198982 = weight(abstract_txt:fields in 3621) [ClassicSimilarity], result of:
            0.029198982 = score(doc=3621,freq=1.0), product of:
              0.09285725 = queryWeight, product of:
                1.1753236 = boost
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.01570314 = queryNorm
              0.3144502 = fieldWeight in 3621, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.0625 = fieldNorm(doc=3621)
          0.040045697 = weight(abstract_txt:technique in 3621) [ClassicSimilarity], result of:
            0.040045697 = score(doc=3621,freq=1.0), product of:
              0.11462374 = queryWeight, product of:
                1.3058306 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.01570314 = queryNorm
              0.34936652 = fieldWeight in 3621, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0625 = fieldNorm(doc=3621)
          0.094256654 = weight(abstract_txt:extraction in 3621) [ClassicSimilarity], result of:
            0.094256654 = score(doc=3621,freq=3.0), product of:
              0.14062794 = queryWeight, product of:
                1.4463898 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.01570314 = queryNorm
              0.67025554 = fieldWeight in 3621, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=3621)
          0.063901454 = weight(abstract_txt:random in 3621) [ClassicSimilarity], result of:
            0.063901454 = score(doc=3621,freq=1.0), product of:
              0.15652288 = queryWeight, product of:
                1.5259435 = boost
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.01570314 = queryNorm
              0.40825632 = fieldWeight in 3621, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.0625 = fieldNorm(doc=3621)
          0.017029272 = weight(abstract_txt:data in 3621) [ClassicSimilarity], result of:
            0.017029272 = score(doc=3621,freq=1.0), product of:
              0.08166665 = queryWeight, product of:
                1.5587875 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01570314 = queryNorm
              0.20852174 = fieldWeight in 3621, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=3621)
          0.1260633 = weight(abstract_txt:named in 3621) [ClassicSimilarity], result of:
            0.1260633 = score(doc=3621,freq=3.0), product of:
              0.17070885 = queryWeight, product of:
                1.5935935 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.01570314 = queryNorm
              0.7384696 = fieldWeight in 3621, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.0625 = fieldNorm(doc=3621)
          0.12643111 = weight(abstract_txt:conditional in 3621) [ClassicSimilarity], result of:
            0.12643111 = score(doc=3621,freq=1.0), product of:
              0.24668342 = queryWeight, product of:
                1.9156648 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.01570314 = queryNorm
              0.5125237 = fieldWeight in 3621, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.0625 = fieldNorm(doc=3621)
        0.36 = coord(9/25)
    
  4. Mao, J.; Cui, H.: Identifying bacterial biotope entities using sequence labeling : performance and feature analysis (2018) 0.19
    0.18760285 = sum of:
      0.18760285 = product of:
        0.521119 = sum of:
          0.020259505 = weight(abstract_txt:large in 4462) [ClassicSimilarity], result of:
            0.020259505 = score(doc=4462,freq=1.0), product of:
              0.07277629 = queryWeight, product of:
                1.0405058 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.01570314 = queryNorm
              0.27838057 = fieldWeight in 4462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=4462)
          0.021310981 = weight(abstract_txt:techniques in 4462) [ClassicSimilarity], result of:
            0.021310981 = score(doc=4462,freq=1.0), product of:
              0.075273074 = queryWeight, product of:
                1.0582039 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.01570314 = queryNorm
              0.2831156 = fieldWeight in 4462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=4462)
          0.0065063527 = weight(abstract_txt:information in 4462) [ClassicSimilarity], result of:
            0.0065063527 = score(doc=4462,freq=1.0), product of:
              0.043000393 = queryWeight, product of:
                1.1310993 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.01570314 = queryNorm
              0.15130915 = fieldWeight in 4462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=4462)
          0.029198982 = weight(abstract_txt:fields in 4462) [ClassicSimilarity], result of:
            0.029198982 = score(doc=4462,freq=1.0), product of:
              0.09285725 = queryWeight, product of:
                1.1753236 = boost
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.01570314 = queryNorm
              0.3144502 = fieldWeight in 4462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.0625 = fieldNorm(doc=4462)
          0.09818263 = weight(abstract_txt:entity in 4462) [ClassicSimilarity], result of:
            0.09818263 = score(doc=4462,freq=3.0), product of:
              0.14450628 = queryWeight, product of:
                1.4661989 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.01570314 = queryNorm
              0.6794351 = fieldWeight in 4462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=4462)
          0.063901454 = weight(abstract_txt:random in 4462) [ClassicSimilarity], result of:
            0.063901454 = score(doc=4462,freq=1.0), product of:
              0.15652288 = queryWeight, product of:
                1.5259435 = boost
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.01570314 = queryNorm
              0.40825632 = fieldWeight in 4462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.0625 = fieldNorm(doc=4462)
          0.07646059 = weight(abstract_txt:extracting in 4462) [ClassicSimilarity], result of:
            0.07646059 = score(doc=4462,freq=1.0), product of:
              0.1764124 = queryWeight, product of:
                1.6199965 = boost
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.01570314 = queryNorm
              0.4334196 = fieldWeight in 4462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.0625 = fieldNorm(doc=4462)
          0.12643111 = weight(abstract_txt:conditional in 4462) [ClassicSimilarity], result of:
            0.12643111 = score(doc=4462,freq=1.0), product of:
              0.24668342 = queryWeight, product of:
                1.9156648 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.01570314 = queryNorm
              0.5125237 = fieldWeight in 4462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.0625 = fieldNorm(doc=4462)
          0.078867376 = weight(abstract_txt:recognition in 4462) [ClassicSimilarity], result of:
            0.078867376 = score(doc=4462,freq=1.0), product of:
              0.20615761 = queryWeight, product of:
                2.1448398 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.01570314 = queryNorm
              0.38255864 = fieldWeight in 4462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.0625 = fieldNorm(doc=4462)
        0.36 = coord(9/25)
    
  5. Wang, P.; Hao, T.; Yan, J.; Jin, L.: Large-scale extraction of drug-disease pairs from the medical literature (2017) 0.18
    0.18330015 = sum of:
      0.18330015 = product of:
        0.7637507 = sum of:
          0.020259505 = weight(abstract_txt:large in 3927) [ClassicSimilarity], result of:
            0.020259505 = score(doc=3927,freq=1.0), product of:
              0.07277629 = queryWeight, product of:
                1.0405058 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.01570314 = queryNorm
              0.27838057 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=3927)
          0.0065063527 = weight(abstract_txt:information in 3927) [ClassicSimilarity], result of:
            0.0065063527 = score(doc=3927,freq=1.0), product of:
              0.043000393 = queryWeight, product of:
                1.1310993 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.01570314 = queryNorm
              0.15130915 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=3927)
          0.076960236 = weight(abstract_txt:extraction in 3927) [ClassicSimilarity], result of:
            0.076960236 = score(doc=3927,freq=2.0), product of:
              0.14062794 = queryWeight, product of:
                1.4463898 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.01570314 = queryNorm
              0.54726136 = fieldWeight in 3927, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=3927)
          0.029564781 = weight(abstract_txt:literature in 3927) [ClassicSimilarity], result of:
            0.029564781 = score(doc=3927,freq=1.0), product of:
              0.10718094 = queryWeight, product of:
                1.5465143 = boost
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.01570314 = queryNorm
              0.27583992 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.0625 = fieldNorm(doc=3927)
          0.13243362 = weight(abstract_txt:extracting in 3927) [ClassicSimilarity], result of:
            0.13243362 = score(doc=3927,freq=3.0), product of:
              0.1764124 = queryWeight, product of:
                1.6199965 = boost
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.01570314 = queryNorm
              0.7507047 = fieldWeight in 3927, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.0625 = fieldNorm(doc=3927)
          0.49802616 = weight(abstract_txt:disease in 3927) [ClassicSimilarity], result of:
            0.49802616 = score(doc=3927,freq=10.0), product of:
              0.32691577 = queryWeight, product of:
                2.7009287 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.01570314 = queryNorm
              1.5234082 = fieldWeight in 3927, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.0625 = fieldNorm(doc=3927)
        0.24 = coord(6/25)