Document (#21545)

Author
Lindsay, R.K.
Gordon, M.D.
Title
Literature-based discovery by lexical statistics
Source
Journal of the American Society for Information Science. 50(1999) no.7, S.574-587
Year
1999
Abstract
We report experiments that use lexical statistics, such as word frequency counts, to discover hidden connections in the medical literature. Hidden connections are those that are unlikely to be found by examination of bibliographic citations or the use of standard indexing methods and yet establish a relationship between topics that might profitably by explored by scientific research. Our experiments were conducted with the MEDLINE medical literature database and follow and extend the work of Swanson
Theme
Informetrie
Field
Medizin
Object
Medline

Similar documents (author)

  1. Gordon, J.A.: Training in indexing : some recent development (1981) 5.66
    5.661144 = sum of:
      5.661144 = weight(author_txt:gordon in 6175) [ClassicSimilarity], result of:
        5.661144 = fieldWeight in 6175, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.625 = fieldNorm(doc=6175)
    
  2. Gordon, M.: Training for indexing : a teacher's view (1987) 5.66
    5.661144 = sum of:
      5.661144 = weight(author_txt:gordon in 6176) [ClassicSimilarity], result of:
        5.661144 = fieldWeight in 6176, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.625 = fieldNorm(doc=6176)
    
  3. Gordon, S.: Museums and the information superhighway (1995) 5.66
    5.661144 = sum of:
      5.661144 = weight(author_txt:gordon in 3844) [ClassicSimilarity], result of:
        5.661144 = fieldWeight in 3844, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.625 = fieldNorm(doc=3844)
    
  4. Gordon, A.S.: Browsing image collections with representations of common-sense activities (2001) 5.66
    5.661144 = sum of:
      5.661144 = weight(author_txt:gordon in 6530) [ClassicSimilarity], result of:
        5.661144 = fieldWeight in 6530, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.625 = fieldNorm(doc=6530)
    
  5. Gordon, A.: ¬The invisibility of science publications in hebrew : a comparative database study (2012) 5.66
    5.661144 = sum of:
      5.661144 = weight(author_txt:gordon in 79) [ClassicSimilarity], result of:
        5.661144 = fieldWeight in 79, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.625 = fieldNorm(doc=79)
    

Similar documents (content)

  1. Srinivasan, P.: Text mining : generating hypotheses from MEDLINE (2004) 0.18
    0.18189533 = sum of:
      0.18189533 = product of:
        0.6496262 = sum of:
          0.0433624 = weight(abstract_txt:report in 2225) [ClassicSimilarity], result of:
            0.0433624 = score(doc=2225,freq=1.0), product of:
              0.10264862 = queryWeight, product of:
                1.0202292 = boost
                5.4071717 = idf(docFreq=538, maxDocs=44218)
                0.018607378 = queryNorm
              0.42243528 = fieldWeight in 2225, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4071717 = idf(docFreq=538, maxDocs=44218)
                0.078125 = fieldNorm(doc=2225)
          0.048667245 = weight(abstract_txt:discovery in 2225) [ClassicSimilarity], result of:
            0.048667245 = score(doc=2225,freq=1.0), product of:
              0.11085843 = queryWeight, product of:
                1.0602434 = boost
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.018607378 = queryNorm
              0.43900353 = fieldWeight in 2225, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.078125 = fieldNorm(doc=2225)
          0.0862079 = weight(abstract_txt:medline in 2225) [ClassicSimilarity], result of:
            0.0862079 = score(doc=2225,freq=1.0), product of:
              0.16229641 = queryWeight, product of:
                1.2828493 = boost
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.018607378 = queryNorm
              0.5311756 = fieldWeight in 2225, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.078125 = fieldNorm(doc=2225)
          0.015480874 = weight(abstract_txt:that in 2225) [ClassicSimilarity], result of:
            0.015480874 = score(doc=2225,freq=2.0), product of:
              0.059134144 = queryWeight, product of:
                1.341223 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018607378 = queryNorm
              0.26179248 = fieldWeight in 2225, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=2225)
          0.22740312 = weight(abstract_txt:swanson in 2225) [ClassicSimilarity], result of:
            0.22740312 = score(doc=2225,freq=1.0), product of:
              0.30984312 = queryWeight, product of:
                1.772524 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.018607378 = queryNorm
              0.7339299 = fieldWeight in 2225, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.078125 = fieldNorm(doc=2225)
          0.083083786 = weight(abstract_txt:experiments in 2225) [ClassicSimilarity], result of:
            0.083083786 = score(doc=2225,freq=1.0), product of:
              0.19951019 = queryWeight, product of:
                2.011494 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.018607378 = queryNorm
              0.41643882 = fieldWeight in 2225, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.078125 = fieldNorm(doc=2225)
          0.14542083 = weight(abstract_txt:connections in 2225) [ClassicSimilarity], result of:
            0.14542083 = score(doc=2225,freq=1.0), product of:
              0.28976014 = queryWeight, product of:
                2.4241278 = boost
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.018607378 = queryNorm
              0.5018662 = fieldWeight in 2225, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.078125 = fieldNorm(doc=2225)
        0.28 = coord(7/25)
    
  2. Weeber, M.; Klein, H.; Jong-van den Berg, L.T.W. de; Vos, R.: Using concepts in literature-based discovery : simulating Swanson's Raynaud-Fish Oil and Migraine-Manesium discoveries (2001) 0.13
    0.12548621 = sum of:
      0.12548621 = product of:
        0.62743104 = sum of:
          0.08259105 = weight(abstract_txt:discovery in 5910) [ClassicSimilarity], result of:
            0.08259105 = score(doc=5910,freq=2.0), product of:
              0.11085843 = queryWeight, product of:
                1.0602434 = boost
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.018607378 = queryNorm
              0.7450137 = fieldWeight in 5910, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.09375 = fieldNorm(doc=5910)
          0.022752145 = weight(abstract_txt:that in 5910) [ClassicSimilarity], result of:
            0.022752145 = score(doc=5910,freq=3.0), product of:
              0.059134144 = queryWeight, product of:
                1.341223 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018607378 = queryNorm
              0.38475478 = fieldWeight in 5910, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=5910)
          0.27288374 = weight(abstract_txt:swanson in 5910) [ClassicSimilarity], result of:
            0.27288374 = score(doc=5910,freq=1.0), product of:
              0.30984312 = queryWeight, product of:
                1.772524 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.018607378 = queryNorm
              0.88071585 = fieldWeight in 5910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.09375 = fieldNorm(doc=5910)
          0.12915704 = weight(abstract_txt:medical in 5910) [ClassicSimilarity], result of:
            0.12915704 = score(doc=5910,freq=1.0), product of:
              0.2370894 = queryWeight, product of:
                2.192766 = boost
                5.8107834 = idf(docFreq=359, maxDocs=44218)
                0.018607378 = queryNorm
              0.54476094 = fieldWeight in 5910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8107834 = idf(docFreq=359, maxDocs=44218)
                0.09375 = fieldNorm(doc=5910)
          0.120047085 = weight(abstract_txt:literature in 5910) [ClassicSimilarity], result of:
            0.120047085 = score(doc=5910,freq=2.0), product of:
              0.2051579 = queryWeight, product of:
                2.4981928 = boost
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.018607378 = queryNorm
              0.5851448 = fieldWeight in 5910, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.09375 = fieldNorm(doc=5910)
        0.2 = coord(5/25)
    
  3. Sebastian, Y.: Literature-based discovery by learning heterogeneous bibliographic information networks (2017) 0.12
    0.11842719 = sum of:
      0.11842719 = product of:
        0.49344662 = sum of:
          0.030831479 = weight(abstract_txt:word in 535) [ClassicSimilarity], result of:
            0.030831479 = score(doc=535,freq=1.0), product of:
              0.103723004 = queryWeight, product of:
                1.0255545 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.018607378 = queryNorm
              0.2972482 = fieldWeight in 535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0546875 = fieldNorm(doc=535)
          0.048178114 = weight(abstract_txt:discovery in 535) [ClassicSimilarity], result of:
            0.048178114 = score(doc=535,freq=2.0), product of:
              0.11085843 = queryWeight, product of:
                1.0602434 = boost
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.018607378 = queryNorm
              0.43459132 = fieldWeight in 535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.0546875 = fieldNorm(doc=535)
          0.017134188 = weight(abstract_txt:that in 535) [ClassicSimilarity], result of:
            0.017134188 = score(doc=535,freq=5.0), product of:
              0.059134144 = queryWeight, product of:
                1.341223 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018607378 = queryNorm
              0.28975117 = fieldWeight in 535, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=535)
          0.17631339 = weight(abstract_txt:connections in 535) [ClassicSimilarity], result of:
            0.17631339 = score(doc=535,freq=3.0), product of:
              0.28976014 = queryWeight, product of:
                2.4241278 = boost
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.018607378 = queryNorm
              0.60848045 = fieldWeight in 535, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.0546875 = fieldNorm(doc=535)
          0.15096197 = weight(abstract_txt:lexical in 535) [ClassicSimilarity], result of:
            0.15096197 = score(doc=535,freq=2.0), product of:
              0.29908222 = queryWeight, product of:
                2.4628134 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.018607378 = queryNorm
              0.5047507 = fieldWeight in 535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0546875 = fieldNorm(doc=535)
          0.07002746 = weight(abstract_txt:literature in 535) [ClassicSimilarity], result of:
            0.07002746 = score(doc=535,freq=2.0), product of:
              0.2051579 = queryWeight, product of:
                2.4981928 = boost
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.018607378 = queryNorm
              0.34133446 = fieldWeight in 535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.0546875 = fieldNorm(doc=535)
        0.24 = coord(6/25)
    
  4. Mohammadi, E.; Thelwall, M.; Haustein, S.; Larivière, V.: Who reads research articles? : an altmetrics analysis of Mendeley user categories (2015) 0.12
    0.11607453 = sum of:
      0.11607453 = product of:
        0.4836439 = sum of:
          0.03339516 = weight(abstract_txt:citations in 2162) [ClassicSimilarity], result of:
            0.03339516 = score(doc=2162,freq=1.0), product of:
              0.1000783 = queryWeight, product of:
                1.007375 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.018607378 = queryNorm
              0.33369032 = fieldWeight in 2162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.0625 = fieldNorm(doc=2162)
          0.09166798 = weight(abstract_txt:counts in 2162) [ClassicSimilarity], result of:
            0.09166798 = score(doc=2162,freq=2.0), product of:
              0.15572298 = queryWeight, product of:
                1.2566015 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.018607378 = queryNorm
              0.5886606 = fieldWeight in 2162, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.0625 = fieldNorm(doc=2162)
          0.008757305 = weight(abstract_txt:that in 2162) [ClassicSimilarity], result of:
            0.008757305 = score(doc=2162,freq=1.0), product of:
              0.059134144 = queryWeight, product of:
                1.341223 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018607378 = queryNorm
              0.1480922 = fieldWeight in 2162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2162)
          0.0861047 = weight(abstract_txt:medical in 2162) [ClassicSimilarity], result of:
            0.0861047 = score(doc=2162,freq=1.0), product of:
              0.2370894 = queryWeight, product of:
                2.192766 = boost
                5.8107834 = idf(docFreq=359, maxDocs=44218)
                0.018607378 = queryNorm
              0.36317396 = fieldWeight in 2162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8107834 = idf(docFreq=359, maxDocs=44218)
                0.0625 = fieldNorm(doc=2162)
          0.10850375 = weight(abstract_txt:statistics in 2162) [ClassicSimilarity], result of:
            0.10850375 = score(doc=2162,freq=1.0), product of:
              0.27660334 = queryWeight, product of:
                2.3684537 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.018607378 = queryNorm
              0.39227203 = fieldWeight in 2162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=2162)
          0.15521501 = weight(abstract_txt:hidden in 2162) [ClassicSimilarity], result of:
            0.15521501 = score(doc=2162,freq=1.0), product of:
              0.35116944 = queryWeight, product of:
                2.668668 = boost
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.018607378 = queryNorm
              0.44199464 = fieldWeight in 2162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.0625 = fieldNorm(doc=2162)
        0.24 = coord(6/25)
    
  5. Bruhns, S.: Bibliografisk sogning som forskning : Don R. Swansons projekt (1995) 0.11
    0.11097663 = sum of:
      0.11097663 = product of:
        0.69360393 = sum of:
          0.0131359575 = weight(abstract_txt:that in 4412) [ClassicSimilarity], result of:
            0.0131359575 = score(doc=4412,freq=1.0), product of:
              0.059134144 = queryWeight, product of:
                1.341223 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018607378 = queryNorm
              0.22213829 = fieldWeight in 4412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=4412)
          0.38591588 = weight(abstract_txt:swanson in 4412) [ClassicSimilarity], result of:
            0.38591588 = score(doc=4412,freq=2.0), product of:
              0.30984312 = queryWeight, product of:
                1.772524 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.018607378 = queryNorm
              1.2455202 = fieldWeight in 4412, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.09375 = fieldNorm(doc=4412)
          0.174505 = weight(abstract_txt:connections in 4412) [ClassicSimilarity], result of:
            0.174505 = score(doc=4412,freq=1.0), product of:
              0.28976014 = queryWeight, product of:
                2.4241278 = boost
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.018607378 = queryNorm
              0.6022395 = fieldWeight in 4412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.09375 = fieldNorm(doc=4412)
          0.120047085 = weight(abstract_txt:literature in 4412) [ClassicSimilarity], result of:
            0.120047085 = score(doc=4412,freq=2.0), product of:
              0.2051579 = queryWeight, product of:
                2.4981928 = boost
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.018607378 = queryNorm
              0.5851448 = fieldWeight in 4412, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.09375 = fieldNorm(doc=4412)
        0.16 = coord(4/25)