Document (#24512)

Author
Swanson, D.R.
Smalheiser, N.R.
Bookstein, A.
Title
Information discovery from complementary literatures : categorizing viruses as potential weapons
Source
Journal of the American Society for Information Science and technology. 52(2001) no.10, S.797-812
Year
2001
Abstract
Using novel informatics techniques to process the Output of Medline searches, we have generated a list of viruses that may have the potential for development as weapons. Our findings are intended as a guide to the virus literature to support further studies that might then lead to appropriate defense and public health measures. This article stresses methods that are more generally relevant to information science. Initial Medline searches identified two kinds of virus literaturesthe first concerning the genetic aspects of virulence, and the second concerning the transmission of viral diseases. Both literatures taken together are of central importance in identifying research relevant to the development of biological weapons. Yet, the two literatures had very few articles in common. We downloaded the Medline records for each of the two literatures and used a computer to extract all virus terms common to both. The fact that the resulting virus list includes most of an earlier independently published list of viruses considered by military experts to have the highest threat as potential biological weapons served as a test of the method; the test outcome showed a high degree of statistical significance, thus supporting an inference that the new viruses an the list share certain important characteristics with viruses of known biological
Theme
Informetrie
Field
Mikrobiologie

Similar documents (author)

  1. Swanson, D.R.; Smalheiser, N.R.: Implicit text linkages between Medline records : using arrowsmith as an aid to scientific discovery (1999) 3.60
    3.602725 = sum of:
      3.602725 = product of:
        5.4040875 = sum of:
          2.0584428 = weight(author_txt:swanson in 5574) [ClassicSimilarity], result of:
            2.0584428 = score(doc=5574,freq=1.0), product of:
              0.49717206 = queryWeight, product of:
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.060040545 = queryNorm
              4.1403027 = fieldWeight in 5574, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.5 = fieldNorm(doc=5574)
          3.3456447 = weight(author_txt:smalheiser in 5574) [ClassicSimilarity], result of:
            3.3456447 = score(doc=5574,freq=1.0), product of:
              0.68728054 = queryWeight, product of:
                1.1757464 = boost
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.060040545 = queryNorm
              4.867946 = fieldWeight in 5574, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.5 = fieldNorm(doc=5574)
        0.6666667 = coord(2/3)
    
  2. Bookstein, A.; Swanson, D.R.: Probabilistic models for automatic indexing (1974) 2.88
    2.8809748 = sum of:
      2.8809748 = product of:
        4.321462 = sum of:
          2.0584428 = weight(author_txt:swanson in 5463) [ClassicSimilarity], result of:
            2.0584428 = score(doc=5463,freq=1.0), product of:
              0.49717206 = queryWeight, product of:
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.060040545 = queryNorm
              4.1403027 = fieldWeight in 5463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.5 = fieldNorm(doc=5463)
          2.2630193 = weight(author_txt:bookstein in 5463) [ClassicSimilarity], result of:
            2.2630193 = score(doc=5463,freq=1.0), product of:
              0.5295899 = queryWeight, product of:
                1.0320874 = boost
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.060040545 = queryNorm
              4.2731543 = fieldWeight in 5463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.5 = fieldNorm(doc=5463)
        0.6666667 = coord(2/3)
    
  3. Bookstein, A.; Swanson, D.R.: ¬A decision theoretic foundation for indexing (1975) 2.88
    2.8809748 = sum of:
      2.8809748 = product of:
        4.321462 = sum of:
          2.0584428 = weight(author_txt:swanson in 142) [ClassicSimilarity], result of:
            2.0584428 = score(doc=142,freq=1.0), product of:
              0.49717206 = queryWeight, product of:
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.060040545 = queryNorm
              4.1403027 = fieldWeight in 142, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.5 = fieldNorm(doc=142)
          2.2630193 = weight(author_txt:bookstein in 142) [ClassicSimilarity], result of:
            2.2630193 = score(doc=142,freq=1.0), product of:
              0.5295899 = queryWeight, product of:
                1.0320874 = boost
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.060040545 = queryNorm
              4.2731543 = fieldWeight in 142, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.5 = fieldNorm(doc=142)
        0.6666667 = coord(2/3)
    
  4. Swanson, D.R.; Smalheiser, N.R.; Torvik, V.I.: Ranking indirect connections in literature-based discovery : the role of Medical Subject Headings (2006) 2.70
    2.7020435 = sum of:
      2.7020435 = product of:
        4.0530653 = sum of:
          1.5438321 = weight(author_txt:swanson in 1001) [ClassicSimilarity], result of:
            1.5438321 = score(doc=1001,freq=1.0), product of:
              0.49717206 = queryWeight, product of:
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.060040545 = queryNorm
              3.105227 = fieldWeight in 1001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.375 = fieldNorm(doc=1001)
          2.5092335 = weight(author_txt:smalheiser in 1001) [ClassicSimilarity], result of:
            2.5092335 = score(doc=1001,freq=1.0), product of:
              0.68728054 = queryWeight, product of:
                1.1757464 = boost
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.060040545 = queryNorm
              3.6509595 = fieldWeight in 1001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.375 = fieldNorm(doc=1001)
        0.6666667 = coord(2/3)
    
  5. Torvik, V.I.; Weeber, M.; Swanson, D.R.; Smalheiser, N.R.: ¬A probabilistic similarity metric for medline mecords : a model for author name disambiguation (2005) 2.25
    2.2517033 = sum of:
      2.2517033 = product of:
        3.377555 = sum of:
          1.2865268 = weight(author_txt:swanson in 4306) [ClassicSimilarity], result of:
            1.2865268 = score(doc=4306,freq=1.0), product of:
              0.49717206 = queryWeight, product of:
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.060040545 = queryNorm
              2.5876892 = fieldWeight in 4306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.3125 = fieldNorm(doc=4306)
          2.091028 = weight(author_txt:smalheiser in 4306) [ClassicSimilarity], result of:
            2.091028 = score(doc=4306,freq=1.0), product of:
              0.68728054 = queryWeight, product of:
                1.1757464 = boost
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.060040545 = queryNorm
              3.0424664 = fieldWeight in 4306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.3125 = fieldNorm(doc=4306)
        0.6666667 = coord(2/3)
    

Similar documents (content)

  1. Aringhieri, R.; Damiani, E.; De Capitani di Vimercati, S.; Paraboschi, S.; Samarati, P.: Fuzzy techniques for trust and reputation management in anonymous peer-to-peer systems (2006) 0.11
    0.10710656 = sum of:
      0.10710656 = product of:
        0.5355328 = sum of:
          0.012389722 = weight(abstract_txt:both in 277) [ClassicSimilarity], result of:
            0.012389722 = score(doc=277,freq=1.0), product of:
              0.04151029 = queryWeight, product of:
                1.0239372 = boost
                3.820461 = idf(docFreq=2594, maxDocs=43556)
                0.010611253 = queryNorm
              0.2984735 = fieldWeight in 277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.820461 = idf(docFreq=2594, maxDocs=43556)
                0.078125 = fieldNorm(doc=277)
          0.012712623 = weight(abstract_txt:development in 277) [ClassicSimilarity], result of:
            0.012712623 = score(doc=277,freq=1.0), product of:
              0.04222842 = queryWeight, product of:
                1.0327563 = boost
                3.8533664 = idf(docFreq=2510, maxDocs=43556)
                0.010611253 = queryNorm
              0.30104426 = fieldWeight in 277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8533664 = idf(docFreq=2510, maxDocs=43556)
                0.078125 = fieldNorm(doc=277)
          0.011125948 = weight(abstract_txt:have in 277) [ClassicSimilarity], result of:
            0.011125948 = score(doc=277,freq=1.0), product of:
              0.0442286 = queryWeight, product of:
                1.2944721 = boost
                3.2199109 = idf(docFreq=4730, maxDocs=43556)
                0.010611253 = queryNorm
              0.25155553 = fieldWeight in 277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2199109 = idf(docFreq=4730, maxDocs=43556)
                0.078125 = fieldNorm(doc=277)
          0.007504515 = weight(abstract_txt:that in 277) [ClassicSimilarity], result of:
            0.007504515 = score(doc=277,freq=1.0), product of:
              0.040331386 = queryWeight, product of:
                1.5958315 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.010611253 = queryNorm
              0.18607134 = fieldWeight in 277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.078125 = fieldNorm(doc=277)
          0.49179998 = weight(abstract_txt:viruses in 277) [ClassicSimilarity], result of:
            0.49179998 = score(doc=277,freq=1.0), product of:
              0.655572 = queryWeight, product of:
                6.43392 = boost
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.010611253 = queryNorm
              0.75018454 = fieldWeight in 277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.078125 = fieldNorm(doc=277)
        0.2 = coord(5/25)
    
  2. Waldrop, M.M.: Intelligent agents prepare to sift the riches of cyberspace (1997) 0.10
    0.09859947 = sum of:
      0.09859947 = product of:
        0.82166225 = sum of:
          0.017801518 = weight(abstract_txt:have in 1781) [ClassicSimilarity], result of:
            0.017801518 = score(doc=1781,freq=1.0), product of:
              0.0442286 = queryWeight, product of:
                1.2944721 = boost
                3.2199109 = idf(docFreq=4730, maxDocs=43556)
                0.010611253 = queryNorm
              0.40248886 = fieldWeight in 1781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2199109 = idf(docFreq=4730, maxDocs=43556)
                0.125 = fieldNorm(doc=1781)
          0.016980778 = weight(abstract_txt:that in 1781) [ClassicSimilarity], result of:
            0.016980778 = score(doc=1781,freq=2.0), product of:
              0.040331386 = queryWeight, product of:
                1.5958315 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.010611253 = queryNorm
              0.4210314 = fieldWeight in 1781, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.125 = fieldNorm(doc=1781)
          0.78687996 = weight(abstract_txt:viruses in 1781) [ClassicSimilarity], result of:
            0.78687996 = score(doc=1781,freq=1.0), product of:
              0.655572 = queryWeight, product of:
                6.43392 = boost
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.010611253 = queryNorm
              1.2002952 = fieldWeight in 1781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.125 = fieldNorm(doc=1781)
        0.12 = coord(3/25)
    
  3. McKinin, E.J.; Sievert, M.E.; Johnson, D.; Mitchell, J.A.: ¬The Medline/full-text research project (1991) 0.06
    0.063245095 = sum of:
      0.063245095 = product of:
        0.26352122 = sum of:
          0.030885328 = weight(abstract_txt:relevant in 383) [ClassicSimilarity], result of:
            0.030885328 = score(doc=383,freq=3.0), product of:
              0.061401993 = queryWeight, product of:
                1.2453364 = boost
                4.646534 = idf(docFreq=1135, maxDocs=43556)
                0.010611253 = queryNorm
              0.50300205 = fieldWeight in 383, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.646534 = idf(docFreq=1135, maxDocs=43556)
                0.0625 = fieldNorm(doc=383)
          0.008900759 = weight(abstract_txt:have in 383) [ClassicSimilarity], result of:
            0.008900759 = score(doc=383,freq=1.0), product of:
              0.0442286 = queryWeight, product of:
                1.2944721 = boost
                3.2199109 = idf(docFreq=4730, maxDocs=43556)
                0.010611253 = queryNorm
              0.20124443 = fieldWeight in 383, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2199109 = idf(docFreq=4730, maxDocs=43556)
                0.0625 = fieldNorm(doc=383)
          0.02294291 = weight(abstract_txt:test in 383) [ClassicSimilarity], result of:
            0.02294291 = score(doc=383,freq=1.0), product of:
              0.07263631 = queryWeight, product of:
                1.3544792 = boost
                5.0537615 = idf(docFreq=755, maxDocs=43556)
                0.010611253 = queryNorm
              0.3158601 = fieldWeight in 383, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0537615 = idf(docFreq=755, maxDocs=43556)
                0.0625 = fieldNorm(doc=383)
          0.025264433 = weight(abstract_txt:searches in 383) [ClassicSimilarity], result of:
            0.025264433 = score(doc=383,freq=1.0), product of:
              0.07745708 = queryWeight, product of:
                1.3987046 = boost
                5.2187734 = idf(docFreq=640, maxDocs=43556)
                0.010611253 = queryNorm
              0.32617334 = fieldWeight in 383, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2187734 = idf(docFreq=640, maxDocs=43556)
                0.0625 = fieldNorm(doc=383)
          0.008490389 = weight(abstract_txt:that in 383) [ClassicSimilarity], result of:
            0.008490389 = score(doc=383,freq=2.0), product of:
              0.040331386 = queryWeight, product of:
                1.5958315 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.010611253 = queryNorm
              0.2105157 = fieldWeight in 383, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.0625 = fieldNorm(doc=383)
          0.1670374 = weight(abstract_txt:medline in 383) [ClassicSimilarity], result of:
            0.1670374 = score(doc=383,freq=4.0), product of:
              0.19676188 = queryWeight, product of:
                2.7303076 = boost
                6.791454 = idf(docFreq=132, maxDocs=43556)
                0.010611253 = queryNorm
              0.8489317 = fieldWeight in 383, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.791454 = idf(docFreq=132, maxDocs=43556)
                0.0625 = fieldNorm(doc=383)
        0.24 = coord(6/25)
    
  4. Zhang, J.; Chen, Y.; Zhao, Y.; Wolfram, D.; Ma, F.: Public health and social media : a study of Zika virus-related posts on Yahoo! Answers (2020) 0.06
    0.05640952 = sum of:
      0.05640952 = product of:
        0.705119 = sum of:
          0.008490389 = weight(abstract_txt:that in 1958) [ClassicSimilarity], result of:
            0.008490389 = score(doc=1958,freq=2.0), product of:
              0.040331386 = queryWeight, product of:
                1.5958315 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.010611253 = queryNorm
              0.2105157 = fieldWeight in 1958, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.0625 = fieldNorm(doc=1958)
          0.69662863 = weight(abstract_txt:virus in 1958) [ClassicSimilarity], result of:
            0.69662863 = score(doc=1958,freq=8.0), product of:
              0.44535086 = queryWeight, product of:
                4.7430925 = boost
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.010611253 = queryNorm
              1.5642242 = fieldWeight in 1958, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.0625 = fieldNorm(doc=1958)
        0.08 = coord(2/25)
    
  5. Shachak, A.: Diffusion pattern of the use of genomic databases and analysis of biological sequences from 1970-2003 : bibliographic record analysis of 12 journals (2006) 0.05
    0.054404926 = sum of:
      0.054404926 = product of:
        0.3400308 = sum of:
          0.009911778 = weight(abstract_txt:both in 904) [ClassicSimilarity], result of:
            0.009911778 = score(doc=904,freq=1.0), product of:
              0.04151029 = queryWeight, product of:
                1.0239372 = boost
                3.820461 = idf(docFreq=2594, maxDocs=43556)
                0.010611253 = queryNorm
              0.23877881 = fieldWeight in 904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.820461 = idf(docFreq=2594, maxDocs=43556)
                0.0625 = fieldNorm(doc=904)
          0.008490389 = weight(abstract_txt:that in 904) [ClassicSimilarity], result of:
            0.008490389 = score(doc=904,freq=2.0), product of:
              0.040331386 = queryWeight, product of:
                1.5958315 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.010611253 = queryNorm
              0.2105157 = fieldWeight in 904, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.0625 = fieldNorm(doc=904)
          0.0835187 = weight(abstract_txt:medline in 904) [ClassicSimilarity], result of:
            0.0835187 = score(doc=904,freq=1.0), product of:
              0.19676188 = queryWeight, product of:
                2.7303076 = boost
                6.791454 = idf(docFreq=132, maxDocs=43556)
                0.010611253 = queryNorm
              0.42446586 = fieldWeight in 904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.791454 = idf(docFreq=132, maxDocs=43556)
                0.0625 = fieldNorm(doc=904)
          0.23810992 = weight(abstract_txt:biological in 904) [ClassicSimilarity], result of:
            0.23810992 = score(doc=904,freq=5.0), product of:
              0.23135567 = queryWeight, product of:
                2.9606097 = boost
                7.3643146 = idf(docFreq=74, maxDocs=43556)
                0.010611253 = queryNorm
              1.0291942 = fieldWeight in 904, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.3643146 = idf(docFreq=74, maxDocs=43556)
                0.0625 = fieldNorm(doc=904)
        0.16 = coord(4/25)