Search (2 results, page 1 of 1)

Torvik, V.I.; Weeber, M.; Swanson, D.R.; Smalheiser, N.R.: ¬A probabilistic similarity metric for medline mecords : a model for author name disambiguation (2005) 0.00
```
0.0044022407 = product of:
  0.013206721 = sum of:
    0.013206721 = weight(_text_:a in 3308) [ClassicSimilarity], result of:
      0.013206721 = score(doc=3308,freq=22.0), product of:
        0.05209492 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.045180224 = queryNorm
        0.25351265 = fieldWeight in 3308, product of:
          4.690416 = tf(freq=22.0), with freq of:
            22.0 = termFreq=22.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=3308)
  0.33333334 = coord(1/3)
```
Abstract

We present a model for estimating the probability that a pair of author names (sharing last name and first initial), appearing an two different Medline articles, refer to the same individual. The model uses a simple yet powerful similarity profile between a pair of articles, based an title, journal name, coauthor names, medical subject headings (MeSH), language, affiliation, and name attributes (prevalence in the literature, middle initial, and suffix). The similarity profile distribution is computed from reference sets consisting of pairs of articles containing almost exclusively author matches versus nonmatches, generated in an unbiased manner. Although the match set is generated automatically and might contain a small proportion of nonmatches, the model is quite robust against contamination with nonmatches. We have created a free, public service ("Author-ity": http://arrowsmith.psych.uic.edu) that takes as input an author's name given an a specific article, and gives as output a list of all articles with that (last name, first initial) ranked by decreasing similarity, with match probability indicated.

Type

a
Swanson, D.R.; Smalheiser, N.R.; Bookstein, A.: Information discovery from complementary literatures : categorizing viruses as potential weapons (2001) 0.00
```
0.0029264777 = product of:
  0.008779433 = sum of:
    0.008779433 = weight(_text_:a in 6513) [ClassicSimilarity], result of:
      0.008779433 = score(doc=6513,freq=14.0), product of:
        0.05209492 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.045180224 = queryNorm
        0.1685276 = fieldWeight in 6513, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0390625 = fieldNorm(doc=6513)
  0.33333334 = coord(1/3)
```
Abstract

Using novel informatics techniques to process the Output of Medline searches, we have generated a list of viruses that may have the potential for development as weapons. Our findings are intended as a guide to the virus literature to support further studies that might then lead to appropriate defense and public health measures. This article stresses methods that are more generally relevant to information science. Initial Medline searches identified two kinds of virus literaturesthe first concerning the genetic aspects of virulence, and the second concerning the transmission of viral diseases. Both literatures taken together are of central importance in identifying research relevant to the development of biological weapons. Yet, the two literatures had very few articles in common. We downloaded the Medline records for each of the two literatures and used a computer to extract all virus terms common to both. The fact that the resulting virus list includes most of an earlier independently published list of viruses considered by military experts to have the highest threat as potential biological weapons served as a test of the method; the test outcome showed a high degree of statistical significance, thus supporting an inference that the new viruses an the list share certain important characteristics with viruses of known biological

Type

a

Search (2 results, page 1 of 1)

Authors