Search (1 results, page 1 of 1)

  • × author_ss:"Amano, K."
  • × theme_ss:"Informetrie"
  1. Onodera, N.; Iwasawa, M.; Midorikawa, N.; Yoshikane, F.; Amano, K.; Ootani, Y.; Kodama, T.; Kiyama, Y.; Tsunoda, H.; Yamazaki, S.: ¬A method for eliminating articles by homonymous authors from the large number of articles retrieved by author search (2011) 0.05
    0.048351478 = sum of:
      0.029973017 = product of:
        0.11989207 = sum of:
          0.11989207 = weight(_text_:authors in 4370) [ClassicSimilarity], result of:
            0.11989207 = score(doc=4370,freq=8.0), product of:
              0.23803101 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.052213363 = queryNorm
              0.50368255 = fieldWeight in 4370, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4370)
        0.25 = coord(1/4)
      0.01837846 = product of:
        0.03675692 = sum of:
          0.03675692 = weight(_text_:k in 4370) [ClassicSimilarity], result of:
            0.03675692 = score(doc=4370,freq=2.0), product of:
              0.18639012 = queryWeight, product of:
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.052213363 = queryNorm
              0.19720423 = fieldWeight in 4370, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4370)
        0.5 = coord(1/2)
    
    Abstract
    This paper proposes a methodology which discriminates the articles by the target authors ("true" articles) from those by other homonymous authors ("false" articles). Author name searches for 2,595 "source" authors in six subject fields retrieved about 629,000 articles. In order to extract true articles from the large amount of the retrieved articles, including many false ones, two filtering stages were applied. At the first stage any retrieved article was eliminated as false if either its affiliation addresses had little similarity to those of its source article or there was no citation relationship between the journal of the retrieved article and that of its source article. At the second stage, a sample of retrieved articles was subjected to manual judgment, and utilizing the judgment results, discrimination functions based on logistic regression were defined. These discrimination functions demonstrated both the recall ratio and the precision of about 95% and the accuracy (correct answer ratio) of 90-95%. Existence of common coauthor(s), address similarity, title words similarity, and interjournal citation relationships between the retrieved and source articles were found to be the effective discrimination predictors. Whether or not the source author was from a specific country was also one of the important predictors. Furthermore, it was shown that a retrieved article is almost certainly true if it was cited by, or cocited with, its source article. The method proposed in this study would be effective when dealing with a large number of articles whose subject fields and affiliation addresses vary widely.