Document (#38646)

Author
Zhu, X.
Turney, P.
Lemire, D.
Vellino, A.
Title
Measuring academic influence : not all citations are equal
Source
Journal of the Association for Information Science and Technology. 66(2015) no.2, S.408-427
Year
2015
Abstract
The importance of a research article is routinely measured by counting how many times it has been cited. However, treating all citations with equal weight ignores the wide variety of functions that citations perform. We want to automatically identify the subset of references in a bibliography that have a central academic influence on the citing paper. For this purpose, we examine the effectiveness of a variety of features for determining the academic influence of a citation. By asking authors to identify the key references in their own work, we created a data set in which citations were labeled according to their academic influence. Using automatic feature selection with supervised machine learning, we found a model for predicting academic influence that achieves good performance on this data set using only four features. The best features, among those we evaluated, were those based on the number of times a reference is mentioned in the body of a citing paper. The performance of these features inspired us to design an influence-primed h-index (the hip-index). Unlike the conventional h-index, it weights citations by how many times a reference is mentioned. According to our experiments, the hip-index is a better indicator of researcher performance than the conventional h-index.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23179/abstract.
Theme
Informetrie

Similar documents (content)

  1. Wan, X.; Liu, F.: WL-index : leveraging citation mention number to quantify an individual's scientific impact (2014) 0.32
    0.32074654 = sum of:
      0.32074654 = product of:
        1.0023329 = sum of:
          0.03715816 = weight(abstract_txt:reference in 1549) [ClassicSimilarity], result of:
            0.03715816 = score(doc=1549,freq=2.0), product of:
              0.094202094 = queryWeight, product of:
                1.1889094 = boost
                4.46271 = idf(docFreq=1385, maxDocs=44218)
                0.01775469 = queryNorm
              0.39445156 = fieldWeight in 1549, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.46271 = idf(docFreq=1385, maxDocs=44218)
                0.0625 = fieldNorm(doc=1549)
          0.060373526 = weight(abstract_txt:according in 1549) [ClassicSimilarity], result of:
            0.060373526 = score(doc=1549,freq=2.0), product of:
              0.13019334 = queryWeight, product of:
                1.3976965 = boost
                5.2464166 = idf(docFreq=632, maxDocs=44218)
                0.01775469 = queryNorm
              0.46372208 = fieldWeight in 1549, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2464166 = idf(docFreq=632, maxDocs=44218)
                0.0625 = fieldNorm(doc=1549)
          0.051946297 = weight(abstract_txt:references in 1549) [ClassicSimilarity], result of:
            0.051946297 = score(doc=1549,freq=1.0), product of:
              0.14838983 = queryWeight, product of:
                1.4921777 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.01775469 = queryNorm
              0.35006642 = fieldWeight in 1549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.0625 = fieldNorm(doc=1549)
          0.15398028 = weight(abstract_txt:citing in 1549) [ClassicSimilarity], result of:
            0.15398028 = score(doc=1549,freq=3.0), product of:
              0.21231014 = queryWeight, product of:
                1.7848588 = boost
                6.699675 = idf(docFreq=147, maxDocs=44218)
                0.01775469 = queryNorm
              0.7252611 = fieldWeight in 1549, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.699675 = idf(docFreq=147, maxDocs=44218)
                0.0625 = fieldNorm(doc=1549)
          0.19865257 = weight(abstract_txt:mentioned in 1549) [ClassicSimilarity], result of:
            0.19865257 = score(doc=1549,freq=4.0), product of:
              0.2286005 = queryWeight, product of:
                1.8520687 = boost
                6.9519553 = idf(docFreq=114, maxDocs=44218)
                0.01775469 = queryNorm
              0.8689944 = fieldWeight in 1549, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9519553 = idf(docFreq=114, maxDocs=44218)
                0.0625 = fieldNorm(doc=1549)
          0.09826768 = weight(abstract_txt:times in 1549) [ClassicSimilarity], result of:
            0.09826768 = score(doc=1549,freq=1.0), product of:
              0.25981963 = queryWeight, product of:
                2.418244 = boost
                6.0514402 = idf(docFreq=282, maxDocs=44218)
                0.01775469 = queryNorm
              0.37821501 = fieldWeight in 1549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0514402 = idf(docFreq=282, maxDocs=44218)
                0.0625 = fieldNorm(doc=1549)
          0.17699504 = weight(abstract_txt:index in 1549) [ClassicSimilarity], result of:
            0.17699504 = score(doc=1549,freq=5.0), product of:
              0.26668492 = queryWeight, product of:
                3.1629167 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.01775469 = queryNorm
              0.663686 = fieldWeight in 1549, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0625 = fieldNorm(doc=1549)
          0.22495933 = weight(abstract_txt:citations in 1549) [ClassicSimilarity], result of:
            0.22495933 = score(doc=1549,freq=4.0), product of:
              0.337078 = queryWeight, product of:
                3.5559342 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.01775469 = queryNorm
              0.66738063 = fieldWeight in 1549, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.0625 = fieldNorm(doc=1549)
        0.32 = coord(8/25)
    
  2. Cronin, B.; Weaver-Wozniak, S.: Online access to acknowledgements (1993) 0.20
    0.20091254 = sum of:
      0.20091254 = product of:
        0.83713555 = sum of:
          0.06057339 = weight(abstract_txt:variety in 7827) [ClassicSimilarity], result of:
            0.06057339 = score(doc=7827,freq=1.0), product of:
              0.12545697 = queryWeight, product of:
                1.3720373 = boost
                5.1501017 = idf(docFreq=696, maxDocs=44218)
                0.01775469 = queryNorm
              0.48282203 = fieldWeight in 7827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1501017 = idf(docFreq=696, maxDocs=44218)
                0.09375 = fieldNorm(doc=7827)
          0.06603695 = weight(abstract_txt:performance in 7827) [ClassicSimilarity], result of:
            0.06603695 = score(doc=7827,freq=1.0), product of:
              0.15212315 = queryWeight, product of:
                1.8503836 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.01775469 = queryNorm
              0.43410188 = fieldWeight in 7827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.09375 = fieldNorm(doc=7827)
          0.1607327 = weight(abstract_txt:academic in 7827) [ClassicSimilarity], result of:
            0.1607327 = score(doc=7827,freq=2.0), product of:
              0.2590278 = queryWeight, product of:
                3.1171787 = boost
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.01775469 = queryNorm
              0.620523 = fieldWeight in 7827, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.09375 = fieldNorm(doc=7827)
          0.11873188 = weight(abstract_txt:index in 7827) [ClassicSimilarity], result of:
            0.11873188 = score(doc=7827,freq=1.0), product of:
              0.26668492 = queryWeight, product of:
                3.1629167 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.01775469 = queryNorm
              0.44521406 = fieldWeight in 7827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.09375 = fieldNorm(doc=7827)
          0.2386054 = weight(abstract_txt:citations in 7827) [ClassicSimilarity], result of:
            0.2386054 = score(doc=7827,freq=2.0), product of:
              0.337078 = queryWeight, product of:
                3.5559342 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.01775469 = queryNorm
              0.70786405 = fieldWeight in 7827, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.09375 = fieldNorm(doc=7827)
          0.19245525 = weight(abstract_txt:influence in 7827) [ClassicSimilarity], result of:
            0.19245525 = score(doc=7827,freq=1.0), product of:
              0.39105138 = queryWeight, product of:
                4.195619 = boost
                5.2495813 = idf(docFreq=630, maxDocs=44218)
                0.01775469 = queryNorm
              0.49214825 = fieldWeight in 7827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2495813 = idf(docFreq=630, maxDocs=44218)
                0.09375 = fieldNorm(doc=7827)
        0.24 = coord(6/25)
    
  3. Wan, X.; Liu, F.: Are all literature citations equally important? : automatic citation strength estimation and its applications (2014) 0.17
    0.16593602 = sum of:
      0.16593602 = product of:
        0.8296801 = sum of:
          0.11262284 = weight(abstract_txt:labeled in 1350) [ClassicSimilarity], result of:
            0.11262284 = score(doc=1350,freq=2.0), product of:
              0.1349456 = queryWeight, product of:
                1.0061966 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.01775469 = queryNorm
              0.8345796 = fieldWeight in 1350, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.078125 = fieldNorm(doc=1350)
          0.06912058 = weight(abstract_txt:features in 1350) [ClassicSimilarity], result of:
            0.06912058 = score(doc=1350,freq=1.0), product of:
              0.19491382 = queryWeight, product of:
                2.4185486 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.01775469 = queryNorm
              0.35462123 = fieldWeight in 1350, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.078125 = fieldNorm(doc=1350)
          0.13992685 = weight(abstract_txt:index in 1350) [ClassicSimilarity], result of:
            0.13992685 = score(doc=1350,freq=2.0), product of:
              0.26668492 = queryWeight, product of:
                3.1629167 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.01775469 = queryNorm
              0.5246898 = fieldWeight in 1350, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.078125 = fieldNorm(doc=1350)
          0.28119916 = weight(abstract_txt:citations in 1350) [ClassicSimilarity], result of:
            0.28119916 = score(doc=1350,freq=4.0), product of:
              0.337078 = queryWeight, product of:
                3.5559342 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.01775469 = queryNorm
              0.8342258 = fieldWeight in 1350, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.078125 = fieldNorm(doc=1350)
          0.22681068 = weight(abstract_txt:influence in 1350) [ClassicSimilarity], result of:
            0.22681068 = score(doc=1350,freq=2.0), product of:
              0.39105138 = queryWeight, product of:
                4.195619 = boost
                5.2495813 = idf(docFreq=630, maxDocs=44218)
                0.01775469 = queryNorm
              0.58000225 = fieldWeight in 1350, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2495813 = idf(docFreq=630, maxDocs=44218)
                0.078125 = fieldNorm(doc=1350)
        0.2 = coord(5/25)
    
  4. González, L.; Campanario, J.M.: Structure of the impact factor of journals included in the Social Sciences Citation Index : citations from documents labeled "Editorial Material" (2007) 0.13
    0.12875503 = sum of:
      0.12875503 = product of:
        0.64377517 = sum of:
          0.09556365 = weight(abstract_txt:labeled in 75) [ClassicSimilarity], result of:
            0.09556365 = score(doc=75,freq=1.0), product of:
              0.1349456 = queryWeight, product of:
                1.0061966 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.01775469 = queryNorm
              0.7081643 = fieldWeight in 75, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.09375 = fieldNorm(doc=75)
          0.030141508 = weight(abstract_txt:many in 75) [ClassicSimilarity], result of:
            0.030141508 = score(doc=75,freq=1.0), product of:
              0.07878017 = queryWeight, product of:
                1.0872438 = boost
                4.081096 = idf(docFreq=2029, maxDocs=44218)
                0.01775469 = queryNorm
              0.38260275 = fieldWeight in 75, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.081096 = idf(docFreq=2029, maxDocs=44218)
                0.09375 = fieldNorm(doc=75)
          0.1607327 = weight(abstract_txt:academic in 75) [ClassicSimilarity], result of:
            0.1607327 = score(doc=75,freq=2.0), product of:
              0.2590278 = queryWeight, product of:
                3.1171787 = boost
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.01775469 = queryNorm
              0.620523 = fieldWeight in 75, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.09375 = fieldNorm(doc=75)
          0.11873188 = weight(abstract_txt:index in 75) [ClassicSimilarity], result of:
            0.11873188 = score(doc=75,freq=1.0), product of:
              0.26668492 = queryWeight, product of:
                3.1629167 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.01775469 = queryNorm
              0.44521406 = fieldWeight in 75, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.09375 = fieldNorm(doc=75)
          0.2386054 = weight(abstract_txt:citations in 75) [ClassicSimilarity], result of:
            0.2386054 = score(doc=75,freq=2.0), product of:
              0.337078 = queryWeight, product of:
                3.5559342 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.01775469 = queryNorm
              0.70786405 = fieldWeight in 75, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.09375 = fieldNorm(doc=75)
        0.2 = coord(5/25)
    
  5. Walters, W.H.: Google Scholar coverage of a multidisciplinary field (2007) 0.12
    0.12361235 = sum of:
      0.12361235 = product of:
        0.5150515 = sum of:
          0.025117926 = weight(abstract_txt:many in 928) [ClassicSimilarity], result of:
            0.025117926 = score(doc=928,freq=1.0), product of:
              0.07878017 = queryWeight, product of:
                1.0872438 = boost
                4.081096 = idf(docFreq=2029, maxDocs=44218)
                0.01775469 = queryNorm
              0.31883565 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.081096 = idf(docFreq=2029, maxDocs=44218)
                0.078125 = fieldNorm(doc=928)
          0.032843485 = weight(abstract_txt:reference in 928) [ClassicSimilarity], result of:
            0.032843485 = score(doc=928,freq=1.0), product of:
              0.094202094 = queryWeight, product of:
                1.1889094 = boost
                4.46271 = idf(docFreq=1385, maxDocs=44218)
                0.01775469 = queryNorm
              0.3486492 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.46271 = idf(docFreq=1385, maxDocs=44218)
                0.078125 = fieldNorm(doc=928)
          0.12283461 = weight(abstract_txt:times in 928) [ClassicSimilarity], result of:
            0.12283461 = score(doc=928,freq=1.0), product of:
              0.25981963 = queryWeight, product of:
                2.418244 = boost
                6.0514402 = idf(docFreq=282, maxDocs=44218)
                0.01775469 = queryNorm
              0.47276878 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0514402 = idf(docFreq=282, maxDocs=44218)
                0.078125 = fieldNorm(doc=928)
          0.09471265 = weight(abstract_txt:academic in 928) [ClassicSimilarity], result of:
            0.09471265 = score(doc=928,freq=1.0), product of:
              0.2590278 = queryWeight, product of:
                3.1171787 = boost
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.01775469 = queryNorm
              0.36564666 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.078125 = fieldNorm(doc=928)
          0.09894323 = weight(abstract_txt:index in 928) [ClassicSimilarity], result of:
            0.09894323 = score(doc=928,freq=1.0), product of:
              0.26668492 = queryWeight, product of:
                3.1629167 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.01775469 = queryNorm
              0.37101173 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.078125 = fieldNorm(doc=928)
          0.14059958 = weight(abstract_txt:citations in 928) [ClassicSimilarity], result of:
            0.14059958 = score(doc=928,freq=1.0), product of:
              0.337078 = queryWeight, product of:
                3.5559342 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.01775469 = queryNorm
              0.4171129 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.078125 = fieldNorm(doc=928)
        0.24 = coord(6/25)