Search (37 results, page 1 of 2)

Leydesdorff, L.; Shin, J.C.: How to evaluate universities in terms of their relative citation impacts : fractional counting of citations and the normalization of differences among disciplines (2011) 0.10

0.09789874 = product of:
  0.13053165 = sum of:
    0.046904307 = weight(_text_:science in 4466) [ClassicSimilarity], result of:
      0.046904307 = score(doc=4466,freq=6.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.35285735 = fieldWeight in 4466, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4466)
    0.044925686 = weight(_text_:research in 4466) [ClassicSimilarity], result of:
      0.044925686 = score(doc=4466,freq=4.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.31204507 = fieldWeight in 4466, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4466)
    0.03870166 = product of:
      0.07740332 = sum of:
        0.07740332 = weight(_text_:network in 4466) [ClassicSimilarity], result of:
          0.07740332 = score(doc=4466,freq=2.0), product of:
            0.22473325 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.050463587 = queryNorm
            0.3444231 = fieldWeight in 4466, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4466)
      0.5 = coord(1/2)
  0.75 = coord(3/4)

Abstract: Fractional counting of citations can improve on ranking of multidisciplinary research units (such as universities) by normalizing the differences among fields of science in terms of differences in citation behavior. Furthermore, normalization in terms of citing papers abolishes the unsolved questions in scientometrics about the delineation of fields of science in terms of journals and normalization when comparing among different (sets of) journals. Using publication and citation data of seven Korean research universities, we demonstrate the advantages and the differences in the rankings, explain the possible statistics, and suggest ways to visualize the differences in (citing) audiences in terms of a network.
Source: Journal of the American Society for Information Science and Technology. 62(2011) no.6, S.1146-1155

Bornmann, L.; Wagner, C.; Leydesdorff, L.: BRICS countries and scientific excellence : a bibliometric analysis of most frequently cited papers (2015) 0.07

0.06531672 = product of:
  0.087088965 = sum of:
    0.027355144 = weight(_text_:science in 2047) [ClassicSimilarity], result of:
      0.027355144 = score(doc=2047,freq=4.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.20579056 = fieldWeight in 2047, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2047)
    0.032089777 = weight(_text_:research in 2047) [ClassicSimilarity], result of:
      0.032089777 = score(doc=2047,freq=4.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.22288933 = fieldWeight in 2047, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2047)
    0.027644044 = product of:
      0.055288088 = sum of:
        0.055288088 = weight(_text_:network in 2047) [ClassicSimilarity], result of:
          0.055288088 = score(doc=2047,freq=2.0), product of:
            0.22473325 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.050463587 = queryNorm
            0.2460165 = fieldWeight in 2047, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2047)
      0.5 = coord(1/2)
  0.75 = coord(3/4)

Abstract: The BRICS countries (Brazil, Russia, India, China, and South Africa) are notable for their increasing participation in science and technology. The governments of these countries have been boosting their investments in research and development to become part of the group of nations doing research at a world-class level. This study investigates the development of the BRICS countries in the domain of top-cited papers (top 10% and 1% most frequently cited papers) between 1990 and 2010. To assess the extent to which these countries have become important players at the top level, we compare the BRICS countries with the top-performing countries worldwide. As the analyses of the (annual) growth rates show, with the exception of Russia, the BRICS countries have increased their output in terms of most frequently cited papers at a higher rate than the top-cited countries worldwide. By way of additional analysis, we generate coauthorship networks among authors of highly cited papers for 4 time points to view changes in BRICS participation (1995, 2000, 2005, and 2010). Here, the results show that all BRICS countries succeeded in becoming part of this network, whereby the Chinese collaboration activities focus on the US.
Source: Journal of the Association for Information Science and Technology. 66(2015) no.7, S.1507-1513

Leydesdorff, L.; Bornmann, L.; Wagner, C.S.: ¬The relative influences of government funding and international collaboration on citation impact (2019) 0.06

0.060424954 = product of:
  0.08056661 = sum of:
    0.032826174 = weight(_text_:science in 4681) [ClassicSimilarity], result of:
      0.032826174 = score(doc=4681,freq=4.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.24694869 = fieldWeight in 4681, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.046875 = fieldNorm(doc=4681)
    0.027229078 = weight(_text_:research in 4681) [ClassicSimilarity], result of:
      0.027229078 = score(doc=4681,freq=2.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.18912788 = fieldWeight in 4681, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.046875 = fieldNorm(doc=4681)
    0.020511357 = product of:
      0.041022714 = sum of:
        0.041022714 = weight(_text_:22 in 4681) [ClassicSimilarity], result of:
          0.041022714 = score(doc=4681,freq=2.0), product of:
            0.17671488 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050463587 = queryNorm
            0.23214069 = fieldWeight in 4681, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=4681)
      0.5 = coord(1/2)
  0.75 = coord(3/4)

Abstract: A recent publication in Nature reports that public R&D funding is only weakly correlated with the citation impact of a nation's articles as measured by the field-weighted citation index (FWCI; defined by Scopus). On the basis of the supplementary data, we up-scaled the design using Web of Science data for the decade 2003-2013 and OECD funding data for the corresponding decade assuming a 2-year delay (2001-2011). Using negative binomial regression analysis, we found very small coefficients, but the effects of international collaboration are positive and statistically significant, whereas the effects of government funding are negative, an order of magnitude smaller, and statistically nonsignificant (in two of three analyses). In other words, international collaboration improves the impact of research articles, whereas more government funding tends to have a small adverse effect when comparing OECD countries.
Date: 8. 1.2019 18:22:45
Source: Journal of the Association for Information Science and Technology. 70(2019) no.2, S.198-201

Leydesdorff, L.; Bornmann, L.; Mingers, J.: Statistical significance and effect sizes of differences among research universities at the level of nations and worldwide based on the Leiden rankings (2019) 0.06
```
0.05930762 = product of:
  0.07907683 = sum of:
    0.019343007 = weight(_text_:science in 5225) [ClassicSimilarity], result of:
      0.019343007 = score(doc=5225,freq=2.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.1455159 = fieldWeight in 5225, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5225)
    0.032089777 = weight(_text_:research in 5225) [ClassicSimilarity], result of:
      0.032089777 = score(doc=5225,freq=4.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.22288933 = fieldWeight in 5225, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5225)
    0.027644044 = product of:
      0.055288088 = sum of:
        0.055288088 = weight(_text_:network in 5225) [ClassicSimilarity], result of:
          0.055288088 = score(doc=5225,freq=2.0), product of:
            0.22473325 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.050463587 = queryNorm
            0.2460165 = fieldWeight in 5225, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5225)
      0.5 = coord(1/2)
  0.75 = coord(3/4)
```
Abstract

The Leiden Rankings can be used for grouping research universities by considering universities which are not statistically significantly different as homogeneous sets. The groups and intergroup relations can be analyzed and visualized using tools from network analysis. Using the so-called "excellence indicator" PPtop-10%-the proportion of the top-10% most-highly-cited papers assigned to a university-we pursue a classification using (a) overlapping stability intervals, (b) statistical-significance tests, and (c) effect sizes of differences among 902 universities in 54 countries; we focus on the UK, Germany, Brazil, and the USA as national examples. Although the groupings remain largely the same using different statistical significance levels or overlapping stability intervals, these classifications are uncorrelated with those based on effect sizes. Effect sizes for the differences between universities are small (w < .2). The more detailed analysis of universities at the country level suggests that distinctions beyond three or perhaps four groups of universities (high, middle, low) may not be meaningful. Given similar institutional incentives, isomorphism within each eco-system of universities should not be underestimated. Our results suggest that networks based on overlapping stability intervals can provide a first impression of the relevant groupings among universities. However, the clusters are not well-defined divisions between groups of universities.

Source

Journal of the Association for Information Science and Technology. 70(2019) no.5, S.509-525

Leydesdorff, L.; Nerghes, A.: Co-word maps and topic modeling : a comparison using small and medium-sized corpora (N?<?1.000) (2017) 0.05

0.05225846 = product of:
  0.06967795 = sum of:
    0.019343007 = weight(_text_:science in 3538) [ClassicSimilarity], result of:
      0.019343007 = score(doc=3538,freq=2.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.1455159 = fieldWeight in 3538, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3538)
    0.0226909 = weight(_text_:research in 3538) [ClassicSimilarity], result of:
      0.0226909 = score(doc=3538,freq=2.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.15760657 = fieldWeight in 3538, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3538)
    0.027644044 = product of:
      0.055288088 = sum of:
        0.055288088 = weight(_text_:network in 3538) [ClassicSimilarity], result of:
          0.055288088 = score(doc=3538,freq=2.0), product of:
            0.22473325 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.050463587 = queryNorm
            0.2460165 = fieldWeight in 3538, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3538)
      0.5 = coord(1/2)
  0.75 = coord(3/4)

Abstract: Induced by "big data," "topic modeling" has become an attractive alternative to mapping co-words in terms of co-occurrences and co-absences using network techniques. Does topic modeling provide an alternative for co-word mapping in research practices using moderately sized document collections? We return to the word/document matrix using first a single text with a strong argument ("The Leiden Manifesto") and then upscale to a sample of moderate size (n?=?687) to study the pros and cons of the two approaches in terms of the resulting possibilities for making semantic maps that can serve an argument. The results from co-word mapping (using two different routines) versus topic modeling are significantly uncorrelated. Whereas components in the co-word maps can easily be designated, the topic models provide sets of words that are very differently organized. In these samples, the topic models seem to reveal similarities other than semantic ones (e.g., linguistic ones). In other words, topic modeling does not replace co-word mapping in small and medium-sized sets; but the paper leaves open the possibility that topic modeling would work well for the semantic mapping of large sets.
Source: Journal of the Association for Information Science and Technology. 68(2017) no.4, S.1024-1035

Marx, W.; Bornmann, L.; Barth, A.; Leydesdorff, L.: Detecting the historical roots of research fields by reference publication year spectroscopy (RPYS) (2014) 0.05

0.049056977 = product of:
  0.098113954 = sum of:
    0.027080212 = weight(_text_:science in 1238) [ClassicSimilarity], result of:
      0.027080212 = score(doc=1238,freq=2.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.20372227 = fieldWeight in 1238, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1238)
    0.071033746 = weight(_text_:research in 1238) [ClassicSimilarity], result of:
      0.071033746 = score(doc=1238,freq=10.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.49338657 = fieldWeight in 1238, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1238)
  0.5 = coord(2/4)

Abstract: We introduce the quantitative method named "Reference Publication Year Spectroscopy" (RPYS). With this method one can determine the historical roots of research fields and quantify their impact on current research. RPYS is based on the analysis of the frequency with which references are cited in the publications of a specific research field in terms of the publication years of these cited references. The origins show up in the form of more or less pronounced peaks mostly caused by individual publications that are cited particularly frequently. In this study, we use research on graphene and on solar cells to illustrate how RPYS functions, and what results it can deliver.
Source: Journal of the Association for Information Science and Technology. 65(2014) no.4, S.751-764

Leydesdorff, L.; Bornmann, L.: ¬The operationalization of "fields" as WoS subject categories (WCs) in evaluative bibliometrics : the cases of "library and information science" and "science & technology studies" (2016) 0.05
```
0.046440713 = product of:
  0.092881426 = sum of:
    0.06565235 = weight(_text_:science in 2779) [ClassicSimilarity], result of:
      0.06565235 = score(doc=2779,freq=16.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.49389738 = fieldWeight in 2779, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.046875 = fieldNorm(doc=2779)
    0.027229078 = weight(_text_:research in 2779) [ClassicSimilarity], result of:
      0.027229078 = score(doc=2779,freq=2.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.18912788 = fieldWeight in 2779, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.046875 = fieldNorm(doc=2779)
  0.5 = coord(2/4)
```
Abstract

Normalization of citation scores using reference sets based on Web of Science subject categories (WCs) has become an established ("best") practice in evaluative bibliometrics. For example, the Times Higher Education World University Rankings are, among other things, based on this operationalization. However, WCs were developed decades ago for the purpose of information retrieval and evolved incrementally with the database; the classification is machine-based and partially manually corrected. Using the WC "information science & library science" and the WCs attributed to journals in the field of "science and technology studies," we show that WCs do not provide sufficient analytical clarity to carry bibliometric normalization in evaluation practices because of "indexer effects." Can the compliance with "best practices" be replaced with an ambition to develop "best possible practices"? New research questions can then be envisaged.

Aid

Web of Science

Source

Journal of the Association for Information Science and Technology. 67(2016) no.3, S.707-714
Leydesdorff, L.; Moya-Anegón, F. de; Nooy, W. de: Aggregated journal-journal citation relations in scopus and web of science matched and compared in terms of networks, maps, and interactive overlays (2016) 0.04
```
0.04328345 = product of:
  0.0865669 = sum of:
    0.038686015 = weight(_text_:science in 3090) [ClassicSimilarity], result of:
      0.038686015 = score(doc=3090,freq=8.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.2910318 = fieldWeight in 3090, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3090)
    0.047880888 = product of:
      0.095761776 = sum of:
        0.095761776 = weight(_text_:network in 3090) [ClassicSimilarity], result of:
          0.095761776 = score(doc=3090,freq=6.0), product of:
            0.22473325 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.050463587 = queryNorm
            0.42611307 = fieldWeight in 3090, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3090)
      0.5 = coord(1/2)
  0.5 = coord(2/4)
```
Abstract

We compare the network of aggregated journal-journal citation relations provided by the Journal Citation Reports (JCR) 2012 of the Science Citation Index (SCI) and Social Sciences Citation Index (SSCI) with similar data based on Scopus 2012. First, global and overlay maps were developed for the 2 sets separately. Using fuzzy-string matching and ISSN numbers, we were able to match 10,524 journal names between the 2 sets: 96.4% of the 10,936 journals contained in JCR, or 51.2% of the 20,554 journals covered by Scopus. Network analysis was pursued on the set of journals shared between the 2 databases and the 2 sets of unique journals. Citations among the shared journals are more comprehensively covered in JCR than in Scopus, so the network in JCR is denser and more connected than in Scopus. The ranking of shared journals in terms of indegree (i.e., numbers of citing journals) or total citations is similar in both databases overall (Spearman rank correlation ??>?0.97), but some individual journals rank very differently. Journals that are unique to Scopus seem to be less important-they are citing shared journals rather than being cited by them-but the humanities are covered better in Scopus than in JCR.

Object

Web of science

Source

Journal of the Association for Information Science and Technology. 67(2016) no.9, S.2194-2211
Leydesdorff, L.; Park, H.W.; Wagner, C.: International coauthorship relations in the Social Sciences Citation Index : is internationalization leading the Network? (2014) 0.04
```
0.04069198 = product of:
  0.08138396 = sum of:
    0.033503074 = weight(_text_:science in 1505) [ClassicSimilarity], result of:
      0.033503074 = score(doc=1505,freq=6.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.25204095 = fieldWeight in 1505, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1505)
    0.047880888 = product of:
      0.095761776 = sum of:
        0.095761776 = weight(_text_:network in 1505) [ClassicSimilarity], result of:
          0.095761776 = score(doc=1505,freq=6.0), product of:
            0.22473325 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.050463587 = queryNorm
            0.42611307 = fieldWeight in 1505, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1505)
      0.5 = coord(1/2)
  0.5 = coord(2/4)
```
Abstract

International coauthorship relations have increasingly shaped another dynamic in the natural and life sciences during recent decades. However, much less is known about such internationalization in the social sciences. In this study, we analyze international and domestic coauthorship relations of all citable items in the DVD version of the Social Sciences Citation Index 2011 (SSCI). Network statistics indicate 4 groups of nations: (a) an Asian-Pacific one to which all Anglo-Saxon nations (including the United Kingdom and Ireland) are attributed, (b) a continental European one including also the Latin-American countries, (c) the Scandinavian nations, and (d) a community of African nations. Within the EU-28, 11 of the EU-15 states have dominant positions. In many respects, the network parameters are not so different from the Science Citation Index. In addition to these descriptive statistics, we address the question of the relative weights of the international versus domestic networks. An information-theoretical test is proposed at the level of organizational addresses within each nation; the results are mixed, but the international dimension is more important than the national one in the aggregated sets (as in the Science Citation Index). In some countries (e.g., France), however, the national distribution is leading more than the international one. Decomposition of the United States in terms of states shows a similarly mixed result; more U.S. states are domestically oriented in the SSCI and more internationally in the SCI. The international networks have grown during the last decades in addition to the national ones but not by replacing them.

Source

Journal of the Association for Information Science and Technology. 65(2014) no.10, S.2111-2126
Leydesdorff, L.; Rotolo, D.; Rafols, I.: Bibliometric perspectives on medical innovation using the medical subject headings of PubMed (2012) 0.04
```
0.03566695 = product of:
  0.0713339 = sum of:
    0.032826174 = weight(_text_:science in 494) [ClassicSimilarity], result of:
      0.032826174 = score(doc=494,freq=4.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.24694869 = fieldWeight in 494, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.046875 = fieldNorm(doc=494)
    0.03850773 = weight(_text_:research in 494) [ClassicSimilarity], result of:
      0.03850773 = score(doc=494,freq=4.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.2674672 = fieldWeight in 494, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.046875 = fieldNorm(doc=494)
  0.5 = coord(2/4)
```
Abstract

Multiple perspectives on the nonlinear processes of medical innovations can be distinguished and combined using the Medical Subject Headings (MeSH) of the MEDLINE database. Focusing on three main branches-"diseases," "drugs and chemicals," and "techniques and equipment"-we use base maps and overlay techniques to investigate the translations and interactions and thus to gain a bibliometric perspective on the dynamics of medical innovations. To this end, we first analyze the MEDLINE database, the MeSH index tree, and the various options for a static mapping from different perspectives and at different levels of aggregation. Following a specific innovation (RNA interference) over time, the notion of a trajectory which leaves a signature in the database is elaborated. Can the detailed index terms describing the dynamics of research be used to predict the diffusion dynamics of research results? Possibilities are specified for further integration between the MEDLINE database on one hand, and the Science Citation Index and Scopus (containing citation information) on the other.

Source

Journal of the American Society for Information Science and Technology. 63(2012) no.11, S.2239-2253
Bauer, J.; Leydesdorff, L.; Bornmann, L.: Highly cited papers in Library and Information Science (LIS) : authors, institutions, and network structures (2016) 0.04
```
0.035448164 = product of:
  0.07089633 = sum of:
    0.043252286 = weight(_text_:science in 3231) [ClassicSimilarity], result of:
      0.043252286 = score(doc=3231,freq=10.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.32538348 = fieldWeight in 3231, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3231)
    0.027644044 = product of:
      0.055288088 = sum of:
        0.055288088 = weight(_text_:network in 3231) [ClassicSimilarity], result of:
          0.055288088 = score(doc=3231,freq=2.0), product of:
            0.22473325 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.050463587 = queryNorm
            0.2460165 = fieldWeight in 3231, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3231)
      0.5 = coord(1/2)
  0.5 = coord(2/4)
```
Abstract

As a follow-up to the highly cited authors list published by Thomson Reuters in June 2014, we analyzed the top 1% most frequently cited papers published between 2002 and 2012 included in the Web of Science (WoS) subject category "Information Science & Library Science." In all, 798 authors contributed to 305 top 1% publications; these authors were employed at 275 institutions. The authors at Harvard University contributed the largest number of papers, when the addresses are whole-number counted. However, Leiden University leads the ranking if fractional counting is used. Twenty-three of the 798 authors were also listed as most highly cited authors by Thomson Reuters in June 2014 (http://highlycited.com/). Twelve of these 23 authors were involved in publishing 4 or more of the 305 papers under study. Analysis of coauthorship relations among the 798 highly cited scientists shows that coauthorships are based on common interests in a specific topic. Three topics were important between 2002 and 2012: (a) collection and exploitation of information in clinical practices; (b) use of the Internet in public communication and commerce; and (c) scientometrics.

Source

Journal of the Association for Information Science and Technology. 67(2016) no.12, S.3095-3100
Leydesdorff, L.; Opthof, T.: Citation analysis with medical subject Headings (MeSH) using the Web of Knowledge : a new routine (2013) 0.03
```
0.033716384 = product of:
  0.06743277 = sum of:
    0.04020369 = weight(_text_:science in 943) [ClassicSimilarity], result of:
      0.04020369 = score(doc=943,freq=6.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.30244917 = fieldWeight in 943, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.046875 = fieldNorm(doc=943)
    0.027229078 = weight(_text_:research in 943) [ClassicSimilarity], result of:
      0.027229078 = score(doc=943,freq=2.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.18912788 = fieldWeight in 943, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.046875 = fieldNorm(doc=943)
  0.5 = coord(2/4)
```
Abstract

Citation analysis of documents retrieved from the Medline database (at the Web of Knowledge) has been possible only on a case-by-case basis. A technique is presented here for citation analysis in batch mode using both Medical Subject Headings (MeSH) at the Web of Knowledge and the Science Citation Index at the Web of Science (WoS). This freeware routine is applied to the case of "Brugada Syndrome," a specific disease and field of research (since 1992). The journals containing these publications, for example, are attributed to WoS categories other than "cardiac and cardiovascular systems", perhaps because of the possibility of genetic testing for this syndrome in the clinic. With this routine, all the instruments available for citation analysis can now be used on the basis of MeSH terms. Other options for crossing between Medline, WoS, and Scopus are also reviewed.

Source

Journal of the American Society for Information Science and Technology. 64(2013) no.5, S.1076-1080
Comins, J.A.; Leydesdorff, L.: Identification of long-term concept-symbols among citations : do common intellectual histories structure citation behavior? (2017) 0.03
```
0.033716384 = product of:
  0.06743277 = sum of:
    0.04020369 = weight(_text_:science in 3599) [ClassicSimilarity], result of:
      0.04020369 = score(doc=3599,freq=6.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.30244917 = fieldWeight in 3599, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.046875 = fieldNorm(doc=3599)
    0.027229078 = weight(_text_:research in 3599) [ClassicSimilarity], result of:
      0.027229078 = score(doc=3599,freq=2.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.18912788 = fieldWeight in 3599, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.046875 = fieldNorm(doc=3599)
  0.5 = coord(2/4)
```
Abstract

"Citation classics" are not only highly cited, but also cited during several decades. We explore whether the peaks in the spectrograms generated by Reference Publication Years Spectroscopy (RPYS) indicate such long-term impact by comparing across RPYS for subsequent time intervals. Multi-RPYS enables us to distinguish between short-term citation peaks at the research front that decay within 10 years versus historically constitutive (long-term) citations that function as concept symbols. Using these constitutive citations, one is able to cluster document sets (e.g., journals) in terms of intellectually shared histories. We test this premise by clustering 40 journals in the Web of Science Category of Information and Library Science using multi-RPYS. It follows that RPYS can not only be used for retrieving roots of sets under study (cited), but also for algorithmic historiography of the citing sets. Significant references are historically rooted symbols among other citations that function as currency.

Source

Journal of the Association for Information Science and Technology. 68(2017) no.5, S.1224-1233
Leydesdorff, L.; Bornmann, L.: Integrated impact indicators compared with impact factors : an alternative research design with policy implications (2011) 0.03
```
0.03297159 = product of:
  0.06594318 = sum of:
    0.043252286 = weight(_text_:science in 4919) [ClassicSimilarity], result of:
      0.043252286 = score(doc=4919,freq=10.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.32538348 = fieldWeight in 4919, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4919)
    0.0226909 = weight(_text_:research in 4919) [ClassicSimilarity], result of:
      0.0226909 = score(doc=4919,freq=2.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.15760657 = fieldWeight in 4919, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4919)
  0.5 = coord(2/4)
```
Abstract

In bibliometrics, the association of "impact" with central-tendency statistics is mistaken. Impacts add up, and citation curves therefore should be integrated instead of averaged. For example, the journals MIS Quarterly and Journal of the American Society for Information Science and Technology differ by a factor of 2 in terms of their respective impact factors (IF), but the journal with the lower IF has the higher impact. Using percentile ranks (e.g., top-1%, top-10%, etc.), an Integrated Impact Indicator (I3) can be based on integration of the citation curves, but after normalization of the citation curves to the same scale. The results across document sets can be compared as percentages of the total impact of a reference set. Total number of citations, however, should not be used instead because the shape of the citation curves is then not appreciated. I3 can be applied to any document set and any citation window. The results of the integration (summation) are fully decomposable in terms of journals or institutional units such as nations, universities, and so on because percentile ranks are determined at the paper level. In this study, we first compare I3 with IFs for the journals in two Institute for Scientific Information subject categories ("Information Science & Library Science" and "Multidisciplinary Sciences"). The library and information science set is additionally decomposed in terms of nations. Policy implications of this possible paradigm shift in citation impact analysis are specified.

Source

Journal of the American Society for Information Science and Technology. 62(2011) no.11, S.2133-2146
Leydesdorff, L.; Bornmann, L.: How fractional counting of citations affects the impact factor : normalization in terms of differences in citation potentials among fields of science (2011) 0.03
```
0.03223665 = product of:
  0.0644733 = sum of:
    0.0473805 = weight(_text_:science in 4186) [ClassicSimilarity], result of:
      0.0473805 = score(doc=4186,freq=12.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.3564397 = fieldWeight in 4186, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4186)
    0.017092798 = product of:
      0.034185596 = sum of:
        0.034185596 = weight(_text_:22 in 4186) [ClassicSimilarity], result of:
          0.034185596 = score(doc=4186,freq=2.0), product of:
            0.17671488 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050463587 = queryNorm
            0.19345059 = fieldWeight in 4186, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4186)
      0.5 = coord(1/2)
  0.5 = coord(2/4)
```
Abstract

The Impact Factors (IFs) of the Institute for Scientific Information suffer from a number of drawbacks, among them the statistics-Why should one use the mean and not the median?-and the incomparability among fields of science because of systematic differences in citation behavior among fields. Can these drawbacks be counteracted by fractionally counting citation weights instead of using whole numbers in the numerators? (a) Fractional citation counts are normalized in terms of the citing sources and thus would take into account differences in citation behavior among fields of science. (b) Differences in the resulting distributions can be tested statistically for their significance at different levels of aggregation. (c) Fractional counting can be generalized to any document set including journals or groups of journals, and thus the significance of differences among both small and large sets can be tested. A list of fractionally counted IFs for 2008 is available online at http:www.leydesdorff.net/weighted_if/weighted_if.xls The between-group variance among the 13 fields of science identified in the U.S. Science and Engineering Indicators is no longer statistically significant after this normalization. Although citation behavior differs largely between disciplines, the reflection of these differences in fractionally counted citation distributions can not be used as a reliable instrument for the classification.

Date

22. 1.2011 12:51:07

Source

Journal of the American Society for Information Science and Technology. 62(2011) no.2, S.217-229
Ye, F.Y.; Yu, S.S.; Leydesdorff, L.: ¬The Triple Helix of university-industry-government relations at the country level and its dynamic evolution under the pressures of globalization (2013) 0.03
```
0.030027626 = product of:
  0.060055252 = sum of:
    0.032826174 = weight(_text_:science in 1110) [ClassicSimilarity], result of:
      0.032826174 = score(doc=1110,freq=4.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.24694869 = fieldWeight in 1110, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.046875 = fieldNorm(doc=1110)
    0.027229078 = weight(_text_:research in 1110) [ClassicSimilarity], result of:
      0.027229078 = score(doc=1110,freq=2.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.18912788 = fieldWeight in 1110, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.046875 = fieldNorm(doc=1110)
  0.5 = coord(2/4)
```
Abstract

Using data from the Web of Science (WoS), we analyze the mutual information among university, industry, and government addresses (U-I-G) at the country level for a number of countries. The dynamic evolution of the Triple Helix can thus be compared among developed and developing nations in terms of cross-sectional coauthorship relations. The results show that the Triple Helix interactions among the three subsystems U-I-G become less intensive over time, but unequally for different countries. We suggest that globalization erodes local Triple Helix relations and thus can be expected to have increased differentiation in national systems since the mid-1990s. This effect of globalization is more pronounced in developed countries than in developing ones. In the dynamic analysis, we focus on a more detailed comparison between China and the United States. Specifically, the Chinese Academy of the (Social) Sciences is changing increasingly from a public research institute to an academic one, and this has a measurable effect on China's position in the globalization.

Source

Journal of the American Society for Information Science and Technology. 64(2013) no.11, S.2317-2325
Leydesdorff, L.; Nooy, W. de: Can "hot spots" in the sciences be mapped using the dynamics of aggregated journal-journal citation relations (2017) 0.03
```
0.029218795 = product of:
  0.05843759 = sum of:
    0.019343007 = weight(_text_:science in 3328) [ClassicSimilarity], result of:
      0.019343007 = score(doc=3328,freq=2.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.1455159 = fieldWeight in 3328, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3328)
    0.039094582 = product of:
      0.078189164 = sum of:
        0.078189164 = weight(_text_:network in 3328) [ClassicSimilarity], result of:
          0.078189164 = score(doc=3328,freq=4.0), product of:
            0.22473325 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.050463587 = queryNorm
            0.34791988 = fieldWeight in 3328, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3328)
      0.5 = coord(1/2)
  0.5 = coord(2/4)
```
Abstract

Using 3 years of the Journal Citation Reports (2011, 2012, and 2013), indicators of transitions in 2012 (between 2011 and 2013) were studied using methodologies based on entropy statistics. Changes can be indicated at the level of journals using the margin totals of entropy production along the row or column vectors, but also at the level of links among journals by importing the transition matrices into network analysis and visualization programs (and using community-finding algorithms). Seventy-four journals were flagged in terms of discontinuous changes in their citations, but 3,114 journals were involved in "hot" links. Most of these links are embedded in a main component; 78 clusters (containing 172 journals) were flagged as potential "hot spots" emerging at the network level. An additional finding was that PLoS ONE introduced a new communication dynamic into the database. The limitations of the methodology were elaborated using an example. The results of the study indicate where developments in the citation dynamics can be considered as significantly unexpected. This can be used as heuristic information, but what a "hot spot" in terms of the entropy statistics of aggregated citation relations means substantively can be expected to vary from case to case.

Source

Journal of the Association for Information Science and Technology. 68(2017) no.1, S.197-213
Chen, C.; Leydesdorff, L.: Patterns of connections and movements in dual-map overlays : a new method of publication portfolio analysis (2014) 0.03
```
0.028096987 = product of:
  0.056193974 = sum of:
    0.033503074 = weight(_text_:science in 1200) [ClassicSimilarity], result of:
      0.033503074 = score(doc=1200,freq=6.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.25204095 = fieldWeight in 1200, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1200)
    0.0226909 = weight(_text_:research in 1200) [ClassicSimilarity], result of:
      0.0226909 = score(doc=1200,freq=2.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.15760657 = fieldWeight in 1200, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1200)
  0.5 = coord(2/4)
```
Abstract

Portfolio analysis of the publication profile of a unit of interest, ranging from individuals and organizations to a scientific field or interdisciplinary programs, aims to inform analysts and decision makers about the position of the unit, where it has been, and where it may go in a complex adaptive environment. A portfolio analysis may aim to identify the gap between the current position of an organization and a goal that it intends to achieve or identify competencies of multiple institutions. We introduce a new visual analytic method for analyzing, comparing, and contrasting characteristics of publication portfolios. The new method introduces a novel design of dual-map thematic overlays on global maps of science. Each publication portfolio can be added as one layer of dual-map overlays over 2 related, but distinct, global maps of science: one for citing journals and the other for cited journals. We demonstrate how the new design facilitates a portfolio analysis in terms of patterns emerging from the distributions of citation threads and the dynamics of trajectories as a function of space and time. We first demonstrate the analysis of portfolios defined on a single source article. Then we contrast publication portfolios of multiple comparable units of interest; namely, colleges in universities and corporate research organizations. We also include examples of overlays of scientific fields. We expect that our method will provide new insights to portfolio analysis.

Source

Journal of the Association for Information Science and Technology. 65(2014) no.2, S.334-351
Shelton, R.D.; Leydesdorff, L.: Publish or patent : bibliometric evidence for empirical trade-offs in national funding strategies (2012) 0.03
```
0.025716392 = product of:
  0.051432785 = sum of:
    0.019343007 = weight(_text_:science in 70) [ClassicSimilarity], result of:
      0.019343007 = score(doc=70,freq=2.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.1455159 = fieldWeight in 70, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.0390625 = fieldNorm(doc=70)
    0.032089777 = weight(_text_:research in 70) [ClassicSimilarity], result of:
      0.032089777 = score(doc=70,freq=4.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.22288933 = fieldWeight in 70, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.0390625 = fieldNorm(doc=70)
  0.5 = coord(2/4)
```
Abstract

Multivariate linear regression models suggest a trade-off in allocations of national research and development (R&D). Government funding and spending in the higher education sector encourage publications as a long-term research benefit. Conversely, other components such as industrial funding and spending in the business sector encourage patenting. Our results help explain why the United States trails the European Union in publications: The focus in the United States is on industrial funding-some 70% of its total R&D investment. Likewise, our results also help explain why the European Union trails the United States in patenting, since its focus on government funding is less effective than industrial funding in predicting triadic patenting. Government funding contributes negatively to patenting in a multiple regression, and this relationship is significant in the case of triadic patenting. We provide new forecasts about the relationships of the United States, the European Union, and China for publishing; these results suggest much later dates for changes than previous forecasts because Chinese growth has been slowing down since 2003. Models for individual countries might be more successful than regression models whose parameters are averaged over a set of countries because nations can be expected to differ historically in terms of the institutional arrangements and funding schemes.

Source

Journal of the American Society for Information Science and Technology. 63(2012) no.3, S.498-511
Zhou, Q.; Leydesdorff, L.: ¬The normalization of occurrence and co-occurrence matrices in bibliometrics using Cosine similarities and Ochiai coefficients (2016) 0.03
```
0.025220342 = product of:
  0.050440684 = sum of:
    0.023211608 = weight(_text_:science in 3161) [ClassicSimilarity], result of:
      0.023211608 = score(doc=3161,freq=2.0), product of:
        0.1329271 = queryWeight, product of:
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.050463587 = queryNorm
        0.17461908 = fieldWeight in 3161, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.6341193 = idf(docFreq=8627, maxDocs=44218)
          0.046875 = fieldNorm(doc=3161)
    0.027229078 = weight(_text_:research in 3161) [ClassicSimilarity], result of:
      0.027229078 = score(doc=3161,freq=2.0), product of:
        0.14397179 = queryWeight, product of:
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.050463587 = queryNorm
        0.18912788 = fieldWeight in 3161, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.8529835 = idf(docFreq=6931, maxDocs=44218)
          0.046875 = fieldNorm(doc=3161)
  0.5 = coord(2/4)
```
Abstract

We prove that Ochiai similarity of the co-occurrence matrix is equal to cosine similarity in the underlying occurrence matrix. Neither the cosine nor the Pearson correlation should be used for the normalization of co-occurrence matrices because the similarity is then normalized twice, and therefore overestimated; the Ochiai coefficient can be used instead. Results are shown using a small matrix (5 cases, 4 variables) for didactic reasons, and also Ahlgren et?al.'s (2003) co-occurrence matrix of 24 authors in library and information sciences. The overestimation is shown numerically and will be illustrated using multidimensional scaling and cluster dendograms. If the occurrence matrix is not available (such as in internet research or author cocitation analysis) using Ochiai for the normalization is preferable to using the cosine.

Source

Journal of the Association for Information Science and Technology. 67(2016) no.11, S.2805-2814

Search (37 results, page 1 of 2)

Authors