Document (#41216)

Author
Colavizza, G.
Boyack, K.W.
Eck, N.J. van
Waltman, L.
Title
¬The closer the better : similarity of publication pairs at different cocitation levels
Source
Journal of the Association for Information Science and Technology. 69(2018) no.4, S.600-609
Year
2018
Abstract
We investigated the similarities of pairs of articles that are cocited at the different cocitation levels of the journal, article, section, paragraph, sentence, and bracket. Our results indicate that textual similarity, intellectual overlap (shared references), author overlap (shared authors), proximity in publication time all rise monotonically as the cocitation level gets lower (from journal to bracket). While the main gain in similarity happens when moving from journal to article cocitation, all level changes entail an increase in similarity, especially section to paragraph and paragraph to sentence/bracket levels. We compared the results from four journals over the years 2010-2015: Cell, the European Journal of Operational Research, Physics Letters B, and Research Policy, with consistent general outcomes and some interesting differences. Our findings motivate the use of granular cocitation information as defined by meaningful units of text, with implications for, among others, the elaboration of maps of science and the retrieval of scholarly literature.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/abs/10.1002/asi.23981.
Theme
Informetrie

Similar documents (author)

  1. Boyack; K.W.; Börner, K.: Indicator-assisted evaluation and funding of research : visualizing the influence of grants on the number and citation counts of research papers (2003) 1.62
    1.6195996 = sum of:
      1.6195996 = product of:
        3.2391992 = sum of:
          3.2391992 = weight(author_txt:boyack in 2472) [ClassicSimilarity], result of:
            3.2391992 = score(doc=2472,freq=1.0), product of:
              0.7128727 = queryWeight, product of:
                1.0082217 = boost
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.07780369 = queryNorm
              4.5438676 = fieldWeight in 2472, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.5 = fieldNorm(doc=2472)
        0.5 = coord(1/2)
    
  2. Klavans, R.; Boyack, K.W.: Identifying a better measure of relatedness for mapping science (2006) 1.62
    1.6195996 = sum of:
      1.6195996 = product of:
        3.2391992 = sum of:
          3.2391992 = weight(author_txt:boyack in 253) [ClassicSimilarity], result of:
            3.2391992 = score(doc=253,freq=1.0), product of:
              0.7128727 = queryWeight, product of:
                1.0082217 = boost
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.07780369 = queryNorm
              4.5438676 = fieldWeight in 253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.5 = fieldNorm(doc=253)
        0.5 = coord(1/2)
    
  3. Klavans, R.; Boyack, K.W.: Toward a consensus map of science (2009) 1.62
    1.6195996 = sum of:
      1.6195996 = product of:
        3.2391992 = sum of:
          3.2391992 = weight(author_txt:boyack in 556) [ClassicSimilarity], result of:
            3.2391992 = score(doc=556,freq=1.0), product of:
              0.7128727 = queryWeight, product of:
                1.0082217 = boost
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.07780369 = queryNorm
              4.5438676 = fieldWeight in 556, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.5 = fieldNorm(doc=556)
        0.5 = coord(1/2)
    
  4. Boyack, K.W.; Klavans, R.: Co-citation analysis, bibliographic coupling, and direct citation : which citation approach represents the research front most accurately? (2010) 1.62
    1.6195996 = sum of:
      1.6195996 = product of:
        3.2391992 = sum of:
          3.2391992 = weight(author_txt:boyack in 1112) [ClassicSimilarity], result of:
            3.2391992 = score(doc=1112,freq=1.0), product of:
              0.7128727 = queryWeight, product of:
                1.0082217 = boost
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.07780369 = queryNorm
              4.5438676 = fieldWeight in 1112, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.5 = fieldNorm(doc=1112)
        0.5 = coord(1/2)
    
  5. Klavans, R.; Boyack, K.W.: Using global mapping to create more accurate document-level maps of research fields (2011) 1.62
    1.6195996 = sum of:
      1.6195996 = product of:
        3.2391992 = sum of:
          3.2391992 = weight(author_txt:boyack in 1957) [ClassicSimilarity], result of:
            3.2391992 = score(doc=1957,freq=1.0), product of:
              0.7128727 = queryWeight, product of:
                1.0082217 = boost
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.07780369 = queryNorm
              4.5438676 = fieldWeight in 1957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.5 = fieldNorm(doc=1957)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Wang, F.; Wolfram, D.: Assessment of journal similarity based on citing discipline analysis (2015) 0.15
    0.15473679 = sum of:
      0.15473679 = product of:
        0.7736839 = sum of:
          0.011957081 = weight(abstract_txt:different in 3850) [ClassicSimilarity], result of:
            0.011957081 = score(doc=3850,freq=1.0), product of:
              0.05164629 = queryWeight, product of:
                3.704299 = idf(docFreq=2830, maxDocs=42306)
                0.0139422575 = queryNorm
              0.23151869 = fieldWeight in 3850, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.704299 = idf(docFreq=2830, maxDocs=42306)
                0.0625 = fieldNorm(doc=3850)
          0.007720029 = weight(abstract_txt:from in 3850) [ClassicSimilarity], result of:
            0.007720029 = score(doc=3850,freq=1.0), product of:
              0.044163693 = queryWeight, product of:
                1.1325536 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0139422575 = queryNorm
              0.17480488 = fieldWeight in 3850, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0625 = fieldNorm(doc=3850)
          0.113794625 = weight(abstract_txt:journal in 3850) [ClassicSimilarity], result of:
            0.113794625 = score(doc=3850,freq=3.0), product of:
              0.20261571 = queryWeight, product of:
                2.8011217 = boost
                5.188096 = idf(docFreq=641, maxDocs=42306)
                0.0139422575 = queryNorm
              0.56162786 = fieldWeight in 3850, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.188096 = idf(docFreq=641, maxDocs=42306)
                0.0625 = fieldNorm(doc=3850)
          0.18501455 = weight(abstract_txt:similarity in 3850) [ClassicSimilarity], result of:
            0.18501455 = score(doc=3850,freq=4.0), product of:
              0.2545362 = queryWeight, product of:
                3.1395705 = boost
                5.814954 = idf(docFreq=342, maxDocs=42306)
                0.0139422575 = queryNorm
              0.7268692 = fieldWeight in 3850, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.814954 = idf(docFreq=342, maxDocs=42306)
                0.0625 = fieldNorm(doc=3850)
          0.45519763 = weight(abstract_txt:cocitation in 3850) [ClassicSimilarity], result of:
            0.45519763 = score(doc=3850,freq=3.0), product of:
              0.54999906 = queryWeight, product of:
                5.1597824 = boost
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.0139422575 = queryNorm
              0.8276335 = fieldWeight in 3850, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.0625 = fieldNorm(doc=3850)
        0.2 = coord(5/25)
    
  2. White, H.D.: Author cocitation analysis and pearson's r (2003) 0.14
    0.14004892 = sum of:
      0.14004892 = product of:
        0.7002446 = sum of:
          0.011795628 = weight(abstract_txt:article in 3120) [ClassicSimilarity], result of:
            0.011795628 = score(doc=3120,freq=1.0), product of:
              0.055945393 = queryWeight, product of:
                1.0407888 = boost
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.0139422575 = queryNorm
              0.2108418 = fieldWeight in 3120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3120)
          0.006755025 = weight(abstract_txt:from in 3120) [ClassicSimilarity], result of:
            0.006755025 = score(doc=3120,freq=1.0), product of:
              0.044163693 = queryWeight, product of:
                1.1325536 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0139422575 = queryNorm
              0.15295427 = fieldWeight in 3120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3120)
          0.0815802 = weight(abstract_txt:cocited in 3120) [ClassicSimilarity], result of:
            0.0815802 = score(doc=3120,freq=1.0), product of:
              0.16118705 = queryWeight, product of:
                1.2491958 = boost
                9.254789 = idf(docFreq=10, maxDocs=42306)
                0.0139422575 = queryNorm
              0.5061213 = fieldWeight in 3120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.254789 = idf(docFreq=10, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3120)
          0.14019889 = weight(abstract_txt:similarity in 3120) [ClassicSimilarity], result of:
            0.14019889 = score(doc=3120,freq=3.0), product of:
              0.2545362 = queryWeight, product of:
                3.1395705 = boost
                5.814954 = idf(docFreq=342, maxDocs=42306)
                0.0139422575 = queryNorm
              0.55080134 = fieldWeight in 3120, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.814954 = idf(docFreq=342, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3120)
          0.45991486 = weight(abstract_txt:cocitation in 3120) [ClassicSimilarity], result of:
            0.45991486 = score(doc=3120,freq=4.0), product of:
              0.54999906 = queryWeight, product of:
                5.1597824 = boost
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.0139422575 = queryNorm
              0.83621025 = fieldWeight in 3120, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3120)
        0.2 = coord(5/25)
    
  3. White, H.D.: Pathfinder networks and author cocitation analysis : a remapping of paradigmatic information scientists (2003) 0.13
    0.13079605 = sum of:
      0.13079605 = product of:
        0.6539802 = sum of:
          0.019064613 = weight(abstract_txt:article in 2460) [ClassicSimilarity], result of:
            0.019064613 = score(doc=2460,freq=2.0), product of:
              0.055945393 = queryWeight, product of:
                1.0407888 = boost
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.0139422575 = queryNorm
              0.3407718 = fieldWeight in 2460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.0625 = fieldNorm(doc=2460)
          0.01091777 = weight(abstract_txt:from in 2460) [ClassicSimilarity], result of:
            0.01091777 = score(doc=2460,freq=2.0), product of:
              0.044163693 = queryWeight, product of:
                1.1325536 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0139422575 = queryNorm
              0.24721143 = fieldWeight in 2460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0625 = fieldNorm(doc=2460)
          0.09323452 = weight(abstract_txt:cocited in 2460) [ClassicSimilarity], result of:
            0.09323452 = score(doc=2460,freq=1.0), product of:
              0.16118705 = queryWeight, product of:
                1.2491958 = boost
                9.254789 = idf(docFreq=10, maxDocs=42306)
                0.0139422575 = queryNorm
              0.57842433 = fieldWeight in 2460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.254789 = idf(docFreq=10, maxDocs=42306)
                0.0625 = fieldNorm(doc=2460)
          0.075565666 = weight(abstract_txt:pairs in 2460) [ClassicSimilarity], result of:
            0.075565666 = score(doc=2460,freq=1.0), product of:
              0.1765382 = queryWeight, product of:
                1.8488419 = boost
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.0139422575 = queryNorm
              0.42804146 = fieldWeight in 2460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.0625 = fieldNorm(doc=2460)
          0.45519763 = weight(abstract_txt:cocitation in 2460) [ClassicSimilarity], result of:
            0.45519763 = score(doc=2460,freq=3.0), product of:
              0.54999906 = queryWeight, product of:
                5.1597824 = boost
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.0139422575 = queryNorm
              0.8276335 = fieldWeight in 2460, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.0625 = fieldNorm(doc=2460)
        0.2 = coord(5/25)
    
  4. Pirkola, A.; Jarvelin, K.: ¬The effect of anaphor and ellipsis resolution on proximity searching in a text database (1995) 0.12
    0.12496954 = sum of:
      0.12496954 = product of:
        0.6248477 = sum of:
          0.011957081 = weight(abstract_txt:different in 4157) [ClassicSimilarity], result of:
            0.011957081 = score(doc=4157,freq=1.0), product of:
              0.05164629 = queryWeight, product of:
                3.704299 = idf(docFreq=2830, maxDocs=42306)
                0.0139422575 = queryNorm
              0.23151869 = fieldWeight in 4157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.704299 = idf(docFreq=2830, maxDocs=42306)
                0.0625 = fieldNorm(doc=4157)
          0.013480717 = weight(abstract_txt:article in 4157) [ClassicSimilarity], result of:
            0.013480717 = score(doc=4157,freq=1.0), product of:
              0.055945393 = queryWeight, product of:
                1.0407888 = boost
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.0139422575 = queryNorm
              0.24096206 = fieldWeight in 4157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.0625 = fieldNorm(doc=4157)
          0.106865995 = weight(abstract_txt:pairs in 4157) [ClassicSimilarity], result of:
            0.106865995 = score(doc=4157,freq=2.0), product of:
              0.1765382 = queryWeight, product of:
                1.8488419 = boost
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.0139422575 = queryNorm
              0.60534203 = fieldWeight in 4157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.0625 = fieldNorm(doc=4157)
          0.108035736 = weight(abstract_txt:sentence in 4157) [ClassicSimilarity], result of:
            0.108035736 = score(doc=4157,freq=2.0), product of:
              0.17782411 = queryWeight, product of:
                1.8555632 = boost
                6.873561 = idf(docFreq=118, maxDocs=42306)
                0.0139422575 = queryNorm
              0.6075427 = fieldWeight in 4157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.873561 = idf(docFreq=118, maxDocs=42306)
                0.0625 = fieldNorm(doc=4157)
          0.3845082 = weight(abstract_txt:paragraph in 4157) [ClassicSimilarity], result of:
            0.3845082 = score(doc=4157,freq=2.0), product of:
              0.47451124 = queryWeight, product of:
                3.7123535 = boost
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.0139422575 = queryNorm
              0.8103247 = fieldWeight in 4157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.0625 = fieldNorm(doc=4157)
        0.2 = coord(5/25)
    
  5. Tang, X.; Yang, C.C.; Song, M.: Understanding the evolution of multiple scientific research domains using a content and network approach (2013) 0.12
    0.12107989 = sum of:
      0.12107989 = product of:
        0.50449955 = sum of:
          0.011957081 = weight(abstract_txt:different in 2745) [ClassicSimilarity], result of:
            0.011957081 = score(doc=2745,freq=1.0), product of:
              0.05164629 = queryWeight, product of:
                3.704299 = idf(docFreq=2830, maxDocs=42306)
                0.0139422575 = queryNorm
              0.23151869 = fieldWeight in 2745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.704299 = idf(docFreq=2830, maxDocs=42306)
                0.0625 = fieldNorm(doc=2745)
          0.04830593 = weight(abstract_txt:closer in 2745) [ClassicSimilarity], result of:
            0.04830593 = score(doc=2745,freq=1.0), product of:
              0.10397908 = queryWeight, product of:
                1.0033176 = boost
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.0139422575 = queryNorm
              0.46457353 = fieldWeight in 2745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.0625 = fieldNorm(doc=2745)
          0.013480717 = weight(abstract_txt:article in 2745) [ClassicSimilarity], result of:
            0.013480717 = score(doc=2745,freq=1.0), product of:
              0.055945393 = queryWeight, product of:
                1.0407888 = boost
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.0139422575 = queryNorm
              0.24096206 = fieldWeight in 2745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.0625 = fieldNorm(doc=2745)
          0.007720029 = weight(abstract_txt:from in 2745) [ClassicSimilarity], result of:
            0.007720029 = score(doc=2745,freq=1.0), product of:
              0.044163693 = queryWeight, product of:
                1.1325536 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0139422575 = queryNorm
              0.17480488 = fieldWeight in 2745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0625 = fieldNorm(doc=2745)
          0.16022728 = weight(abstract_txt:similarity in 2745) [ClassicSimilarity], result of:
            0.16022728 = score(doc=2745,freq=3.0), product of:
              0.2545362 = queryWeight, product of:
                3.1395705 = boost
                5.814954 = idf(docFreq=342, maxDocs=42306)
                0.0139422575 = queryNorm
              0.6294872 = fieldWeight in 2745, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.814954 = idf(docFreq=342, maxDocs=42306)
                0.0625 = fieldNorm(doc=2745)
          0.2628085 = weight(abstract_txt:cocitation in 2745) [ClassicSimilarity], result of:
            0.2628085 = score(doc=2745,freq=1.0), product of:
              0.54999906 = queryWeight, product of:
                5.1597824 = boost
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.0139422575 = queryNorm
              0.47783443 = fieldWeight in 2745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.0625 = fieldNorm(doc=2745)
        0.24 = coord(6/25)