Document (#40539)

Author
Leydesdorff, L.
Nerghes, A.
Title
Co-word maps and topic modeling : a comparison using small and medium-sized corpora (N?<?1.000)
Source
Journal of the Association for Information Science and Technology. 68(2017) no.4, S.1024-1035
Year
2017
Abstract
Induced by "big data," "topic modeling" has become an attractive alternative to mapping co-words in terms of co-occurrences and co-absences using network techniques. Does topic modeling provide an alternative for co-word mapping in research practices using moderately sized document collections? We return to the word/document matrix using first a single text with a strong argument ("The Leiden Manifesto") and then upscale to a sample of moderate size (n?=?687) to study the pros and cons of the two approaches in terms of the resulting possibilities for making semantic maps that can serve an argument. The results from co-word mapping (using two different routines) versus topic modeling are significantly uncorrelated. Whereas components in the co-word maps can easily be designated, the topic models provide sets of words that are very differently organized. In these samples, the topic models seem to reveal similarities other than semantic ones (e.g., linguistic ones). In other words, topic modeling does not replace co-word mapping in small and medium-sized sets; but the paper leaves open the possibility that topic modeling would work well for the semantic mapping of large sets.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23740/full.
Theme
Informetrie

Similar documents (author)

  1. Leydesdorff, L.: ¬The generation of aggregated journal-journal citation maps on the basis of the CD-ROM version of the Science Citation Index (1994) 4.51
    4.512219 = sum of:
      4.512219 = weight(author_txt:leydesdorff in 8281) [ClassicSimilarity], result of:
        4.512219 = fieldWeight in 8281, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.2195506 = idf(docFreq=87, maxDocs=44218)
          0.625 = fieldNorm(doc=8281)
    
  2. Leydesdorff, L.: Why words and co-word cannot map the development of the science (1997) 4.51
    4.512219 = sum of:
      4.512219 = weight(author_txt:leydesdorff in 147) [ClassicSimilarity], result of:
        4.512219 = fieldWeight in 147, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.2195506 = idf(docFreq=87, maxDocs=44218)
          0.625 = fieldNorm(doc=147)
    
  3. Leydesdorff, L.: Theories of citation? (1999) 4.51
    4.512219 = sum of:
      4.512219 = weight(author_txt:leydesdorff in 5130) [ClassicSimilarity], result of:
        4.512219 = fieldWeight in 5130, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.2195506 = idf(docFreq=87, maxDocs=44218)
          0.625 = fieldNorm(doc=5130)
    
  4. Leydesdorff, L.: ¬A sociological theory of communication : the self-organization of the knowledge-based society (2001) 4.51
    4.512219 = sum of:
      4.512219 = weight(author_txt:leydesdorff in 184) [ClassicSimilarity], result of:
        4.512219 = fieldWeight in 184, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.2195506 = idf(docFreq=87, maxDocs=44218)
          0.625 = fieldNorm(doc=184)
    
  5. Leydesdorff, L.: Dynamic and evolutionary updates of classificatory schemes in scientific journal structures (2002) 4.51
    4.512219 = sum of:
      4.512219 = weight(author_txt:leydesdorff in 1249) [ClassicSimilarity], result of:
        4.512219 = fieldWeight in 1249, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.2195506 = idf(docFreq=87, maxDocs=44218)
          0.625 = fieldNorm(doc=1249)
    

Similar documents (content)

  1. Li, X.; Zhang, A.; Li, C.; Ouyang, J.; Cai, Y.: Exploring coherent topics by topic modeling with term weighting (2018) 0.29
    0.28736332 = sum of:
      0.28736332 = product of:
        1.0262976 = sum of:
          0.018656787 = weight(abstract_txt:document in 5045) [ClassicSimilarity], result of:
            0.018656787 = score(doc=5045,freq=1.0), product of:
              0.06954014 = queryWeight, product of:
                1.0469292 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.015473801 = queryNorm
              0.26828802 = fieldWeight in 5045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=5045)
          0.0235868 = weight(abstract_txt:models in 5045) [ClassicSimilarity], result of:
            0.0235868 = score(doc=5045,freq=1.0), product of:
              0.081306204 = queryWeight, product of:
                1.132039 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015473801 = queryNorm
              0.2900984 = fieldWeight in 5045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=5045)
          0.049322635 = weight(abstract_txt:sets in 5045) [ClassicSimilarity], result of:
            0.049322635 = score(doc=5045,freq=1.0), product of:
              0.15219682 = queryWeight, product of:
                1.8969153 = boost
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.015473801 = queryNorm
              0.32407138 = fieldWeight in 5045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.0625 = fieldNorm(doc=5045)
          0.171616 = weight(abstract_txt:words in 5045) [ClassicSimilarity], result of:
            0.171616 = score(doc=5045,freq=10.0), product of:
              0.16221087 = queryWeight, product of:
                1.9583266 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.015473801 = queryNorm
              1.0579809 = fieldWeight in 5045, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=5045)
          0.19681056 = weight(abstract_txt:word in 5045) [ClassicSimilarity], result of:
            0.19681056 = score(doc=5045,freq=3.0), product of:
              0.33448496 = queryWeight, product of:
                3.9769347 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.015473801 = queryNorm
              0.5883988 = fieldWeight in 5045, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=5045)
          0.26649874 = weight(abstract_txt:modeling in 5045) [ClassicSimilarity], result of:
            0.26649874 = score(doc=5045,freq=3.0), product of:
              0.40939367 = queryWeight, product of:
                4.3997774 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.015473801 = queryNorm
              0.6509596 = fieldWeight in 5045, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0625 = fieldNorm(doc=5045)
          0.29980612 = weight(abstract_txt:topic in 5045) [ClassicSimilarity], result of:
            0.29980612 = score(doc=5045,freq=6.0), product of:
              0.38684848 = queryWeight, product of:
                4.9385557 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.015473801 = queryNorm
              0.7749962 = fieldWeight in 5045, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=5045)
        0.28 = coord(7/25)
    
  2. Lu, K.; Wolfram, D.: Measuring author research relatedness : a comparison of word-based, topic-based, and author cocitation approaches (2012) 0.17
    0.16593707 = sum of:
      0.16593707 = product of:
        0.82968533 = sum of:
          0.042420503 = weight(abstract_txt:using in 453) [ClassicSimilarity], result of:
            0.042420503 = score(doc=453,freq=3.0), product of:
              0.11315345 = queryWeight, product of:
                2.11156 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.015473801 = queryNorm
              0.37489358 = fieldWeight in 453, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=453)
          0.1608641 = weight(abstract_txt:mapping in 453) [ClassicSimilarity], result of:
            0.1608641 = score(doc=453,freq=2.0), product of:
              0.31498298 = queryWeight, product of:
                3.5230036 = boost
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.015473801 = queryNorm
              0.51070726 = fieldWeight in 453, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.0625 = fieldNorm(doc=453)
          0.19681056 = weight(abstract_txt:word in 453) [ClassicSimilarity], result of:
            0.19681056 = score(doc=453,freq=3.0), product of:
              0.33448496 = queryWeight, product of:
                3.9769347 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.015473801 = queryNorm
              0.5883988 = fieldWeight in 453, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=453)
          0.21759531 = weight(abstract_txt:modeling in 453) [ClassicSimilarity], result of:
            0.21759531 = score(doc=453,freq=2.0), product of:
              0.40939367 = queryWeight, product of:
                4.3997774 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.015473801 = queryNorm
              0.5315063 = fieldWeight in 453, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0625 = fieldNorm(doc=453)
          0.21199492 = weight(abstract_txt:topic in 453) [ClassicSimilarity], result of:
            0.21199492 = score(doc=453,freq=3.0), product of:
              0.38684848 = queryWeight, product of:
                4.9385557 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.015473801 = queryNorm
              0.54800504 = fieldWeight in 453, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=453)
        0.2 = coord(5/25)
    
  3. Siebers, Q.H.J.F.: Implementing inference rules in the Topic maps model (2006) 0.16
    0.1573092 = sum of:
      0.1573092 = product of:
        0.786546 = sum of:
          0.030614361 = weight(abstract_txt:using in 4730) [ClassicSimilarity], result of:
            0.030614361 = score(doc=4730,freq=1.0), product of:
              0.11315345 = queryWeight, product of:
                2.11156 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.015473801 = queryNorm
              0.27055615 = fieldWeight in 4730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.078125 = fieldNorm(doc=4730)
          0.15642393 = weight(abstract_txt:maps in 4730) [ClassicSimilarity], result of:
            0.15642393 = score(doc=4730,freq=3.0), product of:
              0.19630429 = queryWeight, product of:
                2.1543193 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.015473801 = queryNorm
              0.7968441 = fieldWeight in 4730, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.078125 = fieldNorm(doc=4730)
          0.1421851 = weight(abstract_txt:mapping in 4730) [ClassicSimilarity], result of:
            0.1421851 = score(doc=4730,freq=1.0), product of:
              0.31498298 = queryWeight, product of:
                3.5230036 = boost
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.015473801 = queryNorm
              0.4514057 = fieldWeight in 4730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.078125 = fieldNorm(doc=4730)
          0.19232891 = weight(abstract_txt:modeling in 4730) [ClassicSimilarity], result of:
            0.19232891 = score(doc=4730,freq=1.0), product of:
              0.40939367 = queryWeight, product of:
                4.3997774 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.015473801 = queryNorm
              0.46978965 = fieldWeight in 4730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.078125 = fieldNorm(doc=4730)
          0.26499367 = weight(abstract_txt:topic in 4730) [ClassicSimilarity], result of:
            0.26499367 = score(doc=4730,freq=3.0), product of:
              0.38684848 = queryWeight, product of:
                4.9385557 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.015473801 = queryNorm
              0.6850063 = fieldWeight in 4730, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.078125 = fieldNorm(doc=4730)
        0.2 = coord(5/25)
    
  4. Liu, Y.; Xu, S.; Blanchard, E.: ¬A local context-aware LDA model for topic modeling in a document network (2017) 0.15
    0.1499736 = sum of:
      0.1499736 = product of:
        0.62489 = sum of:
          0.052769363 = weight(abstract_txt:document in 3642) [ClassicSimilarity], result of:
            0.052769363 = score(doc=3642,freq=8.0), product of:
              0.06954014 = queryWeight, product of:
                1.0469292 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.015473801 = queryNorm
              0.7588331 = fieldWeight in 3642, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=3642)
          0.03335677 = weight(abstract_txt:models in 3642) [ClassicSimilarity], result of:
            0.03335677 = score(doc=3642,freq=2.0), product of:
              0.081306204 = queryWeight, product of:
                1.132039 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015473801 = queryNorm
              0.4102611 = fieldWeight in 3642, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=3642)
          0.059851043 = weight(abstract_txt:ones in 3642) [ClassicSimilarity], result of:
            0.059851043 = score(doc=3642,freq=1.0), product of:
              0.15126048 = queryWeight, product of:
                1.5440532 = boost
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.015473801 = queryNorm
              0.39568195 = fieldWeight in 3642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.0625 = fieldNorm(doc=3642)
          0.049322635 = weight(abstract_txt:sets in 3642) [ClassicSimilarity], result of:
            0.049322635 = score(doc=3642,freq=1.0), product of:
              0.15219682 = queryWeight, product of:
                1.8969153 = boost
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.015473801 = queryNorm
              0.32407138 = fieldWeight in 3642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.0625 = fieldNorm(doc=3642)
          0.21759531 = weight(abstract_txt:modeling in 3642) [ClassicSimilarity], result of:
            0.21759531 = score(doc=3642,freq=2.0), product of:
              0.40939367 = queryWeight, product of:
                4.3997774 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.015473801 = queryNorm
              0.5315063 = fieldWeight in 3642, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0625 = fieldNorm(doc=3642)
          0.21199492 = weight(abstract_txt:topic in 3642) [ClassicSimilarity], result of:
            0.21199492 = score(doc=3642,freq=3.0), product of:
              0.38684848 = queryWeight, product of:
                4.9385557 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.015473801 = queryNorm
              0.54800504 = fieldWeight in 3642, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=3642)
        0.24 = coord(6/25)
    
  5. Potha, N.; Stamatatos, E.: Improving author verification based on topic modeling (2019) 0.14
    0.14442037 = sum of:
      0.14442037 = product of:
        0.60175157 = sum of:
          0.018656787 = weight(abstract_txt:document in 5385) [ClassicSimilarity], result of:
            0.018656787 = score(doc=5385,freq=1.0), product of:
              0.06954014 = queryWeight, product of:
                1.0469292 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.015473801 = queryNorm
              0.26828802 = fieldWeight in 5385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=5385)
          0.0235868 = weight(abstract_txt:models in 5385) [ClassicSimilarity], result of:
            0.0235868 = score(doc=5385,freq=1.0), product of:
              0.081306204 = queryWeight, product of:
                1.132039 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015473801 = queryNorm
              0.2900984 = fieldWeight in 5385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=5385)
          0.031691723 = weight(abstract_txt:semantic in 5385) [ClassicSimilarity], result of:
            0.031691723 = score(doc=5385,freq=1.0), product of:
              0.113328375 = queryWeight, product of:
                1.6368711 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.015473801 = queryNorm
              0.2796451 = fieldWeight in 5385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=5385)
          0.049322635 = weight(abstract_txt:sets in 5385) [ClassicSimilarity], result of:
            0.049322635 = score(doc=5385,freq=1.0), product of:
              0.15219682 = queryWeight, product of:
                1.8969153 = boost
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.015473801 = queryNorm
              0.32407138 = fieldWeight in 5385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.0625 = fieldNorm(doc=5385)
          0.26649874 = weight(abstract_txt:modeling in 5385) [ClassicSimilarity], result of:
            0.26649874 = score(doc=5385,freq=3.0), product of:
              0.40939367 = queryWeight, product of:
                4.3997774 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.015473801 = queryNorm
              0.6509596 = fieldWeight in 5385, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0625 = fieldNorm(doc=5385)
          0.21199492 = weight(abstract_txt:topic in 5385) [ClassicSimilarity], result of:
            0.21199492 = score(doc=5385,freq=3.0), product of:
              0.38684848 = queryWeight, product of:
                4.9385557 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.015473801 = queryNorm
              0.54800504 = fieldWeight in 5385, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=5385)
        0.24 = coord(6/25)