Document (#43621)

Author
Abdo, A.H.
Cointet, J.-P.
Bourret, P.
Cambrosio, A,
Title
Domain-topic models with chained dimensions : charting an emergent domain of a major oncology conference
Source
Journal of the Association for Information Science and Technology. 73(2022) no.7, S.992-1011
Year
2022
Abstract
This paper presents a contribution to the study of bibliographic corpora through science mapping. From a graph representation of documents and their textual dimension, stochastic block models can provide a simultaneous clustering of documents and words that we call a domain-topic model. Previous work investigated the resulting topics, or word clusters, while ours focuses on the study of the document clusters we call domains. To enable the description and interactive navigation of domains, we introduce measures and interfaces that consider the structure of the model to relate both types of clusters. We then present a procedure that extends the block model to cluster metadata attributes of documents, which we call a domain-chained model, noting that our measures and interfaces transpose to metadata clusters. We provide an example application to a corpus relevant to current science, technology and society (STS) research and an interesting case for our approach: the abstracts presented between 1995 and 2017 at the American Society of Clinical Oncology Annual Meeting, the major oncology research conference. Through a sequence of domain-topic and domain-chained models, we identify and describe a group of domains that have notably grown through the last decades and which we relate to the establishment of "oncopolicy" as a major concern in oncology.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24606. Vgl.: https://doi.org/10.1002/asi.24606.

Similar documents (content)

  1. Hjoerland, B.; Hartel, J.: Introduction to a Special Issue of Knowledge Organization (2003) 0.22
    0.21611927 = sum of:
      0.21611927 = product of:
        0.49118018 = sum of:
          0.018938877 = weight(abstract_txt:science in 3013) [ClassicSimilarity], result of:
            0.018938877 = score(doc=3013,freq=6.0), product of:
              0.073237 = queryWeight, product of:
                1.017903 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.018635206 = queryNorm
              0.2585971 = fieldWeight in 3013, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3013)
          0.022096122 = weight(abstract_txt:metadata in 3013) [ClassicSimilarity], result of:
            0.022096122 = score(doc=3013,freq=2.0), product of:
              0.11706099 = queryWeight, product of:
                1.2869071 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.018635206 = queryNorm
              0.18875735 = fieldWeight in 3013, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3013)
          0.021392679 = weight(abstract_txt:society in 3013) [ClassicSimilarity], result of:
            0.021392679 = score(doc=3013,freq=1.0), product of:
              0.14434052 = queryWeight, product of:
                1.4290098 = boost
                5.4202437 = idf(docFreq=531, maxDocs=44218)
                0.018635206 = queryNorm
              0.1482098 = fieldWeight in 3013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4202437 = idf(docFreq=531, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3013)
          0.02313398 = weight(abstract_txt:conference in 3013) [ClassicSimilarity], result of:
            0.02313398 = score(doc=3013,freq=1.0), product of:
              0.15207052 = queryWeight, product of:
                1.4667754 = boost
                5.563489 = idf(docFreq=460, maxDocs=44218)
                0.018635206 = queryNorm
              0.15212665 = fieldWeight in 3013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.563489 = idf(docFreq=460, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3013)
          0.016109306 = weight(abstract_txt:that in 3013) [ClassicSimilarity], result of:
            0.016109306 = score(doc=3013,freq=13.0), product of:
              0.06895963 = queryWeight, product of:
                1.5617394 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018635206 = queryNorm
              0.23360488 = fieldWeight in 3013, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3013)
          0.01300522 = weight(abstract_txt:through in 3013) [ClassicSimilarity], result of:
            0.01300522 = score(doc=3013,freq=1.0), product of:
              0.11857334 = queryWeight, product of:
                1.5862814 = boost
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.018635206 = queryNorm
              0.10968082 = fieldWeight in 3013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3013)
          0.020150965 = weight(abstract_txt:models in 3013) [ClassicSimilarity], result of:
            0.020150965 = score(doc=3013,freq=1.0), product of:
              0.15877147 = queryWeight, product of:
                1.8355784 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.018635206 = queryNorm
              0.12691805 = fieldWeight in 3013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3013)
          0.0287708 = weight(abstract_txt:major in 3013) [ClassicSimilarity], result of:
            0.0287708 = score(doc=3013,freq=2.0), product of:
              0.15978397 = queryWeight, product of:
                1.841422 = boost
                4.6563506 = idf(docFreq=1141, maxDocs=44218)
                0.018635206 = queryNorm
              0.18006061 = fieldWeight in 3013, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6563506 = idf(docFreq=1141, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3013)
          0.024068112 = weight(abstract_txt:model in 3013) [ClassicSimilarity], result of:
            0.024068112 = score(doc=3013,freq=2.0), product of:
              0.15613712 = queryWeight, product of:
                2.101886 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.018635206 = queryNorm
              0.15414727 = fieldWeight in 3013, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3013)
          0.12192542 = weight(abstract_txt:domains in 3013) [ClassicSimilarity], result of:
            0.12192542 = score(doc=3013,freq=12.0), product of:
              0.2302737 = queryWeight, product of:
                2.2105935 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.018635206 = queryNorm
              0.52948046 = fieldWeight in 3013, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3013)
          0.18158868 = weight(abstract_txt:domain in 3013) [ClassicSimilarity], result of:
            0.18158868 = score(doc=3013,freq=18.0), product of:
              0.33053714 = queryWeight, product of:
                3.7455177 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.018635206 = queryNorm
              0.5493745 = fieldWeight in 3013, product of:
                4.2426405 = tf(freq=18.0), with freq of:
                  18.0 = termFreq=18.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3013)
        0.44 = coord(11/25)
    
  2. Christensen, H.D.: ¬The framing of scientific domains : about UNISIST, domain analysis and art history (2014) 0.17
    0.17011967 = sum of:
      0.17011967 = product of:
        0.70883197 = sum of:
          0.12728998 = weight(abstract_txt:charting in 1773) [ClassicSimilarity], result of:
            0.12728998 = score(doc=1773,freq=1.0), product of:
              0.2167952 = queryWeight, product of:
                1.2383715 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.018635206 = queryNorm
              0.5871439 = fieldWeight in 1773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=1773)
          0.017688366 = weight(abstract_txt:that in 1773) [ClassicSimilarity], result of:
            0.017688366 = score(doc=1773,freq=3.0), product of:
              0.06895963 = queryWeight, product of:
                1.5617394 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018635206 = queryNorm
              0.2565032 = fieldWeight in 1773, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1773)
          0.092118695 = weight(abstract_txt:models in 1773) [ClassicSimilarity], result of:
            0.092118695 = score(doc=1773,freq=4.0), product of:
              0.15877147 = queryWeight, product of:
                1.8355784 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.018635206 = queryNorm
              0.5801968 = fieldWeight in 1773, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=1773)
          0.038899943 = weight(abstract_txt:model in 1773) [ClassicSimilarity], result of:
            0.038899943 = score(doc=1773,freq=1.0), product of:
              0.15613712 = queryWeight, product of:
                2.101886 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.018635206 = queryNorm
              0.24913962 = fieldWeight in 1773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=1773)
          0.13934334 = weight(abstract_txt:domains in 1773) [ClassicSimilarity], result of:
            0.13934334 = score(doc=1773,freq=3.0), product of:
              0.2302737 = queryWeight, product of:
                2.2105935 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.018635206 = queryNorm
              0.60512054 = fieldWeight in 1773, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0625 = fieldNorm(doc=1773)
          0.29349166 = weight(abstract_txt:domain in 1773) [ClassicSimilarity], result of:
            0.29349166 = score(doc=1773,freq=9.0), product of:
              0.33053714 = queryWeight, product of:
                3.7455177 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.018635206 = queryNorm
              0.88792336 = fieldWeight in 1773, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=1773)
        0.24 = coord(6/25)
    
  3. Chen, T.T.: ¬The congruity between linkage-based factors and content-based clusters : an experimental study using multiple document corpora (2016) 0.17
    0.17007002 = sum of:
      0.17007002 = product of:
        0.70862514 = sum of:
          0.017672604 = weight(abstract_txt:science in 2775) [ClassicSimilarity], result of:
            0.017672604 = score(doc=2775,freq=1.0), product of:
              0.073237 = queryWeight, product of:
                1.017903 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.018635206 = queryNorm
              0.24130704 = fieldWeight in 2775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=2775)
          0.017688366 = weight(abstract_txt:that in 2775) [ClassicSimilarity], result of:
            0.017688366 = score(doc=2775,freq=3.0), product of:
              0.06895963 = queryWeight, product of:
                1.5617394 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018635206 = queryNorm
              0.2565032 = fieldWeight in 2775, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2775)
          0.07897706 = weight(abstract_txt:documents in 2775) [ClassicSimilarity], result of:
            0.07897706 = score(doc=2775,freq=6.0), product of:
              0.12517305 = queryWeight, product of:
                1.6298294 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.018635206 = queryNorm
              0.63094306 = fieldWeight in 2775, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2775)
          0.08044992 = weight(abstract_txt:domains in 2775) [ClassicSimilarity], result of:
            0.08044992 = score(doc=2775,freq=1.0), product of:
              0.2302737 = queryWeight, product of:
                2.2105935 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.018635206 = queryNorm
              0.34936652 = fieldWeight in 2775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0625 = fieldNorm(doc=2775)
          0.41600665 = weight(abstract_txt:clusters in 2775) [ClassicSimilarity], result of:
            0.41600665 = score(doc=2775,freq=6.0), product of:
              0.41708377 = queryWeight, product of:
                3.4353242 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.018635206 = queryNorm
              0.9974175 = fieldWeight in 2775, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=2775)
          0.09783056 = weight(abstract_txt:domain in 2775) [ClassicSimilarity], result of:
            0.09783056 = score(doc=2775,freq=1.0), product of:
              0.33053714 = queryWeight, product of:
                3.7455177 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.018635206 = queryNorm
              0.29597446 = fieldWeight in 2775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=2775)
        0.24 = coord(6/25)
    
  4. Yang, P.; Gao, W.; Tan, Q.; Wong, K.-F.: ¬A link-bridged topic model for cross-domain document classification (2013) 0.16
    0.16068782 = sum of:
      0.16068782 = product of:
        0.6695326 = sum of:
          0.01444249 = weight(abstract_txt:that in 2706) [ClassicSimilarity], result of:
            0.01444249 = score(doc=2706,freq=2.0), product of:
              0.06895963 = queryWeight, product of:
                1.5617394 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018635206 = queryNorm
              0.20943399 = fieldWeight in 2706, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.0644845 = weight(abstract_txt:documents in 2706) [ClassicSimilarity], result of:
            0.0644845 = score(doc=2706,freq=4.0), product of:
              0.12517305 = queryWeight, product of:
                1.6298294 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.018635206 = queryNorm
              0.5151628 = fieldWeight in 2706, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.10349378 = weight(abstract_txt:topic in 2706) [ClassicSimilarity], result of:
            0.10349378 = score(doc=2706,freq=3.0), product of:
              0.18885553 = queryWeight, product of:
                2.0019424 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.018635206 = queryNorm
              0.54800504 = fieldWeight in 2706, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.06737668 = weight(abstract_txt:model in 2706) [ClassicSimilarity], result of:
            0.06737668 = score(doc=2706,freq=3.0), product of:
              0.15613712 = queryWeight, product of:
                2.101886 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.018635206 = queryNorm
              0.4315225 = fieldWeight in 2706, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.16089983 = weight(abstract_txt:domains in 2706) [ClassicSimilarity], result of:
            0.16089983 = score(doc=2706,freq=4.0), product of:
              0.2302737 = queryWeight, product of:
                2.2105935 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.018635206 = queryNorm
              0.69873303 = fieldWeight in 2706, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.25883532 = weight(abstract_txt:domain in 2706) [ClassicSimilarity], result of:
            0.25883532 = score(doc=2706,freq=7.0), product of:
              0.33053714 = queryWeight, product of:
                3.7455177 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.018635206 = queryNorm
              0.7830748 = fieldWeight in 2706, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
        0.24 = coord(6/25)
    
  5. Meireles, N.R.G.; Cendón, B.V.; Almeida, P.E.M. de: Bibliometric knowledge organization : a domain analytic method using artificial neural networks (2014) 0.16
    0.15591112 = sum of:
      0.15591112 = product of:
        0.64962965 = sum of:
          0.010212383 = weight(abstract_txt:that in 1377) [ClassicSimilarity], result of:
            0.010212383 = score(doc=1377,freq=1.0), product of:
              0.06895963 = queryWeight, product of:
                1.5617394 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018635206 = queryNorm
              0.1480922 = fieldWeight in 1377, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1377)
          0.029726218 = weight(abstract_txt:through in 1377) [ClassicSimilarity], result of:
            0.029726218 = score(doc=1377,freq=1.0), product of:
              0.11857334 = queryWeight, product of:
                1.5862814 = boost
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.018635206 = queryNorm
              0.250699 = fieldWeight in 1377, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.0625 = fieldNorm(doc=1377)
          0.09672675 = weight(abstract_txt:documents in 1377) [ClassicSimilarity], result of:
            0.09672675 = score(doc=1377,freq=9.0), product of:
              0.12517305 = queryWeight, product of:
                1.6298294 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.018635206 = queryNorm
              0.77274424 = fieldWeight in 1377, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1377)
          0.08044992 = weight(abstract_txt:domains in 1377) [ClassicSimilarity], result of:
            0.08044992 = score(doc=1377,freq=1.0), product of:
              0.2302737 = queryWeight, product of:
                2.2105935 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.018635206 = queryNorm
              0.34936652 = fieldWeight in 1377, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0625 = fieldNorm(doc=1377)
          0.2941611 = weight(abstract_txt:clusters in 1377) [ClassicSimilarity], result of:
            0.2941611 = score(doc=1377,freq=3.0), product of:
              0.41708377 = queryWeight, product of:
                3.4353242 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.018635206 = queryNorm
              0.70528066 = fieldWeight in 1377, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=1377)
          0.13835329 = weight(abstract_txt:domain in 1377) [ClassicSimilarity], result of:
            0.13835329 = score(doc=1377,freq=2.0), product of:
              0.33053714 = queryWeight, product of:
                3.7455177 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.018635206 = queryNorm
              0.41857108 = fieldWeight in 1377, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=1377)
        0.24 = coord(6/25)