Document (#38139)

Author
Darányi, S.
Wittek, P.
Title
Demonstrating conceptual dynamics in an evolving text collection
Source
Journal of the American Society for Information Science and Technology. 64(2013) no.12, S.2564-2572
Year
2013
Abstract
Based on real-world user demands, we demonstrate how animated visualization of evolving text corpora displays the underlying dynamics of semantic content. To interpret the results, one needs a dynamic theory of word meaning. We suggest that conceptual dynamics as the interaction between kinds of intellectual and emotional content and language is key for such a theory. We demonstrate our method by two-way seriation, which is a popular technique to analyze groups of similar instances and their features as well as the connections between the groups themselves. The two-way seriated data may be visualized as a two-dimensional heat map or as a three-dimensional landscape in which color codes or height correspond to the values in the matrix. In this article, we focus on two-way seriation of sparse data in the Reuters-21568 test collection. To achieve a meaningful visualization, we introduce a compactly supported convolution kernel similar to filter kernels used in image reconstruction and geostatistics. This filter populates the high-dimensional sparse space with values that interpolate nearby elements and provides insight into the clustering structure. We also extend two-way seriation to deal with online updates of both the row and column spaces and, combined with the convolution kernel, demonstrate a three-dimensional visualization of dynamics.
Theme
Visualisierung
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (content)

  1. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 0.10
    0.10118073 = sum of:
      0.10118073 = product of:
        0.8431728 = sum of:
          0.030530134 = weight(abstract_txt:text in 3612) [ClassicSimilarity], result of:
            0.030530134 = score(doc=3612,freq=2.0), product of:
              0.06819895 = queryWeight, product of:
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.016831841 = queryNorm
              0.44766283 = fieldWeight in 3612, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
          0.22004603 = weight(abstract_txt:kernels in 3612) [ClassicSimilarity], result of:
            0.22004603 = score(doc=3612,freq=2.0), product of:
              0.20197187 = queryWeight, product of:
                1.2168628 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.016831841 = queryNorm
              1.0894885 = fieldWeight in 3612, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
          0.5925966 = weight(abstract_txt:kernel in 3612) [ClassicSimilarity], result of:
            0.5925966 = score(doc=3612,freq=9.0), product of:
              0.29835072 = queryWeight, product of:
                2.091581 = boost
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.016831841 = queryNorm
              1.9862415 = fieldWeight in 3612, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.078125 = fieldNorm(doc=3612)
        0.12 = coord(3/25)
    
  2. Zhang, M.; Zhou, G.D.; Aw, A.: Exploring syntactic structured features over parse trees for relation extraction using kernel methods (2008) 0.08
    0.08158019 = sum of:
      0.08158019 = product of:
        0.67983496 = sum of:
          0.017270453 = weight(abstract_txt:text in 4056) [ClassicSimilarity], result of:
            0.017270453 = score(doc=4056,freq=1.0), product of:
              0.06819895 = queryWeight, product of:
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.016831841 = queryNorm
              0.25323635 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.2156002 = weight(abstract_txt:kernels in 4056) [ClassicSimilarity], result of:
            0.2156002 = score(doc=4056,freq=3.0), product of:
              0.20197187 = queryWeight, product of:
                1.2168628 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.016831841 = queryNorm
              1.0674764 = fieldWeight in 4056, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
          0.44696432 = weight(abstract_txt:kernel in 4056) [ClassicSimilarity], result of:
            0.44696432 = score(doc=4056,freq=8.0), product of:
              0.29835072 = queryWeight, product of:
                2.091581 = boost
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.016831841 = queryNorm
              1.4981171 = fieldWeight in 4056, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.0625 = fieldNorm(doc=4056)
        0.12 = coord(3/25)
    
  3. Oh, K.E.; Halpern, D.; Tremaine, M.; Chiang, J.; Silver, D.; Bemis, K.: Blocked: when the information is hidden by the visualization (2016) 0.08
    0.075947486 = sum of:
      0.075947486 = product of:
        0.47467178 = sum of:
          0.032768555 = weight(abstract_txt:three in 4889) [ClassicSimilarity], result of:
            0.032768555 = score(doc=4889,freq=2.0), product of:
              0.08296025 = queryWeight, product of:
                1.1029255 = boost
                4.4688134 = idf(docFreq=1317, maxDocs=42306)
                0.016831841 = queryNorm
              0.39499104 = fieldWeight in 4889, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4688134 = idf(docFreq=1317, maxDocs=42306)
                0.0625 = fieldNorm(doc=4889)
          0.025143761 = weight(abstract_txt:theory in 4889) [ClassicSimilarity], result of:
            0.025143761 = score(doc=4889,freq=1.0), product of:
              0.087604955 = queryWeight, product of:
                1.1333799 = boost
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.016831841 = queryNorm
              0.287013 = fieldWeight in 4889, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.0625 = fieldNorm(doc=4889)
          0.18813822 = weight(abstract_txt:visualization in 4889) [ClassicSimilarity], result of:
            0.18813822 = score(doc=4889,freq=4.0), product of:
              0.24167791 = queryWeight, product of:
                2.305554 = boost
                6.227734 = idf(docFreq=226, maxDocs=42306)
                0.016831841 = queryNorm
              0.77846676 = fieldWeight in 4889, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.227734 = idf(docFreq=226, maxDocs=42306)
                0.0625 = fieldNorm(doc=4889)
          0.22862126 = weight(abstract_txt:dimensional in 4889) [ClassicSimilarity], result of:
            0.22862126 = score(doc=4889,freq=2.0), product of:
              0.38163918 = queryWeight, product of:
                3.3454354 = boost
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.016831841 = queryNorm
              0.5990508 = fieldWeight in 4889, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.0625 = fieldNorm(doc=4889)
        0.16 = coord(4/25)
    
  4. Lin, N.; Li, D.; Ding, Y.; He, B.; Qin, Z.; Tang, J.; Li, J.; Dong, T.: ¬The dynamic features of Delicious, Flickr, and YouTube (2012) 0.07
    0.07138136 = sum of:
      0.07138136 = product of:
        0.3569068 = sum of:
          0.04013312 = weight(abstract_txt:three in 1971) [ClassicSimilarity], result of:
            0.04013312 = score(doc=1971,freq=3.0), product of:
              0.08296025 = queryWeight, product of:
                1.1029255 = boost
                4.4688134 = idf(docFreq=1317, maxDocs=42306)
                0.016831841 = queryNorm
              0.48376325 = fieldWeight in 1971, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4688134 = idf(docFreq=1317, maxDocs=42306)
                0.0625 = fieldNorm(doc=1971)
          0.03473653 = weight(abstract_txt:groups in 1971) [ClassicSimilarity], result of:
            0.03473653 = score(doc=1971,freq=1.0), product of:
              0.10866745 = queryWeight, product of:
                1.2622951 = boost
                5.1145444 = idf(docFreq=690, maxDocs=42306)
                0.016831841 = queryNorm
              0.31965902 = fieldWeight in 1971, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1145444 = idf(docFreq=690, maxDocs=42306)
                0.0625 = fieldNorm(doc=1971)
          0.03740962 = weight(abstract_txt:similar in 1971) [ClassicSimilarity], result of:
            0.03740962 = score(doc=1971,freq=1.0), product of:
              0.11417317 = queryWeight, product of:
                1.2938776 = boost
                5.2425094 = idf(docFreq=607, maxDocs=42306)
                0.016831841 = queryNorm
              0.32765684 = fieldWeight in 1971, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2425094 = idf(docFreq=607, maxDocs=42306)
                0.0625 = fieldNorm(doc=1971)
          0.07346614 = weight(abstract_txt:evolving in 1971) [ClassicSimilarity], result of:
            0.07346614 = score(doc=1971,freq=1.0), product of:
              0.17904684 = queryWeight, product of:
                1.6202966 = boost
                6.565088 = idf(docFreq=161, maxDocs=42306)
                0.016831841 = queryNorm
              0.410318 = fieldWeight in 1971, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.565088 = idf(docFreq=161, maxDocs=42306)
                0.0625 = fieldNorm(doc=1971)
          0.1711614 = weight(abstract_txt:dynamics in 1971) [ClassicSimilarity], result of:
            0.1711614 = score(doc=1971,freq=1.0), product of:
              0.39645058 = queryWeight, product of:
                3.4097357 = boost
                6.907752 = idf(docFreq=114, maxDocs=42306)
                0.016831841 = queryNorm
              0.4317345 = fieldWeight in 1971, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.907752 = idf(docFreq=114, maxDocs=42306)
                0.0625 = fieldNorm(doc=1971)
        0.2 = coord(5/25)
    
  5. Kageura, K.: ¬The dynamics of terminology : a descriptive theory of term formation and terminological growth (2002) 0.07
    0.06981998 = sum of:
      0.06981998 = product of:
        0.4363749 = sum of:
          0.0314297 = weight(abstract_txt:theory in 3788) [ClassicSimilarity], result of:
            0.0314297 = score(doc=3788,freq=1.0), product of:
              0.087604955 = queryWeight, product of:
                1.1333799 = boost
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.016831841 = queryNorm
              0.35876626 = fieldWeight in 3788, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.078125 = fieldNorm(doc=3788)
          0.055609725 = weight(abstract_txt:conceptual in 3788) [ClassicSimilarity], result of:
            0.055609725 = score(doc=3788,freq=2.0), product of:
              0.10171672 = queryWeight, product of:
                1.2212578 = boost
                4.94827 = idf(docFreq=815, maxDocs=42306)
                0.016831841 = queryNorm
              0.54671174 = fieldWeight in 3788, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.94827 = idf(docFreq=815, maxDocs=42306)
                0.078125 = fieldNorm(doc=3788)
          0.046762023 = weight(abstract_txt:similar in 3788) [ClassicSimilarity], result of:
            0.046762023 = score(doc=3788,freq=1.0), product of:
              0.11417317 = queryWeight, product of:
                1.2938776 = boost
                5.2425094 = idf(docFreq=607, maxDocs=42306)
                0.016831841 = queryNorm
              0.40957105 = fieldWeight in 3788, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2425094 = idf(docFreq=607, maxDocs=42306)
                0.078125 = fieldNorm(doc=3788)
          0.30257344 = weight(abstract_txt:dynamics in 3788) [ClassicSimilarity], result of:
            0.30257344 = score(doc=3788,freq=2.0), product of:
              0.39645058 = queryWeight, product of:
                3.4097357 = boost
                6.907752 = idf(docFreq=114, maxDocs=42306)
                0.016831841 = queryNorm
              0.76320595 = fieldWeight in 3788, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.907752 = idf(docFreq=114, maxDocs=42306)
                0.078125 = fieldNorm(doc=3788)
        0.16 = coord(4/25)