Document (#37509)

Author
Dunne, C.
Shneiderman, B.
Gove, R.
Klavans, J.
Dorr, B.
Title
Rapid understanding of scientific paper collections : integrating statistics, text analytics, and visualization
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.12, S.2351-2369
Year
2012
Abstract
Keeping up with rapidly growing research fields, especially when there are multiple interdisciplinary sources, requires substantial effort for researchers, program managers, or venture capital investors. Current theories and tools are directed at finding a paper or website, not gaining an understanding of the key papers, authors, controversies, and hypotheses. This report presents an effort to integrate statistics, text analytics, and visualization in a multiple coordinated window environment that supports exploration. Our prototype system, Action Science Explorer (ASE), provides an environment for demonstrating principles of coordination and conducting iterative usability tests of them with interested and knowledgeable users. We developed an understanding of the value of reference management, statistics, citation text extraction, natural language summarization for single and multiple documents, filters to interactively select key papers, and network visualization to see citation patterns and identify clusters. A three-phase usability study guided our revisions to ASE and led us to improve the testing methods.

Similar documents (author)

  1. Dorr, B.J.: Large-scale dictionary construction for foreign language tutoring and interlingual machine translation (1997) 1.19
    1.1861744 = sum of:
      1.1861744 = product of:
        3.5585232 = sum of:
          3.5585232 = weight(author_txt:dorr in 3244) [ClassicSimilarity], result of:
            3.5585232 = score(doc=3244,freq=1.0), product of:
              0.6122854 = queryWeight, product of:
                1.0915805 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.060320124 = queryNorm
              5.81187 = fieldWeight in 3244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.625 = fieldNorm(doc=3244)
        0.33333334 = coord(1/3)
    
  2. Dorr, B.J.; Olsen, M.B.: Multilingual generation : the role of telicity in lexical choice and syntactic realization (1996) 0.95
    0.9489395 = sum of:
      0.9489395 = product of:
        2.8468184 = sum of:
          2.8468184 = weight(author_txt:dorr in 536) [ClassicSimilarity], result of:
            2.8468184 = score(doc=536,freq=1.0), product of:
              0.6122854 = queryWeight, product of:
                1.0915805 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.060320124 = queryNorm
              4.649496 = fieldWeight in 536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.5 = fieldNorm(doc=536)
        0.33333334 = coord(1/3)
    
  3. Oard, D.W.; Dorr, B.J.: Evaluating cross-laguage text filtering effectiveness (1998) 0.95
    0.9489395 = sum of:
      0.9489395 = product of:
        2.8468184 = sum of:
          2.8468184 = weight(author_txt:dorr in 6214) [ClassicSimilarity], result of:
            2.8468184 = score(doc=6214,freq=1.0), product of:
              0.6122854 = queryWeight, product of:
                1.0915805 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.060320124 = queryNorm
              4.649496 = fieldWeight in 6214, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.5 = fieldNorm(doc=6214)
        0.33333334 = coord(1/3)
    
  4. Dorr, B.J.; Gaasterland, T.: Exploiting aspectual features and connecting words for summarization-inspired temporal-relation extraction (2007) 0.95
    0.9489395 = sum of:
      0.9489395 = product of:
        2.8468184 = sum of:
          2.8468184 = weight(author_txt:dorr in 950) [ClassicSimilarity], result of:
            2.8468184 = score(doc=950,freq=1.0), product of:
              0.6122854 = queryWeight, product of:
                1.0915805 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.060320124 = queryNorm
              4.649496 = fieldWeight in 950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.5 = fieldNorm(doc=950)
        0.33333334 = coord(1/3)
    
  5. Klavans, R.; Boyack, K.W.: Identifying a better measure of relatedness for mapping science (2006) 0.92
    0.92255014 = sum of:
      0.92255014 = product of:
        2.7676504 = sum of:
          2.7676504 = weight(author_txt:klavans in 5252) [ClassicSimilarity], result of:
            2.7676504 = score(doc=5252,freq=1.0), product of:
              0.6008806 = queryWeight, product of:
                1.0813665 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.060320124 = queryNorm
              4.6059904 = fieldWeight in 5252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.5 = fieldNorm(doc=5252)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Aris, A.; Shneiderman, B.; Qazvinian, V.; Radev, D.: Visual overviews for discovering key papers and influences across research fronts (2009) 0.11
    0.107127644 = sum of:
      0.107127644 = product of:
        0.5356382 = sum of:
          0.09719446 = weight(abstract_txt:gaining in 3156) [ClassicSimilarity], result of:
            0.09719446 = score(doc=3156,freq=1.0), product of:
              0.1617895 = queryWeight, product of:
                1.0323777 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.0203803 = queryNorm
              0.6007464 = fieldWeight in 3156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.078125 = fieldNorm(doc=3156)
          0.07099016 = weight(abstract_txt:citation in 3156) [ClassicSimilarity], result of:
            0.07099016 = score(doc=3156,freq=2.0), product of:
              0.13121639 = queryWeight, product of:
                1.3148388 = boost
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.0203803 = queryNorm
              0.5410159 = fieldWeight in 3156, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.078125 = fieldNorm(doc=3156)
          0.12552518 = weight(abstract_txt:papers in 3156) [ClassicSimilarity], result of:
            0.12552518 = score(doc=3156,freq=4.0), product of:
              0.1522883 = queryWeight, product of:
                1.4164842 = boost
                5.2752647 = idf(docFreq=614, maxDocs=44218)
                0.0203803 = queryNorm
              0.8242601 = fieldWeight in 3156, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.2752647 = idf(docFreq=614, maxDocs=44218)
                0.078125 = fieldNorm(doc=3156)
          0.08694747 = weight(abstract_txt:multiple in 3156) [ClassicSimilarity], result of:
            0.08694747 = score(doc=3156,freq=1.0), product of:
              0.21663786 = queryWeight, product of:
                2.0691466 = boost
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.0203803 = queryNorm
              0.40134937 = fieldWeight in 3156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.078125 = fieldNorm(doc=3156)
          0.1549809 = weight(abstract_txt:visualization in 3156) [ClassicSimilarity], result of:
            0.1549809 = score(doc=3156,freq=1.0), product of:
              0.31847978 = queryWeight, product of:
                2.508794 = boost
                6.228827 = idf(docFreq=236, maxDocs=44218)
                0.0203803 = queryNorm
              0.4866271 = fieldWeight in 3156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.228827 = idf(docFreq=236, maxDocs=44218)
                0.078125 = fieldNorm(doc=3156)
        0.2 = coord(5/25)
    
  2. Adler, R.; Ewing, J.; Taylor, P.: Citation statistics : A report from the International Mathematical Union (IMU) in cooperation with the International Council of Industrial and Applied Mathematics (ICIAM) and the Institute of Mathematical Statistics (IMS) (2008) 0.09
    0.09439302 = sum of:
      0.09439302 = product of:
        0.4719651 = sum of:
          0.038877785 = weight(abstract_txt:gaining in 2417) [ClassicSimilarity], result of:
            0.038877785 = score(doc=2417,freq=1.0), product of:
              0.1617895 = queryWeight, product of:
                1.0323777 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.0203803 = queryNorm
              0.24029857 = fieldWeight in 2417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.03125 = fieldNorm(doc=2417)
          0.087522544 = weight(abstract_txt:citation in 2417) [ClassicSimilarity], result of:
            0.087522544 = score(doc=2417,freq=19.0), product of:
              0.13121639 = queryWeight, product of:
                1.3148388 = boost
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.0203803 = queryNorm
              0.66700923 = fieldWeight in 2417, product of:
                4.358899 = tf(freq=19.0), with freq of:
                  19.0 = termFreq=19.0
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.03125 = fieldNorm(doc=2417)
          0.05021007 = weight(abstract_txt:papers in 2417) [ClassicSimilarity], result of:
            0.05021007 = score(doc=2417,freq=4.0), product of:
              0.1522883 = queryWeight, product of:
                1.4164842 = boost
                5.2752647 = idf(docFreq=614, maxDocs=44218)
                0.0203803 = queryNorm
              0.32970405 = fieldWeight in 2417, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.2752647 = idf(docFreq=614, maxDocs=44218)
                0.03125 = fieldNorm(doc=2417)
          0.03385831 = weight(abstract_txt:understanding in 2417) [ClassicSimilarity], result of:
            0.03385831 = score(doc=2417,freq=2.0), product of:
              0.16889752 = queryWeight, product of:
                1.8269881 = boost
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.0203803 = queryNorm
              0.20046659 = fieldWeight in 2417, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.03125 = fieldNorm(doc=2417)
          0.2614964 = weight(abstract_txt:statistics in 2417) [ClassicSimilarity], result of:
            0.2614964 = score(doc=2417,freq=17.0), product of:
              0.32335824 = queryWeight, product of:
                2.527936 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0203803 = queryNorm
              0.8086895 = fieldWeight in 2417, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.03125 = fieldNorm(doc=2417)
        0.2 = coord(5/25)
    
  3. Zhu, B.; Chen, H.: Information visualization (2004) 0.09
    0.09073102 = sum of:
      0.09073102 = product of:
        0.45365506 = sum of:
          0.021293286 = weight(abstract_txt:environment in 4276) [ClassicSimilarity], result of:
            0.021293286 = score(doc=4276,freq=1.0), product of:
              0.1175929 = queryWeight, product of:
                1.2447124 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0203803 = queryNorm
              0.18107629 = fieldWeight in 4276, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4276)
          0.04007071 = weight(abstract_txt:effort in 4276) [ClassicSimilarity], result of:
            0.04007071 = score(doc=4276,freq=1.0), product of:
              0.17924099 = queryWeight, product of:
                1.5367284 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.0203803 = queryNorm
              0.22355773 = fieldWeight in 4276, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4276)
          0.021204097 = weight(abstract_txt:text in 4276) [ClassicSimilarity], result of:
            0.021204097 = score(doc=4276,freq=1.0), product of:
              0.13423412 = queryWeight, product of:
                1.6287541 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0203803 = queryNorm
              0.15796354 = fieldWeight in 4276, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4276)
          0.04232289 = weight(abstract_txt:understanding in 4276) [ClassicSimilarity], result of:
            0.04232289 = score(doc=4276,freq=2.0), product of:
              0.16889752 = queryWeight, product of:
                1.8269881 = boost
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.0203803 = queryNorm
              0.25058323 = fieldWeight in 4276, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4276)
          0.32876408 = weight(abstract_txt:visualization in 4276) [ClassicSimilarity], result of:
            0.32876408 = score(doc=4276,freq=18.0), product of:
              0.31847978 = queryWeight, product of:
                2.508794 = boost
                6.228827 = idf(docFreq=236, maxDocs=44218)
                0.0203803 = queryNorm
              1.0322919 = fieldWeight in 4276, product of:
                4.2426405 = tf(freq=18.0), with freq of:
                  18.0 = termFreq=18.0
                6.228827 = idf(docFreq=236, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4276)
        0.2 = coord(5/25)
    
  4. Information visualization : human-centered issues and perspectives (2008) 0.08
    0.083296835 = sum of:
      0.083296835 = product of:
        0.6941403 = sum of:
          0.06276259 = weight(abstract_txt:papers in 3285) [ClassicSimilarity], result of:
            0.06276259 = score(doc=3285,freq=1.0), product of:
              0.1522883 = queryWeight, product of:
                1.4164842 = boost
                5.2752647 = idf(docFreq=614, maxDocs=44218)
                0.0203803 = queryNorm
              0.41213006 = fieldWeight in 3285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2752647 = idf(docFreq=614, maxDocs=44218)
                0.078125 = fieldNorm(doc=3285)
          0.1930256 = weight(abstract_txt:analytics in 3285) [ClassicSimilarity], result of:
            0.1930256 = score(doc=3285,freq=1.0), product of:
              0.3220643 = queryWeight, product of:
                2.0599172 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0203803 = queryNorm
              0.5993387 = fieldWeight in 3285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.078125 = fieldNorm(doc=3285)
          0.43835214 = weight(abstract_txt:visualization in 3285) [ClassicSimilarity], result of:
            0.43835214 = score(doc=3285,freq=8.0), product of:
              0.31847978 = queryWeight, product of:
                2.508794 = boost
                6.228827 = idf(docFreq=236, maxDocs=44218)
                0.0203803 = queryNorm
              1.3763893 = fieldWeight in 3285, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.228827 = idf(docFreq=236, maxDocs=44218)
                0.078125 = fieldNorm(doc=3285)
        0.12 = coord(3/25)
    
  5. Parsons, P.; Sedig, K.: Adjustable properties of visual representations : improving the quality of human-information interaction (2014) 0.08
    0.07803069 = sum of:
      0.07803069 = product of:
        0.48769182 = sum of:
          0.07066683 = weight(abstract_txt:coordination in 1214) [ClassicSimilarity], result of:
            0.07066683 = score(doc=1214,freq=1.0), product of:
              0.15180045 = queryWeight, product of:
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0203803 = queryNorm
              0.4655245 = fieldWeight in 1214, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0625 = fieldNorm(doc=1214)
          0.07465671 = weight(abstract_txt:coordinated in 1214) [ClassicSimilarity], result of:
            0.07465671 = score(doc=1214,freq=1.0), product of:
              0.1574618 = queryWeight, product of:
                1.0184766 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0203803 = queryNorm
              0.47412583 = fieldWeight in 1214, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=1214)
          0.21838355 = weight(abstract_txt:analytics in 1214) [ClassicSimilarity], result of:
            0.21838355 = score(doc=1214,freq=2.0), product of:
              0.3220643 = queryWeight, product of:
                2.0599172 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0203803 = queryNorm
              0.67807436 = fieldWeight in 1214, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0625 = fieldNorm(doc=1214)
          0.12398472 = weight(abstract_txt:visualization in 1214) [ClassicSimilarity], result of:
            0.12398472 = score(doc=1214,freq=1.0), product of:
              0.31847978 = queryWeight, product of:
                2.508794 = boost
                6.228827 = idf(docFreq=236, maxDocs=44218)
                0.0203803 = queryNorm
              0.3893017 = fieldWeight in 1214, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.228827 = idf(docFreq=236, maxDocs=44218)
                0.0625 = fieldNorm(doc=1214)
        0.16 = coord(4/25)