Document (#36228)

Author
Berendt, B.
Krause, B.
Kolbe-Nusser, S.
Title
Intelligent scientific authoring tools : interactive data mining for constructive uses of citation networks
Source
Information processing and management. 46(2010) no.1, S.1-10
Year
2010
Abstract
Many powerful methods and tools exist for extracting meaning from scientific publications, their texts, and their citation links. However, existing proposals often neglect a fundamental aspect of learning: that understanding and learning require an active and constructive exploration of a domain. In this paper, we describe a new method and a tool that use data mining and interactivity to turn the typical search and retrieve dialogue, in which the user asks questions and a system gives answers, into a dialogue that also involves sense-making, in which the user has to become active by constructing a bibliography and a domain model of the search term(s). This model starts from an automatically generated and annotated clustering solution that is iteratively modified by users. The tool is part of an integrated authoring system covering all phases from search through reading and sense-making to writing. Two evaluation studies demonstrate the usability of this interactive and constructive approach, and they show that clusters and groups represent identifiable sub-topics.
Theme
Data Mining

Similar documents (author)

  1. Krause, J.: Praxisorientierte natürlichsprachliche Frage-Antwort-Systeme : zur Entwicklung vor allem in der Bundesrepublik Deutschland (1983) 4.92
    4.924188 = sum of:
      4.924188 = weight(author_txt:krause in 5188) [ClassicSimilarity], result of:
        4.924188 = fieldWeight in 5188, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.8787007 = idf(docFreq=43, maxDocs=42740)
          0.625 = fieldNorm(doc=5188)
    
  2. Krause, J.: Mensch-Maschine-Interaktion in natürlicherSprache : zur Bewertung eines natürlichsprachigen Frage-Antwort-Systems (1980) 4.92
    4.924188 = sum of:
      4.924188 = weight(author_txt:krause in 5498) [ClassicSimilarity], result of:
        4.924188 = fieldWeight in 5498, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.8787007 = idf(docFreq=43, maxDocs=42740)
          0.625 = fieldNorm(doc=5498)
    
  3. Krause, M.G.: Intellectual problems of indexing picture collections (1988) 4.92
    4.924188 = sum of:
      4.924188 = weight(author_txt:krause in 5638) [ClassicSimilarity], result of:
        4.924188 = fieldWeight in 5638, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.8787007 = idf(docFreq=43, maxDocs=42740)
          0.625 = fieldNorm(doc=5638)
    
  4. Krause, J.: Was leisten informationslinguistische Komponenten von Referenz-Retrievalsystemen für Massendaten? : Von der 'Pragmatik im Computer' zur Pragmatikanalyse als Designgrundlage (1986) 4.92
    4.924188 = sum of:
      4.924188 = weight(author_txt:krause in 7395) [ClassicSimilarity], result of:
        4.924188 = fieldWeight in 7395, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.8787007 = idf(docFreq=43, maxDocs=42740)
          0.625 = fieldNorm(doc=7395)
    
  5. Krause, J.: Mensch-Maschine-Interaktion in natürlicher Sprache : Evaluierungsstudien zu praxisorientierten Frage-Antwort-Systemen und ihre Methodik (1982) 4.92
    4.924188 = sum of:
      4.924188 = weight(author_txt:krause in 964) [ClassicSimilarity], result of:
        4.924188 = fieldWeight in 964, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.8787007 = idf(docFreq=43, maxDocs=42740)
          0.625 = fieldNorm(doc=964)
    

Similar documents (content)

  1. Cooper, L.; Kuhlthau, C.C.: Imagery for constructing meaning in the information search process : a study of middle school students (1999) 0.16
    0.15578641 = sum of:
      0.15578641 = product of:
        0.6491101 = sum of:
          0.014633407 = weight(abstract_txt:user in 1281) [ClassicSimilarity], result of:
            0.014633407 = score(doc=1281,freq=1.0), product of:
              0.0727167 = queryWeight, product of:
                3.6797917 = idf(docFreq=2930, maxDocs=42740)
                0.019761091 = queryNorm
              0.2012386 = fieldWeight in 1281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6797917 = idf(docFreq=2930, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1281)
          0.013511323 = weight(abstract_txt:from in 1281) [ClassicSimilarity], result of:
            0.013511323 = score(doc=1281,freq=2.0), product of:
              0.06264545 = queryWeight, product of:
                1.1367718 = boost
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.019761091 = queryNorm
              0.21567924 = fieldWeight in 1281, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1281)
          0.032630876 = weight(abstract_txt:learning in 1281) [ClassicSimilarity], result of:
            0.032630876 = score(doc=1281,freq=1.0), product of:
              0.12411463 = queryWeight, product of:
                1.3064549 = boost
                4.807482 = idf(docFreq=948, maxDocs=42740)
                0.019761091 = queryNorm
              0.26290917 = fieldWeight in 1281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.807482 = idf(docFreq=948, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1281)
          0.037721165 = weight(abstract_txt:making in 1281) [ClassicSimilarity], result of:
            0.037721165 = score(doc=1281,freq=1.0), product of:
              0.13670799 = queryWeight, product of:
                1.3711339 = boost
                5.0454874 = idf(docFreq=747, maxDocs=42740)
                0.019761091 = queryNorm
              0.2759251 = fieldWeight in 1281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0454874 = idf(docFreq=747, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1281)
          0.020163994 = weight(abstract_txt:that in 1281) [ClassicSimilarity], result of:
            0.020163994 = score(doc=1281,freq=4.0), product of:
              0.07698656 = queryWeight, product of:
                1.6268983 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.019761091 = queryNorm
              0.26191577 = fieldWeight in 1281, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1281)
          0.53044933 = weight(abstract_txt:constructive in 1281) [ClassicSimilarity], result of:
            0.53044933 = score(doc=1281,freq=4.0), product of:
              0.57435036 = queryWeight, product of:
                3.4420485 = boost
                8.444015 = idf(docFreq=24, maxDocs=42740)
                0.019761091 = queryNorm
              0.9235641 = fieldWeight in 1281, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.444015 = idf(docFreq=24, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1281)
        0.24 = coord(6/25)
    
  2. Kuo, J.-S.; Li, H.; Yang, Y.-K.: Active learning for constructing transliteration lexicons from the Web (2008) 0.14
    0.14247157 = sum of:
      0.14247157 = product of:
        0.59363157 = sum of:
          0.11721411 = weight(abstract_txt:starts in 3346) [ClassicSimilarity], result of:
            0.11721411 = score(doc=3346,freq=1.0), product of:
              0.1613089 = queryWeight, product of:
                1.0531666 = boost
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.019761091 = queryNorm
              0.7266438 = fieldWeight in 3346, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.09375 = fieldNorm(doc=3346)
          0.016378198 = weight(abstract_txt:from in 3346) [ClassicSimilarity], result of:
            0.016378198 = score(doc=3346,freq=1.0), product of:
              0.06264545 = queryWeight, product of:
                1.1367718 = boost
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.019761091 = queryNorm
              0.26144272 = fieldWeight in 3346, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.09375 = fieldNorm(doc=3346)
          0.16389252 = weight(abstract_txt:iteratively in 3346) [ClassicSimilarity], result of:
            0.16389252 = score(doc=3346,freq=1.0), product of:
              0.20170243 = queryWeight, product of:
                1.1776696 = boost
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.019761091 = queryNorm
              0.8125461 = fieldWeight in 3346, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.09375 = fieldNorm(doc=3346)
          0.13702112 = weight(abstract_txt:learning in 3346) [ClassicSimilarity], result of:
            0.13702112 = score(doc=3346,freq=6.0), product of:
              0.12411463 = queryWeight, product of:
                1.3064549 = boost
                4.807482 = idf(docFreq=948, maxDocs=42740)
                0.019761091 = queryNorm
              1.1039885 = fieldWeight in 3346, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.807482 = idf(docFreq=948, maxDocs=42740)
                0.09375 = fieldNorm(doc=3346)
          0.029935768 = weight(abstract_txt:that in 3346) [ClassicSimilarity], result of:
            0.029935768 = score(doc=3346,freq=3.0), product of:
              0.07698656 = queryWeight, product of:
                1.6268983 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.019761091 = queryNorm
              0.38884407 = fieldWeight in 3346, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.09375 = fieldNorm(doc=3346)
          0.12918983 = weight(abstract_txt:active in 3346) [ClassicSimilarity], result of:
            0.12918983 = score(doc=3346,freq=1.0), product of:
              0.21685392 = queryWeight, product of:
                1.7268975 = boost
                6.354623 = idf(docFreq=201, maxDocs=42740)
                0.019761091 = queryNorm
              0.5957459 = fieldWeight in 3346, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.354623 = idf(docFreq=201, maxDocs=42740)
                0.09375 = fieldNorm(doc=3346)
        0.24 = coord(6/25)
    
  3. Lee, S.-C.: ¬The utilization and selection of authoring software (1993) 0.14
    0.14000154 = sum of:
      0.14000154 = product of:
        0.87500966 = sum of:
          0.059562005 = weight(abstract_txt:tools in 4567) [ClassicSimilarity], result of:
            0.059562005 = score(doc=4567,freq=1.0), product of:
              0.106832184 = queryWeight, product of:
                1.2120875 = boost
                4.4602294 = idf(docFreq=1342, maxDocs=42740)
                0.019761091 = queryNorm
              0.5575287 = fieldWeight in 4567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4602294 = idf(docFreq=1342, maxDocs=42740)
                0.125 = fieldNorm(doc=4567)
          0.08263336 = weight(abstract_txt:tool in 4567) [ClassicSimilarity], result of:
            0.08263336 = score(doc=4567,freq=1.0), product of:
              0.13289016 = queryWeight, product of:
                1.3518525 = boost
                4.974536 = idf(docFreq=802, maxDocs=42740)
                0.019761091 = queryNorm
              0.621817 = fieldWeight in 4567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.974536 = idf(docFreq=802, maxDocs=42740)
                0.125 = fieldNorm(doc=4567)
          0.16222051 = weight(abstract_txt:interactive in 4567) [ClassicSimilarity], result of:
            0.16222051 = score(doc=4567,freq=2.0), product of:
              0.16536734 = queryWeight, product of:
                1.5080223 = boost
                5.549208 = idf(docFreq=451, maxDocs=42740)
                0.019761091 = queryNorm
              0.9809707 = fieldWeight in 4567, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.549208 = idf(docFreq=451, maxDocs=42740)
                0.125 = fieldNorm(doc=4567)
          0.5705938 = weight(abstract_txt:authoring in 4567) [ClassicSimilarity], result of:
            0.5705938 = score(doc=4567,freq=5.0), product of:
              0.2818062 = queryWeight, product of:
                1.9686033 = boost
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.019761091 = queryNorm
              2.0247736 = fieldWeight in 4567, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.125 = fieldNorm(doc=4567)
        0.16 = coord(4/25)
    
  4. Patton, M.; Reynolds, D.; Choudhury, G.S.; DiLauro, T.: Toward a metadata generation framework : a case study at Johns Hopkins University (2004) 0.14
    0.13522588 = sum of:
      0.13522588 = product of:
        0.48294955 = sum of:
          0.031587522 = weight(abstract_txt:tools in 3193) [ClassicSimilarity], result of:
            0.031587522 = score(doc=3193,freq=2.0), product of:
              0.106832184 = queryWeight, product of:
                1.2120875 = boost
                4.4602294 = idf(docFreq=1342, maxDocs=42740)
                0.019761091 = queryNorm
              0.29567423 = fieldWeight in 3193, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4602294 = idf(docFreq=1342, maxDocs=42740)
                0.046875 = fieldNorm(doc=3193)
          0.02586478 = weight(abstract_txt:scientific in 3193) [ClassicSimilarity], result of:
            0.02586478 = score(doc=3193,freq=1.0), product of:
              0.11780785 = queryWeight, product of:
                1.2728289 = boost
                4.6837454 = idf(docFreq=1073, maxDocs=42740)
                0.019761091 = queryNorm
              0.21955056 = fieldWeight in 3193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6837454 = idf(docFreq=1073, maxDocs=42740)
                0.046875 = fieldNorm(doc=3193)
          0.043822955 = weight(abstract_txt:tool in 3193) [ClassicSimilarity], result of:
            0.043822955 = score(doc=3193,freq=2.0), product of:
              0.13289016 = queryWeight, product of:
                1.3518525 = boost
                4.974536 = idf(docFreq=802, maxDocs=42740)
                0.019761091 = queryNorm
              0.32976824 = fieldWeight in 3193, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.974536 = idf(docFreq=802, maxDocs=42740)
                0.046875 = fieldNorm(doc=3193)
          0.032332428 = weight(abstract_txt:making in 3193) [ClassicSimilarity], result of:
            0.032332428 = score(doc=3193,freq=1.0), product of:
              0.13670799 = queryWeight, product of:
                1.3711339 = boost
                5.0454874 = idf(docFreq=747, maxDocs=42740)
                0.019761091 = queryNorm
              0.23650722 = fieldWeight in 3193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0454874 = idf(docFreq=747, maxDocs=42740)
                0.046875 = fieldNorm(doc=3193)
          0.014967884 = weight(abstract_txt:that in 3193) [ClassicSimilarity], result of:
            0.014967884 = score(doc=3193,freq=3.0), product of:
              0.07698656 = queryWeight, product of:
                1.6268983 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.019761091 = queryNorm
              0.19442204 = fieldWeight in 3193, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.046875 = fieldNorm(doc=3193)
          0.10703854 = weight(abstract_txt:dialogue in 3193) [ClassicSimilarity], result of:
            0.10703854 = score(doc=3193,freq=1.0), product of:
              0.3036653 = queryWeight, product of:
                2.0435276 = boost
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.019761091 = queryNorm
              0.35248855 = fieldWeight in 3193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.046875 = fieldNorm(doc=3193)
          0.22733544 = weight(abstract_txt:constructive in 3193) [ClassicSimilarity], result of:
            0.22733544 = score(doc=3193,freq=1.0), product of:
              0.57435036 = queryWeight, product of:
                3.4420485 = boost
                8.444015 = idf(docFreq=24, maxDocs=42740)
                0.019761091 = queryNorm
              0.39581317 = fieldWeight in 3193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.444015 = idf(docFreq=24, maxDocs=42740)
                0.046875 = fieldNorm(doc=3193)
        0.28 = coord(7/25)
    
  5. Kuhlthau, C.C.; Tama, S.L.: Information search process of lawyers : a call for 'just for me' information services (2001) 0.12
    0.119023584 = sum of:
      0.119023584 = product of:
        0.4959316 = sum of:
          0.021841718 = weight(abstract_txt:model in 493) [ClassicSimilarity], result of:
            0.021841718 = score(doc=493,freq=1.0), product of:
              0.086882785 = queryWeight, product of:
                1.0930746 = boost
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.019761091 = queryNorm
              0.25139293 = fieldWeight in 493, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.0625 = fieldNorm(doc=493)
          0.03729243 = weight(abstract_txt:learning in 493) [ClassicSimilarity], result of:
            0.03729243 = score(doc=493,freq=1.0), product of:
              0.12411463 = queryWeight, product of:
                1.3064549 = boost
                4.807482 = idf(docFreq=948, maxDocs=42740)
                0.019761091 = queryNorm
              0.3004676 = fieldWeight in 493, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.807482 = idf(docFreq=948, maxDocs=42740)
                0.0625 = fieldNorm(doc=493)
          0.024512386 = weight(abstract_txt:search in 493) [ClassicSimilarity], result of:
            0.024512386 = score(doc=493,freq=1.0), product of:
              0.10740637 = queryWeight, product of:
                1.488482 = boost
                3.6515355 = idf(docFreq=3014, maxDocs=42740)
                0.019761091 = queryNorm
              0.22822097 = fieldWeight in 493, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6515355 = idf(docFreq=3014, maxDocs=42740)
                0.0625 = fieldNorm(doc=493)
          0.023044566 = weight(abstract_txt:that in 493) [ClassicSimilarity], result of:
            0.023044566 = score(doc=493,freq=4.0), product of:
              0.07698656 = queryWeight, product of:
                1.6268983 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.019761091 = queryNorm
              0.29933232 = fieldWeight in 493, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=493)
          0.08612655 = weight(abstract_txt:active in 493) [ClassicSimilarity], result of:
            0.08612655 = score(doc=493,freq=1.0), product of:
              0.21685392 = queryWeight, product of:
                1.7268975 = boost
                6.354623 = idf(docFreq=201, maxDocs=42740)
                0.019761091 = queryNorm
              0.39716393 = fieldWeight in 493, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.354623 = idf(docFreq=201, maxDocs=42740)
                0.0625 = fieldNorm(doc=493)
          0.30311394 = weight(abstract_txt:constructive in 493) [ClassicSimilarity], result of:
            0.30311394 = score(doc=493,freq=1.0), product of:
              0.57435036 = queryWeight, product of:
                3.4420485 = boost
                8.444015 = idf(docFreq=24, maxDocs=42740)
                0.019761091 = queryNorm
              0.5277509 = fieldWeight in 493, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.444015 = idf(docFreq=24, maxDocs=42740)
                0.0625 = fieldNorm(doc=493)
        0.24 = coord(6/25)