Document (#36227)

Berendt, B.
Krause, B.
Kolbe-Nusser, S.
Intelligent scientific authoring tools : interactive data mining for constructive uses of citation networks
Information processing and management. 46(2010) no.1, S.1-10
Many powerful methods and tools exist for extracting meaning from scientific publications, their texts, and their citation links. However, existing proposals often neglect a fundamental aspect of learning: that understanding and learning require an active and constructive exploration of a domain. In this paper, we describe a new method and a tool that use data mining and interactivity to turn the typical search and retrieve dialogue, in which the user asks questions and a system gives answers, into a dialogue that also involves sense-making, in which the user has to become active by constructing a bibliography and a domain model of the search term(s). This model starts from an automatically generated and annotated clustering solution that is iteratively modified by users. The tool is part of an integrated authoring system covering all phases from search through reading and sense-making to writing. Two evaluation studies demonstrate the usability of this interactive and constructive approach, and they show that clusters and groups represent identifiable sub-topics.
Data Mining

Similar documents (author)

  1. Krause, J.: Praxisorientierte natürlichsprachliche Frage-Antwort-Systeme : zur Entwicklung vor allem in der Bundesrepublik Deutschland (1983) 4.95
    4.945436 = sum of:
      4.945436 = weight(author_txt:krause in 5188) [ClassicSimilarity], result of:
        4.945436 = fieldWeight in 5188, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.912698 = idf(docFreq=43, maxDocs=44218)
          0.625 = fieldNorm(doc=5188)
  2. Krause, J.: Mensch-Maschine-Interaktion in natürlicherSprache : zur Bewertung eines natürlichsprachigen Frage-Antwort-Systems (1980) 4.95
    4.945436 = sum of:
      4.945436 = weight(author_txt:krause in 5498) [ClassicSimilarity], result of:
        4.945436 = fieldWeight in 5498, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.912698 = idf(docFreq=43, maxDocs=44218)
          0.625 = fieldNorm(doc=5498)
  3. Krause, M.G.: Intellectual problems of indexing picture collections (1988) 4.95
    4.945436 = sum of:
      4.945436 = weight(author_txt:krause in 5638) [ClassicSimilarity], result of:
        4.945436 = fieldWeight in 5638, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.912698 = idf(docFreq=43, maxDocs=44218)
          0.625 = fieldNorm(doc=5638)
  4. Krause, J.: Was leisten informationslinguistische Komponenten von Referenz-Retrievalsystemen für Massendaten? : Von der 'Pragmatik im Computer' zur Pragmatikanalyse als Designgrundlage (1986) 4.95
    4.945436 = sum of:
      4.945436 = weight(author_txt:krause in 7395) [ClassicSimilarity], result of:
        4.945436 = fieldWeight in 7395, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.912698 = idf(docFreq=43, maxDocs=44218)
          0.625 = fieldNorm(doc=7395)
  5. Krause, J.: Mensch-Maschine-Interaktion in natürlicher Sprache : Evaluierungsstudien zu praxisorientierten Frage-Antwort-Systemen und ihre Methodik (1982) 4.95
    4.945436 = sum of:
      4.945436 = weight(author_txt:krause in 8964) [ClassicSimilarity], result of:
        4.945436 = fieldWeight in 8964, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.912698 = idf(docFreq=43, maxDocs=44218)
          0.625 = fieldNorm(doc=8964)

Similar documents (content)

  1. Cooper, L.; Kuhlthau, C.C.: Imagery for constructing meaning in the information search process : a study of middle school students (1999) 0.15
    0.15301311 = sum of:
      0.15301311 = product of:
        0.63755465 = sum of:
          0.014823349 = weight(abstract_txt:user in 280) [ClassicSimilarity], result of:
            0.014823349 = score(doc=280,freq=1.0), product of:
              0.073585525 = queryWeight, product of:
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.019976826 = queryNorm
              0.20144382 = fieldWeight in 280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0546875 = fieldNorm(doc=280)
          0.013283537 = weight(abstract_txt:from in 280) [ClassicSimilarity], result of:
            0.013283537 = score(doc=280,freq=2.0), product of:
              0.062142834 = queryWeight, product of:
                1.1254987 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.019976826 = queryNorm
              0.21375814 = fieldWeight in 280, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0546875 = fieldNorm(doc=280)
          0.031803064 = weight(abstract_txt:learning in 280) [ClassicSimilarity], result of:
            0.031803064 = score(doc=280,freq=1.0), product of:
              0.12240734 = queryWeight, product of:
                1.289756 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.019976826 = queryNorm
              0.25981337 = fieldWeight in 280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0546875 = fieldNorm(doc=280)
          0.037180036 = weight(abstract_txt:making in 280) [ClassicSimilarity], result of:
            0.037180036 = score(doc=280,freq=1.0), product of:
              0.13584219 = queryWeight, product of:
                1.3586924 = boost
                5.0048037 = idf(docFreq=805, maxDocs=44218)
                0.019976826 = queryNorm
              0.2737002 = fieldWeight in 280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0048037 = idf(docFreq=805, maxDocs=44218)
                0.0546875 = fieldNorm(doc=280)
          0.019727642 = weight(abstract_txt:that in 280) [ClassicSimilarity], result of:
            0.019727642 = score(doc=280,freq=4.0), product of:
              0.07612108 = queryWeight, product of:
                1.608149 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019976826 = queryNorm
              0.25916135 = fieldWeight in 280, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=280)
          0.520737 = weight(abstract_txt:constructive in 280) [ClassicSimilarity], result of:
            0.520737 = score(doc=280,freq=4.0), product of:
              0.5691816 = queryWeight, product of:
                3.4062371 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.019976826 = queryNorm
              0.9148872 = fieldWeight in 280, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.0546875 = fieldNorm(doc=280)
        0.24 = coord(6/25)
  2. Lee, S.-C.: ¬The utilization and selection of authoring software (1993) 0.14
    0.14252841 = sum of:
      0.14252841 = product of:
        0.8908026 = sum of:
          0.059787627 = weight(abstract_txt:tools in 4567) [ClassicSimilarity], result of:
            0.059787627 = score(doc=4567,freq=1.0), product of:
              0.10745363 = queryWeight, product of:
                1.2084101 = boost
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.019976826 = queryNorm
              0.556404 = fieldWeight in 4567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.125 = fieldNorm(doc=4567)
          0.08230396 = weight(abstract_txt:tool in 4567) [ClassicSimilarity], result of:
            0.08230396 = score(doc=4567,freq=1.0), product of:
              0.13297215 = queryWeight, product of:
                1.3442627 = boost
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.019976826 = queryNorm
              0.6189564 = fieldWeight in 4567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.125 = fieldNorm(doc=4567)
          0.16432457 = weight(abstract_txt:interactive in 4567) [ClassicSimilarity], result of:
            0.16432457 = score(doc=4567,freq=2.0), product of:
              0.1673421 = queryWeight, product of:
                1.5080177 = boost
                5.5548496 = idf(docFreq=464, maxDocs=44218)
                0.019976826 = queryNorm
              0.9819679 = fieldWeight in 4567, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5548496 = idf(docFreq=464, maxDocs=44218)
                0.125 = fieldNorm(doc=4567)
          0.58438647 = weight(abstract_txt:authoring in 4567) [ClassicSimilarity], result of:
            0.58438647 = score(doc=4567,freq=5.0), product of:
              0.28727 = queryWeight, product of:
                1.9758272 = boost
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.019976826 = queryNorm
              2.034276 = fieldWeight in 4567, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.125 = fieldNorm(doc=4567)
        0.16 = coord(4/25)
  3. Kuo, J.-S.; Li, H.; Yang, Y.-K.: Active learning for constructing transliteration lexicons from the Web (2008) 0.14
    0.14199327 = sum of:
      0.14199327 = product of:
        0.5916386 = sum of:
          0.11902456 = weight(abstract_txt:starts in 1345) [ClassicSimilarity], result of:
            0.11902456 = score(doc=1345,freq=1.0), product of:
              0.163501 = queryWeight, product of:
                1.0540204 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.019976826 = queryNorm
              0.72797453 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.016102077 = weight(abstract_txt:from in 1345) [ClassicSimilarity], result of:
            0.016102077 = score(doc=1345,freq=1.0), product of:
              0.062142834 = queryWeight, product of:
                1.1254987 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.019976826 = queryNorm
              0.259114 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.16466692 = weight(abstract_txt:iteratively in 1345) [ClassicSimilarity], result of:
            0.16466692 = score(doc=1345,freq=1.0), product of:
              0.20300198 = queryWeight, product of:
                1.174462 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.019976826 = queryNorm
              0.8111592 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.13354506 = weight(abstract_txt:learning in 1345) [ClassicSimilarity], result of:
            0.13354506 = score(doc=1345,freq=6.0), product of:
              0.12240734 = queryWeight, product of:
                1.289756 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.019976826 = queryNorm
              1.090989 = fieldWeight in 1345, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.029287947 = weight(abstract_txt:that in 1345) [ClassicSimilarity], result of:
            0.029287947 = score(doc=1345,freq=3.0), product of:
              0.07612108 = queryWeight, product of:
                1.608149 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019976826 = queryNorm
              0.38475478 = fieldWeight in 1345, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.12901212 = weight(abstract_txt:active in 1345) [ClassicSimilarity], result of:
            0.12901212 = score(doc=1345,freq=1.0), product of:
              0.21736671 = queryWeight, product of:
                1.718701 = boost
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.019976826 = queryNorm
              0.5935229 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
        0.24 = coord(6/25)
  4. Patton, M.; Reynolds, D.; Choudhury, G.S.; DiLauro, T.: Toward a metadata generation framework : a case study at Johns Hopkins University (2004) 0.13
    0.13348775 = sum of:
      0.13348775 = product of:
        0.47674194 = sum of:
          0.03170718 = weight(abstract_txt:tools in 1192) [ClassicSimilarity], result of:
            0.03170718 = score(doc=1192,freq=2.0), product of:
              0.10745363 = queryWeight, product of:
                1.2084101 = boost
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.019976826 = queryNorm
              0.29507777 = fieldWeight in 1192, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.046875 = fieldNorm(doc=1192)
          0.025421303 = weight(abstract_txt:scientific in 1192) [ClassicSimilarity], result of:
            0.025421303 = score(doc=1192,freq=1.0), product of:
              0.116839916 = queryWeight, product of:
                1.2600838 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.019976826 = queryNorm
              0.21757379 = fieldWeight in 1192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.046875 = fieldNorm(doc=1192)
          0.043648265 = weight(abstract_txt:tool in 1192) [ClassicSimilarity], result of:
            0.043648265 = score(doc=1192,freq=2.0), product of:
              0.13297215 = queryWeight, product of:
                1.3442627 = boost
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.019976826 = queryNorm
              0.32825118 = fieldWeight in 1192, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.046875 = fieldNorm(doc=1192)
          0.0318686 = weight(abstract_txt:making in 1192) [ClassicSimilarity], result of:
            0.0318686 = score(doc=1192,freq=1.0), product of:
              0.13584219 = queryWeight, product of:
                1.3586924 = boost
                5.0048037 = idf(docFreq=805, maxDocs=44218)
                0.019976826 = queryNorm
              0.23460017 = fieldWeight in 1192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0048037 = idf(docFreq=805, maxDocs=44218)
                0.046875 = fieldNorm(doc=1192)
          0.014643974 = weight(abstract_txt:that in 1192) [ClassicSimilarity], result of:
            0.014643974 = score(doc=1192,freq=3.0), product of:
              0.07612108 = queryWeight, product of:
                1.608149 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019976826 = queryNorm
              0.19237739 = fieldWeight in 1192, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.046875 = fieldNorm(doc=1192)
          0.10627966 = weight(abstract_txt:dialogue in 1192) [ClassicSimilarity], result of:
            0.10627966 = score(doc=1192,freq=1.0), product of:
              0.3032211 = queryWeight, product of:
                2.0299416 = boost
                7.4773793 = idf(docFreq=67, maxDocs=44218)
                0.019976826 = queryNorm
              0.35050216 = fieldWeight in 1192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4773793 = idf(docFreq=67, maxDocs=44218)
                0.046875 = fieldNorm(doc=1192)
          0.22317299 = weight(abstract_txt:constructive in 1192) [ClassicSimilarity], result of:
            0.22317299 = score(doc=1192,freq=1.0), product of:
              0.5691816 = queryWeight, product of:
                3.4062371 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.019976826 = queryNorm
              0.39209452 = fieldWeight in 1192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.046875 = fieldNorm(doc=1192)
        0.28 = coord(7/25)
  5. Kuhlthau, C.C.; Tama, S.L.: Information search process of lawyers : a call for 'just for me' information services (2001) 0.12
    0.1173172 = sum of:
      0.1173172 = product of:
        0.4888217 = sum of:
          0.021469846 = weight(abstract_txt:model in 4492) [ClassicSimilarity], result of:
            0.021469846 = score(doc=4492,freq=1.0), product of:
              0.08617596 = queryWeight, product of:
                1.0821735 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.019976826 = queryNorm
              0.24913962 = fieldWeight in 4492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=4492)
          0.036346357 = weight(abstract_txt:learning in 4492) [ClassicSimilarity], result of:
            0.036346357 = score(doc=4492,freq=1.0), product of:
              0.12240734 = queryWeight, product of:
                1.289756 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.019976826 = queryNorm
              0.29692957 = fieldWeight in 4492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=4492)
          0.024887523 = weight(abstract_txt:search in 4492) [ClassicSimilarity], result of:
            0.024887523 = score(doc=4492,freq=1.0), product of:
              0.108855836 = queryWeight, product of:
                1.4896194 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.019976826 = queryNorm
              0.22862828 = fieldWeight in 4492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=4492)
          0.022545874 = weight(abstract_txt:that in 4492) [ClassicSimilarity], result of:
            0.022545874 = score(doc=4492,freq=4.0), product of:
              0.07612108 = queryWeight, product of:
                1.608149 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019976826 = queryNorm
              0.2961844 = fieldWeight in 4492, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4492)
          0.08600809 = weight(abstract_txt:active in 4492) [ClassicSimilarity], result of:
            0.08600809 = score(doc=4492,freq=1.0), product of:
              0.21736671 = queryWeight, product of:
                1.718701 = boost
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.019976826 = queryNorm
              0.39568195 = fieldWeight in 4492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.0625 = fieldNorm(doc=4492)
          0.297564 = weight(abstract_txt:constructive in 4492) [ClassicSimilarity], result of:
            0.297564 = score(doc=4492,freq=1.0), product of:
              0.5691816 = queryWeight, product of:
                3.4062371 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.019976826 = queryNorm
              0.5227927 = fieldWeight in 4492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.0625 = fieldNorm(doc=4492)
        0.24 = coord(6/25)