Document (#31017)

Author
Endres-Niggemeyer, B.
Jauris-Heipke, S.
Pinsky, S.M.
Ulbricht, U.
Title
Wissen gewinnen durch Wissen : Ontologiebasierte Informationsextraktion
Source
Information - Wissenschaft und Praxis. 57(2006) H.6/7, S.301-308
Year
2006
Abstract
Die ontologiebasierte Informationsextraktion, über die hier berichtet wird, ist Teil eines Systems zum automatischen Zusammenfassen, das sich am Vorgehen kompetenter Menschen orientiert. Dahinter steht die Annahme, dass Menschen die Ergebnisse eines Systems leichter übernehmen können, wenn sie mit Verfahren erarbeitet worden sind, die sie selbst auch benutzen. Das erste Anwendungsgebiet ist Knochenmarktransplantation (KMT). Im Kern des Systems Summit-BMT (Summarize It in Bone Marrow Transplantation) steht eine Ontologie des Fachgebietes. Sie ist als MySQL-Datenbank realisiert und versorgt menschliche Benutzer und Systemkomponenten mit Wissen. Summit-BMT unterstützt die Frageformulierung mit einem empirisch fundierten Szenario-Interface. Die Retrievalergebnisse werden durch ein Textpassagenretrieval vorselektiert und dann kognitiv fundierten Agenten unterbreitet, die unter Einsatz ihrer Wissensbasis / Ontologie genauer prüfen, ob die Propositionen aus der Benutzerfrage getroffen werden. Die relevanten Textclips aus dem Duelldokument werden in das Szenarioformular eingetragen und mit einem Link zu ihrem Vorkommen im Original präsentiert. In diesem Artikel stehen die Ontologie und ihr Gebrauch zur wissensbasierten Informationsextraktion im Mittelpunkt. Die Ontologiedatenbank hält unterschiedliche Wissenstypen so bereit, dass sie leicht kombiniert werden können: Konzepte, Propositionen und ihre syntaktisch-semantischen Schemata, Unifikatoren, Paraphrasen und Definitionen von Frage-Szenarios. Auf sie stützen sich die Systemagenten, welche von Menschen adaptierte Zusammenfassungsstrategien ausführen. Mängel in anderen Verarbeitungsschritten führen zu Verlusten, aber die eigentliche Qualität der Ergebnisse steht und fällt mit der Qualität der Ontologie. Erste Tests der Extraktionsleistung fallen verblüffend positiv aus.
Theme
Automatisches Abstracting
Wissensrepräsentation

Similar documents (author)

  1. Endres-Niggemeyer, B.: Sprachverarbeitung im Informationsbereich (1989) 5.96
    5.9568186 = sum of:
      5.9568186 = sum of:
        2.8780825 = weight(author_txt:niggemeyer in 4860) [ClassicSimilarity], result of:
          2.8780825 = score(doc=4860,freq=1.0), product of:
            0.69105005 = queryWeight, product of:
              8.329592 = idf(docFreq=28, maxDocs=44218)
              0.08296326 = queryNorm
            4.164796 = fieldWeight in 4860, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.329592 = idf(docFreq=28, maxDocs=44218)
              0.5 = fieldNorm(doc=4860)
        3.0787358 = weight(author_txt:endres in 4860) [ClassicSimilarity], result of:
          3.0787358 = score(doc=4860,freq=1.0), product of:
            0.72280693 = queryWeight, product of:
              1.0227191 = boost
              8.518833 = idf(docFreq=23, maxDocs=44218)
              0.08296326 = queryNorm
            4.2594166 = fieldWeight in 4860, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.518833 = idf(docFreq=23, maxDocs=44218)
              0.5 = fieldNorm(doc=4860)
    
  2. Endres-Niggemeyer, B.: ¬An empirical process model of abstracting (1992) 5.96
    5.9568186 = sum of:
      5.9568186 = sum of:
        2.8780825 = weight(author_txt:niggemeyer in 8834) [ClassicSimilarity], result of:
          2.8780825 = score(doc=8834,freq=1.0), product of:
            0.69105005 = queryWeight, product of:
              8.329592 = idf(docFreq=28, maxDocs=44218)
              0.08296326 = queryNorm
            4.164796 = fieldWeight in 8834, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.329592 = idf(docFreq=28, maxDocs=44218)
              0.5 = fieldNorm(doc=8834)
        3.0787358 = weight(author_txt:endres in 8834) [ClassicSimilarity], result of:
          3.0787358 = score(doc=8834,freq=1.0), product of:
            0.72280693 = queryWeight, product of:
              1.0227191 = boost
              8.518833 = idf(docFreq=23, maxDocs=44218)
              0.08296326 = queryNorm
            4.2594166 = fieldWeight in 8834, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.518833 = idf(docFreq=23, maxDocs=44218)
              0.5 = fieldNorm(doc=8834)
    
  3. Endres-Niggemeyer, B.: Summarising text for intelligent communication : results of the Dagstuhl seminar (1994) 5.96
    5.9568186 = sum of:
      5.9568186 = sum of:
        2.8780825 = weight(author_txt:niggemeyer in 8867) [ClassicSimilarity], result of:
          2.8780825 = score(doc=8867,freq=1.0), product of:
            0.69105005 = queryWeight, product of:
              8.329592 = idf(docFreq=28, maxDocs=44218)
              0.08296326 = queryNorm
            4.164796 = fieldWeight in 8867, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.329592 = idf(docFreq=28, maxDocs=44218)
              0.5 = fieldNorm(doc=8867)
        3.0787358 = weight(author_txt:endres in 8867) [ClassicSimilarity], result of:
          3.0787358 = score(doc=8867,freq=1.0), product of:
            0.72280693 = queryWeight, product of:
              1.0227191 = boost
              8.518833 = idf(docFreq=23, maxDocs=44218)
              0.08296326 = queryNorm
            4.2594166 = fieldWeight in 8867, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.518833 = idf(docFreq=23, maxDocs=44218)
              0.5 = fieldNorm(doc=8867)
    
  4. Endres-Niggemeyer, B.: Wissensbasierte Ansätze zur Formalerfassung (1988) 5.96
    5.9568186 = sum of:
      5.9568186 = sum of:
        2.8780825 = weight(author_txt:niggemeyer in 524) [ClassicSimilarity], result of:
          2.8780825 = score(doc=524,freq=1.0), product of:
            0.69105005 = queryWeight, product of:
              8.329592 = idf(docFreq=28, maxDocs=44218)
              0.08296326 = queryNorm
            4.164796 = fieldWeight in 524, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.329592 = idf(docFreq=28, maxDocs=44218)
              0.5 = fieldNorm(doc=524)
        3.0787358 = weight(author_txt:endres in 524) [ClassicSimilarity], result of:
          3.0787358 = score(doc=524,freq=1.0), product of:
            0.72280693 = queryWeight, product of:
              1.0227191 = boost
              8.518833 = idf(docFreq=23, maxDocs=44218)
              0.08296326 = queryNorm
            4.2594166 = fieldWeight in 524, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.518833 = idf(docFreq=23, maxDocs=44218)
              0.5 = fieldNorm(doc=524)
    
  5. Endres-Niggemeyer, B.: Content analysis : a special case of text compression (1989) 5.96
    5.9568186 = sum of:
      5.9568186 = sum of:
        2.8780825 = weight(author_txt:niggemeyer in 3549) [ClassicSimilarity], result of:
          2.8780825 = score(doc=3549,freq=1.0), product of:
            0.69105005 = queryWeight, product of:
              8.329592 = idf(docFreq=28, maxDocs=44218)
              0.08296326 = queryNorm
            4.164796 = fieldWeight in 3549, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.329592 = idf(docFreq=28, maxDocs=44218)
              0.5 = fieldNorm(doc=3549)
        3.0787358 = weight(author_txt:endres in 3549) [ClassicSimilarity], result of:
          3.0787358 = score(doc=3549,freq=1.0), product of:
            0.72280693 = queryWeight, product of:
              1.0227191 = boost
              8.518833 = idf(docFreq=23, maxDocs=44218)
              0.08296326 = queryNorm
            4.2594166 = fieldWeight in 3549, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.518833 = idf(docFreq=23, maxDocs=44218)
              0.5 = fieldNorm(doc=3549)
    

Similar documents (content)

  1. Endres-Niggemeyer, B.; Ziegert, C.: SummIt-BMT : (Summarize It in BMT) in Diagnose und Therapie, Abschlussbericht (2002) 0.20
    0.19560854 = sum of:
      0.19560854 = product of:
        0.9780427 = sum of:
          0.1287224 = weight(abstract_txt:zusammenfassen in 4497) [ClassicSimilarity], result of:
            0.1287224 = score(doc=4497,freq=2.0), product of:
              0.15502244 = queryWeight, product of:
                1.0102495 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.016334333 = queryNorm
              0.8303468 = fieldWeight in 4497, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=4497)
          0.09411742 = weight(abstract_txt:syntaktisch in 4497) [ClassicSimilarity], result of:
            0.09411742 = score(doc=4497,freq=1.0), product of:
              0.15851918 = queryWeight, product of:
                1.0215797 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.016334333 = queryNorm
              0.5937289 = fieldWeight in 4497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.0625 = fieldNorm(doc=4497)
          0.05008241 = weight(abstract_txt:werden in 4497) [ClassicSimilarity], result of:
            0.05008241 = score(doc=4497,freq=7.0), product of:
              0.086379886 = queryWeight, product of:
                1.5082303 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.016334333 = queryNorm
              0.5797925 = fieldWeight in 4497, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=4497)
          0.36408192 = weight(abstract_txt:summit in 4497) [ClassicSimilarity], result of:
            0.36408192 = score(doc=4497,freq=4.0), product of:
              0.31004488 = queryWeight, product of:
                2.020499 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.016334333 = queryNorm
              1.1742878 = fieldWeight in 4497, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=4497)
          0.34103858 = weight(abstract_txt:ontologie in 4497) [ClassicSimilarity], result of:
            0.34103858 = score(doc=4497,freq=3.0), product of:
              0.41160795 = queryWeight, product of:
                3.292329 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.016334333 = queryNorm
              0.828552 = fieldWeight in 4497, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=4497)
        0.2 = coord(5/25)
    
  2. Stollberg, M.: Ontologiebasierte Wissensmodellierung : Verwendung als semantischer Grundbaustein des Semantic Web (2002) 0.14
    0.14383423 = sum of:
      0.14383423 = product of:
        0.5993093 = sum of:
          0.06361653 = weight(abstract_txt:anwendungsgebiet in 4495) [ClassicSimilarity], result of:
            0.06361653 = score(doc=4495,freq=1.0), product of:
              0.16701743 = queryWeight, product of:
                1.0486058 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.016334333 = queryNorm
              0.38089755 = fieldWeight in 4495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4495)
          0.02258729 = weight(abstract_txt:ergebnisse in 4495) [ClassicSimilarity], result of:
            0.02258729 = score(doc=4495,freq=1.0), product of:
              0.10551185 = queryWeight, product of:
                1.1786829 = boost
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.016334333 = queryNorm
              0.2140735 = fieldWeight in 4495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4495)
          0.048779875 = weight(abstract_txt:werden in 4495) [ClassicSimilarity], result of:
            0.048779875 = score(doc=4495,freq=17.0), product of:
              0.086379886 = queryWeight, product of:
                1.5082303 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.016334333 = queryNorm
              0.56471336 = fieldWeight in 4495, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4495)
          0.08483906 = weight(abstract_txt:fundierten in 4495) [ClassicSimilarity], result of:
            0.08483906 = score(doc=4495,freq=1.0), product of:
              0.2549504 = queryWeight, product of:
                1.8322057 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.016334333 = queryNorm
              0.33276692 = fieldWeight in 4495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4495)
          0.13336313 = weight(abstract_txt:ontologiebasierte in 4495) [ClassicSimilarity], result of:
            0.13336313 = score(doc=4495,freq=1.0), product of:
              0.34467965 = queryWeight, product of:
                2.1303658 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.016334333 = queryNorm
              0.38691905 = fieldWeight in 4495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4495)
          0.24612342 = weight(abstract_txt:ontologie in 4495) [ClassicSimilarity], result of:
            0.24612342 = score(doc=4495,freq=4.0), product of:
              0.41160795 = queryWeight, product of:
                3.292329 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.016334333 = queryNorm
              0.59795594 = fieldWeight in 4495, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4495)
        0.24 = coord(6/25)
    
  3. Werrmann, J.: Modellierung im Kontext : Ontologie-basiertes Information Retrieval (2011) 0.08
    0.07565821 = sum of:
      0.07565821 = product of:
        0.6304851 = sum of:
          0.033126403 = weight(abstract_txt:werden in 1141) [ClassicSimilarity], result of:
            0.033126403 = score(doc=1141,freq=1.0), product of:
              0.086379886 = queryWeight, product of:
                1.5082303 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.016334333 = queryNorm
              0.3834967 = fieldWeight in 1141, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.109375 = fieldNorm(doc=1141)
          0.11005923 = weight(abstract_txt:wissen in 1141) [ClassicSimilarity], result of:
            0.11005923 = score(doc=1141,freq=2.0), product of:
              0.13869406 = queryWeight, product of:
                1.6550874 = boost
                5.1302147 = idf(docFreq=710, maxDocs=44218)
                0.016334333 = queryNorm
              0.7935396 = fieldWeight in 1141, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1302147 = idf(docFreq=710, maxDocs=44218)
                0.109375 = fieldNorm(doc=1141)
          0.48729947 = weight(abstract_txt:ontologie in 1141) [ClassicSimilarity], result of:
            0.48729947 = score(doc=1141,freq=2.0), product of:
              0.41160795 = queryWeight, product of:
                3.292329 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.016334333 = queryNorm
              1.1838923 = fieldWeight in 1141, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.109375 = fieldNorm(doc=1141)
        0.12 = coord(3/25)
    
  4. Aprin, L.: Entwicklung eines semantisch operierenden Risikomanagement-Informationssystems am Beispiel der Europäischen Organisation für Kernforschung (CERN) (2012) 0.07
    0.06706232 = sum of:
      0.06706232 = product of:
        0.3353116 = sum of:
          0.039320193 = weight(abstract_txt:qualität in 2286) [ClassicSimilarity], result of:
            0.039320193 = score(doc=2286,freq=1.0), product of:
              0.13521186 = queryWeight, product of:
                1.3343008 = boost
                6.203826 = idf(docFreq=242, maxDocs=44218)
                0.016334333 = queryNorm
              0.29080433 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.203826 = idf(docFreq=242, maxDocs=44218)
                0.046875 = fieldNorm(doc=2286)
          0.014197029 = weight(abstract_txt:werden in 2286) [ClassicSimilarity], result of:
            0.014197029 = score(doc=2286,freq=1.0), product of:
              0.086379886 = queryWeight, product of:
                1.5082303 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.016334333 = queryNorm
              0.16435573 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.046875 = fieldNorm(doc=2286)
          0.08824369 = weight(abstract_txt:wissen in 2286) [ClassicSimilarity], result of:
            0.08824369 = score(doc=2286,freq=7.0), product of:
              0.13869406 = queryWeight, product of:
                1.6550874 = boost
                5.1302147 = idf(docFreq=710, maxDocs=44218)
                0.016334333 = queryNorm
              0.6362471 = fieldWeight in 2286, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1302147 = idf(docFreq=710, maxDocs=44218)
                0.046875 = fieldNorm(doc=2286)
          0.045876633 = weight(abstract_txt:steht in 2286) [ClassicSimilarity], result of:
            0.045876633 = score(doc=2286,freq=1.0), product of:
              0.17153883 = queryWeight, product of:
                1.8406584 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.016334333 = queryNorm
              0.2674417 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.046875 = fieldNorm(doc=2286)
          0.14767405 = weight(abstract_txt:ontologie in 2286) [ClassicSimilarity], result of:
            0.14767405 = score(doc=2286,freq=1.0), product of:
              0.41160795 = queryWeight, product of:
                3.292329 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.016334333 = queryNorm
              0.35877356 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.046875 = fieldNorm(doc=2286)
        0.2 = coord(5/25)
    
  5. Smith, B.; Siebert, D.; Ceusters, W.: Was die philosophische Ontologie zur biomedizinischen Informatik beitragen kann (2004) 0.06
    0.061238106 = sum of:
      0.061238106 = product of:
        0.51031756 = sum of:
          0.051109202 = weight(abstract_txt:ergebnisse in 2181) [ClassicSimilarity], result of:
            0.051109202 = score(doc=2181,freq=2.0), product of:
              0.10551185 = queryWeight, product of:
                1.1786829 = boost
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.016334333 = queryNorm
              0.484393 = fieldWeight in 2181, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.0625 = fieldNorm(doc=2181)
          0.018929372 = weight(abstract_txt:werden in 2181) [ClassicSimilarity], result of:
            0.018929372 = score(doc=2181,freq=1.0), product of:
              0.086379886 = queryWeight, product of:
                1.5082303 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.016334333 = queryNorm
              0.21914098 = fieldWeight in 2181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=2181)
          0.44027898 = weight(abstract_txt:ontologie in 2181) [ClassicSimilarity], result of:
            0.44027898 = score(doc=2181,freq=5.0), product of:
              0.41160795 = queryWeight, product of:
                3.292329 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.016334333 = queryNorm
              1.0696561 = fieldWeight in 2181, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=2181)
        0.12 = coord(3/25)