Document (#10438)

Author
Sarre, F.
Güntzer, U.
Myka, A.
Jüttner, G.
Title
Maschinelles Lernen von Relationen für Thesauri und Hypertext
Source
Kognitive Ansätze zum Ordnen und Darstellen von Wissen. 2. Tagung der Deutschen ISKO Sektion einschl. der Vorträge des Workshops "Thesauri als Werkzeuge der Sprachtechnologie", Weilburg, 15.-18.10.1991
Imprint
Frankfurt : Indeks
Year
1992
Pages
S.265-276
Series
Fortschritte in der Wissensorganisation; Bd.2
Abstract
Fortschrittliche Informationssysteme stellen ihren Benutzern 2 wichtige Suchmethoden zur Verfügung: die gezielte (Volltext-) Suche und das Navigieren im Objektbestand mit Hilfe von Hypertext-Links. Der Grund, warum diese beiden Konzepte aber auf breiter Basis noch nicht in jedem Informationssystem Anwendung gefunden haben, ist darin zu sehen, daß der manuelle Aufbau von umfassenden Hypertext-Strukturen auf der einen Seite und von großen Thesauri, die den Erfolg von Volltextsuchen wesentlich steigern, auf der anderen Seite bislang enormen Aufwand und damit hohe Kosten verursachte. Langfristig werden Informationssysteme aber nur dann große Akzeptanz bei der Benutzerschaft erzielen, wenn sie ihre Benutzer mit diesen beiden Techniken unterstützen und wenn sie dynamisch neuen Informationsbedürfnissen anpassen können, also lernfähig sind. Für den einzelnen Benutzer ergibt sich daraus der wesentliche Vorteil, daß er von den Recherche-Erfahrungen anderer Benutzer profitieren kann. In diesem Papier stellen wir eine Lernkomponente vor, die für das Hypertextsystem 'HyperMan' an der TU München entwickelt und implementiert wurde. Wir zeigen beispielhaft, wie Volltext-Suchanfragen der HyperMan-Benutzer von der Lernkomponente untersucht werden, um Thesauruseinträge zu gewinnen. Bei der Entwicklung dieser Lerntechniken zum (automatischen) Thesaurusaufbau konnte auf Erfahrungen mit dem lernfähigen Information Retrieval System 'Tegen' zurückgegriffen werden. In dem HyperMan System werden aber nicht nur Beziehungen (Relationen) zwischen Begriffen erlernt, sondern auch zwischen Textstücken. Wir gehen daher auch darauf ein, wie aufgrund einer Analyse des Benutzerverhaltens sowohl neue Hypertext-Links erlernt als auch vorhandene Links, die zuvor von HyperMans Generierungskomponente automatisch erzeugt wurden, modifiziert werden
Theme
Hypertext
Konzeption und Anwendung des Prinzips Thesaurus
Object
HyperMan
TEGEN

Similar documents (content)

  1. Ellis, R.; Hindersmann, J.: Volltext- und Katalogverlinkungen in Online-Datenbanken (2003) 0.21
    0.21247487 = sum of:
      0.21247487 = product of:
        0.59020793 = sum of:
          0.026588459 = weight(abstract_txt:zwischen in 2487) [ClassicSimilarity], result of:
            0.026588459 = score(doc=2487,freq=2.0), product of:
              0.11874601 = queryWeight, product of:
                1.1489381 = boost
                5.0665126 = idf(docFreq=724, maxDocs=42306)
                0.020399207 = queryNorm
              0.22391033 = fieldWeight in 2487, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0665126 = idf(docFreq=724, maxDocs=42306)
                0.03125 = fieldNorm(doc=2487)
          0.028169347 = weight(abstract_txt:wenn in 2487) [ClassicSimilarity], result of:
            0.028169347 = score(doc=2487,freq=2.0), product of:
              0.12340747 = queryWeight, product of:
                1.1712722 = boost
                5.165 = idf(docFreq=656, maxDocs=42306)
                0.020399207 = queryNorm
              0.2282629 = fieldWeight in 2487, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.165 = idf(docFreq=656, maxDocs=42306)
                0.03125 = fieldNorm(doc=2487)
          0.026113782 = weight(abstract_txt:auch in 2487) [ClassicSimilarity], result of:
            0.026113782 = score(doc=2487,freq=5.0), product of:
              0.09895867 = queryWeight, product of:
                1.2845756 = boost
                3.7764254 = idf(docFreq=2633, maxDocs=42306)
                0.020399207 = queryNorm
              0.26388574 = fieldWeight in 2487, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.7764254 = idf(docFreq=2633, maxDocs=42306)
                0.03125 = fieldNorm(doc=2487)
          0.03716567 = weight(abstract_txt:aber in 2487) [ClassicSimilarity], result of:
            0.03716567 = score(doc=2487,freq=3.0), product of:
              0.14845161 = queryWeight, product of:
                1.5733494 = boost
                4.6253695 = idf(docFreq=1126, maxDocs=42306)
                0.020399207 = queryNorm
              0.25035545 = fieldWeight in 2487, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6253695 = idf(docFreq=1126, maxDocs=42306)
                0.03125 = fieldNorm(doc=2487)
          0.053660307 = weight(abstract_txt:informationssysteme in 2487) [ClassicSimilarity], result of:
            0.053660307 = score(doc=2487,freq=1.0), product of:
              0.23892908 = queryWeight, product of:
                1.6297523 = boost
                7.186776 = idf(docFreq=86, maxDocs=42306)
                0.020399207 = queryNorm
              0.22458676 = fieldWeight in 2487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.186776 = idf(docFreq=86, maxDocs=42306)
                0.03125 = fieldNorm(doc=2487)
          0.16050044 = weight(abstract_txt:volltext in 2487) [ClassicSimilarity], result of:
            0.16050044 = score(doc=2487,freq=8.0), product of:
              0.24800156 = queryWeight, product of:
                1.660406 = boost
                7.321951 = idf(docFreq=75, maxDocs=42306)
                0.020399207 = queryNorm
              0.64717513 = fieldWeight in 2487, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.321951 = idf(docFreq=75, maxDocs=42306)
                0.03125 = fieldNorm(doc=2487)
          0.09806342 = weight(abstract_txt:links in 2487) [ClassicSimilarity], result of:
            0.09806342 = score(doc=2487,freq=10.0), product of:
              0.18975884 = queryWeight, product of:
                1.7788271 = boost
                5.2294374 = idf(docFreq=615, maxDocs=42306)
                0.020399207 = queryNorm
              0.5167792 = fieldWeight in 2487, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.2294374 = idf(docFreq=615, maxDocs=42306)
                0.03125 = fieldNorm(doc=2487)
          0.04497545 = weight(abstract_txt:werden in 2487) [ClassicSimilarity], result of:
            0.04497545 = score(doc=2487,freq=8.0), product of:
              0.14413427 = queryWeight, product of:
                2.0014315 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.020399207 = queryNorm
              0.31203854 = fieldWeight in 2487, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.03125 = fieldNorm(doc=2487)
          0.11497106 = weight(abstract_txt:benutzer in 2487) [ClassicSimilarity], result of:
            0.11497106 = score(doc=2487,freq=3.0), product of:
              0.34689298 = queryWeight, product of:
                2.7771533 = boost
                6.1232553 = idf(docFreq=251, maxDocs=42306)
                0.020399207 = queryNorm
              0.3314309 = fieldWeight in 2487, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1232553 = idf(docFreq=251, maxDocs=42306)
                0.03125 = fieldNorm(doc=2487)
        0.36 = coord(9/25)
    
  2. Nikolai, R.: Thesaurusföderationen : Ein Rahmenwerk für die flexible Integration von heterogenen, autonomen Thesauri (2002) 0.16
    0.16476187 = sum of:
      0.16476187 = product of:
        0.51488084 = sum of:
          0.07012346 = weight(abstract_txt:manuelle in 2166) [ClassicSimilarity], result of:
            0.07012346 = score(doc=2166,freq=2.0), product of:
              0.1799102 = queryWeight, product of:
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.020399207 = queryNorm
              0.3897692 = fieldWeight in 2166, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.03125 = fieldNorm(doc=2166)
          0.019918736 = weight(abstract_txt:wenn in 2166) [ClassicSimilarity], result of:
            0.019918736 = score(doc=2166,freq=1.0), product of:
              0.12340747 = queryWeight, product of:
                1.1712722 = boost
                5.165 = idf(docFreq=656, maxDocs=42306)
                0.020399207 = queryNorm
              0.16140625 = fieldWeight in 2166, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.165 = idf(docFreq=656, maxDocs=42306)
                0.03125 = fieldNorm(doc=2166)
          0.0955382 = weight(abstract_txt:thesauri in 2166) [ClassicSimilarity], result of:
            0.0955382 = score(doc=2166,freq=17.0), product of:
              0.13650084 = queryWeight, product of:
                1.2318413 = boost
                5.432094 = idf(docFreq=502, maxDocs=42306)
                0.020399207 = queryNorm
              0.69990927 = fieldWeight in 2166, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                5.432094 = idf(docFreq=502, maxDocs=42306)
                0.03125 = fieldNorm(doc=2166)
          0.023356877 = weight(abstract_txt:auch in 2166) [ClassicSimilarity], result of:
            0.023356877 = score(doc=2166,freq=4.0), product of:
              0.09895867 = queryWeight, product of:
                1.2845756 = boost
                3.7764254 = idf(docFreq=2633, maxDocs=42306)
                0.020399207 = queryNorm
              0.23602659 = fieldWeight in 2166, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.7764254 = idf(docFreq=2633, maxDocs=42306)
                0.03125 = fieldNorm(doc=2166)
          0.030345645 = weight(abstract_txt:aber in 2166) [ClassicSimilarity], result of:
            0.030345645 = score(doc=2166,freq=2.0), product of:
              0.14845161 = queryWeight, product of:
                1.5733494 = boost
                4.6253695 = idf(docFreq=1126, maxDocs=42306)
                0.020399207 = queryNorm
              0.20441438 = fieldWeight in 2166, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6253695 = idf(docFreq=1126, maxDocs=42306)
                0.03125 = fieldNorm(doc=2166)
          0.13144037 = weight(abstract_txt:informationssysteme in 2166) [ClassicSimilarity], result of:
            0.13144037 = score(doc=2166,freq=6.0), product of:
              0.23892908 = queryWeight, product of:
                1.6297523 = boost
                7.186776 = idf(docFreq=86, maxDocs=42306)
                0.020399207 = queryNorm
              0.550123 = fieldWeight in 2166, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.186776 = idf(docFreq=86, maxDocs=42306)
                0.03125 = fieldNorm(doc=2166)
          0.050284076 = weight(abstract_txt:werden in 2166) [ClassicSimilarity], result of:
            0.050284076 = score(doc=2166,freq=10.0), product of:
              0.14413427 = queryWeight, product of:
                2.0014315 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.020399207 = queryNorm
              0.34886968 = fieldWeight in 2166, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.03125 = fieldNorm(doc=2166)
          0.09387348 = weight(abstract_txt:benutzer in 2166) [ClassicSimilarity], result of:
            0.09387348 = score(doc=2166,freq=2.0), product of:
              0.34689298 = queryWeight, product of:
                2.7771533 = boost
                6.1232553 = idf(docFreq=251, maxDocs=42306)
                0.020399207 = queryNorm
              0.2706122 = fieldWeight in 2166, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1232553 = idf(docFreq=251, maxDocs=42306)
                0.03125 = fieldNorm(doc=2166)
        0.32 = coord(8/25)
    
  3. Hutzler, E.; Scheuplein, M.: Elektronische Zeitschriftenbibliothek : Neue Dienste im Rahmen von vascoda (2004) 0.15
    0.14884937 = sum of:
      0.14884937 = product of:
        0.5316049 = sum of:
          0.024898421 = weight(abstract_txt:wenn in 3986) [ClassicSimilarity], result of:
            0.024898421 = score(doc=3986,freq=1.0), product of:
              0.12340747 = queryWeight, product of:
                1.1712722 = boost
                5.165 = idf(docFreq=656, maxDocs=42306)
                0.020399207 = queryNorm
              0.20175782 = fieldWeight in 3986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.165 = idf(docFreq=656, maxDocs=42306)
                0.0390625 = fieldNorm(doc=3986)
          0.025284562 = weight(abstract_txt:auch in 3986) [ClassicSimilarity], result of:
            0.025284562 = score(doc=3986,freq=3.0), product of:
              0.09895867 = queryWeight, product of:
                1.2845756 = boost
                3.7764254 = idf(docFreq=2633, maxDocs=42306)
                0.020399207 = queryNorm
              0.25550628 = fieldWeight in 3986, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7764254 = idf(docFreq=2633, maxDocs=42306)
                0.0390625 = fieldNorm(doc=3986)
          0.054617353 = weight(abstract_txt:seite in 3986) [ClassicSimilarity], result of:
            0.054617353 = score(doc=3986,freq=1.0), product of:
              0.20834383 = queryWeight, product of:
                1.5218695 = boost
                6.711042 = idf(docFreq=139, maxDocs=42306)
                0.020399207 = queryNorm
              0.26215008 = fieldWeight in 3986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.711042 = idf(docFreq=139, maxDocs=42306)
                0.0390625 = fieldNorm(doc=3986)
          0.18766803 = weight(abstract_txt:volltext in 3986) [ClassicSimilarity], result of:
            0.18766803 = score(doc=3986,freq=7.0), product of:
              0.24800156 = queryWeight, product of:
                1.660406 = boost
                7.321951 = idf(docFreq=75, maxDocs=42306)
                0.020399207 = queryNorm
              0.75672114 = fieldWeight in 3986, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.321951 = idf(docFreq=75, maxDocs=42306)
                0.0390625 = fieldNorm(doc=3986)
          0.038762964 = weight(abstract_txt:links in 3986) [ClassicSimilarity], result of:
            0.038762964 = score(doc=3986,freq=1.0), product of:
              0.18975884 = queryWeight, product of:
                1.7788271 = boost
                5.2294374 = idf(docFreq=615, maxDocs=42306)
                0.020399207 = queryNorm
              0.2042749 = fieldWeight in 3986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2294374 = idf(docFreq=615, maxDocs=42306)
                0.0390625 = fieldNorm(doc=3986)
          0.03442715 = weight(abstract_txt:werden in 3986) [ClassicSimilarity], result of:
            0.03442715 = score(doc=3986,freq=3.0), product of:
              0.14413427 = queryWeight, product of:
                2.0014315 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.020399207 = queryNorm
              0.23885474 = fieldWeight in 3986, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.0390625 = fieldNorm(doc=3986)
          0.16594642 = weight(abstract_txt:benutzer in 3986) [ClassicSimilarity], result of:
            0.16594642 = score(doc=3986,freq=4.0), product of:
              0.34689298 = queryWeight, product of:
                2.7771533 = boost
                6.1232553 = idf(docFreq=251, maxDocs=42306)
                0.020399207 = queryNorm
              0.4783793 = fieldWeight in 3986, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.1232553 = idf(docFreq=251, maxDocs=42306)
                0.0390625 = fieldNorm(doc=3986)
        0.28 = coord(7/25)
    
  4. Hammwöhner, R.: Kohärenzrelationen in Hypertexten : Textparsing (1990) 0.13
    0.13356152 = sum of:
      0.13356152 = product of:
        0.83475953 = sum of:
          0.30806014 = weight(abstract_txt:relationen in 429) [ClassicSimilarity], result of:
            0.30806014 = score(doc=429,freq=1.0), product of:
              0.26198548 = queryWeight, product of:
                1.7065763 = boost
                7.52555 = idf(docFreq=61, maxDocs=42306)
                0.020399207 = queryNorm
              1.1758672 = fieldWeight in 429, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.52555 = idf(docFreq=61, maxDocs=42306)
                0.15625 = fieldNorm(doc=429)
          0.15505186 = weight(abstract_txt:links in 429) [ClassicSimilarity], result of:
            0.15505186 = score(doc=429,freq=1.0), product of:
              0.18975884 = queryWeight, product of:
                1.7788271 = boost
                5.2294374 = idf(docFreq=615, maxDocs=42306)
                0.020399207 = queryNorm
              0.8170996 = fieldWeight in 429, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2294374 = idf(docFreq=615, maxDocs=42306)
                0.15625 = fieldNorm(doc=429)
          0.11243862 = weight(abstract_txt:werden in 429) [ClassicSimilarity], result of:
            0.11243862 = score(doc=429,freq=2.0), product of:
              0.14413427 = queryWeight, product of:
                2.0014315 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.020399207 = queryNorm
              0.78009635 = fieldWeight in 429, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.15625 = fieldNorm(doc=429)
          0.25920892 = weight(abstract_txt:hypertext in 429) [ClassicSimilarity], result of:
            0.25920892 = score(doc=429,freq=1.0), product of:
              0.29419154 = queryWeight, product of:
                2.5575092 = boost
                5.638969 = idf(docFreq=408, maxDocs=42306)
                0.020399207 = queryNorm
              0.8810889 = fieldWeight in 429, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.638969 = idf(docFreq=408, maxDocs=42306)
                0.15625 = fieldNorm(doc=429)
        0.16 = coord(4/25)
    
  5. Block, B.; Hengel, C.; Heuvelmann, R.; Katz, C.; Rusch, B.; Schmidgall, K.; Sigrist, B.: Maschinelles Austauschformat für Bibliotheken und die Functional Requirements for Bibliographic Records : Oder: Wieviel FRBR verträgt MAB? (2005) 0.13
    0.13161163 = sum of:
      0.13161163 = product of:
        0.65805817 = sum of:
          0.44555676 = weight(title_txt:maschinelles in 2468) [ClassicSimilarity], result of:
            0.44555676 = score(doc=2468,freq=1.0), product of:
              0.1944012 = queryWeight, product of:
                1.0394931 = boost
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.020399207 = queryNorm
              2.2919445 = fieldWeight in 2468, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.25 = fieldNorm(doc=2468)
          0.03760176 = weight(abstract_txt:zwischen in 2468) [ClassicSimilarity], result of:
            0.03760176 = score(doc=2468,freq=4.0), product of:
              0.11874601 = queryWeight, product of:
                1.1489381 = boost
                5.0665126 = idf(docFreq=724, maxDocs=42306)
                0.020399207 = queryNorm
              0.31665704 = fieldWeight in 2468, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.0665126 = idf(docFreq=724, maxDocs=42306)
                0.03125 = fieldNorm(doc=2468)
          0.026113782 = weight(abstract_txt:auch in 2468) [ClassicSimilarity], result of:
            0.026113782 = score(doc=2468,freq=5.0), product of:
              0.09895867 = queryWeight, product of:
                1.2845756 = boost
                3.7764254 = idf(docFreq=2633, maxDocs=42306)
                0.020399207 = queryNorm
              0.26388574 = fieldWeight in 2468, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.7764254 = idf(docFreq=2633, maxDocs=42306)
                0.03125 = fieldNorm(doc=2468)
          0.10671516 = weight(abstract_txt:relationen in 2468) [ClassicSimilarity], result of:
            0.10671516 = score(doc=2468,freq=3.0), product of:
              0.26198548 = queryWeight, product of:
                1.7065763 = boost
                7.52555 = idf(docFreq=61, maxDocs=42306)
                0.020399207 = queryNorm
              0.40733233 = fieldWeight in 2468, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.52555 = idf(docFreq=61, maxDocs=42306)
                0.03125 = fieldNorm(doc=2468)
          0.042070676 = weight(abstract_txt:werden in 2468) [ClassicSimilarity], result of:
            0.042070676 = score(doc=2468,freq=7.0), product of:
              0.14413427 = queryWeight, product of:
                2.0014315 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.020399207 = queryNorm
              0.29188532 = fieldWeight in 2468, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.03125 = fieldNorm(doc=2468)
        0.2 = coord(5/25)