Document (#23796)

Author
Ladewig, C.
Henkes, M.
Title
Verfahren zur automatischen inhaltlichen Erschließung von elektronischen Texten : ASPECTIX
Source
nfd Information - Wissenschaft und Praxis. 52(2001) H.3, S.159-164
Year
2001
Abstract
Das Verfahren zur automatischen syntaktischen inhaltlichen Erschließung von elektronischen Texten, AspectiX, basiert auf einem Index, dessen Elemente mit einer universellen Aspekt-Klassifikation verknüpft sind, die es erlauben, ein syntaktisches Retrieval durchzuführen. Mit diesen, auf den jeweiligen Suchgegenstand inhaltlich bezogenen Klassifikationselementen, werden die Informationen in elektronischen Texten mit bekannten Suchalgorithmen abgefragt und die Ergebnisse entsprechend der Aspektverknüpfung ausgewertet. Mit diesen Aspekten ist es möglich, unbekannte Textdokumente automatisch fachgebiets- und sprachunabhängig nach Inhalten zu klassifizieren und beim Suchen in einem Textcorpus nicht nur auf die Verwendung von Zeichenfolgen angewiesen zu sein wie bei Suchmaschinen im WWW. Der Index kann bei diesen Vorgängen intellektuell und automatisch weiter ausgebaut werden und liefert Ergebnisse im Retrieval von nahezu 100 Prozent Precision, bei gleichzeitig nahezu 100 Prozent Recall. Damit ist das Verfahren AspectiX allen anderen Recherchetools um bis zu 40 Prozent an Precision bzw. Recall überlegen, wie an zahlreichen Recherchen in drei Datenbanken, die unterschiedlich groß und thematisch unähnlich sind, nachgewiesen wird
Theme
Automatisches Indexieren
Object
AspectiX

Similar documents (author)

  1. Ladewig, C.: ¬Die Ausbildung am Institut für Information und Dokumentation der Fachhochschule Potsdam (IID) (1994) 6.16
    6.163078 = sum of:
      6.163078 = weight(author_txt:ladewig in 384) [ClassicSimilarity], result of:
        6.163078 = fieldWeight in 384, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.860925 = idf(docFreq=5, maxDocs=42306)
          0.625 = fieldNorm(doc=384)
    
  2. Ladewig, C.: 'Information Retrieval ohne Linguistik?' : Erwiderung zu dem Artikel von Gerda Ruge und Sebastian Goeser, Nfd 49(1998) H.6, S.361-369 (1998) 6.16
    6.163078 = sum of:
      6.163078 = weight(author_txt:ladewig in 3514) [ClassicSimilarity], result of:
        6.163078 = fieldWeight in 3514, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.860925 = idf(docFreq=5, maxDocs=42306)
          0.625 = fieldNorm(doc=3514)
    
  3. Ladewig, C.: Grundlagen der inhaltlichen Erschließung (1997) 6.16
    6.163078 = sum of:
      6.163078 = weight(author_txt:ladewig in 1696) [ClassicSimilarity], result of:
        6.163078 = fieldWeight in 1696, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.860925 = idf(docFreq=5, maxDocs=42306)
          0.625 = fieldNorm(doc=1696)
    
  4. Ladewig, C.; Rieger, M.: Ähnlichkeitsmessung mit und ohne aspektische Indexierung (1998) 4.93
    4.9304624 = sum of:
      4.9304624 = weight(author_txt:ladewig in 3527) [ClassicSimilarity], result of:
        4.9304624 = fieldWeight in 3527, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.860925 = idf(docFreq=5, maxDocs=42306)
          0.5 = fieldNorm(doc=3527)
    

Similar documents (content)

  1. Scherer, B.: Automatische Indexierung und ihre Anwendung im DFG-Projekt "Gemeinsames Portal für Bibliotheken, Archive und Museen (BAM)" (2003) 0.15
    0.14943895 = sum of:
      0.14943895 = product of:
        0.6226623 = sum of:
          0.0381753 = weight(abstract_txt:einem in 1284) [ClassicSimilarity], result of:
            0.0381753 = score(doc=1284,freq=3.0), product of:
              0.08093928 = queryWeight, product of:
                4.3569493 = idf(docFreq=1473, maxDocs=42306)
                0.018577052 = queryNorm
              0.47165358 = fieldWeight in 1284, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3569493 = idf(docFreq=1473, maxDocs=42306)
                0.0625 = fieldNorm(doc=1284)
          0.04449046 = weight(abstract_txt:ergebnisse in 1284) [ClassicSimilarity], result of:
            0.04449046 = score(doc=1284,freq=1.0), product of:
              0.12927742 = queryWeight, product of:
                1.2638097 = boost
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.018577052 = queryNorm
              0.34414718 = fieldWeight in 1284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.0625 = fieldNorm(doc=1284)
          0.07974417 = weight(abstract_txt:erschließung in 1284) [ClassicSimilarity], result of:
            0.07974417 = score(doc=1284,freq=2.0), product of:
              0.15140285 = queryWeight, product of:
                1.367689 = boost
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.018577052 = queryNorm
              0.5267019 = fieldWeight in 1284, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.0625 = fieldNorm(doc=1284)
          0.17308111 = weight(abstract_txt:automatischen in 1284) [ClassicSimilarity], result of:
            0.17308111 = score(doc=1284,freq=4.0), product of:
              0.20144564 = queryWeight, product of:
                1.5776086 = boost
                6.873561 = idf(docFreq=118, maxDocs=42306)
                0.018577052 = queryNorm
              0.8591951 = fieldWeight in 1284, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.873561 = idf(docFreq=118, maxDocs=42306)
                0.0625 = fieldNorm(doc=1284)
          0.13353749 = weight(abstract_txt:verfahren in 1284) [ClassicSimilarity], result of:
            0.13353749 = score(doc=1284,freq=3.0), product of:
              0.21350278 = queryWeight, product of:
                1.9891509 = boost
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.018577052 = queryNorm
              0.62546015 = fieldWeight in 1284, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.0625 = fieldNorm(doc=1284)
          0.15363374 = weight(abstract_txt:texten in 1284) [ClassicSimilarity], result of:
            0.15363374 = score(doc=1284,freq=1.0), product of:
              0.33809045 = queryWeight, product of:
                2.5031245 = boost
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.018577052 = queryNorm
              0.4544161 = fieldWeight in 1284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.0625 = fieldNorm(doc=1284)
        0.24 = coord(6/25)
    
  2. Heyer, G.; Quasthoff, U.; Wittig, T.: Text Mining : Wissensrohstoff Text. Konzepte, Algorithmen, Ergebnisse (2006) 0.13
    0.12834841 = sum of:
      0.12834841 = product of:
        0.45838717 = sum of:
          0.019481253 = weight(abstract_txt:einem in 219) [ClassicSimilarity], result of:
            0.019481253 = score(doc=219,freq=2.0), product of:
              0.08093928 = queryWeight, product of:
                4.3569493 = idf(docFreq=1473, maxDocs=42306)
                0.018577052 = queryNorm
              0.24068972 = fieldWeight in 219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3569493 = idf(docFreq=1473, maxDocs=42306)
                0.0390625 = fieldNorm(doc=219)
          0.059595313 = weight(abstract_txt:unbekannte in 219) [ClassicSimilarity], result of:
            0.059595313 = score(doc=219,freq=1.0), product of:
              0.17056483 = queryWeight, product of:
                1.026479 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.018577052 = queryNorm
              0.34939978 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.0390625 = fieldNorm(doc=219)
          0.027806537 = weight(abstract_txt:ergebnisse in 219) [ClassicSimilarity], result of:
            0.027806537 = score(doc=219,freq=1.0), product of:
              0.12927742 = queryWeight, product of:
                1.2638097 = boost
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.018577052 = queryNorm
              0.21509199 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.0390625 = fieldNorm(doc=219)
          0.05663614 = weight(abstract_txt:automatisch in 219) [ClassicSimilarity], result of:
            0.05663614 = score(doc=219,freq=1.0), product of:
              0.20772424 = queryWeight, product of:
                1.6020052 = boost
                6.9798555 = idf(docFreq=106, maxDocs=42306)
                0.018577052 = queryNorm
              0.2726506 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9798555 = idf(docFreq=106, maxDocs=42306)
                0.0390625 = fieldNorm(doc=219)
          0.06270126 = weight(abstract_txt:diesen in 219) [ClassicSimilarity], result of:
            0.06270126 = score(doc=219,freq=2.0), product of:
              0.20197426 = queryWeight, product of:
                1.9347016 = boost
                5.619598 = idf(docFreq=416, maxDocs=42306)
                0.018577052 = queryNorm
              0.31044185 = fieldWeight in 219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.619598 = idf(docFreq=416, maxDocs=42306)
                0.0390625 = fieldNorm(doc=219)
          0.09637237 = weight(abstract_txt:verfahren in 219) [ClassicSimilarity], result of:
            0.09637237 = score(doc=219,freq=4.0), product of:
              0.21350278 = queryWeight, product of:
                1.9891509 = boost
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.018577052 = queryNorm
              0.451387 = fieldWeight in 219, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.0390625 = fieldNorm(doc=219)
          0.13579431 = weight(abstract_txt:texten in 219) [ClassicSimilarity], result of:
            0.13579431 = score(doc=219,freq=2.0), product of:
              0.33809045 = queryWeight, product of:
                2.5031245 = boost
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.018577052 = queryNorm
              0.40165085 = fieldWeight in 219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.0390625 = fieldNorm(doc=219)
        0.28 = coord(7/25)
    
  3. Lepsky, K.; Zimmermann, H.H.: Katalogerweiterung durch Scanning und automatische Dokumenterschließung : Ergebnisse des DFG-Projekts KASCADE (2000) 0.13
    0.12664811 = sum of:
      0.12664811 = product of:
        0.6332406 = sum of:
          0.054547507 = weight(abstract_txt:einem in 5967) [ClassicSimilarity], result of:
            0.054547507 = score(doc=5967,freq=2.0), product of:
              0.08093928 = queryWeight, product of:
                4.3569493 = idf(docFreq=1473, maxDocs=42306)
                0.018577052 = queryNorm
              0.67393124 = fieldWeight in 5967, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3569493 = idf(docFreq=1473, maxDocs=42306)
                0.109375 = fieldNorm(doc=5967)
          0.07785831 = weight(abstract_txt:ergebnisse in 5967) [ClassicSimilarity], result of:
            0.07785831 = score(doc=5967,freq=1.0), product of:
              0.12927742 = queryWeight, product of:
                1.2638097 = boost
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.018577052 = queryNorm
              0.60225755 = fieldWeight in 5967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.109375 = fieldNorm(doc=5967)
          0.15144597 = weight(abstract_txt:automatischen in 5967) [ClassicSimilarity], result of:
            0.15144597 = score(doc=5967,freq=1.0), product of:
              0.20144564 = queryWeight, product of:
                1.5776086 = boost
                6.873561 = idf(docFreq=118, maxDocs=42306)
                0.018577052 = queryNorm
              0.7517957 = fieldWeight in 5967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.873561 = idf(docFreq=118, maxDocs=42306)
                0.109375 = fieldNorm(doc=5967)
          0.1585812 = weight(abstract_txt:automatisch in 5967) [ClassicSimilarity], result of:
            0.1585812 = score(doc=5967,freq=1.0), product of:
              0.20772424 = queryWeight, product of:
                1.6020052 = boost
                6.9798555 = idf(docFreq=106, maxDocs=42306)
                0.018577052 = queryNorm
              0.7634217 = fieldWeight in 5967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9798555 = idf(docFreq=106, maxDocs=42306)
                0.109375 = fieldNorm(doc=5967)
          0.19080757 = weight(abstract_txt:verfahren in 5967) [ClassicSimilarity], result of:
            0.19080757 = score(doc=5967,freq=2.0), product of:
              0.21350278 = queryWeight, product of:
                1.9891509 = boost
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.018577052 = queryNorm
              0.8937006 = fieldWeight in 5967, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.109375 = fieldNorm(doc=5967)
        0.2 = coord(5/25)
    
  4. Hänger, C.; Krätzsch, C.; Niemann, C.: Was vom Tagging übrig blieb : Erkenntnisse und Einsichten aus zwei Jahren Projektarbeit (2011) 0.12
    0.12304078 = sum of:
      0.12304078 = product of:
        0.5126699 = sum of:
          0.038929153 = weight(abstract_txt:ergebnisse in 1520) [ClassicSimilarity], result of:
            0.038929153 = score(doc=1520,freq=1.0), product of:
              0.12927742 = queryWeight, product of:
                1.2638097 = boost
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.018577052 = queryNorm
              0.30112877 = fieldWeight in 1520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1520)
          0.09867837 = weight(abstract_txt:erschließung in 1520) [ClassicSimilarity], result of:
            0.09867837 = score(doc=1520,freq=4.0), product of:
              0.15140285 = queryWeight, product of:
                1.367689 = boost
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.018577052 = queryNorm
              0.65176034 = fieldWeight in 1520, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1520)
          0.10708848 = weight(abstract_txt:automatischen in 1520) [ClassicSimilarity], result of:
            0.10708848 = score(doc=1520,freq=2.0), product of:
              0.20144564 = queryWeight, product of:
                1.5776086 = boost
                6.873561 = idf(docFreq=118, maxDocs=42306)
                0.018577052 = queryNorm
              0.5315999 = fieldWeight in 1520, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.873561 = idf(docFreq=118, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1520)
          0.0792906 = weight(abstract_txt:automatisch in 1520) [ClassicSimilarity], result of:
            0.0792906 = score(doc=1520,freq=1.0), product of:
              0.20772424 = queryWeight, product of:
                1.6020052 = boost
                6.9798555 = idf(docFreq=106, maxDocs=42306)
                0.018577052 = queryNorm
              0.38171086 = fieldWeight in 1520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9798555 = idf(docFreq=106, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1520)
          0.116845295 = weight(abstract_txt:verfahren in 1520) [ClassicSimilarity], result of:
            0.116845295 = score(doc=1520,freq=3.0), product of:
              0.21350278 = queryWeight, product of:
                1.9891509 = boost
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.018577052 = queryNorm
              0.5472776 = fieldWeight in 1520, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1520)
          0.071838014 = weight(abstract_txt:elektronischen in 1520) [ClassicSimilarity], result of:
            0.071838014 = score(doc=1520,freq=1.0), product of:
              0.22264145 = queryWeight, product of:
                2.0312762 = boost
                5.9001117 = idf(docFreq=314, maxDocs=42306)
                0.018577052 = queryNorm
              0.32266235 = fieldWeight in 1520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9001117 = idf(docFreq=314, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1520)
        0.24 = coord(6/25)
    
  5. Glaesener, L.: Automatisches Indexieren einer informationswissenschaftlichen Datenbank mit Mehrwortgruppen (2012) 0.12
    0.11969014 = sum of:
      0.11969014 = product of:
        0.7480634 = sum of:
          0.12583803 = weight(abstract_txt:ergebnisse in 2402) [ClassicSimilarity], result of:
            0.12583803 = score(doc=2402,freq=2.0), product of:
              0.12927742 = queryWeight, product of:
                1.2638097 = boost
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.018577052 = queryNorm
              0.97339517 = fieldWeight in 2402, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.125 = fieldNorm(doc=2402)
          0.17308111 = weight(abstract_txt:automatischen in 2402) [ClassicSimilarity], result of:
            0.17308111 = score(doc=2402,freq=1.0), product of:
              0.20144564 = queryWeight, product of:
                1.5776086 = boost
                6.873561 = idf(docFreq=118, maxDocs=42306)
                0.018577052 = queryNorm
              0.8591951 = fieldWeight in 2402, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.873561 = idf(docFreq=118, maxDocs=42306)
                0.125 = fieldNorm(doc=2402)
          0.14187677 = weight(abstract_txt:diesen in 2402) [ClassicSimilarity], result of:
            0.14187677 = score(doc=2402,freq=1.0), product of:
              0.20197426 = queryWeight, product of:
                1.9347016 = boost
                5.619598 = idf(docFreq=416, maxDocs=42306)
                0.018577052 = queryNorm
              0.70244974 = fieldWeight in 2402, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.619598 = idf(docFreq=416, maxDocs=42306)
                0.125 = fieldNorm(doc=2402)
          0.3072675 = weight(abstract_txt:texten in 2402) [ClassicSimilarity], result of:
            0.3072675 = score(doc=2402,freq=1.0), product of:
              0.33809045 = queryWeight, product of:
                2.5031245 = boost
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.018577052 = queryNorm
              0.9088322 = fieldWeight in 2402, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.125 = fieldNorm(doc=2402)
        0.16 = coord(4/25)