Document (#23795)

Author
Ladewig, C.
Henkes, M.
Title
Verfahren zur automatischen inhaltlichen Erschließung von elektronischen Texten : ASPECTIX
Source
nfd Information - Wissenschaft und Praxis. 52(2001) H.3, S.159-164
Year
2001
Abstract
Das Verfahren zur automatischen syntaktischen inhaltlichen Erschließung von elektronischen Texten, AspectiX, basiert auf einem Index, dessen Elemente mit einer universellen Aspekt-Klassifikation verknüpft sind, die es erlauben, ein syntaktisches Retrieval durchzuführen. Mit diesen, auf den jeweiligen Suchgegenstand inhaltlich bezogenen Klassifikationselementen, werden die Informationen in elektronischen Texten mit bekannten Suchalgorithmen abgefragt und die Ergebnisse entsprechend der Aspektverknüpfung ausgewertet. Mit diesen Aspekten ist es möglich, unbekannte Textdokumente automatisch fachgebiets- und sprachunabhängig nach Inhalten zu klassifizieren und beim Suchen in einem Textcorpus nicht nur auf die Verwendung von Zeichenfolgen angewiesen zu sein wie bei Suchmaschinen im WWW. Der Index kann bei diesen Vorgängen intellektuell und automatisch weiter ausgebaut werden und liefert Ergebnisse im Retrieval von nahezu 100 Prozent Precision, bei gleichzeitig nahezu 100 Prozent Recall. Damit ist das Verfahren AspectiX allen anderen Recherchetools um bis zu 40 Prozent an Precision bzw. Recall überlegen, wie an zahlreichen Recherchen in drei Datenbanken, die unterschiedlich groß und thematisch unähnlich sind, nachgewiesen wird
Theme
Automatisches Indexieren
Object
AspectiX

Similar documents (author)

  1. Ladewig, C.: ¬Die Ausbildung am Institut für Information und Dokumentation der Fachhochschule Potsdam (IID) (1994) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:ladewig in 8384) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 8384, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=8384)
    
  2. Ladewig, C.: 'Information Retrieval ohne Linguistik?' : Erwiderung zu dem Artikel von Gerda Ruge und Sebastian Goeser, Nfd 49(1998) H.6, S.361-369 (1998) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:ladewig in 2513) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 2513, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=2513)
    
  3. Ladewig, C.: Grundlagen der inhaltlichen Erschließung (1997) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:ladewig in 695) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 695, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=695)
    
  4. Ladewig, C.; Rieger, M.: Ähnlichkeitsmessung mit und ohne aspektische Indexierung (1998) 4.95
    4.952564 = sum of:
      4.952564 = weight(author_txt:ladewig in 2526) [ClassicSimilarity], result of:
        4.952564 = fieldWeight in 2526, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.5 = fieldNorm(doc=2526)
    

Similar documents (content)

  1. Scherer, B.: Automatische Indexierung und ihre Anwendung im DFG-Projekt "Gemeinsames Portal für Bibliotheken, Archive und Museen (BAM)" (2003) 0.15
    0.14890128 = sum of:
      0.14890128 = product of:
        0.620422 = sum of:
          0.038005784 = weight(abstract_txt:einem in 4283) [ClassicSimilarity], result of:
            0.038005784 = score(doc=4283,freq=3.0), product of:
              0.080966435 = queryWeight, product of:
                1.0077215 = boost
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.018529361 = queryNorm
              0.46940172 = fieldWeight in 4283, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
          0.044298146 = weight(abstract_txt:ergebnisse in 4283) [ClassicSimilarity], result of:
            0.044298146 = score(doc=4283,freq=1.0), product of:
              0.12933101 = queryWeight, product of:
                1.2736185 = boost
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.018529361 = queryNorm
              0.34251758 = fieldWeight in 4283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
          0.07881486 = weight(abstract_txt:erschließung in 4283) [ClassicSimilarity], result of:
            0.07881486 = score(doc=4283,freq=2.0), product of:
              0.1507212 = queryWeight, product of:
                1.374913 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.018529361 = queryNorm
              0.52291816 = fieldWeight in 4283, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
          0.17503677 = weight(abstract_txt:automatischen in 4283) [ClassicSimilarity], result of:
            0.17503677 = score(doc=4283,freq=4.0), product of:
              0.2036316 = queryWeight, product of:
                1.5981245 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.018529361 = queryNorm
              0.8595757 = fieldWeight in 4283, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
          0.13376638 = weight(abstract_txt:verfahren in 4283) [ClassicSimilarity], result of:
            0.13376638 = score(doc=4283,freq=3.0), product of:
              0.21445374 = queryWeight, product of:
                2.0086324 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.018529361 = queryNorm
              0.623754 = fieldWeight in 4283, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
          0.15050009 = weight(abstract_txt:texten in 4283) [ClassicSimilarity], result of:
            0.15050009 = score(doc=4283,freq=1.0), product of:
              0.33458045 = queryWeight, product of:
                2.5089033 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.018529361 = queryNorm
              0.44981736 = fieldWeight in 4283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
        0.24 = coord(6/25)
    
  2. Heyer, G.; Quasthoff, U.; Wittig, T.: Text Mining : Wissensrohstoff Text. Konzepte, Algorithmen, Ergebnisse (2006) 0.13
    0.12812461 = sum of:
      0.12812461 = product of:
        0.45758787 = sum of:
          0.019394746 = weight(abstract_txt:einem in 5218) [ClassicSimilarity], result of:
            0.019394746 = score(doc=5218,freq=2.0), product of:
              0.080966435 = queryWeight, product of:
                1.0077215 = boost
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.018529361 = queryNorm
              0.23954056 = fieldWeight in 5218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
          0.061085418 = weight(abstract_txt:unbekannte in 5218) [ClassicSimilarity], result of:
            0.061085418 = score(doc=5218,freq=1.0), product of:
              0.17396985 = queryWeight, product of:
                1.0445038 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.018529361 = queryNorm
              0.35112646 = fieldWeight in 5218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
          0.02768634 = weight(abstract_txt:ergebnisse in 5218) [ClassicSimilarity], result of:
            0.02768634 = score(doc=5218,freq=1.0), product of:
              0.12933101 = queryWeight, product of:
                1.2736185 = boost
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.018529361 = queryNorm
              0.2140735 = fieldWeight in 5218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
          0.057607923 = weight(abstract_txt:automatisch in 5218) [ClassicSimilarity], result of:
            0.057607923 = score(doc=5218,freq=1.0), product of:
              0.2107886 = queryWeight, product of:
                1.6259664 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.018529361 = queryNorm
              0.27329716 = fieldWeight in 5218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
          0.062251307 = weight(abstract_txt:diesen in 5218) [ClassicSimilarity], result of:
            0.062251307 = score(doc=5218,freq=2.0), product of:
              0.20167175 = queryWeight, product of:
                1.9478531 = boost
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.018529361 = queryNorm
              0.3086764 = fieldWeight in 5218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
          0.096537575 = weight(abstract_txt:verfahren in 5218) [ClassicSimilarity], result of:
            0.096537575 = score(doc=5218,freq=4.0), product of:
              0.21445374 = queryWeight, product of:
                2.0086324 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.018529361 = queryNorm
              0.4501557 = fieldWeight in 5218, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
          0.13302454 = weight(abstract_txt:texten in 5218) [ClassicSimilarity], result of:
            0.13302454 = score(doc=5218,freq=2.0), product of:
              0.33458045 = queryWeight, product of:
                2.5089033 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.018529361 = queryNorm
              0.3975861 = fieldWeight in 5218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
        0.28 = coord(7/25)
    
  3. Lepsky, K.; Zimmermann, H.H.: Katalogerweiterung durch Scanning und automatische Dokumenterschließung : Ergebnisse des DFG-Projekts KASCADE (2000) 0.13
    0.1274842 = sum of:
      0.1274842 = product of:
        0.637421 = sum of:
          0.054305285 = weight(abstract_txt:einem in 4966) [ClassicSimilarity], result of:
            0.054305285 = score(doc=4966,freq=2.0), product of:
              0.080966435 = queryWeight, product of:
                1.0077215 = boost
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.018529361 = queryNorm
              0.67071354 = fieldWeight in 4966, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.109375 = fieldNorm(doc=4966)
          0.07752175 = weight(abstract_txt:ergebnisse in 4966) [ClassicSimilarity], result of:
            0.07752175 = score(doc=4966,freq=1.0), product of:
              0.12933101 = queryWeight, product of:
                1.2736185 = boost
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.018529361 = queryNorm
              0.59940577 = fieldWeight in 4966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.109375 = fieldNorm(doc=4966)
          0.15315717 = weight(abstract_txt:automatischen in 4966) [ClassicSimilarity], result of:
            0.15315717 = score(doc=4966,freq=1.0), product of:
              0.2036316 = queryWeight, product of:
                1.5981245 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.018529361 = queryNorm
              0.7521287 = fieldWeight in 4966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.109375 = fieldNorm(doc=4966)
          0.16130218 = weight(abstract_txt:automatisch in 4966) [ClassicSimilarity], result of:
            0.16130218 = score(doc=4966,freq=1.0), product of:
              0.2107886 = queryWeight, product of:
                1.6259664 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.018529361 = queryNorm
              0.765232 = fieldWeight in 4966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.109375 = fieldNorm(doc=4966)
          0.19113463 = weight(abstract_txt:verfahren in 4966) [ClassicSimilarity], result of:
            0.19113463 = score(doc=4966,freq=2.0), product of:
              0.21445374 = queryWeight, product of:
                2.0086324 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.018529361 = queryNorm
              0.89126277 = fieldWeight in 4966, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.109375 = fieldNorm(doc=4966)
        0.2 = coord(5/25)
    
  4. Hänger, C.; Krätzsch, C.; Niemann, C.: Was vom Tagging übrig blieb : Erkenntnisse und Einsichten aus zwei Jahren Projektarbeit (2011) 0.12
    0.12387029 = sum of:
      0.12387029 = product of:
        0.5161262 = sum of:
          0.038760874 = weight(abstract_txt:ergebnisse in 4519) [ClassicSimilarity], result of:
            0.038760874 = score(doc=4519,freq=1.0), product of:
              0.12933101 = queryWeight, product of:
                1.2736185 = boost
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.018529361 = queryNorm
              0.29970288 = fieldWeight in 4519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
          0.09752841 = weight(abstract_txt:erschließung in 4519) [ClassicSimilarity], result of:
            0.09752841 = score(doc=4519,freq=4.0), product of:
              0.1507212 = queryWeight, product of:
                1.374913 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.018529361 = queryNorm
              0.6470782 = fieldWeight in 4519, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
          0.10829847 = weight(abstract_txt:automatischen in 4519) [ClassicSimilarity], result of:
            0.10829847 = score(doc=4519,freq=2.0), product of:
              0.2036316 = queryWeight, product of:
                1.5981245 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.018529361 = queryNorm
              0.5318353 = fieldWeight in 4519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
          0.08065109 = weight(abstract_txt:automatisch in 4519) [ClassicSimilarity], result of:
            0.08065109 = score(doc=4519,freq=1.0), product of:
              0.2107886 = queryWeight, product of:
                1.6259664 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.018529361 = queryNorm
              0.382616 = fieldWeight in 4519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
          0.11704559 = weight(abstract_txt:verfahren in 4519) [ClassicSimilarity], result of:
            0.11704559 = score(doc=4519,freq=3.0), product of:
              0.21445374 = queryWeight, product of:
                2.0086324 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.018529361 = queryNorm
              0.5457848 = fieldWeight in 4519, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
          0.073841825 = weight(abstract_txt:elektronischen in 4519) [ClassicSimilarity], result of:
            0.073841825 = score(doc=4519,freq=1.0), product of:
              0.2275127 = queryWeight, product of:
                2.0688856 = boost
                5.934836 = idf(docFreq=317, maxDocs=44218)
                0.018529361 = queryNorm
              0.32456133 = fieldWeight in 4519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.934836 = idf(docFreq=317, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
        0.24 = coord(6/25)
    
  5. Glaesener, L.: Automatisches Indexieren einer informationswissenschaftlichen Datenbank mit Mehrwortgruppen (2012) 0.12
    0.11875034 = sum of:
      0.11875034 = product of:
        0.74218965 = sum of:
          0.12529407 = weight(abstract_txt:ergebnisse in 401) [ClassicSimilarity], result of:
            0.12529407 = score(doc=401,freq=2.0), product of:
              0.12933101 = queryWeight, product of:
                1.2736185 = boost
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.018529361 = queryNorm
              0.968786 = fieldWeight in 401, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.125 = fieldNorm(doc=401)
          0.17503677 = weight(abstract_txt:automatischen in 401) [ClassicSimilarity], result of:
            0.17503677 = score(doc=401,freq=1.0), product of:
              0.2036316 = queryWeight, product of:
                1.5981245 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.018529361 = queryNorm
              0.8595757 = fieldWeight in 401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.125 = fieldNorm(doc=401)
          0.14085864 = weight(abstract_txt:diesen in 401) [ClassicSimilarity], result of:
            0.14085864 = score(doc=401,freq=1.0), product of:
              0.20167175 = queryWeight, product of:
                1.9478531 = boost
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.018529361 = queryNorm
              0.698455 = fieldWeight in 401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.125 = fieldNorm(doc=401)
          0.30100018 = weight(abstract_txt:texten in 401) [ClassicSimilarity], result of:
            0.30100018 = score(doc=401,freq=1.0), product of:
              0.33458045 = queryWeight, product of:
                2.5089033 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.018529361 = queryNorm
              0.8996347 = fieldWeight in 401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.125 = fieldNorm(doc=401)
        0.16 = coord(4/25)