Document (#20572)

Author
Volk, M.
Mittermaier, H.
Schurig, A.
Biedassek, T.
Title
Halbautomatische Volltextanalyse, Datenbankaufbau und Document Retrieval
Source
Datenanalyse, Klassifikation und Informationsverarbeitung: Methoden und Anwendungen in verschiedenen Fachgebieten. Hrsg.: H. Goebl u. M. Schader
Imprint
Heidelberg : Physica-Verlag
Year
1992
Pages
S.205-214
Abstract
In diesem Aufsatz beschreiben wir ein System zur Analyse von Kurzartikeln. Das System arbeitet halbautomatisch. Das heißt, zunächst wird der Artikel vom System analysiert und dann dem benutzer zur Nachberarbeitung vorgelegt. Die so gewonnene Information wird in einem Datenbankeintrag abgelegt. Über die Datenbank - in dBase IV implementiert - sind dann Abfragen und Zugriffe auf die Originaltexte effizient möglich. Der Kern dieses Aufsatzes betrifft die halbautomatische Analyse. Wir beschreiben unser Verfahren für parametrisiertes Pattern Matching sowie linguistische Heuristiken zur Ermittlung von Nominalphrasen und Präpositionalphrasen. Das System wurde für den praktischen Einsatz im Bonner Büro des 'Forums InformatikerInnen Für Frieden und gesellschaftliche Verantwortung e.V. (FIFF)' entwickelt
Theme
Automatisches Indexieren
Computerlinguistik

Similar documents (content)

  1. Bernhardt, U.; Ruhmann, I.: ¬Die Informationsgesellschaft ist keine Jobmaschine : Trotz der Dynamik im Medien- und Telekommunikationsmarkt werden die ökonomischen Erwartungen nicht erfüllt (1998) 0.06
    0.064783156 = sum of:
      0.064783156 = product of:
        0.53985965 = sum of:
          0.13048278 = weight(abstract_txt:gesellschaftliche in 2317) [ClassicSimilarity], result of:
            0.13048278 = score(doc=2317,freq=1.0), product of:
              0.13730435 = queryWeight, product of:
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.01806032 = queryNorm
              0.95031786 = fieldWeight in 2317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.125 = fieldNorm(doc=2317)
          0.13406748 = weight(abstract_txt:verantwortung in 2317) [ClassicSimilarity], result of:
            0.13406748 = score(doc=2317,freq=1.0), product of:
              0.13980772 = queryWeight, product of:
                1.0090749 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.01806032 = queryNorm
              0.95894194 = fieldWeight in 2317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.125 = fieldNorm(doc=2317)
          0.27530938 = weight(abstract_txt:frieden in 2317) [ClassicSimilarity], result of:
            0.27530938 = score(doc=2317,freq=1.0), product of:
              0.22587223 = queryWeight, product of:
                1.2825942 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.01806032 = queryNorm
              1.2188722 = fieldWeight in 2317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.125 = fieldNorm(doc=2317)
        0.12 = coord(3/25)
    
  2. Datenschutz-Folgenabschätzung (DSFA) für die Corona-App (2020) 0.05
    0.048587363 = sum of:
      0.048587363 = product of:
        0.4048947 = sum of:
          0.09786208 = weight(abstract_txt:gesellschaftliche in 5827) [ClassicSimilarity], result of:
            0.09786208 = score(doc=5827,freq=1.0), product of:
              0.13730435 = queryWeight, product of:
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.01806032 = queryNorm
              0.7127384 = fieldWeight in 5827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.09375 = fieldNorm(doc=5827)
          0.100550614 = weight(abstract_txt:verantwortung in 5827) [ClassicSimilarity], result of:
            0.100550614 = score(doc=5827,freq=1.0), product of:
              0.13980772 = queryWeight, product of:
                1.0090749 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.01806032 = queryNorm
              0.71920645 = fieldWeight in 5827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.09375 = fieldNorm(doc=5827)
          0.20648204 = weight(abstract_txt:frieden in 5827) [ClassicSimilarity], result of:
            0.20648204 = score(doc=5827,freq=1.0), product of:
              0.22587223 = queryWeight, product of:
                1.2825942 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.01806032 = queryNorm
              0.9141542 = fieldWeight in 5827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.09375 = fieldNorm(doc=5827)
        0.12 = coord(3/25)
    
  3. Oberhauser, O.: Automatisches Klassifizieren : Verfahren zur Erschließung elektronischer Dokumente (2004) 0.04
    0.043361653 = sum of:
      0.043361653 = product of:
        0.3613471 = sum of:
          0.058654524 = weight(abstract_txt:betrifft in 2487) [ClassicSimilarity], result of:
            0.058654524 = score(doc=2487,freq=1.0), product of:
              0.13980772 = queryWeight, product of:
                1.0090749 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.01806032 = queryNorm
              0.4195371 = fieldWeight in 2487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2487)
          0.050190613 = weight(abstract_txt:analyse in 2487) [ClassicSimilarity], result of:
            0.050190613 = score(doc=2487,freq=1.0), product of:
              0.15876514 = queryWeight, product of:
                1.520724 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.01806032 = queryNorm
              0.3161312 = fieldWeight in 2487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2487)
          0.25250196 = weight(abstract_txt:halbautomatische in 2487) [ClassicSimilarity], result of:
            0.25250196 = score(doc=2487,freq=1.0), product of:
              0.4661403 = queryWeight, product of:
                2.6057405 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.01806032 = queryNorm
              0.54168665 = fieldWeight in 2487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2487)
        0.12 = coord(3/25)
    
  4. Oberhauser, O.: Automatisches Klassifizieren : Entwicklungsstand - Methodik - Anwendungsbereiche (2005) 0.04
    0.043361653 = sum of:
      0.043361653 = product of:
        0.3613471 = sum of:
          0.058654524 = weight(abstract_txt:betrifft in 38) [ClassicSimilarity], result of:
            0.058654524 = score(doc=38,freq=1.0), product of:
              0.13980772 = queryWeight, product of:
                1.0090749 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.01806032 = queryNorm
              0.4195371 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
          0.050190613 = weight(abstract_txt:analyse in 38) [ClassicSimilarity], result of:
            0.050190613 = score(doc=38,freq=1.0), product of:
              0.15876514 = queryWeight, product of:
                1.520724 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.01806032 = queryNorm
              0.3161312 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
          0.25250196 = weight(abstract_txt:halbautomatische in 38) [ClassicSimilarity], result of:
            0.25250196 = score(doc=38,freq=1.0), product of:
              0.4661403 = queryWeight, product of:
                2.6057405 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.01806032 = queryNorm
              0.54168665 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
        0.12 = coord(3/25)
    
  5. Hotho, A.; Jäschke, R.; Benz, D.; Grahl, M.; Krause, B.; Schmitz, C.; Stumme, G.: Social Bookmarking am Beispiel BibSonomy (2009) 0.04
    0.036895767 = sum of:
      0.036895767 = product of:
        0.30746472 = sum of:
          0.100381225 = weight(abstract_txt:analyse in 4873) [ClassicSimilarity], result of:
            0.100381225 = score(doc=4873,freq=1.0), product of:
              0.15876514 = queryWeight, product of:
                1.520724 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.01806032 = queryNorm
              0.6322624 = fieldWeight in 4873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.109375 = fieldNorm(doc=4873)
          0.039859008 = weight(abstract_txt:system in 4873) [ClassicSimilarity], result of:
            0.039859008 = score(doc=4873,freq=1.0), product of:
              0.108064026 = queryWeight, product of:
                1.7743056 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.01806032 = queryNorm
              0.36884624 = fieldWeight in 4873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.109375 = fieldNorm(doc=4873)
          0.16722447 = weight(abstract_txt:beschreiben in 4873) [ClassicSimilarity], result of:
            0.16722447 = score(doc=4873,freq=1.0), product of:
              0.2231105 = queryWeight, product of:
                1.802739 = boost
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.01806032 = queryNorm
              0.7495141 = fieldWeight in 4873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.109375 = fieldNorm(doc=4873)
        0.12 = coord(3/25)