Document (#17102)

Author
Panyr, J.
Title
Automatische thematische Textklassifikation und ihre Interpretation in der Dokumentengrobrecherche
Source
Wissensstrukturen und Ordnungsmuster. Proc. der 4. Fachtagung der Gesellschaft für Klassifikation, Salzburg, 16.-19.4.1980. Red.: W. Dahlberg
Imprint
Frankfurt : Indeks
Year
1980
Pages
S.284-293
Series
Studien zur Klassifikation; Bd.9
Abstract
Für die automatische Erschließung natürlich-sprachlicher Dokumente in einem Informationssystem wurde ein Verfahren zur automatischen thematischen hierarchischen Klassifikation der Texte entwickelt. Die dabei gewonnene Ordnungsstruktur (Begriffsnetz) wird beim Retrieval als Recherchehilfe engeboten. Die Klassifikation erfolgt in vier Stufen: Textindexierung, Prioritätsklassenbildung, Verknüpfung der begriffe und Vernetzung der Prioritätsklassen miteinander. Die so entstandenen Wichtigkeitsstufen sind die Hierarchieebenen der Klassifikation. Die während des Clusteringverfahrens erzeugten Begriffs- und Dokumenten-Gruppierungen bilden die Knoten des Klassifikationsnetzes. Die Verknüpfung zwischen den Knoten benachbarter Prioritätsklassen repräsentieren die Netzwege in diesem Netz. Die Abbildung der Suchfrage auf dieses Begriffsnetz wird zur Relevanzbeurteilung der wiedergewonnenen Texte benutzt
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Panyr, J.: Thesaurus und wissensbasierte Systeme - Thesauri und Wissensbasen (1988) 5.44
    5.4375505 = sum of:
      5.4375505 = weight(author_txt:panyr in 22) [ClassicSimilarity], result of:
        5.4375505 = fieldWeight in 22, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.700081 = idf(docFreq=18, maxDocs=41962)
          0.625 = fieldNorm(doc=22)
    
  2. Panyr, J.: Information-Retrieval-Methoden in regelbasierten Expertensystemen (1990) 5.44
    5.4375505 = sum of:
      5.4375505 = weight(author_txt:panyr in 260) [ClassicSimilarity], result of:
        5.4375505 = fieldWeight in 260, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.700081 = idf(docFreq=18, maxDocs=41962)
          0.625 = fieldNorm(doc=260)
    
  3. Panyr, J.: Vom Wissen zur Information : Notwendigkeit der Kooperation der Fachleute aus dem Bereich der Informations-Retrieval-Systeme und der Systeme mit formaler Intelligenz (1988) 5.44
    5.4375505 = sum of:
      5.4375505 = weight(author_txt:panyr in 768) [ClassicSimilarity], result of:
        5.4375505 = fieldWeight in 768, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.700081 = idf(docFreq=18, maxDocs=41962)
          0.625 = fieldNorm(doc=768)
    
  4. Panyr, J.: ¬Die Theorie der Fuzzy-Mengen und Information-Retrieval-Systeme (1986) 5.44
    5.4375505 = sum of:
      5.4375505 = weight(author_txt:panyr in 788) [ClassicSimilarity], result of:
        5.4375505 = fieldWeight in 788, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.700081 = idf(docFreq=18, maxDocs=41962)
          0.625 = fieldNorm(doc=788)
    
  5. Panyr, J.: Probabilistische Modelle in Information-Retrieval-Systemen (1986) 5.44
    5.4375505 = sum of:
      5.4375505 = weight(author_txt:panyr in 1460) [ClassicSimilarity], result of:
        5.4375505 = fieldWeight in 1460, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.700081 = idf(docFreq=18, maxDocs=41962)
          0.625 = fieldNorm(doc=1460)
    

Similar documents (content)

  1. Manecke, H.-J.: Klassifikation, Klassieren (2004) 0.21
    0.20550126 = sum of:
      0.20550126 = product of:
        1.2843829 = sum of:
          0.03825348 = weight(abstract_txt:erfolgt in 3903) [ClassicSimilarity], result of:
            0.03825348 = score(doc=3903,freq=1.0), product of:
              0.118867755 = queryWeight, product of:
                1.028569 = boost
                6.865396 = idf(docFreq=118, maxDocs=41962)
                0.016833136 = queryNorm
              0.32181543 = fieldWeight in 3903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.865396 = idf(docFreq=118, maxDocs=41962)
                0.046875 = fieldNorm(doc=3903)
          0.018413924 = weight(abstract_txt:wird in 3903) [ClassicSimilarity], result of:
            0.018413924 = score(doc=3903,freq=2.0), product of:
              0.073009774 = queryWeight, product of:
                1.1400054 = boost
                3.8046005 = idf(docFreq=2539, maxDocs=41962)
                0.016833136 = queryNorm
              0.25221175 = fieldWeight in 3903, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8046005 = idf(docFreq=2539, maxDocs=41962)
                0.046875 = fieldNorm(doc=3903)
          0.0625729 = weight(abstract_txt:begriffs in 3903) [ClassicSimilarity], result of:
            0.0625729 = score(doc=3903,freq=1.0), product of:
              0.16502166 = queryWeight, product of:
                1.2119142 = boost
                8.089171 = idf(docFreq=34, maxDocs=41962)
                0.016833136 = queryNorm
              0.3791799 = fieldWeight in 3903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.089171 = idf(docFreq=34, maxDocs=41962)
                0.046875 = fieldNorm(doc=3903)
          1.1651427 = weight(title_txt:klassifikation in 3903) [ClassicSimilarity], result of:
            1.1651427 = score(doc=3903,freq=1.0), product of:
              0.29736105 = queryWeight, product of:
                2.8177605 = boost
                6.2692413 = idf(docFreq=215, maxDocs=41962)
                0.016833136 = queryNorm
              3.9182758 = fieldWeight in 3903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2692413 = idf(docFreq=215, maxDocs=41962)
                0.625 = fieldNorm(doc=3903)
        0.16 = coord(4/25)
    
  2. Hoppe, A.: Klassifikation innersprachlicher semantischer Komplexe (1978) 0.20
    0.20225708 = sum of:
      0.20225708 = product of:
        1.2641068 = sum of:
          0.100439504 = weight(abstract_txt:benutzt in 148) [ClassicSimilarity], result of:
            0.100439504 = score(doc=148,freq=1.0), product of:
              0.14251693 = queryWeight, product of:
                1.1262496 = boost
                7.5173855 = idf(docFreq=61, maxDocs=41962)
                0.016833136 = queryNorm
              0.7047549 = fieldWeight in 148, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5173855 = idf(docFreq=61, maxDocs=41962)
                0.09375 = fieldNorm(doc=148)
          0.036827847 = weight(abstract_txt:wird in 148) [ClassicSimilarity], result of:
            0.036827847 = score(doc=148,freq=2.0), product of:
              0.073009774 = queryWeight, product of:
                1.1400054 = boost
                3.8046005 = idf(docFreq=2539, maxDocs=41962)
                0.016833136 = queryNorm
              0.5044235 = fieldWeight in 148, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8046005 = idf(docFreq=2539, maxDocs=41962)
                0.09375 = fieldNorm(doc=148)
          0.19472541 = weight(abstract_txt:verknüpfung in 148) [ClassicSimilarity], result of:
            0.19472541 = score(doc=148,freq=1.0), product of:
              0.2791827 = queryWeight, product of:
                2.2292597 = boost
                7.439827 = idf(docFreq=66, maxDocs=41962)
                0.016833136 = queryNorm
              0.6974838 = fieldWeight in 148, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.439827 = idf(docFreq=66, maxDocs=41962)
                0.09375 = fieldNorm(doc=148)
          0.93211406 = weight(title_txt:klassifikation in 148) [ClassicSimilarity], result of:
            0.93211406 = score(doc=148,freq=1.0), product of:
              0.29736105 = queryWeight, product of:
                2.8177605 = boost
                6.2692413 = idf(docFreq=215, maxDocs=41962)
                0.016833136 = queryNorm
              3.1346207 = fieldWeight in 148, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2692413 = idf(docFreq=215, maxDocs=41962)
                0.5 = fieldNorm(doc=148)
        0.16 = coord(4/25)
    
  3. Degens, P.O.: Hierarchische Klassifikation (1980) 0.19
    0.18540521 = sum of:
      0.18540521 = product of:
        1.5450435 = sum of:
          0.17845002 = weight(abstract_txt:hierarchischen in 503) [ClassicSimilarity], result of:
            0.17845002 = score(doc=503,freq=1.0), product of:
              0.18864414 = queryWeight, product of:
                1.2957555 = boost
                8.6487875 = idf(docFreq=19, maxDocs=41962)
                0.016833136 = queryNorm
              0.9459611 = fieldWeight in 503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6487875 = idf(docFreq=19, maxDocs=41962)
                0.109375 = fieldNorm(doc=503)
          0.2014508 = weight(abstract_txt:gewonnene in 503) [ClassicSimilarity], result of:
            0.2014508 = score(doc=503,freq=1.0), product of:
              0.2045243 = queryWeight, product of:
                1.3491923 = boost
                9.005463 = idf(docFreq=13, maxDocs=41962)
                0.016833136 = queryNorm
              0.9849725 = fieldWeight in 503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.005463 = idf(docFreq=13, maxDocs=41962)
                0.109375 = fieldNorm(doc=503)
          1.1651427 = weight(title_txt:klassifikation in 503) [ClassicSimilarity], result of:
            1.1651427 = score(doc=503,freq=1.0), product of:
              0.29736105 = queryWeight, product of:
                2.8177605 = boost
                6.2692413 = idf(docFreq=215, maxDocs=41962)
                0.016833136 = queryNorm
              3.9182758 = fieldWeight in 503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2692413 = idf(docFreq=215, maxDocs=41962)
                0.625 = fieldNorm(doc=503)
        0.12 = coord(3/25)
    
  4. Dahmen, E.: Klassifikation als Ordnundssystem im elektronischen Pressearchiv (2003) 0.18
    0.1790506 = sum of:
      0.1790506 = product of:
        0.89525294 = sum of:
          0.038827866 = weight(abstract_txt:natürlich in 2514) [ClassicSimilarity], result of:
            0.038827866 = score(doc=2514,freq=1.0), product of:
              0.120054685 = queryWeight, product of:
                1.0336915 = boost
                6.8995876 = idf(docFreq=114, maxDocs=41962)
                0.016833136 = queryNorm
              0.32341817 = fieldWeight in 2514, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8995876 = idf(docFreq=114, maxDocs=41962)
                0.046875 = fieldNorm(doc=2514)
          0.018413924 = weight(abstract_txt:wird in 2514) [ClassicSimilarity], result of:
            0.018413924 = score(doc=2514,freq=2.0), product of:
              0.073009774 = queryWeight, product of:
                1.1400054 = boost
                3.8046005 = idf(docFreq=2539, maxDocs=41962)
                0.016833136 = queryNorm
              0.25221175 = fieldWeight in 2514, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8046005 = idf(docFreq=2539, maxDocs=41962)
                0.046875 = fieldNorm(doc=2514)
          0.063248 = weight(abstract_txt:thematischen in 2514) [ClassicSimilarity], result of:
            0.063248 = score(doc=2514,freq=1.0), product of:
              0.1662065 = queryWeight, product of:
                1.2162571 = boost
                8.118159 = idf(docFreq=33, maxDocs=41962)
                0.016833136 = queryNorm
              0.3805387 = fieldWeight in 2514, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.118159 = idf(docFreq=33, maxDocs=41962)
                0.046875 = fieldNorm(doc=2514)
          0.07567761 = weight(abstract_txt:texte in 2514) [ClassicSimilarity], result of:
            0.07567761 = score(doc=2514,freq=1.0), product of:
              0.23601432 = queryWeight, product of:
                2.0496776 = boost
                6.8404984 = idf(docFreq=121, maxDocs=41962)
                0.016833136 = queryNorm
              0.32064837 = fieldWeight in 2514, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8404984 = idf(docFreq=121, maxDocs=41962)
                0.046875 = fieldNorm(doc=2514)
          0.69908553 = weight(title_txt:klassifikation in 2514) [ClassicSimilarity], result of:
            0.69908553 = score(doc=2514,freq=1.0), product of:
              0.29736105 = queryWeight, product of:
                2.8177605 = boost
                6.2692413 = idf(docFreq=215, maxDocs=41962)
                0.016833136 = queryNorm
              2.3509655 = fieldWeight in 2514, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2692413 = idf(docFreq=215, maxDocs=41962)
                0.375 = fieldNorm(doc=2514)
        0.2 = coord(5/25)
    
  5. Hoffmann, R.: Entwicklung einer benutzerunterstützten automatisierten Klassifikation von Web - Dokumenten : Untersuchung gegenwärtiger Methoden zur automatisierten Dokumentklassifikation und Implementierung eines Prototyps zum verbesserten Information Retrieval für das xFIND System (2002) 0.16
    0.15569936 = sum of:
      0.15569936 = product of:
        0.6487473 = sum of:
          0.08585319 = weight(abstract_txt:automatischen in 198) [ClassicSimilarity], result of:
            0.08585319 = score(doc=198,freq=5.0), product of:
              0.11916015 = queryWeight, product of:
                1.0298333 = boost
                6.873835 = idf(docFreq=117, maxDocs=41962)
                0.016833136 = queryNorm
              0.72048575 = fieldWeight in 198, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.873835 = idf(docFreq=117, maxDocs=41962)
                0.046875 = fieldNorm(doc=198)
          0.01302061 = weight(abstract_txt:wird in 198) [ClassicSimilarity], result of:
            0.01302061 = score(doc=198,freq=1.0), product of:
              0.073009774 = queryWeight, product of:
                1.1400054 = boost
                3.8046005 = idf(docFreq=2539, maxDocs=41962)
                0.016833136 = queryNorm
              0.17834064 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8046005 = idf(docFreq=2539, maxDocs=41962)
                0.046875 = fieldNorm(doc=198)
          0.05791639 = weight(abstract_txt:thematische in 198) [ClassicSimilarity], result of:
            0.05791639 = score(doc=198,freq=1.0), product of:
              0.15672962 = queryWeight, product of:
                1.1810735 = boost
                7.8833194 = idf(docFreq=42, maxDocs=41962)
                0.016833136 = queryNorm
              0.3695306 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8833194 = idf(docFreq=42, maxDocs=41962)
                0.046875 = fieldNorm(doc=198)
          0.063248 = weight(abstract_txt:thematischen in 198) [ClassicSimilarity], result of:
            0.063248 = score(doc=198,freq=1.0), product of:
              0.1662065 = queryWeight, product of:
                1.2162571 = boost
                8.118159 = idf(docFreq=33, maxDocs=41962)
                0.016833136 = queryNorm
              0.3805387 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.118159 = idf(docFreq=33, maxDocs=41962)
                0.046875 = fieldNorm(doc=198)
          0.07916636 = weight(abstract_txt:automatische in 198) [ClassicSimilarity], result of:
            0.07916636 = score(doc=198,freq=1.0), product of:
              0.24321324 = queryWeight, product of:
                2.0807025 = boost
                6.9440393 = idf(docFreq=109, maxDocs=41962)
                0.016833136 = queryNorm
              0.32550186 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9440393 = idf(docFreq=109, maxDocs=41962)
                0.046875 = fieldNorm(doc=198)
          0.34954277 = weight(title_txt:klassifikation in 198) [ClassicSimilarity], result of:
            0.34954277 = score(doc=198,freq=1.0), product of:
              0.29736105 = queryWeight, product of:
                2.8177605 = boost
                6.2692413 = idf(docFreq=215, maxDocs=41962)
                0.016833136 = queryNorm
              1.1754827 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2692413 = idf(docFreq=215, maxDocs=41962)
                0.1875 = fieldNorm(doc=198)
        0.24 = coord(6/25)