Document (#9153)

Author
Schulze, U.
Title
Erfahrungen bei der Anwendung automatischer Klassifizierungsverfahren zur Inhaltsanalyse einer Dokumentenmenge
Source
Kooperation in der Klassifikation I. Proc. der Sekt.1-3 der 2. Fachtagung der Gesellschaft für Klassifikation, Frankfurt-Hoechst, 6.-7.4.1978. Bearb.: W. Dahlberg
Imprint
Frankfurt : Gesellschaft für Klassifikation
Year
1978
Pages
S.166-185
Series
Studien zur Klassifikation; Bd.2
Abstract
Die der Analyse zugrundeliegende Dokumentenmenge besteht aus 1.000 Entscheidungen des Bundesverfassungsgerichtes, deren volle Texte maschinenlesbar zur Verfügung standen. Vorgestellt werden die Anwendung eines iterativen Centroidverfahrens auf etwa 1.000 Wörter und die Anwendung eines Single-Linkage-Verfahrens in einer nicht-hierarchischen Variante, sowie die auf der Graphentheorie basierenden Verfahren und die verschiedener Ähnlichkeitsfunktionen und der Einfluß auf die Ergebnisse
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Schulze, E.: ¬Der Terminus : Eigenschaften und Wesen sowie seine Abgrenzung von anderen Lexemarten (1993) 5.59
    5.585293 = sum of:
      5.585293 = weight(author_txt:schulze in 4696) [ClassicSimilarity], result of:
        5.585293 = score(doc=4696,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.936469 = idf(docFreq=14, maxDocs=41962)
            0.111901015 = queryNorm
          5.5852933 = fieldWeight in 4696, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.936469 = idf(docFreq=14, maxDocs=41962)
            0.625 = fieldNorm(doc=4696)
    
  2. Schulze, G.: ¬Die Rolle der Europäischen Union beim Aufbau transeuropäischer Netze (1996) 5.59
    5.585293 = sum of:
      5.585293 = weight(author_txt:schulze in 6105) [ClassicSimilarity], result of:
        5.585293 = score(doc=6105,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.936469 = idf(docFreq=14, maxDocs=41962)
            0.111901015 = queryNorm
          5.5852933 = fieldWeight in 6105, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.936469 = idf(docFreq=14, maxDocs=41962)
            0.625 = fieldNorm(doc=6105)
    
  3. Schulze, S.: Ahnenforschung und Alterszucker : Noch sind sie eine Minderheit - Senioren surfen durchs Internet, sammeln Informationen und schließen online Freundschaften (1998) 5.59
    5.585293 = sum of:
      5.585293 = weight(author_txt:schulze in 1888) [ClassicSimilarity], result of:
        5.585293 = score(doc=1888,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.936469 = idf(docFreq=14, maxDocs=41962)
            0.111901015 = queryNorm
          5.5852933 = fieldWeight in 1888, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.936469 = idf(docFreq=14, maxDocs=41962)
            0.625 = fieldNorm(doc=1888)
    
  4. Schulze, M.: ¬Das Projekt "nestor" : Aufbau eines Kompetenznetzwerks Langzeitarchivierung und Langzeitverfügbarkeit digitaler Ressourcen für Deutschland (2004) 5.59
    5.585293 = sum of:
      5.585293 = weight(author_txt:schulze in 5948) [ClassicSimilarity], result of:
        5.585293 = score(doc=5948,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.936469 = idf(docFreq=14, maxDocs=41962)
            0.111901015 = queryNorm
          5.5852933 = fieldWeight in 5948, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.936469 = idf(docFreq=14, maxDocs=41962)
            0.625 = fieldNorm(doc=5948)
    
  5. Schulze, V.: ¬Die Klassifikation der Kunstgeschichte : Geschichte der Ordnungsgrundsätze und Erörterung des Entwurfs der Systematik 'Kunst' für die Universitätsbibliothek Bremen (1967) 5.59
    5.585293 = sum of:
      5.585293 = weight(author_txt:schulze in 6682) [ClassicSimilarity], result of:
        5.585293 = score(doc=6682,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.936469 = idf(docFreq=14, maxDocs=41962)
            0.111901015 = queryNorm
          5.5852933 = fieldWeight in 6682, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.936469 = idf(docFreq=14, maxDocs=41962)
            0.625 = fieldNorm(doc=6682)
    

Similar documents (content)

  1. Lepsky, K.: Automatische Indexierung zur Erschließung deutschsprachiger Dokumente (1999) 0.09
    0.092973806 = sum of:
      0.092973806 = product of:
        0.5810863 = sum of:
          0.08756472 = weight(abstract_txt:texte in 6070) [ClassicSimilarity], result of:
            0.08756472 = score(doc=6070,freq=1.0), product of:
              0.13654321 = queryWeight, product of:
                1.1750458 = boost
                6.8404984 = idf(docFreq=121, maxDocs=41962)
                0.016987426 = queryNorm
              0.64129674 = fieldWeight in 6070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8404984 = idf(docFreq=121, maxDocs=41962)
                0.09375 = fieldNorm(doc=6070)
          0.25631943 = weight(abstract_txt:verfahrens in 6070) [ClassicSimilarity], result of:
            0.25631943 = score(doc=6070,freq=3.0), product of:
              0.19373049 = queryWeight, product of:
                1.3996477 = boost
                8.148012 = idf(docFreq=32, maxDocs=41962)
                0.016987426 = queryNorm
              1.3230723 = fieldWeight in 6070, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.148012 = idf(docFreq=32, maxDocs=41962)
                0.09375 = fieldNorm(doc=6070)
          0.05309182 = weight(abstract_txt:eines in 6070) [ClassicSimilarity], result of:
            0.05309182 = score(doc=6070,freq=1.0), product of:
              0.12323832 = queryWeight, product of:
                1.5787292 = boost
                4.595265 = idf(docFreq=1151, maxDocs=41962)
                0.016987426 = queryNorm
              0.4308061 = fieldWeight in 6070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.595265 = idf(docFreq=1151, maxDocs=41962)
                0.09375 = fieldNorm(doc=6070)
          0.18411027 = weight(abstract_txt:anwendung in 6070) [ClassicSimilarity], result of:
            0.18411027 = score(doc=6070,freq=1.0), product of:
              0.3232038 = queryWeight, product of:
                3.131256 = boost
                6.076175 = idf(docFreq=261, maxDocs=41962)
                0.016987426 = queryNorm
              0.5696414 = fieldWeight in 6070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.076175 = idf(docFreq=261, maxDocs=41962)
                0.09375 = fieldNorm(doc=6070)
        0.16 = coord(4/25)
    
  2. Scheele, M.: ¬Die automatische Indexierung beliebiger Titel und Schlagwörter auf der Grundlage eines Modells für einen Gesamtthesaurus des Wissens (1983) 0.09
    0.08990728 = sum of:
      0.08990728 = product of:
        0.44953638 = sum of:
          0.0681662 = weight(abstract_txt:erfahrungen in 524) [ClassicSimilarity], result of:
            0.0681662 = score(doc=524,freq=1.0), product of:
              0.11554826 = queryWeight, product of:
                1.0809397 = boost
                6.2926617 = idf(docFreq=210, maxDocs=41962)
                0.016987426 = queryNorm
              0.58993703 = fieldWeight in 524, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2926617 = idf(docFreq=210, maxDocs=41962)
                0.09375 = fieldNorm(doc=524)
          0.12274185 = weight(abstract_txt:wörter in 524) [ClassicSimilarity], result of:
            0.12274185 = score(doc=524,freq=1.0), product of:
              0.17101957 = queryWeight, product of:
                1.3150512 = boost
                7.6555357 = idf(docFreq=53, maxDocs=41962)
                0.016987426 = queryNorm
              0.71770644 = fieldWeight in 524, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6555357 = idf(docFreq=53, maxDocs=41962)
                0.09375 = fieldNorm(doc=524)
          0.04634508 = weight(abstract_txt:einer in 524) [ClassicSimilarity], result of:
            0.04634508 = score(doc=524,freq=2.0), product of:
              0.089341484 = queryWeight, product of:
                1.344192 = boost
                3.912589 = idf(docFreq=2279, maxDocs=41962)
                0.016987426 = queryNorm
              0.5187409 = fieldWeight in 524, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.912589 = idf(docFreq=2279, maxDocs=41962)
                0.09375 = fieldNorm(doc=524)
          0.15919144 = weight(abstract_txt:automatischer in 524) [ClassicSimilarity], result of:
            0.15919144 = score(doc=524,freq=1.0), product of:
              0.20339043 = queryWeight, product of:
                1.4341184 = boost
                8.348682 = idf(docFreq=26, maxDocs=41962)
                0.016987426 = queryNorm
              0.782689 = fieldWeight in 524, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.348682 = idf(docFreq=26, maxDocs=41962)
                0.09375 = fieldNorm(doc=524)
          0.05309182 = weight(abstract_txt:eines in 524) [ClassicSimilarity], result of:
            0.05309182 = score(doc=524,freq=1.0), product of:
              0.12323832 = queryWeight, product of:
                1.5787292 = boost
                4.595265 = idf(docFreq=1151, maxDocs=41962)
                0.016987426 = queryNorm
              0.4308061 = fieldWeight in 524, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.595265 = idf(docFreq=1151, maxDocs=41962)
                0.09375 = fieldNorm(doc=524)
        0.2 = coord(5/25)
    
  3. Umlauf, K.: Sacherschließung auf der VLBPlus-CD-ROM durch Klassifikation : Die Warengruppen-Systematik des Buchhandels (2001) 0.09
    0.08712251 = sum of:
      0.08712251 = product of:
        0.54451567 = sum of:
          0.044976275 = weight(abstract_txt:analyse in 3405) [ClassicSimilarity], result of:
            0.044976275 = score(doc=3405,freq=1.0), product of:
              0.098891854 = queryWeight, product of:
                5.8214736 = idf(docFreq=337, maxDocs=41962)
                0.016987426 = queryNorm
              0.45480263 = fieldWeight in 3405, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8214736 = idf(docFreq=337, maxDocs=41962)
                0.078125 = fieldNorm(doc=3405)
          0.038620904 = weight(abstract_txt:einer in 3405) [ClassicSimilarity], result of:
            0.038620904 = score(doc=3405,freq=2.0), product of:
              0.089341484 = queryWeight, product of:
                1.344192 = boost
                3.912589 = idf(docFreq=2279, maxDocs=41962)
                0.016987426 = queryNorm
              0.43228412 = fieldWeight in 3405, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.912589 = idf(docFreq=2279, maxDocs=41962)
                0.078125 = fieldNorm(doc=3405)
          0.24394247 = weight(abstract_txt:1.000 in 3405) [ClassicSimilarity], result of:
            0.24394247 = score(doc=3405,freq=1.0), product of:
              0.38462704 = queryWeight, product of:
                2.7890394 = boost
                8.118159 = idf(docFreq=33, maxDocs=41962)
                0.016987426 = queryNorm
              0.6342312 = fieldWeight in 3405, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.118159 = idf(docFreq=33, maxDocs=41962)
                0.078125 = fieldNorm(doc=3405)
          0.21697603 = weight(abstract_txt:anwendung in 3405) [ClassicSimilarity], result of:
            0.21697603 = score(doc=3405,freq=2.0), product of:
              0.3232038 = queryWeight, product of:
                3.131256 = boost
                6.076175 = idf(docFreq=261, maxDocs=41962)
                0.016987426 = queryNorm
              0.67132884 = fieldWeight in 3405, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.076175 = idf(docFreq=261, maxDocs=41962)
                0.078125 = fieldNorm(doc=3405)
        0.16 = coord(4/25)
    
  4. Nöther, I.: Modell einer Konkordanz-Klassifikation für systematische Kataloge : T.1-2 (1994) 0.08
    0.081984214 = sum of:
      0.081984214 = product of:
        0.40992108 = sum of:
          0.054136775 = weight(abstract_txt:etwa in 3549) [ClassicSimilarity], result of:
            0.054136775 = score(doc=3549,freq=1.0), product of:
              0.0990936 = queryWeight, product of:
                1.0010195 = boost
                5.827409 = idf(docFreq=335, maxDocs=41962)
                0.016987426 = queryNorm
              0.5463196 = fieldWeight in 3549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.827409 = idf(docFreq=335, maxDocs=41962)
                0.09375 = fieldNorm(doc=3549)
          0.068948545 = weight(abstract_txt:besteht in 3549) [ClassicSimilarity], result of:
            0.068948545 = score(doc=3549,freq=1.0), product of:
              0.11643068 = queryWeight, product of:
                1.0850593 = boost
                6.3166437 = idf(docFreq=205, maxDocs=41962)
                0.016987426 = queryNorm
              0.5921854 = fieldWeight in 3549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3166437 = idf(docFreq=205, maxDocs=41962)
                0.09375 = fieldNorm(doc=3549)
          0.056760907 = weight(abstract_txt:einer in 3549) [ClassicSimilarity], result of:
            0.056760907 = score(doc=3549,freq=3.0), product of:
              0.089341484 = queryWeight, product of:
                1.344192 = boost
                3.912589 = idf(docFreq=2279, maxDocs=41962)
                0.016987426 = queryNorm
              0.6353253 = fieldWeight in 3549, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.912589 = idf(docFreq=2279, maxDocs=41962)
                0.09375 = fieldNorm(doc=3549)
          0.17698303 = weight(abstract_txt:variante in 3549) [ClassicSimilarity], result of:
            0.17698303 = score(doc=3549,freq=1.0), product of:
              0.21827558 = queryWeight, product of:
                1.4856699 = boost
                8.6487875 = idf(docFreq=19, maxDocs=41962)
                0.016987426 = queryNorm
              0.8108238 = fieldWeight in 3549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6487875 = idf(docFreq=19, maxDocs=41962)
                0.09375 = fieldNorm(doc=3549)
          0.05309182 = weight(abstract_txt:eines in 3549) [ClassicSimilarity], result of:
            0.05309182 = score(doc=3549,freq=1.0), product of:
              0.12323832 = queryWeight, product of:
                1.5787292 = boost
                4.595265 = idf(docFreq=1151, maxDocs=41962)
                0.016987426 = queryNorm
              0.4308061 = fieldWeight in 3549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.595265 = idf(docFreq=1151, maxDocs=41962)
                0.09375 = fieldNorm(doc=3549)
        0.2 = coord(5/25)
    
  5. Kompakt Brockhaus multimedial : das digitale Lexikon von A bis Z (1996) 0.08
    0.08112974 = sum of:
      0.08112974 = product of:
        1.0141218 = sum of:
          0.2335059 = weight(abstract_txt:texte in 6093) [ClassicSimilarity], result of:
            0.2335059 = score(doc=6093,freq=1.0), product of:
              0.13654321 = queryWeight, product of:
                1.1750458 = boost
                6.8404984 = idf(docFreq=121, maxDocs=41962)
                0.016987426 = queryNorm
              1.7101246 = fieldWeight in 6093, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8404984 = idf(docFreq=121, maxDocs=41962)
                0.25 = fieldNorm(doc=6093)
          0.7806159 = weight(abstract_txt:1.000 in 6093) [ClassicSimilarity], result of:
            0.7806159 = score(doc=6093,freq=1.0), product of:
              0.38462704 = queryWeight, product of:
                2.7890394 = boost
                8.118159 = idf(docFreq=33, maxDocs=41962)
                0.016987426 = queryNorm
              2.0295398 = fieldWeight in 6093, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.118159 = idf(docFreq=33, maxDocs=41962)
                0.25 = fieldNorm(doc=6093)
        0.08 = coord(2/25)