Document (#18793)

Author
Tzeras, K.
Title
Zur Aufwandsabschätzung bei der Entwicklung eines Indexierungswörterbuches
Source
Information retrieval: GI/GMD-Workshop, Darmstadt, 23.-24.6.1991: Proceedings. Ed.: N. Fuhr
Imprint
Berlin : Springer
Year
1991
Pages
S.23-37
Series
Informatik-Fachberichte; 289
Abstract
Für die automatische Indexierung mit einem vorgegebenen Deskriptorensystem wird ein Wörterbuch benötigt, das möglichst viele Fachausdrücke des Anwendungsgebietes durch Relationen mit Deskriptoren verbindet. Werden die in einem solchen Indexierungswörterbuch erfaßten Relationen aus der Verarbeitung von Texten gewonnen, so ergibt sich eine Beziehung zwischen der Anzahl der Texte und der Größe und Leistungsfähigkeit des Wörterbuches. Die beschreibung derartiger Beziehungen ist besonders vor Beginn der Entwicklung eines automatischen Indexierungssystems von großem Interesse. H. Hüther hat sich in mehreren Arbeiten mit diesem Problem beschäftigt und verschiedene Schätzverfahren theoretische hergeleitet. Für eines der von ihm vorgeschlagenen Schätzverfahren zur Abschätzung der Größe eines Indexierungswörterbuches in Abhängigkeit von der Anzahl der zugrundeliegenden Texte werden im vorliegenden beitrag die Leistungsfähigkeit und die Anwendbarkeit untersucht
Theme
Automatisches Indexieren
Object
AIR/PHYS

Similar documents (content)

  1. Albrecht, R.: Digitale Auskunft im Verbund : Ein Jahr InfoPoint Rhein-Main (2005) 0.09
    0.08616908 = sum of:
      0.08616908 = product of:
        0.43084538 = sum of:
          0.025724538 = weight(abstract_txt:einem in 4305) [ClassicSimilarity], result of:
            0.025724538 = score(doc=4305,freq=1.0), product of:
              0.09492127 = queryWeight, product of:
                1.1752033 = boost
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.018627154 = queryNorm
              0.2710092 = fieldWeight in 4305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.0625 = fieldNorm(doc=4305)
          0.040275622 = weight(abstract_txt:entwicklung in 4305) [ClassicSimilarity], result of:
            0.040275622 = score(doc=4305,freq=1.0), product of:
              0.12798527 = queryWeight, product of:
                1.3646184 = boost
                5.0350323 = idf(docFreq=781, maxDocs=44218)
                0.018627154 = queryNorm
              0.31468952 = fieldWeight in 4305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0350323 = idf(docFreq=781, maxDocs=44218)
                0.0625 = fieldNorm(doc=4305)
          0.107640155 = weight(abstract_txt:anzahl in 4305) [ClassicSimilarity], result of:
            0.107640155 = score(doc=4305,freq=1.0), product of:
              0.24647981 = queryWeight, product of:
                1.8937469 = boost
                6.987357 = idf(docFreq=110, maxDocs=44218)
                0.018627154 = queryNorm
              0.43670982 = fieldWeight in 4305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.987357 = idf(docFreq=110, maxDocs=44218)
                0.0625 = fieldNorm(doc=4305)
          0.1524415 = weight(abstract_txt:größe in 4305) [ClassicSimilarity], result of:
            0.1524415 = score(doc=4305,freq=1.0), product of:
              0.3108379 = queryWeight, product of:
                2.1266608 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.018627154 = queryNorm
              0.49042124 = fieldWeight in 4305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0625 = fieldNorm(doc=4305)
          0.10476355 = weight(abstract_txt:eines in 4305) [ClassicSimilarity], result of:
            0.10476355 = score(doc=4305,freq=3.0), product of:
              0.21146649 = queryWeight, product of:
                2.4806588 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.018627154 = queryNorm
              0.49541444 = fieldWeight in 4305, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.0625 = fieldNorm(doc=4305)
        0.2 = coord(5/25)
    
  2. Meyer, A.: Begriffsrelationen im Kategoriensystem der Wikipedia : Entwicklung eines Relationeninventars zur kollaborativen Anwendung (2010) 0.08
    0.08254921 = sum of:
      0.08254921 = product of:
        0.51593256 = sum of:
          0.0750128 = weight(abstract_txt:deskriptoren in 4429) [ClassicSimilarity], result of:
            0.0750128 = score(doc=4429,freq=1.0), product of:
              0.15377252 = queryWeight, product of:
                1.0576832 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.018627154 = queryNorm
              0.4878167 = fieldWeight in 4429, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.0625 = fieldNorm(doc=4429)
          0.040275622 = weight(abstract_txt:entwicklung in 4429) [ClassicSimilarity], result of:
            0.040275622 = score(doc=4429,freq=1.0), product of:
              0.12798527 = queryWeight, product of:
                1.3646184 = boost
                5.0350323 = idf(docFreq=781, maxDocs=44218)
                0.018627154 = queryNorm
              0.31468952 = fieldWeight in 4429, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0350323 = idf(docFreq=781, maxDocs=44218)
                0.0625 = fieldNorm(doc=4429)
          0.26539496 = weight(abstract_txt:relationen in 4429) [ClassicSimilarity], result of:
            0.26539496 = score(doc=4429,freq=4.0), product of:
              0.28338286 = queryWeight, product of:
                2.0305703 = boost
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.018627154 = queryNorm
              0.93652433 = fieldWeight in 4429, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.0625 = fieldNorm(doc=4429)
          0.13524917 = weight(abstract_txt:eines in 4429) [ClassicSimilarity], result of:
            0.13524917 = score(doc=4429,freq=5.0), product of:
              0.21146649 = queryWeight, product of:
                2.4806588 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.018627154 = queryNorm
              0.6395773 = fieldWeight in 4429, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.0625 = fieldNorm(doc=4429)
        0.16 = coord(4/25)
    
  3. Scholz, O.R.: Bild, Darstellung, Zeichen : Philosophische Theorien bildlicher Darstellung (2004) 0.07
    0.07358607 = sum of:
      0.07358607 = product of:
        0.36793032 = sum of:
          0.079246216 = weight(abstract_txt:theoretische in 1436) [ClassicSimilarity], result of:
            0.079246216 = score(doc=1436,freq=1.0), product of:
              0.1374572 = queryWeight, product of:
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.018627154 = queryNorm
              0.57651556 = fieldWeight in 1436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.078125 = fieldNorm(doc=1436)
          0.081489764 = weight(abstract_txt:ergibt in 1436) [ClassicSimilarity], result of:
            0.081489764 = score(doc=1436,freq=1.0), product of:
              0.14003949 = queryWeight, product of:
                1.0093493 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.018627154 = queryNorm
              0.5819056 = fieldWeight in 1436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.078125 = fieldNorm(doc=1436)
          0.0994321 = weight(abstract_txt:verbindet in 1436) [ClassicSimilarity], result of:
            0.0994321 = score(doc=1436,freq=1.0), product of:
              0.15990654 = queryWeight, product of:
                1.0785725 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.018627154 = queryNorm
              0.6218139 = fieldWeight in 1436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.078125 = fieldNorm(doc=1436)
          0.032155674 = weight(abstract_txt:einem in 1436) [ClassicSimilarity], result of:
            0.032155674 = score(doc=1436,freq=1.0), product of:
              0.09492127 = queryWeight, product of:
                1.1752033 = boost
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.018627154 = queryNorm
              0.3387615 = fieldWeight in 1436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.078125 = fieldNorm(doc=1436)
          0.075606585 = weight(abstract_txt:eines in 1436) [ClassicSimilarity], result of:
            0.075606585 = score(doc=1436,freq=1.0), product of:
              0.21146649 = queryWeight, product of:
                2.4806588 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.018627154 = queryNorm
              0.3575346 = fieldWeight in 1436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.078125 = fieldNorm(doc=1436)
        0.2 = coord(5/25)
    
  4. Coulon, C.-H.: ¬Die Rolle des Anpassungswissens im CBR : am Beispiel der Ausnutzung von Struktur im CBR (1996) 0.07
    0.06544335 = sum of:
      0.06544335 = product of:
        0.5453613 = sum of:
          0.13114074 = weight(abstract_txt:benötigt in 5203) [ClassicSimilarity], result of:
            0.13114074 = score(doc=5203,freq=1.0), product of:
              0.14058109 = queryWeight, product of:
                1.0112993 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.018627154 = queryNorm
              0.9328476 = fieldWeight in 5203, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.125 = fieldNorm(doc=5203)
          0.29325 = weight(abstract_txt:leistungsfähigkeit in 5203) [ClassicSimilarity], result of:
            0.29325 = score(doc=5203,freq=1.0), product of:
              0.30287993 = queryWeight, product of:
                2.0992613 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.018627154 = queryNorm
              0.96820545 = fieldWeight in 5203, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.125 = fieldNorm(doc=5203)
          0.12097053 = weight(abstract_txt:eines in 5203) [ClassicSimilarity], result of:
            0.12097053 = score(doc=5203,freq=1.0), product of:
              0.21146649 = queryWeight, product of:
                2.4806588 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.018627154 = queryNorm
              0.57205534 = fieldWeight in 5203, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.125 = fieldNorm(doc=5203)
        0.12 = coord(3/25)
    
  5. Schmitz-Esser, W.: Publikumsfragen an Literatur zur Zeitgeschichte (1993) 0.06
    0.06327853 = sum of:
      0.06327853 = product of:
        0.5273211 = sum of:
          0.22503841 = weight(abstract_txt:deskriptoren in 4411) [ClassicSimilarity], result of:
            0.22503841 = score(doc=4411,freq=1.0), product of:
              0.15377252 = queryWeight, product of:
                1.0576832 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.018627154 = queryNorm
              1.4634501 = fieldWeight in 4411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.1875 = fieldNorm(doc=4411)
          0.12082687 = weight(abstract_txt:entwicklung in 4411) [ClassicSimilarity], result of:
            0.12082687 = score(doc=4411,freq=1.0), product of:
              0.12798527 = queryWeight, product of:
                1.3646184 = boost
                5.0350323 = idf(docFreq=781, maxDocs=44218)
                0.018627154 = queryNorm
              0.94406855 = fieldWeight in 4411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0350323 = idf(docFreq=781, maxDocs=44218)
                0.1875 = fieldNorm(doc=4411)
          0.1814558 = weight(abstract_txt:eines in 4411) [ClassicSimilarity], result of:
            0.1814558 = score(doc=4411,freq=1.0), product of:
              0.21146649 = queryWeight, product of:
                2.4806588 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.018627154 = queryNorm
              0.858083 = fieldWeight in 4411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.1875 = fieldNorm(doc=4411)
        0.12 = coord(3/25)