Document (#18794)

Author
Tzeras, K.
Title
Zur Aufwandsabschätzung bei der Entwicklung eines Indexierungswörterbuches
Source
Information retrieval: GI/GMD-Workshop, Darmstadt, 23.-24.6.1991: Proceedings. Ed.: N. Fuhr
Imprint
Berlin : Springer
Year
1991
Pages
S.23-37
Series
Informatik-Fachberichte; 289
Abstract
Für die automatische Indexierung mit einem vorgegebenen Deskriptorensystem wird ein Wörterbuch benötigt, das möglichst viele Fachausdrücke des Anwendungsgebietes durch Relationen mit Deskriptoren verbindet. Werden die in einem solchen Indexierungswörterbuch erfaßten Relationen aus der Verarbeitung von Texten gewonnen, so ergibt sich eine Beziehung zwischen der Anzahl der Texte und der Größe und Leistungsfähigkeit des Wörterbuches. Die beschreibung derartiger Beziehungen ist besonders vor Beginn der Entwicklung eines automatischen Indexierungssystems von großem Interesse. H. Hüther hat sich in mehreren Arbeiten mit diesem Problem beschäftigt und verschiedene Schätzverfahren theoretische hergeleitet. Für eines der von ihm vorgeschlagenen Schätzverfahren zur Abschätzung der Größe eines Indexierungswörterbuches in Abhängigkeit von der Anzahl der zugrundeliegenden Texte werden im vorliegenden beitrag die Leistungsfähigkeit und die Anwendbarkeit untersucht
Theme
Automatisches Indexieren
Object
AIR/PHYS

Similar documents (content)

  1. Albrecht, R.: Digitale Auskunft im Verbund : Ein Jahr InfoPoint Rhein-Main (2005) 0.09
    0.08617673 = sum of:
      0.08617673 = product of:
        0.43088365 = sum of:
          0.02597569 = weight(abstract_txt:einem in 306) [ClassicSimilarity], result of:
            0.02597569 = score(doc=306,freq=1.0), product of:
              0.09544802 = queryWeight, product of:
                1.1749113 = boost
                4.354318 = idf(docFreq=1510, maxDocs=43254)
                0.018656995 = queryNorm
              0.27214488 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.354318 = idf(docFreq=1510, maxDocs=43254)
                0.0625 = fieldNorm(doc=306)
          0.040444158 = weight(abstract_txt:entwicklung in 306) [ClassicSimilarity], result of:
            0.040444158 = score(doc=306,freq=1.0), product of:
              0.12822106 = queryWeight, product of:
                1.3617623 = boost
                5.0468035 = idf(docFreq=755, maxDocs=43254)
                0.018656995 = queryNorm
              0.31542522 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0468035 = idf(docFreq=755, maxDocs=43254)
                0.0625 = fieldNorm(doc=306)
          0.10715822 = weight(abstract_txt:anzahl in 306) [ClassicSimilarity], result of:
            0.10715822 = score(doc=306,freq=1.0), product of:
              0.24551189 = queryWeight, product of:
                1.884334 = boost
                6.983497 = idf(docFreq=108, maxDocs=43254)
                0.018656995 = queryNorm
              0.43646857 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.983497 = idf(docFreq=108, maxDocs=43254)
                0.0625 = fieldNorm(doc=306)
          0.15197964 = weight(abstract_txt:größe in 306) [ClassicSimilarity], result of:
            0.15197964 = score(doc=306,freq=1.0), product of:
              0.3099173 = queryWeight, product of:
                2.1171153 = boost
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.018656995 = queryNorm
              0.49038774 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.0625 = fieldNorm(doc=306)
          0.105325945 = weight(abstract_txt:eines in 306) [ClassicSimilarity], result of:
            0.105325945 = score(doc=306,freq=3.0), product of:
              0.21202253 = queryWeight, product of:
                2.4764388 = boost
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.018656995 = queryNorm
              0.4967677 = fieldWeight in 306, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.0625 = fieldNorm(doc=306)
        0.2 = coord(5/25)
    
  2. Meyer, A.: Begriffsrelationen im Kategoriensystem der Wikipedia : Entwicklung eines Relationeninventars zur kollaborativen Anwendung (2010) 0.08
    0.08286893 = sum of:
      0.08286893 = product of:
        0.51793087 = sum of:
          0.07598982 = weight(abstract_txt:deskriptoren in 894) [ClassicSimilarity], result of:
            0.07598982 = score(doc=894,freq=1.0), product of:
              0.15495865 = queryWeight, product of:
                1.0585576 = boost
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.018656995 = queryNorm
              0.49038774 = fieldWeight in 894, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.0625 = fieldNorm(doc=894)
          0.040444158 = weight(abstract_txt:entwicklung in 894) [ClassicSimilarity], result of:
            0.040444158 = score(doc=894,freq=1.0), product of:
              0.12822106 = queryWeight, product of:
                1.3617623 = boost
                5.0468035 = idf(docFreq=755, maxDocs=43254)
                0.018656995 = queryNorm
              0.31542522 = fieldWeight in 894, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0468035 = idf(docFreq=755, maxDocs=43254)
                0.0625 = fieldNorm(doc=894)
          0.26552165 = weight(abstract_txt:relationen in 894) [ClassicSimilarity], result of:
            0.26552165 = score(doc=894,freq=4.0), product of:
              0.2832058 = queryWeight, product of:
                2.0238237 = boost
                7.500458 = idf(docFreq=64, maxDocs=43254)
                0.018656995 = queryNorm
              0.9375572 = fieldWeight in 894, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.500458 = idf(docFreq=64, maxDocs=43254)
                0.0625 = fieldNorm(doc=894)
          0.13597521 = weight(abstract_txt:eines in 894) [ClassicSimilarity], result of:
            0.13597521 = score(doc=894,freq=5.0), product of:
              0.21202253 = queryWeight, product of:
                2.4764388 = boost
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.018656995 = queryNorm
              0.64132434 = fieldWeight in 894, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.0625 = fieldNorm(doc=894)
        0.16 = coord(4/25)
    
  3. Scholz, O.R.: Bild, Darstellung, Zeichen : Philosophische Theorien bildlicher Darstellung (2004) 0.07
    0.07423713 = sum of:
      0.07423713 = product of:
        0.37118566 = sum of:
          0.08007961 = weight(abstract_txt:theoretische in 3437) [ClassicSimilarity], result of:
            0.08007961 = score(doc=3437,freq=1.0), product of:
              0.13828874 = queryWeight, product of:
                7.412165 = idf(docFreq=70, maxDocs=43254)
                0.018656995 = queryNorm
              0.5790754 = fieldWeight in 3437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.412165 = idf(docFreq=70, maxDocs=43254)
                0.078125 = fieldNorm(doc=3437)
          0.08246984 = weight(abstract_txt:ergibt in 3437) [ClassicSimilarity], result of:
            0.08246984 = score(doc=3437,freq=1.0), product of:
              0.141027 = queryWeight, product of:
                1.009852 = boost
                7.4851904 = idf(docFreq=65, maxDocs=43254)
                0.018656995 = queryNorm
              0.5847805 = fieldWeight in 3437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4851904 = idf(docFreq=65, maxDocs=43254)
                0.078125 = fieldNorm(doc=3437)
          0.100154154 = weight(abstract_txt:verbindet in 3437) [ClassicSimilarity], result of:
            0.100154154 = score(doc=3437,freq=1.0), product of:
              0.16052826 = queryWeight, product of:
                1.0774133 = boost
                7.9859657 = idf(docFreq=39, maxDocs=43254)
                0.018656995 = queryNorm
              0.6239036 = fieldWeight in 3437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9859657 = idf(docFreq=39, maxDocs=43254)
                0.078125 = fieldNorm(doc=3437)
          0.03246961 = weight(abstract_txt:einem in 3437) [ClassicSimilarity], result of:
            0.03246961 = score(doc=3437,freq=1.0), product of:
              0.09544802 = queryWeight, product of:
                1.1749113 = boost
                4.354318 = idf(docFreq=1510, maxDocs=43254)
                0.018656995 = queryNorm
              0.3401811 = fieldWeight in 3437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.354318 = idf(docFreq=1510, maxDocs=43254)
                0.078125 = fieldNorm(doc=3437)
          0.076012455 = weight(abstract_txt:eines in 3437) [ClassicSimilarity], result of:
            0.076012455 = score(doc=3437,freq=1.0), product of:
              0.21202253 = queryWeight, product of:
                2.4764388 = boost
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.018656995 = queryNorm
              0.3585112 = fieldWeight in 3437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.078125 = fieldNorm(doc=3437)
        0.2 = coord(5/25)
    
  4. Coulon, C.-H.: ¬Die Rolle des Anpassungswissens im CBR : am Beispiel der Ausnutzung von Struktur im CBR (1996) 0.07
    0.06556446 = sum of:
      0.06556446 = product of:
        0.5463705 = sum of:
          0.13037929 = weight(abstract_txt:benötigt in 6272) [ClassicSimilarity], result of:
            0.13037929 = score(doc=6272,freq=1.0), product of:
              0.13990436 = queryWeight, product of:
                1.0058246 = boost
                7.4553375 = idf(docFreq=67, maxDocs=43254)
                0.018656995 = queryNorm
              0.9319172 = fieldWeight in 6272, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4553375 = idf(docFreq=67, maxDocs=43254)
                0.125 = fieldNorm(doc=6272)
          0.2943713 = weight(abstract_txt:leistungsfähigkeit in 6272) [ClassicSimilarity], result of:
            0.2943713 = score(doc=6272,freq=1.0), product of:
              0.30336526 = queryWeight, product of:
                2.0946167 = boost
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.018656995 = queryNorm
              0.97035277 = fieldWeight in 6272, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.125 = fieldNorm(doc=6272)
          0.121619925 = weight(abstract_txt:eines in 6272) [ClassicSimilarity], result of:
            0.121619925 = score(doc=6272,freq=1.0), product of:
              0.21202253 = queryWeight, product of:
                2.4764388 = boost
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.018656995 = queryNorm
              0.57361794 = fieldWeight in 6272, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.125 = fieldNorm(doc=6272)
        0.12 = coord(3/25)
    
  5. Schmitz-Esser, W.: Publikumsfragen an Literatur zur Zeitgeschichte (1993) 0.06
    0.06380782 = sum of:
      0.06380782 = product of:
        0.53173184 = sum of:
          0.22796948 = weight(abstract_txt:deskriptoren in 4411) [ClassicSimilarity], result of:
            0.22796948 = score(doc=4411,freq=1.0), product of:
              0.15495865 = queryWeight, product of:
                1.0585576 = boost
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.018656995 = queryNorm
              1.4711633 = fieldWeight in 4411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.1875 = fieldNorm(doc=4411)
          0.121332474 = weight(abstract_txt:entwicklung in 4411) [ClassicSimilarity], result of:
            0.121332474 = score(doc=4411,freq=1.0), product of:
              0.12822106 = queryWeight, product of:
                1.3617623 = boost
                5.0468035 = idf(docFreq=755, maxDocs=43254)
                0.018656995 = queryNorm
              0.94627565 = fieldWeight in 4411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0468035 = idf(docFreq=755, maxDocs=43254)
                0.1875 = fieldNorm(doc=4411)
          0.18242988 = weight(abstract_txt:eines in 4411) [ClassicSimilarity], result of:
            0.18242988 = score(doc=4411,freq=1.0), product of:
              0.21202253 = queryWeight, product of:
                2.4764388 = boost
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.018656995 = queryNorm
              0.8604269 = fieldWeight in 4411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.1875 = fieldNorm(doc=4411)
        0.12 = coord(3/25)