Document (#18794)

Author
Tzeras, K.
Title
Zur Aufwandsabschätzung bei der Entwicklung eines Indexierungswörterbuches
Source
Information retrieval: GI/GMD-Workshop, Darmstadt, 23.-24.6.1991: Proceedings. Ed.: N. Fuhr
Imprint
Berlin : Springer
Year
1991
Pages
S.23-37
Series
Informatik-Fachberichte; 289
Abstract
Für die automatische Indexierung mit einem vorgegebenen Deskriptorensystem wird ein Wörterbuch benötigt, das möglichst viele Fachausdrücke des Anwendungsgebietes durch Relationen mit Deskriptoren verbindet. Werden die in einem solchen Indexierungswörterbuch erfaßten Relationen aus der Verarbeitung von Texten gewonnen, so ergibt sich eine Beziehung zwischen der Anzahl der Texte und der Größe und Leistungsfähigkeit des Wörterbuches. Die beschreibung derartiger Beziehungen ist besonders vor Beginn der Entwicklung eines automatischen Indexierungssystems von großem Interesse. H. Hüther hat sich in mehreren Arbeiten mit diesem Problem beschäftigt und verschiedene Schätzverfahren theoretische hergeleitet. Für eines der von ihm vorgeschlagenen Schätzverfahren zur Abschätzung der Größe eines Indexierungswörterbuches in Abhängigkeit von der Anzahl der zugrundeliegenden Texte werden im vorliegenden beitrag die Leistungsfähigkeit und die Anwendbarkeit untersucht
Theme
Automatisches Indexieren
Object
AIR/PHYS

Similar documents (content)

  1. Albrecht, R.: Digitale Auskunft im Verbund : Ein Jahr InfoPoint Rhein-Main (2005) 0.09
    0.08612799 = sum of:
      0.08612799 = product of:
        0.43063995 = sum of:
          0.025976397 = weight(abstract_txt:einem in 306) [ClassicSimilarity], result of:
            0.025976397 = score(doc=306,freq=1.0), product of:
              0.09544997 = queryWeight, product of:
                1.1745658 = boost
                4.3543477 = idf(docFreq=1492, maxDocs=42740)
                0.018662736 = queryNorm
              0.27214673 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3543477 = idf(docFreq=1492, maxDocs=42740)
                0.0625 = fieldNorm(doc=306)
          0.040541783 = weight(abstract_txt:entwicklung in 306) [ClassicSimilarity], result of:
            0.040541783 = score(doc=306,freq=1.0), product of:
              0.12842761 = queryWeight, product of:
                1.362444 = boost
                5.0508494 = idf(docFreq=743, maxDocs=42740)
                0.018662736 = queryNorm
              0.3156781 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0508494 = idf(docFreq=743, maxDocs=42740)
                0.0625 = fieldNorm(doc=306)
          0.10746143 = weight(abstract_txt:anzahl in 306) [ClassicSimilarity], result of:
            0.10746143 = score(doc=306,freq=1.0), product of:
              0.24597535 = queryWeight, product of:
                1.8855379 = boost
                6.9900618 = idf(docFreq=106, maxDocs=42740)
                0.018662736 = queryNorm
              0.43687886 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9900618 = idf(docFreq=106, maxDocs=42740)
                0.0625 = fieldNorm(doc=306)
          0.15128703 = weight(abstract_txt:größe in 306) [ClassicSimilarity], result of:
            0.15128703 = score(doc=306,freq=1.0), product of:
              0.3089757 = queryWeight, product of:
                2.1132536 = boost
                7.834249 = idf(docFreq=45, maxDocs=42740)
                0.018662736 = queryNorm
              0.48964056 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.834249 = idf(docFreq=45, maxDocs=42740)
                0.0625 = fieldNorm(doc=306)
          0.105373286 = weight(abstract_txt:eines in 306) [ClassicSimilarity], result of:
            0.105373286 = score(doc=306,freq=3.0), product of:
              0.21208654 = queryWeight, product of:
                2.4760592 = boost
                4.5896206 = idf(docFreq=1179, maxDocs=42740)
                0.018662736 = queryNorm
              0.49684098 = fieldWeight in 306, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5896206 = idf(docFreq=1179, maxDocs=42740)
                0.0625 = fieldNorm(doc=306)
        0.2 = coord(5/25)
    
  2. Pfeifer, W. (Bearb.): Etymologisches Wörterbuch des Deutschen : Erarbeitet unter der Leitung von Wolfgang Pfeifer (2003) 0.09
    0.08529131 = sum of:
      0.08529131 = product of:
        0.71076095 = sum of:
          0.5283671 = weight(subject_txt:wörterbuch in 3789) [ClassicSimilarity], result of:
            0.5283671 = score(doc=3789,freq=2.0), product of:
              0.17780899 = queryWeight, product of:
                1.1335778 = boost
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.018662736 = queryNorm
              2.9715433 = fieldWeight in 3789, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.25 = fieldNorm(doc=3789)
          0.05067723 = weight(abstract_txt:entwicklung in 3789) [ClassicSimilarity], result of:
            0.05067723 = score(doc=3789,freq=1.0), product of:
              0.12842761 = queryWeight, product of:
                1.362444 = boost
                5.0508494 = idf(docFreq=743, maxDocs=42740)
                0.018662736 = queryNorm
              0.39459762 = fieldWeight in 3789, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0508494 = idf(docFreq=743, maxDocs=42740)
                0.078125 = fieldNorm(doc=3789)
          0.13171661 = weight(abstract_txt:eines in 3789) [ClassicSimilarity], result of:
            0.13171661 = score(doc=3789,freq=3.0), product of:
              0.21208654 = queryWeight, product of:
                2.4760592 = boost
                4.5896206 = idf(docFreq=1179, maxDocs=42740)
                0.018662736 = queryNorm
              0.62105125 = fieldWeight in 3789, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5896206 = idf(docFreq=1179, maxDocs=42740)
                0.078125 = fieldNorm(doc=3789)
        0.12 = coord(3/25)
    
  3. Meyer, A.: Begriffsrelationen im Kategoriensystem der Wikipedia : Entwicklung eines Relationeninventars zur kollaborativen Anwendung (2010) 0.08
    0.083270125 = sum of:
      0.083270125 = product of:
        0.5204383 = sum of:
          0.07628195 = weight(abstract_txt:deskriptoren in 1430) [ClassicSimilarity], result of:
            0.07628195 = score(doc=1430,freq=1.0), product of:
              0.15535589 = queryWeight, product of:
                1.0595912 = boost
                7.856228 = idf(docFreq=44, maxDocs=42740)
                0.018662736 = queryNorm
              0.49101424 = fieldWeight in 1430, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.856228 = idf(docFreq=44, maxDocs=42740)
                0.0625 = fieldNorm(doc=1430)
          0.040541783 = weight(abstract_txt:entwicklung in 1430) [ClassicSimilarity], result of:
            0.040541783 = score(doc=1430,freq=1.0), product of:
              0.12842761 = queryWeight, product of:
                1.362444 = boost
                5.0508494 = idf(docFreq=743, maxDocs=42740)
                0.018662736 = queryNorm
              0.3156781 = fieldWeight in 1430, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0508494 = idf(docFreq=743, maxDocs=42740)
                0.0625 = fieldNorm(doc=1430)
          0.2675782 = weight(abstract_txt:relationen in 1430) [ClassicSimilarity], result of:
            0.2675782 = score(doc=1430,freq=4.0), product of:
              0.28466693 = queryWeight, product of:
                2.0284204 = boost
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.018662736 = queryNorm
              0.9399695 = fieldWeight in 1430, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.0625 = fieldNorm(doc=1430)
          0.13603634 = weight(abstract_txt:eines in 1430) [ClassicSimilarity], result of:
            0.13603634 = score(doc=1430,freq=5.0), product of:
              0.21208654 = queryWeight, product of:
                2.4760592 = boost
                4.5896206 = idf(docFreq=1179, maxDocs=42740)
                0.018662736 = queryNorm
              0.641419 = fieldWeight in 1430, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.5896206 = idf(docFreq=1179, maxDocs=42740)
                0.0625 = fieldNorm(doc=1430)
        0.16 = coord(4/25)
    
  4. Reck, H.U.: Index Kreativität (2007) 0.08
    0.07710138 = sum of:
      0.07710138 = product of:
        0.6425115 = sum of:
          0.56041795 = weight(subject_txt:wörterbuch in 3819) [ClassicSimilarity], result of:
            0.56041795 = score(doc=3819,freq=1.0), product of:
              0.17780899 = queryWeight, product of:
                1.1335778 = boost
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.018662736 = queryNorm
              3.1517978 = fieldWeight in 3819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.375 = fieldNorm(doc=3819)
          0.016235247 = weight(abstract_txt:einem in 3819) [ClassicSimilarity], result of:
            0.016235247 = score(doc=3819,freq=1.0), product of:
              0.09544997 = queryWeight, product of:
                1.1745658 = boost
                4.3543477 = idf(docFreq=1492, maxDocs=42740)
                0.018662736 = queryNorm
              0.1700917 = fieldWeight in 3819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3543477 = idf(docFreq=1492, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3819)
          0.065858305 = weight(abstract_txt:eines in 3819) [ClassicSimilarity], result of:
            0.065858305 = score(doc=3819,freq=3.0), product of:
              0.21208654 = queryWeight, product of:
                2.4760592 = boost
                4.5896206 = idf(docFreq=1179, maxDocs=42740)
                0.018662736 = queryNorm
              0.31052563 = fieldWeight in 3819, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5896206 = idf(docFreq=1179, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3819)
        0.12 = coord(3/25)
    
  5. Scholz, O.R.: Bild, Darstellung, Zeichen : Philosophische Theorien bildlicher Darstellung (2004) 0.07
    0.0744844 = sum of:
      0.0744844 = product of:
        0.37242198 = sum of:
          0.08015245 = weight(abstract_txt:theoretische in 3437) [ClassicSimilarity], result of:
            0.08015245 = score(doc=3437,freq=1.0), product of:
              0.1383729 = queryWeight, product of:
                7.4143953 = idf(docFreq=69, maxDocs=42740)
                0.018662736 = queryNorm
              0.5792496 = fieldWeight in 3437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4143953 = idf(docFreq=69, maxDocs=42740)
                0.078125 = fieldNorm(doc=3437)
          0.08309394 = weight(abstract_txt:ergibt in 3437) [ClassicSimilarity], result of:
            0.08309394 = score(doc=3437,freq=1.0), product of:
              0.14173794 = queryWeight, product of:
                1.0120863 = boost
                7.5040073 = idf(docFreq=63, maxDocs=42740)
                0.018662736 = queryNorm
              0.58625054 = fieldWeight in 3437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5040073 = idf(docFreq=63, maxDocs=42740)
                0.078125 = fieldNorm(doc=3437)
          0.10065847 = weight(abstract_txt:verbindet in 3437) [ClassicSimilarity], result of:
            0.10065847 = score(doc=3437,freq=1.0), product of:
              0.16106705 = queryWeight, product of:
                1.0788916 = boost
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.018662736 = queryNorm
              0.6249476 = fieldWeight in 3437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.078125 = fieldNorm(doc=3437)
          0.032470495 = weight(abstract_txt:einem in 3437) [ClassicSimilarity], result of:
            0.032470495 = score(doc=3437,freq=1.0), product of:
              0.09544997 = queryWeight, product of:
                1.1745658 = boost
                4.3543477 = idf(docFreq=1492, maxDocs=42740)
                0.018662736 = queryNorm
              0.3401834 = fieldWeight in 3437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3543477 = idf(docFreq=1492, maxDocs=42740)
                0.078125 = fieldNorm(doc=3437)
          0.07604662 = weight(abstract_txt:eines in 3437) [ClassicSimilarity], result of:
            0.07604662 = score(doc=3437,freq=1.0), product of:
              0.21208654 = queryWeight, product of:
                2.4760592 = boost
                4.5896206 = idf(docFreq=1179, maxDocs=42740)
                0.018662736 = queryNorm
              0.3585641 = fieldWeight in 3437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5896206 = idf(docFreq=1179, maxDocs=42740)
                0.078125 = fieldNorm(doc=3437)
        0.2 = coord(5/25)