Document (#26881)

Author
Grummann, M.
Title
Sind Verfahren zur maschinellen Indexierung für Literaturbestände Öffentlicher Bibliotheken geeignet? : Retrievaltests von indexierten ekz-Daten mit der Software IDX
Source
Bibliothek: Forschung und Praxis. 24(2000) H.3, S.297-318
Year
2000
Abstract
Maschinelles Indexieren vereinheitlicht und vermehrt das Suchvokabular eines Bibliothekskatalogs durch verschiedene Methoden (u.a. Ermittlung der Grundform, Kompositazerlegung, Wortableitungen). Ein Retrievaltest mit einem für öffentliche Bibliotheken typischen Sachbuchbestand zeigt, dass dieses Verfahren die Ergebnisse von OPAC-Recherchen verbessert - trotz 'blumiger' Titelformulierungen. Im Vergleich zu herkömmlichen Erschließungsmethoden (Stich- und Schlagwörter) werden mehr relevante Titel gefunden, ohne gleichzeitig den 'Ballast' zu erhöhen. Das maschinelle Indexieren kann die Verschlagwortung jedoch nicht ersetzen, sondern nur ergänzen
Theme
Automatisches Indexieren
Retrievalstudien
Object
MILOS
IDX

Similar documents (content)

  1. Herrmann, J.: Maschinelles Lernen und wissensbasierte Systeme : Systematische Einführung mit praxisorientierten Fallstudien (1997) 0.07
    0.06760348 = sum of:
      0.06760348 = product of:
        0.8450435 = sum of:
          0.22171377 = weight(abstract_txt:maschinellen in 5705) [ClassicSimilarity], result of:
            0.22171377 = score(doc=5705,freq=1.0), product of:
              0.15353926 = queryWeight, product of:
                1.014383 = boost
                7.7014403 = idf(docFreq=51, maxDocs=42306)
                0.019653754 = queryNorm
              1.44402 = fieldWeight in 5705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7014403 = idf(docFreq=51, maxDocs=42306)
                0.1875 = fieldNorm(doc=5705)
          0.6233297 = weight(title_txt:maschinelles in 5705) [ClassicSimilarity], result of:
            0.6233297 = score(doc=5705,freq=1.0), product of:
              0.21757235 = queryWeight, product of:
                1.2075193 = boost
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.019653754 = queryNorm
              2.8649306 = fieldWeight in 5705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.3125 = fieldNorm(doc=5705)
        0.08 = coord(2/25)
    
  2. Scheele, M.: ¬Die automatische Indexierung beliebiger Titel und Schlagwörter auf der Grundlage eines Modells für einen Gesamtthesaurus des Wissens (1983) 0.05
    0.052665334 = sum of:
      0.052665334 = product of:
        0.4388778 = sum of:
          0.112559184 = weight(abstract_txt:schlagwörter in 111) [ClassicSimilarity], result of:
            0.112559184 = score(doc=111,freq=1.0), product of:
              0.15510708 = queryWeight, product of:
                1.0195489 = boost
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.019653754 = queryNorm
              0.72568697 = fieldWeight in 111, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.09375 = fieldNorm(doc=111)
          0.2327018 = weight(abstract_txt:stich in 111) [ClassicSimilarity], result of:
            0.2327018 = score(doc=111,freq=1.0), product of:
              0.251716 = queryWeight, product of:
                1.2988161 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.019653754 = queryNorm
              0.9244617 = fieldWeight in 111, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.09375 = fieldNorm(doc=111)
          0.093616806 = weight(abstract_txt:verfahren in 111) [ClassicSimilarity], result of:
            0.093616806 = score(doc=111,freq=1.0), product of:
              0.17283176 = queryWeight, product of:
                1.5220152 = boost
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.019653754 = queryNorm
              0.54166436 = fieldWeight in 111, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.09375 = fieldNorm(doc=111)
        0.12 = coord(3/25)
    
  3. Forschen für die Internet-Gesellschaft : Trends, Technologien, Anwendungen, Trends und Handlungsempfehlungen 2008 des Feldafinger Kreises (2008) 0.05
    0.050477505 = sum of:
      0.050477505 = product of:
        0.3154844 = sum of:
          0.07129297 = weight(abstract_txt:verbessert in 157) [ClassicSimilarity], result of:
            0.07129297 = score(doc=157,freq=1.0), product of:
              0.14990045 = queryWeight, product of:
                1.0022907 = boost
                7.609633 = idf(docFreq=56, maxDocs=42306)
                0.019653754 = queryNorm
              0.47560206 = fieldWeight in 157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.609633 = idf(docFreq=56, maxDocs=42306)
                0.0625 = fieldNorm(doc=157)
          0.0833071 = weight(abstract_txt:öffentlicher in 157) [ClassicSimilarity], result of:
            0.0833071 = score(doc=157,freq=1.0), product of:
              0.16630036 = queryWeight, product of:
                1.0556959 = boost
                8.015098 = idf(docFreq=37, maxDocs=42306)
                0.019653754 = queryNorm
              0.5009436 = fieldWeight in 157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.015098 = idf(docFreq=37, maxDocs=42306)
                0.0625 = fieldNorm(doc=157)
          0.098473154 = weight(abstract_txt:ermittlung in 157) [ClassicSimilarity], result of:
            0.098473154 = score(doc=157,freq=1.0), product of:
              0.18591613 = queryWeight, product of:
                1.1162225 = boost
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.019653754 = queryNorm
              0.5296644 = fieldWeight in 157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.0625 = fieldNorm(doc=157)
          0.062411204 = weight(abstract_txt:verfahren in 157) [ClassicSimilarity], result of:
            0.062411204 = score(doc=157,freq=1.0), product of:
              0.17283176 = queryWeight, product of:
                1.5220152 = boost
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.019653754 = queryNorm
              0.36110958 = fieldWeight in 157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.0625 = fieldNorm(doc=157)
        0.16 = coord(4/25)
    
  4. Zimmermann, H.H.: Linguistische Verfahren zur Archivierung und zum Wiederfinden unstrukturierter Texte (1983) 0.05
    0.046390243 = sum of:
      0.046390243 = product of:
        0.38658535 = sum of:
          0.10620789 = weight(abstract_txt:herkömmlichen in 558) [ClassicSimilarity], result of:
            0.10620789 = score(doc=558,freq=1.0), product of:
              0.14921604 = queryWeight, product of:
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.019653754 = queryNorm
              0.7117726 = fieldWeight in 558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.09375 = fieldNorm(doc=558)
          0.118228376 = weight(abstract_txt:ersetzen in 558) [ClassicSimilarity], result of:
            0.118228376 = score(doc=558,freq=1.0), product of:
              0.16027242 = queryWeight, product of:
                1.0363863 = boost
                7.8684945 = idf(docFreq=43, maxDocs=42306)
                0.019653754 = queryNorm
              0.7376714 = fieldWeight in 558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8684945 = idf(docFreq=43, maxDocs=42306)
                0.09375 = fieldNorm(doc=558)
          0.16214907 = weight(abstract_txt:verfahren in 558) [ClassicSimilarity], result of:
            0.16214907 = score(doc=558,freq=3.0), product of:
              0.17283176 = queryWeight, product of:
                1.5220152 = boost
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.019653754 = queryNorm
              0.9381902 = fieldWeight in 558, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.09375 = fieldNorm(doc=558)
        0.12 = coord(3/25)
    
  5. Lepsky, K.: Auf dem Weg zur automatischen Inhaltserschließung? : Das DFG-Projekt MILOS und seine Ergebnisse (1997) 0.05
    0.04575159 = sum of:
      0.04575159 = product of:
        0.38126326 = sum of:
          0.18697587 = weight(abstract_txt:retrievaltests in 1012) [ClassicSimilarity], result of:
            0.18697587 = score(doc=1012,freq=1.0), product of:
              0.19630747 = queryWeight, product of:
                1.1469927 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.019653754 = queryNorm
              0.95246434 = fieldWeight in 1012, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.109375 = fieldNorm(doc=1012)
          0.08506779 = weight(abstract_txt:bibliotheken in 1012) [ClassicSimilarity], result of:
            0.08506779 = score(doc=1012,freq=2.0), product of:
              0.11612433 = queryWeight, product of:
                1.2475812 = boost
                4.735969 = idf(docFreq=1008, maxDocs=42306)
                0.019653754 = queryNorm
              0.73255783 = fieldWeight in 1012, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.735969 = idf(docFreq=1008, maxDocs=42306)
                0.109375 = fieldNorm(doc=1012)
          0.10921961 = weight(abstract_txt:verfahren in 1012) [ClassicSimilarity], result of:
            0.10921961 = score(doc=1012,freq=1.0), product of:
              0.17283176 = queryWeight, product of:
                1.5220152 = boost
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.019653754 = queryNorm
              0.6319418 = fieldWeight in 1012, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7777534 = idf(docFreq=355, maxDocs=42306)
                0.109375 = fieldNorm(doc=1012)
        0.12 = coord(3/25)