Document (#37600)

Author
Weiner, U.
Title
Vor uns die Dokumentenflut oder Automatische Indexierung als notwendige und sinnvolle Ergänzung zur intellektuellen Sacherschließung
Imprint
Wien : Universität
Year
2012
Pages
xxx Bl
Abstract
Vor dem Hintergrund veränderter Ansprüche der Bibliotheksbenutzer an Recherchemöglichkeiten - weg vom klassischen Online-Katalog hin zum "One-Stop-Shop" mit Funktionalitäten wie thematisches Browsing, Relevanzranking und dergleichen mehr - einerseits und der notwendigen Bearbeitung von Massendaten (Stichwort Dokumentenflut) andererseits rücken Systeme zur automatischen Indexierung wieder verstärkt in den Mittelpunkt des Interesses. Da in Österreich die Beschäftigung mit diesem Thema im Bibliotheksbereich bislang nur sehr selektiv, bezogen auf wenige konkrete Projekte, erfolgte, wird zuerst ein allgemeiner theoretischer Überblick über die unterschiedlichen Verfahrensansätze der automatischen Indexierung geboten. Im nächsten Schritt werden mit der IDX-basierten Indexierungssoftware MILOS (mit den Teilprojekten MILOS I, MILOS II und KASCADE) und dem modularen System intelligentCAPTURE (mit der integrierten Indexierungssoftware AUTINDEX) die bis vor wenigen Jahren im deutschsprachigen Raum einzigen im Praxiseinsatz befindlichen automatischen Indexierungssysteme vorgestellt. Mit zunehmender Notwendigkeit, neue Wege der inhaltlichen Erschließung zu beschreiten, wurden in den vergangenen 5 - 6 Jahren zahlreiche Softwareentwicklungen auf ihre Einsatzmöglichkeit im Bibliotheksbereich hin getestet. Stellvertretend für diese in Entwicklung befindlichen Systeme zur automatischen inhaltlichen Erschließung wird das Projekt PETRUS, welches in den Jahren 2009 - 2011 an der DNB durchgeführt wurde und die Komponenten PICA Match&Merge sowie die Extraction Platform der Firma Averbis beinhaltet, vorgestellt.
Footnote
Wien, Univ., Lehrgang Library and Information Studies, Master-Thesis, 2012
Theme
Automatisches Indexieren
Object
MILOS
KASCADE
intelligentCAPTURE
AUTINDEX
PETRUS

Similar documents (author)

  1. Weiner, S.T.: Electronic journals, four part series : an introduction (1997) 5.92
    5.9160414 = sum of:
      5.9160414 = weight(author_txt:weiner in 835) [ClassicSimilarity], result of:
        5.9160414 = fieldWeight in 835, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.465666 = idf(docFreq=8, maxDocs=42740)
          0.625 = fieldNorm(doc=835)
    
  2. Weiner, R.G.: Information access illiterate? (1997) 5.92
    5.9160414 = sum of:
      5.9160414 = weight(author_txt:weiner in 2414) [ClassicSimilarity], result of:
        5.9160414 = fieldWeight in 2414, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.465666 = idf(docFreq=8, maxDocs=42740)
          0.625 = fieldNorm(doc=2414)
    
  3. Weiner, M.: ¬Die Agenten kommen (2002) 5.92
    5.9160414 = sum of:
      5.9160414 = weight(author_txt:weiner in 735) [ClassicSimilarity], result of:
        5.9160414 = fieldWeight in 735, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.465666 = idf(docFreq=8, maxDocs=42740)
          0.625 = fieldNorm(doc=735)
    
  4. Weiner, M.L.; Rusch, P.F.: New searching technologies and interfaces (1996) 4.73
    4.732833 = sum of:
      4.732833 = weight(author_txt:weiner in 110) [ClassicSimilarity], result of:
        4.732833 = fieldWeight in 110, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.465666 = idf(docFreq=8, maxDocs=42740)
          0.5 = fieldNorm(doc=110)
    
  5. Weiner, M.L.; Rusch, P.F.: New searching technologies and interfaces (1997) 4.73
    4.732833 = sum of:
      4.732833 = weight(author_txt:weiner in 322) [ClassicSimilarity], result of:
        4.732833 = fieldWeight in 322, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.465666 = idf(docFreq=8, maxDocs=42740)
          0.5 = fieldNorm(doc=322)
    

Similar documents (content)

  1. Lepsky, K.; Zimmermann, H.H.: Katalogerweiterung durch Scanning und automatische Dokumenterschließung : Ergebnisse des DFG-Projekts KASCADE (2000) 0.21
    0.21177068 = sum of:
      0.21177068 = product of:
        1.3235668 = sum of:
          0.39591607 = weight(title_txt:kascade in 5967) [ClassicSimilarity], result of:
            0.39591607 = score(doc=5967,freq=1.0), product of:
              0.16730617 = queryWeight, product of:
                1.136241 = boost
                9.465666 = idf(docFreq=8, maxDocs=42740)
                0.015555728 = queryNorm
              2.3664165 = fieldWeight in 5967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.465666 = idf(docFreq=8, maxDocs=42740)
                0.25 = fieldNorm(doc=5967)
          0.18845414 = weight(abstract_txt:indexierung in 5967) [ClassicSimilarity], result of:
            0.18845414 = score(doc=5967,freq=1.0), product of:
              0.25525174 = queryWeight, product of:
                2.430857 = boost
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.015555728 = queryNorm
              0.738307 = fieldWeight in 5967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.109375 = fieldNorm(doc=5967)
          0.26551026 = weight(abstract_txt:automatischen in 5967) [ClassicSimilarity], result of:
            0.26551026 = score(doc=5967,freq=1.0), product of:
              0.3530737 = queryWeight, product of:
                3.3012402 = boost
                6.8753986 = idf(docFreq=119, maxDocs=42740)
                0.015555728 = queryNorm
              0.75199676 = fieldWeight in 5967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8753986 = idf(docFreq=119, maxDocs=42740)
                0.109375 = fieldNorm(doc=5967)
          0.47368634 = weight(abstract_txt:milos in 5967) [ClassicSimilarity], result of:
            0.47368634 = score(doc=5967,freq=1.0), product of:
              0.4718734 = queryWeight, product of:
                3.3051245 = boost
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.015555728 = queryNorm
              1.003842 = fieldWeight in 5967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.109375 = fieldNorm(doc=5967)
        0.16 = coord(4/25)
    
  2. Schneider, A.: Moderne Retrievalverfahren in klassischen bibliotheksbezogenen Anwendungen : Projekte und Perspektiven (2008) 0.15
    0.15485974 = sum of:
      0.15485974 = product of:
        0.6452489 = sum of:
          0.041179717 = weight(abstract_txt:vorgestellt in 1032) [ClassicSimilarity], result of:
            0.041179717 = score(doc=1032,freq=2.0), product of:
              0.112953044 = queryWeight, product of:
                1.3203176 = boost
                5.4995756 = idf(docFreq=474, maxDocs=42740)
                0.015555728 = queryNorm
              0.36457378 = fieldWeight in 1032, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4995756 = idf(docFreq=474, maxDocs=42740)
                0.046875 = fieldNorm(doc=1032)
          0.036982547 = weight(abstract_txt:erschließung in 1032) [ClassicSimilarity], result of:
            0.036982547 = score(doc=1032,freq=1.0), product of:
              0.1324698 = queryWeight, product of:
                1.4298415 = boost
                5.95578 = idf(docFreq=300, maxDocs=42740)
                0.015555728 = queryNorm
              0.2791772 = fieldWeight in 1032, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.95578 = idf(docFreq=300, maxDocs=42740)
                0.046875 = fieldNorm(doc=1032)
          0.036172986 = weight(abstract_txt:jahren in 1032) [ClassicSimilarity], result of:
            0.036172986 = score(doc=1032,freq=1.0), product of:
              0.14941895 = queryWeight, product of:
                1.85985 = boost
                5.1646085 = idf(docFreq=663, maxDocs=42740)
                0.015555728 = queryNorm
              0.24209103 = fieldWeight in 1032, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1646085 = idf(docFreq=663, maxDocs=42740)
                0.046875 = fieldNorm(doc=1032)
          0.18801427 = weight(abstract_txt:bibliotheksbereich in 1032) [ClassicSimilarity], result of:
            0.18801427 = score(doc=1032,freq=3.0), product of:
              0.27156416 = queryWeight, product of:
                2.0472255 = boost
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.015555728 = queryNorm
              0.6923383 = fieldWeight in 1032, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.046875 = fieldNorm(doc=1032)
          0.13989092 = weight(abstract_txt:indexierung in 1032) [ClassicSimilarity], result of:
            0.13989092 = score(doc=1032,freq=3.0), product of:
              0.25525174 = queryWeight, product of:
                2.430857 = boost
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.015555728 = queryNorm
              0.5480508 = fieldWeight in 1032, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.046875 = fieldNorm(doc=1032)
          0.20300844 = weight(abstract_txt:milos in 1032) [ClassicSimilarity], result of:
            0.20300844 = score(doc=1032,freq=1.0), product of:
              0.4718734 = queryWeight, product of:
                3.3051245 = boost
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.015555728 = queryNorm
              0.430218 = fieldWeight in 1032, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.046875 = fieldNorm(doc=1032)
        0.24 = coord(6/25)
    
  3. Scherer, B.: Automatische Indexierung und ihre Anwendung im DFG-Projekt "Gemeinsames Portal für Bibliotheken, Archive und Museen (BAM)" (2003) 0.15
    0.14664432 = sum of:
      0.14664432 = product of:
        0.7332216 = sum of:
          0.048034824 = weight(abstract_txt:systeme in 1284) [ClassicSimilarity], result of:
            0.048034824 = score(doc=1284,freq=1.0), product of:
              0.13017592 = queryWeight, product of:
                1.4174076 = boost
                5.903989 = idf(docFreq=316, maxDocs=42740)
                0.015555728 = queryNorm
              0.3689993 = fieldWeight in 1284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.903989 = idf(docFreq=316, maxDocs=42740)
                0.0625 = fieldNorm(doc=1284)
          0.06973496 = weight(abstract_txt:erschließung in 1284) [ClassicSimilarity], result of:
            0.06973496 = score(doc=1284,freq=2.0), product of:
              0.1324698 = queryWeight, product of:
                1.4298415 = boost
                5.95578 = idf(docFreq=300, maxDocs=42740)
                0.015555728 = queryNorm
              0.52642155 = fieldWeight in 1284, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.95578 = idf(docFreq=300, maxDocs=42740)
                0.0625 = fieldNorm(doc=1284)
          0.048230648 = weight(abstract_txt:jahren in 1284) [ClassicSimilarity], result of:
            0.048230648 = score(doc=1284,freq=1.0), product of:
              0.14941895 = queryWeight, product of:
                1.85985 = boost
                5.1646085 = idf(docFreq=663, maxDocs=42740)
                0.015555728 = queryNorm
              0.32278803 = fieldWeight in 1284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1646085 = idf(docFreq=663, maxDocs=42740)
                0.0625 = fieldNorm(doc=1284)
          0.26378086 = weight(abstract_txt:indexierung in 1284) [ClassicSimilarity], result of:
            0.26378086 = score(doc=1284,freq=6.0), product of:
              0.25525174 = queryWeight, product of:
                2.430857 = boost
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.015555728 = queryNorm
              1.0334146 = fieldWeight in 1284, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.0625 = fieldNorm(doc=1284)
          0.3034403 = weight(abstract_txt:automatischen in 1284) [ClassicSimilarity], result of:
            0.3034403 = score(doc=1284,freq=4.0), product of:
              0.3530737 = queryWeight, product of:
                3.3012402 = boost
                6.8753986 = idf(docFreq=119, maxDocs=42740)
                0.015555728 = queryNorm
              0.8594248 = fieldWeight in 1284, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8753986 = idf(docFreq=119, maxDocs=42740)
                0.0625 = fieldNorm(doc=1284)
        0.2 = coord(5/25)
    
  4. Oberhauser, O.; Labner, J.: OPAC-Erweiterung durch automatische Indexierung : Empirische Untersuchung mit Daten aus dem Österreichischen Verbundkatalog (2002) 0.14
    0.13808705 = sum of:
      0.13808705 = product of:
        0.8630441 = sum of:
          0.06028831 = weight(abstract_txt:jahren in 2884) [ClassicSimilarity], result of:
            0.06028831 = score(doc=2884,freq=1.0), product of:
              0.14941895 = queryWeight, product of:
                1.85985 = boost
                5.1646085 = idf(docFreq=663, maxDocs=42740)
                0.015555728 = queryNorm
              0.40348503 = fieldWeight in 2884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1646085 = idf(docFreq=663, maxDocs=42740)
                0.078125 = fieldNorm(doc=2884)
          0.1346101 = weight(abstract_txt:indexierung in 2884) [ClassicSimilarity], result of:
            0.1346101 = score(doc=2884,freq=1.0), product of:
              0.25525174 = queryWeight, product of:
                2.430857 = boost
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.015555728 = queryNorm
              0.52736217 = fieldWeight in 2884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.078125 = fieldNorm(doc=2884)
          0.18965018 = weight(abstract_txt:automatischen in 2884) [ClassicSimilarity], result of:
            0.18965018 = score(doc=2884,freq=1.0), product of:
              0.3530737 = queryWeight, product of:
                3.3012402 = boost
                6.8753986 = idf(docFreq=119, maxDocs=42740)
                0.015555728 = queryNorm
              0.5371405 = fieldWeight in 2884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8753986 = idf(docFreq=119, maxDocs=42740)
                0.078125 = fieldNorm(doc=2884)
          0.47849548 = weight(abstract_txt:milos in 2884) [ClassicSimilarity], result of:
            0.47849548 = score(doc=2884,freq=2.0), product of:
              0.4718734 = queryWeight, product of:
                3.3051245 = boost
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.015555728 = queryNorm
              1.0140336 = fieldWeight in 2884, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.078125 = fieldNorm(doc=2884)
        0.16 = coord(4/25)
    
  5. Lepsky, K.: Automatische Indexierung und bibliothekarische Inhaltserschließung : Ergebnisse des DFG-Projekts MILOS I (1996) 0.14
    0.13653858 = sum of:
      0.13653858 = product of:
        0.85336614 = sum of:
          0.058236916 = weight(abstract_txt:vorgestellt in 4062) [ClassicSimilarity], result of:
            0.058236916 = score(doc=4062,freq=1.0), product of:
              0.112953044 = queryWeight, product of:
                1.3203176 = boost
                5.4995756 = idf(docFreq=474, maxDocs=42740)
                0.015555728 = queryNorm
              0.5155852 = fieldWeight in 4062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4995756 = idf(docFreq=474, maxDocs=42740)
                0.09375 = fieldNorm(doc=4062)
          0.16153212 = weight(abstract_txt:indexierung in 4062) [ClassicSimilarity], result of:
            0.16153212 = score(doc=4062,freq=1.0), product of:
              0.25525174 = queryWeight, product of:
                2.430857 = boost
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.015555728 = queryNorm
              0.63283455 = fieldWeight in 4062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.09375 = fieldNorm(doc=4062)
          0.22758022 = weight(abstract_txt:automatischen in 4062) [ClassicSimilarity], result of:
            0.22758022 = score(doc=4062,freq=1.0), product of:
              0.3530737 = queryWeight, product of:
                3.3012402 = boost
                6.8753986 = idf(docFreq=119, maxDocs=42740)
                0.015555728 = queryNorm
              0.6445686 = fieldWeight in 4062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8753986 = idf(docFreq=119, maxDocs=42740)
                0.09375 = fieldNorm(doc=4062)
          0.4060169 = weight(abstract_txt:milos in 4062) [ClassicSimilarity], result of:
            0.4060169 = score(doc=4062,freq=1.0), product of:
              0.4718734 = queryWeight, product of:
                3.3051245 = boost
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.015555728 = queryNorm
              0.860436 = fieldWeight in 4062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.09375 = fieldNorm(doc=4062)
        0.16 = coord(4/25)