Document (#37599)

Author
Weiner, U.
Title
Vor uns die Dokumentenflut oder Automatische Indexierung als notwendige und sinnvolle Ergänzung zur intellektuellen Sacherschließung
Imprint
Wien : Universität
Year
2012
Pages
xxx Bl
Abstract
Vor dem Hintergrund veränderter Ansprüche der Bibliotheksbenutzer an Recherchemöglichkeiten - weg vom klassischen Online-Katalog hin zum "One-Stop-Shop" mit Funktionalitäten wie thematisches Browsing, Relevanzranking und dergleichen mehr - einerseits und der notwendigen Bearbeitung von Massendaten (Stichwort Dokumentenflut) andererseits rücken Systeme zur automatischen Indexierung wieder verstärkt in den Mittelpunkt des Interesses. Da in Österreich die Beschäftigung mit diesem Thema im Bibliotheksbereich bislang nur sehr selektiv, bezogen auf wenige konkrete Projekte, erfolgte, wird zuerst ein allgemeiner theoretischer Überblick über die unterschiedlichen Verfahrensansätze der automatischen Indexierung geboten. Im nächsten Schritt werden mit der IDX-basierten Indexierungssoftware MILOS (mit den Teilprojekten MILOS I, MILOS II und KASCADE) und dem modularen System intelligentCAPTURE (mit der integrierten Indexierungssoftware AUTINDEX) die bis vor wenigen Jahren im deutschsprachigen Raum einzigen im Praxiseinsatz befindlichen automatischen Indexierungssysteme vorgestellt. Mit zunehmender Notwendigkeit, neue Wege der inhaltlichen Erschließung zu beschreiten, wurden in den vergangenen 5 - 6 Jahren zahlreiche Softwareentwicklungen auf ihre Einsatzmöglichkeit im Bibliotheksbereich hin getestet. Stellvertretend für diese in Entwicklung befindlichen Systeme zur automatischen inhaltlichen Erschließung wird das Projekt PETRUS, welches in den Jahren 2009 - 2011 an der DNB durchgeführt wurde und die Komponenten PICA Match&Merge sowie die Extraction Platform der Firma Averbis beinhaltet, vorgestellt.
Footnote
Wien, Univ., Lehrgang Library and Information Studies, Master-Thesis, 2012
Theme
Automatisches Indexieren
Object
MILOS
KASCADE
intelligentCAPTURE
AUTINDEX
PETRUS

Similar documents (author)

  1. Weiner, S.T.: Electronic journals, four part series : an introduction (1997) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:weiner in 834) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 834, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=834)
    
  2. Weiner, R.G.: Information access illiterate? (1997) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:weiner in 1413) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 1413, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=1413)
    
  3. Weiner, M.: ¬Die Agenten kommen (2002) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:weiner in 6734) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 6734, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=6734)
    
  4. Weiner, M.L.; Rusch, P.F.: New searching technologies and interfaces (1996) 4.75
    4.749831 = sum of:
      4.749831 = weight(author_txt:weiner in 7040) [ClassicSimilarity], result of:
        4.749831 = fieldWeight in 7040, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.5 = fieldNorm(doc=7040)
    
  5. Weiner, M.L.; Rusch, P.F.: New searching technologies and interfaces (1997) 4.75
    4.749831 = sum of:
      4.749831 = weight(author_txt:weiner in 321) [ClassicSimilarity], result of:
        4.749831 = fieldWeight in 321, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.5 = fieldNorm(doc=321)
    

Similar documents (content)

  1. Lepsky, K.; Zimmermann, H.H.: Katalogerweiterung durch Scanning und automatische Dokumenterschließung : Ergebnisse des DFG-Projekts KASCADE (2000) 0.21
    0.21304935 = sum of:
      0.21304935 = product of:
        1.3315585 = sum of:
          0.39955735 = weight(title_txt:kascade in 4966) [ClassicSimilarity], result of:
            0.39955735 = score(doc=4966,freq=1.0), product of:
              0.16824065 = queryWeight, product of:
                1.1356871 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.015594236 = queryNorm
              2.3749156 = fieldWeight in 4966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.25 = fieldNorm(doc=4966)
          0.18857197 = weight(abstract_txt:indexierung in 4966) [ClassicSimilarity], result of:
            0.18857197 = score(doc=4966,freq=1.0), product of:
              0.25522193 = queryWeight, product of:
                2.4227736 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.015594236 = queryNorm
              0.7388549 = fieldWeight in 4966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.109375 = fieldNorm(doc=4966)
          0.26522532 = weight(abstract_txt:automatischen in 4966) [ClassicSimilarity], result of:
            0.26522532 = score(doc=4966,freq=1.0), product of:
              0.35263288 = queryWeight, product of:
                3.2883997 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.015594236 = queryNorm
              0.7521287 = fieldWeight in 4966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.109375 = fieldNorm(doc=4966)
          0.47820374 = weight(abstract_txt:milos in 4966) [ClassicSimilarity], result of:
            0.47820374 = score(doc=4966,freq=1.0), product of:
              0.47461548 = queryWeight, product of:
                3.303884 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.015594236 = queryNorm
              1.0075604 = fieldWeight in 4966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.109375 = fieldNorm(doc=4966)
        0.16 = coord(4/25)
    
  2. Schneider, A.: Moderne Retrievalverfahren in klassischen bibliotheksbezogenen Anwendungen : Projekte und Perspektiven (2008) 0.15
    0.15339527 = sum of:
      0.15339527 = product of:
        0.639147 = sum of:
          0.04068267 = weight(abstract_txt:vorgestellt in 4031) [ClassicSimilarity], result of:
            0.04068267 = score(doc=4031,freq=2.0), product of:
              0.11198254 = queryWeight, product of:
                1.310338 = boost
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.015594236 = queryNorm
              0.36329475 = fieldWeight in 4031, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.046875 = fieldNorm(doc=4031)
          0.03619112 = weight(abstract_txt:erschließung in 4031) [ClassicSimilarity], result of:
            0.03619112 = score(doc=4031,freq=1.0), product of:
              0.13050346 = queryWeight, product of:
                1.414553 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.015594236 = queryNorm
              0.27731925 = fieldWeight in 4031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.046875 = fieldNorm(doc=4031)
          0.035427555 = weight(abstract_txt:jahren in 4031) [ClassicSimilarity], result of:
            0.035427555 = score(doc=4031,freq=1.0), product of:
              0.14728048 = queryWeight, product of:
                1.8404603 = boost
                5.1316223 = idf(docFreq=709, maxDocs=44218)
                0.015594236 = queryNorm
              0.2405448 = fieldWeight in 4031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1316223 = idf(docFreq=709, maxDocs=44218)
                0.046875 = fieldNorm(doc=4031)
          0.18192276 = weight(abstract_txt:bibliotheksbereich in 4031) [ClassicSimilarity], result of:
            0.18192276 = score(doc=4031,freq=3.0), product of:
              0.26552472 = queryWeight, product of:
                2.017719 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.015594236 = queryNorm
              0.68514436 = fieldWeight in 4031, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.046875 = fieldNorm(doc=4031)
          0.13997838 = weight(abstract_txt:indexierung in 4031) [ClassicSimilarity], result of:
            0.13997838 = score(doc=4031,freq=3.0), product of:
              0.25522193 = queryWeight, product of:
                2.4227736 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.015594236 = queryNorm
              0.5484575 = fieldWeight in 4031, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.046875 = fieldNorm(doc=4031)
          0.20494448 = weight(abstract_txt:milos in 4031) [ClassicSimilarity], result of:
            0.20494448 = score(doc=4031,freq=1.0), product of:
              0.47461548 = queryWeight, product of:
                3.303884 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.015594236 = queryNorm
              0.4318116 = fieldWeight in 4031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.046875 = fieldNorm(doc=4031)
        0.24 = coord(6/25)
    
  3. Scherer, B.: Automatische Indexierung und ihre Anwendung im DFG-Projekt "Gemeinsames Portal für Bibliotheken, Archive und Museen (BAM)" (2003) 0.15
    0.14591081 = sum of:
      0.14591081 = product of:
        0.72955406 = sum of:
          0.047014274 = weight(abstract_txt:systeme in 4283) [ClassicSimilarity], result of:
            0.047014274 = score(doc=4283,freq=1.0), product of:
              0.12825708 = queryWeight, product of:
                1.4023256 = boost
                5.8650045 = idf(docFreq=340, maxDocs=44218)
                0.015594236 = queryNorm
              0.36656278 = fieldWeight in 4283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8650045 = idf(docFreq=340, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
          0.06824263 = weight(abstract_txt:erschließung in 4283) [ClassicSimilarity], result of:
            0.06824263 = score(doc=4283,freq=2.0), product of:
              0.13050346 = queryWeight, product of:
                1.414553 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.015594236 = queryNorm
              0.52291816 = fieldWeight in 4283, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
          0.04723674 = weight(abstract_txt:jahren in 4283) [ClassicSimilarity], result of:
            0.04723674 = score(doc=4283,freq=1.0), product of:
              0.14728048 = queryWeight, product of:
                1.8404603 = boost
                5.1316223 = idf(docFreq=709, maxDocs=44218)
                0.015594236 = queryNorm
              0.3207264 = fieldWeight in 4283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1316223 = idf(docFreq=709, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
          0.2639458 = weight(abstract_txt:indexierung in 4283) [ClassicSimilarity], result of:
            0.2639458 = score(doc=4283,freq=6.0), product of:
              0.25522193 = queryWeight, product of:
                2.4227736 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.015594236 = queryNorm
              1.0341815 = fieldWeight in 4283, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
          0.30311465 = weight(abstract_txt:automatischen in 4283) [ClassicSimilarity], result of:
            0.30311465 = score(doc=4283,freq=4.0), product of:
              0.35263288 = queryWeight, product of:
                3.2883997 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.015594236 = queryNorm
              0.8595757 = fieldWeight in 4283, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
        0.2 = coord(5/25)
    
  4. Oberhauser, O.; Labner, J.: OPAC-Erweiterung durch automatische Indexierung : Empirische Untersuchung mit Daten aus dem Österreichischen Verbundkatalog (2002) 0.14
    0.13859929 = sum of:
      0.13859929 = product of:
        0.86624557 = sum of:
          0.059045922 = weight(abstract_txt:jahren in 883) [ClassicSimilarity], result of:
            0.059045922 = score(doc=883,freq=1.0), product of:
              0.14728048 = queryWeight, product of:
                1.8404603 = boost
                5.1316223 = idf(docFreq=709, maxDocs=44218)
                0.015594236 = queryNorm
              0.400908 = fieldWeight in 883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1316223 = idf(docFreq=709, maxDocs=44218)
                0.078125 = fieldNorm(doc=883)
          0.13469426 = weight(abstract_txt:indexierung in 883) [ClassicSimilarity], result of:
            0.13469426 = score(doc=883,freq=1.0), product of:
              0.25522193 = queryWeight, product of:
                2.4227736 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.015594236 = queryNorm
              0.5277535 = fieldWeight in 883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.078125 = fieldNorm(doc=883)
          0.18944664 = weight(abstract_txt:automatischen in 883) [ClassicSimilarity], result of:
            0.18944664 = score(doc=883,freq=1.0), product of:
              0.35263288 = queryWeight, product of:
                3.2883997 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.015594236 = queryNorm
              0.5372348 = fieldWeight in 883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.078125 = fieldNorm(doc=883)
          0.48305875 = weight(abstract_txt:milos in 883) [ClassicSimilarity], result of:
            0.48305875 = score(doc=883,freq=2.0), product of:
              0.47461548 = queryWeight, product of:
                3.303884 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.015594236 = queryNorm
              1.0177897 = fieldWeight in 883, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.078125 = fieldNorm(doc=883)
        0.16 = coord(4/25)
    
  5. Lepsky, K.: Automatische Indexierung und bibliothekarische Inhaltserschließung : Ergebnisse des DFG-Projekts MILOS I (1996) 0.14
    0.13702272 = sum of:
      0.13702272 = product of:
        0.856392 = sum of:
          0.057533983 = weight(abstract_txt:vorgestellt in 2061) [ClassicSimilarity], result of:
            0.057533983 = score(doc=2061,freq=1.0), product of:
              0.11198254 = queryWeight, product of:
                1.310338 = boost
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.015594236 = queryNorm
              0.51377636 = fieldWeight in 2061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.09375 = fieldNorm(doc=2061)
          0.16163312 = weight(abstract_txt:indexierung in 2061) [ClassicSimilarity], result of:
            0.16163312 = score(doc=2061,freq=1.0), product of:
              0.25522193 = queryWeight, product of:
                2.4227736 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.015594236 = queryNorm
              0.6333042 = fieldWeight in 2061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.09375 = fieldNorm(doc=2061)
          0.22733599 = weight(abstract_txt:automatischen in 2061) [ClassicSimilarity], result of:
            0.22733599 = score(doc=2061,freq=1.0), product of:
              0.35263288 = queryWeight, product of:
                3.2883997 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.015594236 = queryNorm
              0.64468175 = fieldWeight in 2061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.09375 = fieldNorm(doc=2061)
          0.40988895 = weight(abstract_txt:milos in 2061) [ClassicSimilarity], result of:
            0.40988895 = score(doc=2061,freq=1.0), product of:
              0.47461548 = queryWeight, product of:
                3.303884 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.015594236 = queryNorm
              0.8636232 = fieldWeight in 2061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.09375 = fieldNorm(doc=2061)
        0.16 = coord(4/25)