Document (#43374)

Author
Sack, H.
Title
Hybride Künstliche Intelligenz in der automatisierten Inhaltserschließung
Source
Qualität in der Inhaltserschließung. Hrsg.: M. Franke-Maier, u.a
Imprint
München : DeGruyter-Saur
Year
2021
Pages
S.387-405
Series
Bibliotheks- und Informationspraxis; 70
Abstract
Effizienter (Online-)Zugang zu Bibliotheks- und Archivmaterialien erfordert eine qualitativ hinreichende inhaltliche Erschließung dieser Dokumente. Die passgenaue Verschlagwortung und Kategorisierung dieser unstrukturierten Dokumente ermöglichen einen strukturell gegliederten Zugang sowohl in der analogen als auch in der digitalen Welt. Darüber hinaus erweitert eine vollständige Transkription der Dokumente den Zugang über die Möglichkeiten der Volltextsuche. Angesichts der in jüngster Zeit erzielten spektakulären Erfolge der Künstlichen Intelligenz liegt die Schlussfolgerung nahe, dass auch das Problem der automatisierten Inhaltserschließung für Bibliotheken und Archive als mehr oder weniger gelöst anzusehen wäre. Allerdings lassen sich die oftmals nur in thematisch engen Teilbereichen erzielten Erfolge nicht immer problemlos verallgemeinern oder in einen neuen Kontext übertragen. Das Ziel der vorliegenden Darstellung liegt in der Diskussion des aktuellen Stands der Technik der automatisierten inhaltlichen Erschließung anhand ausgewählter Beispiele sowie möglicher Fortschritte und Prognosen basierend auf aktuellen Entwicklungen des maschinellen Lernens und der Künstlichen Intelligenz einschließlich deren Kritik.
Theme
Automatisches Indexieren

Similar documents (content)

  1. Kasprzik, A.: Automatisierte und semiautomatisierte Klassifizierung : eine Analyse aktueller Projekte (2014) 0.23
    0.23148112 = sum of:
      0.23148112 = product of:
        0.8267183 = sum of:
          0.025296006 = weight(abstract_txt:dieser in 2470) [ClassicSimilarity], result of:
            0.025296006 = score(doc=2470,freq=1.0), product of:
              0.07399707 = queryWeight, product of:
                1.0114458 = boost
                4.3756986 = idf(docFreq=1511, maxDocs=44218)
                0.016719548 = queryNorm
              0.34185144 = fieldWeight in 2470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3756986 = idf(docFreq=1511, maxDocs=44218)
                0.078125 = fieldNorm(doc=2470)
          0.0705598 = weight(abstract_txt:aktuellen in 2470) [ClassicSimilarity], result of:
            0.0705598 = score(doc=2470,freq=1.0), product of:
              0.14662841 = queryWeight, product of:
                1.4237849 = boost
                6.159553 = idf(docFreq=253, maxDocs=44218)
                0.016719548 = queryNorm
              0.4812151 = fieldWeight in 2470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.159553 = idf(docFreq=253, maxDocs=44218)
                0.078125 = fieldNorm(doc=2470)
          0.11153042 = weight(abstract_txt:inhaltserschließung in 2470) [ClassicSimilarity], result of:
            0.11153042 = score(doc=2470,freq=1.0), product of:
              0.19896443 = queryWeight, product of:
                1.6585289 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.016719548 = queryNorm
              0.56055456 = fieldWeight in 2470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.078125 = fieldNorm(doc=2470)
          0.11361619 = weight(abstract_txt:künstlichen in 2470) [ClassicSimilarity], result of:
            0.11361619 = score(doc=2470,freq=1.0), product of:
              0.20143737 = queryWeight, product of:
                1.668804 = boost
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.016719548 = queryNorm
              0.56402737 = fieldWeight in 2470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.078125 = fieldNorm(doc=2470)
          0.11080834 = weight(abstract_txt:dokumente in 2470) [ClassicSimilarity], result of:
            0.11080834 = score(doc=2470,freq=1.0), product of:
              0.22677332 = queryWeight, product of:
                2.1685874 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.016719548 = queryNorm
              0.4886304 = fieldWeight in 2470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.078125 = fieldNorm(doc=2470)
          0.11892414 = weight(abstract_txt:intelligenz in 2470) [ClassicSimilarity], result of:
            0.11892414 = score(doc=2470,freq=1.0), product of:
              0.23771521 = queryWeight, product of:
                2.2202885 = boost
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.016719548 = queryNorm
              0.5002799 = fieldWeight in 2470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.078125 = fieldNorm(doc=2470)
          0.2759834 = weight(abstract_txt:automatisierten in 2470) [ClassicSimilarity], result of:
            0.2759834 = score(doc=2470,freq=1.0), product of:
              0.41667643 = queryWeight, product of:
                2.939547 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.016719548 = queryNorm
              0.66234463 = fieldWeight in 2470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.078125 = fieldNorm(doc=2470)
        0.28 = coord(7/25)
    
  2. Groß, T.; Faden, M.: Automatische Indexierung elektronischer Dokumente an der Deutschen Zentralbibliothek für Wirtschaftswissenschaften : Bericht über die Jahrestagung der Internationalen Buchwissenschaftlichen Gesellschaft (2010) 0.12
    0.12013538 = sum of:
      0.12013538 = product of:
        0.42905495 = sum of:
          0.015177605 = weight(abstract_txt:dieser in 4051) [ClassicSimilarity], result of:
            0.015177605 = score(doc=4051,freq=1.0), product of:
              0.07399707 = queryWeight, product of:
                1.0114458 = boost
                4.3756986 = idf(docFreq=1511, maxDocs=44218)
                0.016719548 = queryNorm
              0.20511088 = fieldWeight in 4051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3756986 = idf(docFreq=1511, maxDocs=44218)
                0.046875 = fieldNorm(doc=4051)
          0.037512604 = weight(abstract_txt:erschließung in 4051) [ClassicSimilarity], result of:
            0.037512604 = score(doc=4051,freq=1.0), product of:
              0.13526866 = queryWeight, product of:
                1.3675207 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.016719548 = queryNorm
              0.27731925 = fieldWeight in 4051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.046875 = fieldNorm(doc=4051)
          0.04233588 = weight(abstract_txt:aktuellen in 4051) [ClassicSimilarity], result of:
            0.04233588 = score(doc=4051,freq=1.0), product of:
              0.14662841 = queryWeight, product of:
                1.4237849 = boost
                6.159553 = idf(docFreq=253, maxDocs=44218)
                0.016719548 = queryNorm
              0.28872904 = fieldWeight in 4051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.159553 = idf(docFreq=253, maxDocs=44218)
                0.046875 = fieldNorm(doc=4051)
          0.066918254 = weight(abstract_txt:inhaltserschließung in 4051) [ClassicSimilarity], result of:
            0.066918254 = score(doc=4051,freq=1.0), product of:
              0.19896443 = queryWeight, product of:
                1.6585289 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.016719548 = queryNorm
              0.33633274 = fieldWeight in 4051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.046875 = fieldNorm(doc=4051)
          0.13795893 = weight(abstract_txt:erzielten in 4051) [ClassicSimilarity], result of:
            0.13795893 = score(doc=4051,freq=1.0), product of:
              0.32228908 = queryWeight, product of:
                2.1108537 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.016719548 = queryNorm
              0.42805958 = fieldWeight in 4051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.046875 = fieldNorm(doc=4051)
          0.06266667 = weight(abstract_txt:zugang in 4051) [ClassicSimilarity], result of:
            0.06266667 = score(doc=4051,freq=1.0), product of:
              0.21800539 = queryWeight, product of:
                2.1262512 = boost
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.016719548 = queryNorm
              0.2874547 = fieldWeight in 4051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.046875 = fieldNorm(doc=4051)
          0.06648501 = weight(abstract_txt:dokumente in 4051) [ClassicSimilarity], result of:
            0.06648501 = score(doc=4051,freq=1.0), product of:
              0.22677332 = queryWeight, product of:
                2.1685874 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.016719548 = queryNorm
              0.29317826 = fieldWeight in 4051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.046875 = fieldNorm(doc=4051)
        0.28 = coord(7/25)
    
  3. Boltzendahl, S.: Ontologien in digitalen Bibliotheken unter dem Schwerpunkt Inhaltserschliessung und Recherche (2004) 0.11
    0.10759485 = sum of:
      0.10759485 = product of:
        0.44831187 = sum of:
          0.03393816 = weight(abstract_txt:dieser in 1414) [ClassicSimilarity], result of:
            0.03393816 = score(doc=1414,freq=5.0), product of:
              0.07399707 = queryWeight, product of:
                1.0114458 = boost
                4.3756986 = idf(docFreq=1511, maxDocs=44218)
                0.016719548 = queryNorm
              0.4586419 = fieldWeight in 1414, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.3756986 = idf(docFreq=1511, maxDocs=44218)
                0.046875 = fieldNorm(doc=1414)
          0.03848422 = weight(abstract_txt:liegt in 1414) [ClassicSimilarity], result of:
            0.03848422 = score(doc=1414,freq=1.0), product of:
              0.13759443 = queryWeight, product of:
                1.3792269 = boost
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.016719548 = queryNorm
              0.27969316 = fieldWeight in 1414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.046875 = fieldNorm(doc=1414)
          0.115905814 = weight(abstract_txt:inhaltserschließung in 1414) [ClassicSimilarity], result of:
            0.115905814 = score(doc=1414,freq=3.0), product of:
              0.19896443 = queryWeight, product of:
                1.6585289 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.016719548 = queryNorm
              0.5825454 = fieldWeight in 1414, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.046875 = fieldNorm(doc=1414)
          0.096406534 = weight(abstract_txt:künstlichen in 1414) [ClassicSimilarity], result of:
            0.096406534 = score(doc=1414,freq=2.0), product of:
              0.20143737 = queryWeight, product of:
                1.668804 = boost
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.016719548 = queryNorm
              0.4785931 = fieldWeight in 1414, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.046875 = fieldNorm(doc=1414)
          0.06266667 = weight(abstract_txt:zugang in 1414) [ClassicSimilarity], result of:
            0.06266667 = score(doc=1414,freq=1.0), product of:
              0.21800539 = queryWeight, product of:
                2.1262512 = boost
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.016719548 = queryNorm
              0.2874547 = fieldWeight in 1414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.046875 = fieldNorm(doc=1414)
          0.10091048 = weight(abstract_txt:intelligenz in 1414) [ClassicSimilarity], result of:
            0.10091048 = score(doc=1414,freq=2.0), product of:
              0.23771521 = queryWeight, product of:
                2.2202885 = boost
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.016719548 = queryNorm
              0.42450154 = fieldWeight in 1414, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.046875 = fieldNorm(doc=1414)
        0.24 = coord(6/25)
    
  4. Gabler, S.: Vergabe von DDC-Sachgruppen mittels eines Schlagwort-Thesaurus (2021) 0.08
    0.08127901 = sum of:
      0.08127901 = product of:
        0.5079938 = sum of:
          0.020236805 = weight(abstract_txt:dieser in 1000) [ClassicSimilarity], result of:
            0.020236805 = score(doc=1000,freq=1.0), product of:
              0.07399707 = queryWeight, product of:
                1.0114458 = boost
                4.3756986 = idf(docFreq=1511, maxDocs=44218)
                0.016719548 = queryNorm
              0.27348116 = fieldWeight in 1000, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3756986 = idf(docFreq=1511, maxDocs=44218)
                0.0625 = fieldNorm(doc=1000)
          0.14160492 = weight(abstract_txt:kategorisierung in 1000) [ClassicSimilarity], result of:
            0.14160492 = score(doc=1000,freq=2.0), product of:
              0.17053707 = queryWeight, product of:
                1.0857497 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.016719548 = queryNorm
              0.8303468 = fieldWeight in 1000, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=1000)
          0.12536533 = weight(abstract_txt:dokumente in 1000) [ClassicSimilarity], result of:
            0.12536533 = score(doc=1000,freq=2.0), product of:
              0.22677332 = queryWeight, product of:
                2.1685874 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.016719548 = queryNorm
              0.55282223 = fieldWeight in 1000, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.0625 = fieldNorm(doc=1000)
          0.22078672 = weight(abstract_txt:automatisierten in 1000) [ClassicSimilarity], result of:
            0.22078672 = score(doc=1000,freq=1.0), product of:
              0.41667643 = queryWeight, product of:
                2.939547 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.016719548 = queryNorm
              0.5298757 = fieldWeight in 1000, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.0625 = fieldNorm(doc=1000)
        0.16 = coord(4/25)
    
  5. Giesselbach, S.; Estler-Ziegler, T.: Dokumente schneller analysieren mit Künstlicher Intelligenz (2021) 0.07
    0.07488962 = sum of:
      0.07488962 = product of:
        0.6240802 = sum of:
          0.17729335 = weight(abstract_txt:dokumente in 128) [ClassicSimilarity], result of:
            0.17729335 = score(doc=128,freq=4.0), product of:
              0.22677332 = queryWeight, product of:
                2.1685874 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.016719548 = queryNorm
              0.7818087 = fieldWeight in 128, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.0625 = fieldNorm(doc=128)
          0.13454731 = weight(abstract_txt:intelligenz in 128) [ClassicSimilarity], result of:
            0.13454731 = score(doc=128,freq=2.0), product of:
              0.23771521 = queryWeight, product of:
                2.2202885 = boost
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.016719548 = queryNorm
              0.5660021 = fieldWeight in 128, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.0625 = fieldNorm(doc=128)
          0.31223956 = weight(abstract_txt:automatisierten in 128) [ClassicSimilarity], result of:
            0.31223956 = score(doc=128,freq=2.0), product of:
              0.41667643 = queryWeight, product of:
                2.939547 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.016719548 = queryNorm
              0.7493574 = fieldWeight in 128, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.0625 = fieldNorm(doc=128)
        0.12 = coord(3/25)