Document (#43130)

Author
Giesselbach, S.
Estler-Ziegler, T.
Title
Dokumente schneller analysieren mit Künstlicher Intelligenz
Source
Mail an Inetbib, 06.02.2021, [von Tania Estler-Ziegler]
Year
2021
Abstract
Künstliche Intelligenz (KI) und natürliches Sprachverstehen (natural language understanding/NLU) verändern viele Aspekte unseres Alltags und unserer Arbeitsweise. Besondere Prominenz erlangte NLU durch Sprachassistenten wie Siri, Alexa und Google Now. NLU bietet Firmen und Einrichtungen das Potential, Prozesse effizienter zu gestalten und Mehrwert aus textuellen Inhalten zu schöpfen. So sind NLU-Lösungen in der Lage, komplexe, unstrukturierte Dokumente inhaltlich zu erschließen. Für die semantische Textanalyse hat das NLU-Team des IAIS Sprachmodelle entwickelt, die mit Deep-Learning-Verfahren trainiert werden. Die NLU-Suite analysiert Dokumente, extrahiert Eckdaten und erstellt bei Bedarf sogar eine strukturierte Zusammenfassung. Mit diesen Ergebnissen, aber auch über den Inhalt der Dokumente selbst, lassen sich Dokumente vergleichen oder Texte mit ähnlichen Informationen finden. KI-basierten Sprachmodelle sind der klassischen Verschlagwortung deutlich überlegen. Denn sie finden nicht nur Texte mit vordefinierten Schlagwörtern, sondern suchen intelligent nach Begriffen, die in ähnlichem Zusammenhang auftauchen oder als Synonym gebraucht werden. Der Vortrag liefert eine Einordnung der Begriffe "Künstliche Intelligenz" und "Natural Language Understanding" und zeigt Möglichkeiten, Grenzen, aktuelle Forschungsrichtungen und Methoden auf. Anhand von Praxisbeispielen wird anschließend demonstriert, wie NLU zur automatisierten Belegverarbeitung, zur Katalogisierung von großen Datenbeständen wie Nachrichten und Patenten und zur automatisierten thematischen Gruppierung von Social Media Beiträgen und Publikationen genutzt werden kann.
Content
Vgl.: https://www.iais.fraunhofer.de/.
Footnote
Vortrag im Rahmen des Berliner Arbeitskreis Information (BAK) am 25.02.2021.
Theme
Computerlinguistik
Automatisches Indexieren
Field
Informatik
Sprachwissenschaft

Similar documents (author)

  1. Ziegler, R.A.; Ziegler, R.S.: ¬The National Film Registry : a videography (1995) 6.36
    6.3560677 = sum of:
      6.3560677 = weight(author_txt:ziegler in 3445) [ClassicSimilarity], result of:
        6.3560677 = fieldWeight in 3445, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.5 = fieldNorm(doc=3445)
    
  2. Ziegler, J.: ¬Der Auskunftsbibliothekar : ein Zauberlehrling? (1991) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:ziegler in 4325) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 4325, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=4325)
    
  3. Ziegler, B.: ESS: ein schneller Algorithmus zur Mustersuche in Zeichenfolgen (1996) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:ziegler in 7543) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 7543, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=7543)
    
  4. Ziegler, C.: Smartes Chaos : Web 2.0 versus Semantic Web (2006) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:ziegler in 4868) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 4868, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=4868)
    
  5. Ziegler, C.: Weltendämmerung : XML und Datenbanken: Einblick in Tamino (2001) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:ziegler in 5802) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 5802, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=5802)
    

Similar documents (content)

  1. Sack, H.: Hybride Künstliche Intelligenz in der automatisierten Inhaltserschließung (2021) 0.14
    0.13845336 = sum of:
      0.13845336 = product of:
        0.86533356 = sum of:
          0.095962875 = weight(abstract_txt:verschlagwortung in 372) [ClassicSimilarity], result of:
            0.095962875 = score(doc=372,freq=1.0), product of:
              0.14418933 = queryWeight, product of:
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.016925948 = queryNorm
              0.66553384 = fieldWeight in 372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.078125 = fieldNorm(doc=372)
          0.2675407 = weight(abstract_txt:automatisierten in 372) [ClassicSimilarity], result of:
            0.2675407 = score(doc=372,freq=2.0), product of:
              0.28562146 = queryWeight, product of:
                1.990416 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.016925948 = queryNorm
              0.93669677 = fieldWeight in 372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.078125 = fieldNorm(doc=372)
          0.17292915 = weight(abstract_txt:intelligenz in 372) [ClassicSimilarity], result of:
            0.17292915 = score(doc=372,freq=2.0), product of:
              0.24442193 = queryWeight, product of:
                2.2550914 = boost
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.016925948 = queryNorm
              0.7075026 = fieldWeight in 372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.078125 = fieldNorm(doc=372)
          0.32890078 = weight(abstract_txt:dokumente in 372) [ClassicSimilarity], result of:
            0.32890078 = score(doc=372,freq=3.0), product of:
              0.3886188 = queryWeight, product of:
                3.670966 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.016925948 = queryNorm
              0.84633267 = fieldWeight in 372, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.078125 = fieldNorm(doc=372)
        0.16 = coord(4/25)
    
  2. Nohr, H.: Theorie des Information Retrieval II : Automatische Indexierung (2004) 0.08
    0.07765681 = sum of:
      0.07765681 = product of:
        0.48535508 = sum of:
          0.094567455 = weight(abstract_txt:unstrukturierte in 8) [ClassicSimilarity], result of:
            0.094567455 = score(doc=8,freq=1.0), product of:
              0.16569093 = queryWeight, product of:
                1.0719705 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.016925948 = queryNorm
              0.5707461 = fieldWeight in 8, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0625 = fieldNorm(doc=8)
          0.09985287 = weight(abstract_txt:textanalyse in 8) [ClassicSimilarity], result of:
            0.09985287 = score(doc=8,freq=1.0), product of:
              0.1718085 = queryWeight, product of:
                1.0915805 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.016925948 = queryNorm
              0.581187 = fieldWeight in 8, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.0625 = fieldNorm(doc=8)
          0.027814133 = weight(abstract_txt:werden in 8) [ClassicSimilarity], result of:
            0.027814133 = score(doc=8,freq=3.0), product of:
              0.0732793 = queryWeight, product of:
                1.2347661 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.016925948 = queryNorm
              0.3795633 = fieldWeight in 8, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=8)
          0.26312062 = weight(abstract_txt:dokumente in 8) [ClassicSimilarity], result of:
            0.26312062 = score(doc=8,freq=3.0), product of:
              0.3886188 = queryWeight, product of:
                3.670966 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.016925948 = queryNorm
              0.67706615 = fieldWeight in 8, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.0625 = fieldNorm(doc=8)
        0.16 = coord(4/25)
    
  3. Zilm, G.: "Kl ist ein glorifizierter Taschenrechner" (2023) 0.07
    0.0687769 = sum of:
      0.0687769 = product of:
        0.57314086 = sum of:
          0.032116994 = weight(abstract_txt:werden in 1129) [ClassicSimilarity], result of:
            0.032116994 = score(doc=1129,freq=1.0), product of:
              0.0732793 = queryWeight, product of:
                1.2347661 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.016925948 = queryNorm
              0.43828195 = fieldWeight in 1129, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.125 = fieldNorm(doc=1129)
          0.26433727 = weight(abstract_txt:künstliche in 1129) [ClassicSimilarity], result of:
            0.26433727 = score(doc=1129,freq=2.0), product of:
              0.20712055 = queryWeight, product of:
                1.6949623 = boost
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.016925948 = queryNorm
              1.2762483 = fieldWeight in 1129, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.125 = fieldNorm(doc=1129)
          0.27668664 = weight(abstract_txt:intelligenz in 1129) [ClassicSimilarity], result of:
            0.27668664 = score(doc=1129,freq=2.0), product of:
              0.24442193 = queryWeight, product of:
                2.2550914 = boost
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.016925948 = queryNorm
              1.1320041 = fieldWeight in 1129, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.125 = fieldNorm(doc=1129)
        0.12 = coord(3/25)
    
  4. Ehrmann, S.: ¬Die Nadel im Bytehaufen : Finden statt suchen: Text Retrieval, Multimediadatenbanken, Dokumentenmanagement (2000) 0.07
    0.06570535 = sum of:
      0.06570535 = product of:
        0.5475446 = sum of:
          0.08911978 = weight(abstract_txt:finden in 5317) [ClassicSimilarity], result of:
            0.08911978 = score(doc=5317,freq=1.0), product of:
              0.12640871 = queryWeight, product of:
                1.3241493 = boost
                5.6401033 = idf(docFreq=426, maxDocs=44218)
                0.016925948 = queryNorm
              0.7050129 = fieldWeight in 5317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6401033 = idf(docFreq=426, maxDocs=44218)
                0.125 = fieldNorm(doc=5317)
          0.15459925 = weight(abstract_txt:texte in 5317) [ClassicSimilarity], result of:
            0.15459925 = score(doc=5317,freq=1.0), product of:
              0.18250126 = queryWeight, product of:
                1.591041 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.016925948 = queryNorm
              0.8471133 = fieldWeight in 5317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.125 = fieldNorm(doc=5317)
          0.30382556 = weight(abstract_txt:dokumente in 5317) [ClassicSimilarity], result of:
            0.30382556 = score(doc=5317,freq=1.0), product of:
              0.3886188 = queryWeight, product of:
                3.670966 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.016925948 = queryNorm
              0.7818087 = fieldWeight in 5317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.125 = fieldNorm(doc=5317)
        0.12 = coord(3/25)
    
  5. Henn, W.: Wehe, die Computer sagen einmal "ich" : Gefahr durch KI (2018) 0.06
    0.062201798 = sum of:
      0.062201798 = product of:
        0.51834834 = sum of:
          0.040146243 = weight(abstract_txt:werden in 4330) [ClassicSimilarity], result of:
            0.040146243 = score(doc=4330,freq=1.0), product of:
              0.0732793 = queryWeight, product of:
                1.2347661 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.016925948 = queryNorm
              0.54785246 = fieldWeight in 4330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.15625 = fieldNorm(doc=4330)
          0.23364332 = weight(abstract_txt:künstliche in 4330) [ClassicSimilarity], result of:
            0.23364332 = score(doc=4330,freq=1.0), product of:
              0.20712055 = queryWeight, product of:
                1.6949623 = boost
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.016925948 = queryNorm
              1.1280547 = fieldWeight in 4330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.15625 = fieldNorm(doc=4330)
          0.24455875 = weight(abstract_txt:intelligenz in 4330) [ClassicSimilarity], result of:
            0.24455875 = score(doc=4330,freq=1.0), product of:
              0.24442193 = queryWeight, product of:
                2.2550914 = boost
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.016925948 = queryNorm
              1.0005598 = fieldWeight in 4330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.15625 = fieldNorm(doc=4330)
        0.12 = coord(3/25)