Document (#38751)

Editor
Carstensen, K.U.
Title
Sprachtechnologie : ein Überblick
Issue
Version 2.1, 5. Oktober 2012
Source
http://www.kai-uwe-carstensen.de/Publikationen/Sprachtechnologie.pdf
Imprint
o.O. : K.U. Carstensen
Year
2012
Pages
VIII, 239 S
Abstract
Seit mehr als einem halben Jahrhundert existieren ernsthafte und ernst zu nehmende Versuche, menschliche Sprache maschinell zu verarbeiten. Maschinelle Übersetzung oder "natürliche" Dialoge mit Computern gehören zu den ersten Ideen, die den Bereich der späteren Computerlinguistik oder Sprachtechnologie abgesteckt und deren Vorhaben geleitet haben. Heute ist dieser auch maschinelle Sprachverarbeitung (natural language processing, NLP) genannte Bereich stark ausdiversifiziert: Durch die rapide Entwicklung der Informatik ist vieles vorher Unvorstellbare Realität (z. B. automatische Telefonauskunft), einiges früher Unmögliche immerhin möglich geworden (z. B. Handhelds mit Sprachein- und -ausgabe als digitale persönliche (Informations-)Assistenten). Es gibt verschiedene Anwendungen der Computerlinguistik, von denen einige den Sprung in die kommerzielle Nutzung geschafft haben (z. B. Diktiersysteme, Textklassifikation, maschinelle Übersetzung). Immer noch wird an natürlichsprachlichen Systemen (natural language systems, NLS) verschiedenster Funktionalität (z. B. zur Beantwortung beliebiger Fragen oder zur Generierung komplexer Texte) intensiv geforscht, auch wenn die hoch gesteckten Ziele von einst längst nicht erreicht sind (und deshalb entsprechend "heruntergefahren" wurden). Wo die maschinelle Sprachverarbeitung heute steht, ist allerdings angesichts der vielfältigen Aktivitäten in der Computerlinguistik und Sprachtechnologie weder offensichtlich noch leicht in Erfahrung zu bringen (für Studierende des Fachs und erst recht für Laien). Ein Ziel dieses Buches ist, es, die aktuelle Literaturlage in dieser Hinsicht zu verbessern, indem spezifisch systembezogene Aspekte der Computerlinguistik als Überblick über die Sprachtechnologie zusammengetragen werden.
Footnote
Volltext unter: ..\voltlexte\Carstensen_Sprachtechnologie.pdf.
Theme
Computerlinguistik
Grundlagen u. Einführungen: Allgemeine Literatur
Field
Sprachwissenschaft

Similar documents (content)

  1. Computerlinguistik und Sprachtechnologie : Eine Einführung (2010) 0.44
    0.43564704 = sum of:
      0.43564704 = product of:
        2.722794 = sum of:
          0.037942063 = weight(abstract_txt:überblick in 1735) [ClassicSimilarity], result of:
            0.037942063 = score(doc=1735,freq=1.0), product of:
              0.10467257 = queryWeight, product of:
                1.3086522 = boost
                5.799733 = idf(docFreq=363, maxDocs=44218)
                0.013791156 = queryNorm
              0.36248332 = fieldWeight in 1735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.799733 = idf(docFreq=363, maxDocs=44218)
                0.0625 = fieldNorm(doc=1735)
          0.075354554 = weight(abstract_txt:übersetzung in 1735) [ClassicSimilarity], result of:
            0.075354554 = score(doc=1735,freq=1.0), product of:
              0.16538341 = queryWeight, product of:
                1.644954 = boost
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.013791156 = queryNorm
              0.4556355 = fieldWeight in 1735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.0625 = fieldNorm(doc=1735)
          0.48094702 = weight(abstract_txt:computerlinguistik in 1735) [ClassicSimilarity], result of:
            0.48094702 = score(doc=1735,freq=4.0), product of:
              0.4516553 = queryWeight, product of:
                3.8443801 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.013791156 = queryNorm
              1.0648541 = fieldWeight in 1735, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=1735)
          2.1285505 = weight(title_txt:sprachtechnologie in 1735) [ClassicSimilarity], result of:
            2.1285505 = score(doc=1735,freq=1.0), product of:
              0.52814466 = queryWeight, product of:
                4.1571836 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.013791156 = queryNorm
              4.0302415 = fieldWeight in 1735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.4375 = fieldNorm(doc=1735)
        0.16 = coord(4/25)
    
  2. Computerlinguistik und Sprachtechnologie : Eine Einführung (2001) 0.43
    0.4278521 = sum of:
      0.4278521 = product of:
        2.6740756 = sum of:
          0.053658176 = weight(abstract_txt:überblick in 1749) [ClassicSimilarity], result of:
            0.053658176 = score(doc=1749,freq=2.0), product of:
              0.10467257 = queryWeight, product of:
                1.3086522 = boost
                5.799733 = idf(docFreq=363, maxDocs=44218)
                0.013791156 = queryNorm
              0.5126288 = fieldWeight in 1749, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.799733 = idf(docFreq=363, maxDocs=44218)
                0.0625 = fieldNorm(doc=1749)
          0.075354554 = weight(abstract_txt:übersetzung in 1749) [ClassicSimilarity], result of:
            0.075354554 = score(doc=1749,freq=1.0), product of:
              0.16538341 = queryWeight, product of:
                1.644954 = boost
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.013791156 = queryNorm
              0.4556355 = fieldWeight in 1749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.0625 = fieldNorm(doc=1749)
          0.41651234 = weight(abstract_txt:computerlinguistik in 1749) [ClassicSimilarity], result of:
            0.41651234 = score(doc=1749,freq=3.0), product of:
              0.4516553 = queryWeight, product of:
                3.8443801 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.013791156 = queryNorm
              0.9221907 = fieldWeight in 1749, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=1749)
          2.1285505 = weight(title_txt:sprachtechnologie in 1749) [ClassicSimilarity], result of:
            2.1285505 = score(doc=1749,freq=1.0), product of:
              0.52814466 = queryWeight, product of:
                4.1571836 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.013791156 = queryNorm
              4.0302415 = fieldWeight in 1749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.4375 = fieldNorm(doc=1749)
        0.16 = coord(4/25)
    
  3. Kreissig, B.: ¬Der neue Brockhaus : Einsatz von Sprachtechnologie und Wissensnetz (2006) 0.21
    0.2080493 = sum of:
      0.2080493 = product of:
        1.7337441 = sum of:
          0.020642338 = weight(abstract_txt:noch in 6015) [ClassicSimilarity], result of:
            0.020642338 = score(doc=6015,freq=1.0), product of:
              0.0697576 = queryWeight, product of:
                1.0683254 = boost
                4.734644 = idf(docFreq=1055, maxDocs=44218)
                0.013791156 = queryNorm
              0.29591525 = fieldWeight in 6015, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.734644 = idf(docFreq=1055, maxDocs=44218)
                0.0625 = fieldNorm(doc=6015)
          0.19270839 = weight(abstract_txt:maschinelle in 6015) [ClassicSimilarity], result of:
            0.19270839 = score(doc=6015,freq=1.0), product of:
              0.38966915 = queryWeight, product of:
                3.5708432 = boost
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.013791156 = queryNorm
              0.4945436 = fieldWeight in 6015, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.0625 = fieldNorm(doc=6015)
          1.5203934 = weight(title_txt:sprachtechnologie in 6015) [ClassicSimilarity], result of:
            1.5203934 = score(doc=6015,freq=1.0), product of:
              0.52814466 = queryWeight, product of:
                4.1571836 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.013791156 = queryNorm
              2.8787441 = fieldWeight in 6015, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.3125 = fieldNorm(doc=6015)
        0.12 = coord(3/25)
    
  4. Hahn, U.: Automatische Sprachverarbeitung (2023) 0.19
    0.1857727 = sum of:
      0.1857727 = product of:
        0.9288635 = sum of:
          0.054070707 = weight(abstract_txt:natural in 790) [ClassicSimilarity], result of:
            0.054070707 = score(doc=790,freq=2.0), product of:
              0.08028902 = queryWeight, product of:
                1.1461352 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.013791156 = queryNorm
              0.6734508 = fieldWeight in 790, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.09375 = fieldNorm(doc=790)
          0.047245212 = weight(abstract_txt:oder in 790) [ClassicSimilarity], result of:
            0.047245212 = score(doc=790,freq=2.0), product of:
              0.08400086 = queryWeight, product of:
                1.4358044 = boost
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.013791156 = queryNorm
              0.56243724 = fieldWeight in 790, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.09375 = fieldNorm(doc=790)
          0.17777477 = weight(abstract_txt:sprachverarbeitung in 790) [ClassicSimilarity], result of:
            0.17777477 = score(doc=790,freq=1.0), product of:
              0.22366853 = queryWeight, product of:
                1.912979 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.013791156 = queryNorm
              0.7948135 = fieldWeight in 790, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.09375 = fieldNorm(doc=790)
          0.2890626 = weight(abstract_txt:maschinelle in 790) [ClassicSimilarity], result of:
            0.2890626 = score(doc=790,freq=1.0), product of:
              0.38966915 = queryWeight, product of:
                3.5708432 = boost
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.013791156 = queryNorm
              0.74181545 = fieldWeight in 790, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.09375 = fieldNorm(doc=790)
          0.36071026 = weight(abstract_txt:computerlinguistik in 790) [ClassicSimilarity], result of:
            0.36071026 = score(doc=790,freq=1.0), product of:
              0.4516553 = queryWeight, product of:
                3.8443801 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.013791156 = queryNorm
              0.7986406 = fieldWeight in 790, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.09375 = fieldNorm(doc=790)
        0.2 = coord(5/25)
    
  5. Ludwig, B.; Reischer, J.: Informationslinguistik in Regensburg (2012) 0.17
    0.16509934 = sum of:
      0.16509934 = product of:
        1.0318708 = sum of:
          0.06639861 = weight(abstract_txt:überblick in 555) [ClassicSimilarity], result of:
            0.06639861 = score(doc=555,freq=1.0), product of:
              0.10467257 = queryWeight, product of:
                1.3086522 = boost
                5.799733 = idf(docFreq=363, maxDocs=44218)
                0.013791156 = queryNorm
              0.6343458 = fieldWeight in 555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.799733 = idf(docFreq=363, maxDocs=44218)
                0.109375 = fieldNorm(doc=555)
          0.2074039 = weight(abstract_txt:sprachverarbeitung in 555) [ClassicSimilarity], result of:
            0.2074039 = score(doc=555,freq=1.0), product of:
              0.22366853 = queryWeight, product of:
                1.912979 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.013791156 = queryNorm
              0.92728245 = fieldWeight in 555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.109375 = fieldNorm(doc=555)
          0.33723968 = weight(abstract_txt:maschinelle in 555) [ClassicSimilarity], result of:
            0.33723968 = score(doc=555,freq=1.0), product of:
              0.38966915 = queryWeight, product of:
                3.5708432 = boost
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.013791156 = queryNorm
              0.86545134 = fieldWeight in 555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.109375 = fieldNorm(doc=555)
          0.42082864 = weight(abstract_txt:computerlinguistik in 555) [ClassicSimilarity], result of:
            0.42082864 = score(doc=555,freq=1.0), product of:
              0.4516553 = queryWeight, product of:
                3.8443801 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.013791156 = queryNorm
              0.9317474 = fieldWeight in 555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.109375 = fieldNorm(doc=555)
        0.16 = coord(4/25)