Search (7 results, page 1 of 1)

Boleda, G.; Evert, S.: Multiword expressions : a pain in the neck of lexical semantics (2009) 0.02

0.016933288 = product of:
  0.08466644 = sum of:
    0.08466644 = weight(_text_:22 in 4888) [ClassicSimilarity], result of:
      0.08466644 = score(doc=4888,freq=2.0), product of:
        0.18236019 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.052075688 = queryNorm
        0.46428138 = fieldWeight in 4888, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.09375 = fieldNorm(doc=4888)
  0.2 = coord(1/5)

Date: 1. 3.2013 14:56:22

Lezius, W.: Morphy - Morphologie und Tagging für das Deutsche (2013) 0.01

0.011288859 = product of:
  0.056444295 = sum of:
    0.056444295 = weight(_text_:22 in 1490) [ClassicSimilarity], result of:
      0.056444295 = score(doc=1490,freq=2.0), product of:
        0.18236019 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.052075688 = queryNorm
        0.30952093 = fieldWeight in 1490, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0625 = fieldNorm(doc=1490)
  0.2 = coord(1/5)

Date: 22. 3.2015 9:30:24

Bager, J.: ¬Die Text-KI ChatGPT schreibt Fachtexte, Prosa, Gedichte und Programmcode (2023) 0.01

0.011288859 = product of:
  0.056444295 = sum of:
    0.056444295 = weight(_text_:22 in 835) [ClassicSimilarity], result of:
      0.056444295 = score(doc=835,freq=2.0), product of:
        0.18236019 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.052075688 = queryNorm
        0.30952093 = fieldWeight in 835, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0625 = fieldNorm(doc=835)
  0.2 = coord(1/5)

Date: 29.12.2022 18:22:55

Rieger, F.: Lügende Computer (2023) 0.01

0.011288859 = product of:
  0.056444295 = sum of:
    0.056444295 = weight(_text_:22 in 912) [ClassicSimilarity], result of:
      0.056444295 = score(doc=912,freq=2.0), product of:
        0.18236019 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.052075688 = queryNorm
        0.30952093 = fieldWeight in 912, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0625 = fieldNorm(doc=912)
  0.2 = coord(1/5)

Date: 16. 3.2023 19:22:55

Radford, A.; Wu, J.; Child, R.; Luan, D.; Amode, D.; Sutskever, I.: Language models are unsupervised multitask learners 0.01
```
0.0075771073 = product of:
  0.037885536 = sum of:
    0.037885536 = weight(_text_:7 in 871) [ClassicSimilarity], result of:
      0.037885536 = score(doc=871,freq=2.0), product of:
        0.17251469 = queryWeight, product of:
          3.3127685 = idf(docFreq=4376, maxDocs=44218)
          0.052075688 = queryNorm
        0.21960759 = fieldWeight in 871, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3127685 = idf(docFreq=4376, maxDocs=44218)
          0.046875 = fieldNorm(doc=871)
  0.2 = coord(1/5)
```
Abstract

Natural language processing tasks, such as question answering, machine translation, reading comprehension, and summarization, are typically approached with supervised learning on task-specific datasets. We demonstrate that language models begin to learn these tasks without any explicit supervision when trained on a new dataset of millions of webpages called WebText. When conditioned on a document plus questions, the answers generated by the language model reach 55 F1 on the CoQA dataset - matching or exceeding the performance of 3 out of 4 baseline systems without using the 127,000+ training examples. The capacity of the language model is essential to the success of zero-shot task transfer and increasing it improves performance in a log-linear fashion across tasks. Our largest model, GPT-2, is a 1.5B parameter Transformer that achieves state of the art results on 7 out of 8 tested language modeling datasets in a zero-shot setting but still underfits WebText. Samples from the model reflect these improvements and contain coherent paragraphs of text. These findings suggest a promising path towards building language processing systems which learn to perform tasks from their naturally occurring demonstrations.

Rötzer, F.: KI-Programm besser als Menschen im Verständnis natürlicher Sprache (2018) 0.01

0.0056444295 = product of:
  0.028222147 = sum of:
    0.028222147 = weight(_text_:22 in 4217) [ClassicSimilarity], result of:
      0.028222147 = score(doc=4217,freq=2.0), product of:
        0.18236019 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.052075688 = queryNorm
        0.15476047 = fieldWeight in 4217, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.03125 = fieldNorm(doc=4217)
  0.2 = coord(1/5)

Date: 22. 1.2018 11:32:44

Artemenko, O.; Shramko, M.: Entwicklung eines Werkzeugs zur Sprachidentifikation in mono- und multilingualen Texten (2005) 0.00
```
0.0044199787 = product of:
  0.022099894 = sum of:
    0.022099894 = weight(_text_:7 in 572) [ClassicSimilarity], result of:
      0.022099894 = score(doc=572,freq=2.0), product of:
        0.17251469 = queryWeight, product of:
          3.3127685 = idf(docFreq=4376, maxDocs=44218)
          0.052075688 = queryNorm
        0.12810442 = fieldWeight in 572, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3127685 = idf(docFreq=4376, maxDocs=44218)
          0.02734375 = fieldNorm(doc=572)
  0.2 = coord(1/5)
```
Abstract

Die Arbeit wird in zwei Hauptteile gegliedert. Der erste Teil besteht aus Kapiteln 1-5, in denen theoretische Grundlagen zum Thema Sprachidentifikation dargelegt werden. Das erste Kapitel beschreibt den Sprachidentifikationsprozess und definiert grundlegende Begriffe. Im zweiten und dritten Kapitel werden vorherrschende Ansätze zur Sprachidentifikation von monolingualen Dokumenten dargestellt und miteinander verglichen, indem deren Vor- und Nachteile diskutiert werden. Das vierte Kapitel stellt einige Arbeiten vor, die sich mit der Sprachidentifikation von multilingualen Texten befasst haben. Der erste Teil der Arbeit wird mit einem Überblick über die bereits entwickelten und im Internet verfügbaren Sprachidentifikationswerkzeuge abgeschlossen. Der zweite Teil der Arbeit stellt die Entwicklung des Sprachidentifikationssystems LangIdent dar. In den Kapiteln 6 und 7 werden die an das System gestellten Anforderungen zusammengefasst und die wichtigsten Phasen des Projekts definiert. In den weiterführenden Kapiteln 8 und 9 werden die Systemarchitektur und eine detaillierte Beschreibung ihrer Kernkomponenten gegeben. Das Kapitel 10 liefert ein statisches UML-Klassendiagramm mit einer ausführlichen Erklärung von Attributen und Methoden der im Diagramm vorgestellten Klassen. Das nächste Kapitel befasst sich mit den im Prozess der Systementwicklung aufgetretenen Problemen. Die Bedienung des Programms wird im Kapitel 12 beschrieben. Im letzten Kapitel der Arbeit wird die Systemevaluierung vorgestellt, in der der Aufbau und Umfang von Trainingskorpora sowie die wichtigsten Ergebnisse mit der anschließenden Diskussion präsentiert werden.

Search (7 results, page 1 of 1)

Authors

Years

Languages

Types