Search (55 results, page 1 of 3)

  • × theme_ss:"Computerlinguistik"
  1. Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.06
    0.06278193 = product of:
      0.094172895 = sum of:
        0.07498358 = product of:
          0.22495073 = sum of:
            0.22495073 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
              0.22495073 = score(doc=562,freq=2.0), product of:
                0.40025535 = queryWeight, product of:
                  8.478011 = idf(docFreq=24, maxDocs=44218)
                  0.047210995 = queryNorm
                0.56201804 = fieldWeight in 562, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  8.478011 = idf(docFreq=24, maxDocs=44218)
                  0.046875 = fieldNorm(doc=562)
          0.33333334 = coord(1/3)
        0.019189315 = product of:
          0.03837863 = sum of:
            0.03837863 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
              0.03837863 = score(doc=562,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.23214069 = fieldWeight in 562, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=562)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Content
    Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
    Date
    8. 1.2013 10:22:32
  2. Lawrie, D.; Mayfield, J.; McNamee, P.; Oard, P.W.: Cross-language person-entity linking from 20 languages (2015) 0.05
    0.05252579 = product of:
      0.15757737 = sum of:
        0.15757737 = sum of:
          0.11919874 = weight(_text_:plus in 1848) [ClassicSimilarity], result of:
            0.11919874 = score(doc=1848,freq=2.0), product of:
              0.29135957 = queryWeight, product of:
                6.1714344 = idf(docFreq=250, maxDocs=44218)
                0.047210995 = queryNorm
              0.40911216 = fieldWeight in 1848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1714344 = idf(docFreq=250, maxDocs=44218)
                0.046875 = fieldNorm(doc=1848)
          0.03837863 = weight(_text_:22 in 1848) [ClassicSimilarity], result of:
            0.03837863 = score(doc=1848,freq=2.0), product of:
              0.16532487 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.047210995 = queryNorm
              0.23214069 = fieldWeight in 1848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=1848)
      0.33333334 = coord(1/3)
    
    Abstract
    The goal of entity linking is to associate references to an entity that is found in unstructured natural language content to an authoritative inventory of known entities. This article describes the construction of 6 test collections for cross-language person-entity linking that together span 22 languages. Fully automated components were used together with 2 crowdsourced validation stages to affordably generate ground-truth annotations with an accuracy comparable to that of a completely manual process. The resulting test collections each contain between 642 (Arabic) and 2,361 (Romanian) person references in non-English texts for which the correct resolution in English Wikipedia is known, plus a similar number of references for which no correct resolution into English Wikipedia is believed to exist. Fully automated cross-language person-name linking experiments with 20 non-English languages yielded a resolution accuracy of between 0.84 (Serbian) and 0.98 (Romanian), which compares favorably with previously reported cross-language entity linking results for Spanish.
  3. Jha, A.: Why GPT-4 isn't all it's cracked up to be (2023) 0.03
    0.032128777 = product of:
      0.09638633 = sum of:
        0.09638633 = weight(_text_:me in 923) [ClassicSimilarity], result of:
          0.09638633 = score(doc=923,freq=2.0), product of:
            0.3430384 = queryWeight, product of:
              7.2660704 = idf(docFreq=83, maxDocs=44218)
              0.047210995 = queryNorm
            0.28097823 = fieldWeight in 923, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              7.2660704 = idf(docFreq=83, maxDocs=44218)
              0.02734375 = fieldNorm(doc=923)
      0.33333334 = coord(1/3)
    
    Abstract
    He doesn't dismiss the potential of LLMs to become useful assistants in all sorts of ways-Google and Microsoft have already announced that they will be integrating LLMs into their search and office productivity software. But he talked me through some of his criticisms of the technology's apparent capabilities. At the heart of Dr Marcus's thoughtful critique is an attempt to put LLMs into proper context. Deep learning, the underlying technology that makes LLMs work, is only one piece of the puzzle in the quest for machine intelligence. To reach the level of artificial general intelligence (AGI) that many tech companies strive for-i.e. machines that can plan, reason and solve problems in the way human brains can-they will need to deploy a suite of other AI techniques. These include, for example, the kind of "symbolic AI" that was popular before artificial neural networks and deep learning became all the rage.
  4. Noever, D.; Ciolino, M.: ¬The Turing deception (2022) 0.02
    0.024994528 = product of:
      0.07498358 = sum of:
        0.07498358 = product of:
          0.22495073 = sum of:
            0.22495073 = weight(_text_:3a in 862) [ClassicSimilarity], result of:
              0.22495073 = score(doc=862,freq=2.0), product of:
                0.40025535 = queryWeight, product of:
                  8.478011 = idf(docFreq=24, maxDocs=44218)
                  0.047210995 = queryNorm
                0.56201804 = fieldWeight in 862, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  8.478011 = idf(docFreq=24, maxDocs=44218)
                  0.046875 = fieldNorm(doc=862)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Source
    https%3A%2F%2Farxiv.org%2Fabs%2F2212.06721&usg=AOvVaw3i_9pZm9y_dQWoHi6uv0EN
  5. Radford, A.; Wu, J.; Child, R.; Luan, D.; Amode, D.; Sutskever, I.: Language models are unsupervised multitask learners 0.02
    0.019866457 = product of:
      0.05959937 = sum of:
        0.05959937 = product of:
          0.11919874 = sum of:
            0.11919874 = weight(_text_:plus in 871) [ClassicSimilarity], result of:
              0.11919874 = score(doc=871,freq=2.0), product of:
                0.29135957 = queryWeight, product of:
                  6.1714344 = idf(docFreq=250, maxDocs=44218)
                  0.047210995 = queryNorm
                0.40911216 = fieldWeight in 871, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.1714344 = idf(docFreq=250, maxDocs=44218)
                  0.046875 = fieldNorm(doc=871)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Natural language processing tasks, such as question answering, machine translation, reading comprehension, and summarization, are typically approached with supervised learning on task-specific datasets. We demonstrate that language models begin to learn these tasks without any explicit supervision when trained on a new dataset of millions of webpages called WebText. When conditioned on a document plus questions, the answers generated by the language model reach 55 F1 on the CoQA dataset - matching or exceeding the performance of 3 out of 4 baseline systems without using the 127,000+ training examples. The capacity of the language model is essential to the success of zero-shot task transfer and increasing it improves performance in a log-linear fashion across tasks. Our largest model, GPT-2, is a 1.5B parameter Transformer that achieves state of the art results on 7 out of 8 tested language modeling datasets in a zero-shot setting but still underfits WebText. Samples from the model reflect these improvements and contain coherent paragraphs of text. These findings suggest a promising path towards building language processing systems which learn to perform tasks from their naturally occurring demonstrations.
  6. Warner, A.J.: Natural language processing (1987) 0.02
    0.01705717 = product of:
      0.051171508 = sum of:
        0.051171508 = product of:
          0.102343015 = sum of:
            0.102343015 = weight(_text_:22 in 337) [ClassicSimilarity], result of:
              0.102343015 = score(doc=337,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.61904186 = fieldWeight in 337, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.125 = fieldNorm(doc=337)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Source
    Annual review of information science and technology. 22(1987), S.79-108
  7. Muneer, I.; Sharjeel, M.; Iqbal, M.; Adeel Nawab, R.M.; Rayson, P.: CLEU - A Cross-language english-urdu corpus and benchmark for text reuse experiments (2019) 0.02
    0.016555382 = product of:
      0.049666144 = sum of:
        0.049666144 = product of:
          0.09933229 = sum of:
            0.09933229 = weight(_text_:plus in 5299) [ClassicSimilarity], result of:
              0.09933229 = score(doc=5299,freq=2.0), product of:
                0.29135957 = queryWeight, product of:
                  6.1714344 = idf(docFreq=250, maxDocs=44218)
                  0.047210995 = queryNorm
                0.3409268 = fieldWeight in 5299, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.1714344 = idf(docFreq=250, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5299)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Text reuse is becoming a serious issue in many fields and research shows that it is much harder to detect when it occurs across languages. The recent rise in multi-lingual content on the Web has increased cross-language text reuse to an unprecedented scale. Although researchers have proposed methods to detect it, one major drawback is the unavailability of large-scale gold standard evaluation resources built on real cases. To overcome this problem, we propose a cross-language sentence/passage level text reuse corpus for the English-Urdu language pair. The Cross-Language English-Urdu Corpus (CLEU) has source text in English whereas the derived text is in Urdu. It contains in total 3,235 sentence/passage pairs manually tagged into three categories that is near copy, paraphrased copy, and independently written. Further, as a second contribution, we evaluate the Translation plus Mono-lingual Analysis method using three sets of experiments on the proposed dataset to highlight its usefulness. Evaluation results (f1=0.732 binary, f1=0.552 ternary classification) indicate that it is harder to detect cross-language real cases of text reuse, especially when the language pairs have unrelated scripts. The corpus is a useful benchmark resource for the future development and assessment of cross-language text reuse detection systems for the English-Urdu language pair.
  8. McMahon, J.G.; Smith, F.J.: Improved statistical language model performance with automatic generated word hierarchies (1996) 0.01
    0.014925024 = product of:
      0.04477507 = sum of:
        0.04477507 = product of:
          0.08955014 = sum of:
            0.08955014 = weight(_text_:22 in 3164) [ClassicSimilarity], result of:
              0.08955014 = score(doc=3164,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.5416616 = fieldWeight in 3164, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=3164)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Source
    Computational linguistics. 22(1996) no.2, S.217-248
  9. Ruge, G.: ¬A spreading activation network for automatic generation of thesaurus relationships (1991) 0.01
    0.014925024 = product of:
      0.04477507 = sum of:
        0.04477507 = product of:
          0.08955014 = sum of:
            0.08955014 = weight(_text_:22 in 4506) [ClassicSimilarity], result of:
              0.08955014 = score(doc=4506,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.5416616 = fieldWeight in 4506, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4506)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    8.10.2000 11:52:22
  10. Somers, H.: Example-based machine translation : Review article (1999) 0.01
    0.014925024 = product of:
      0.04477507 = sum of:
        0.04477507 = product of:
          0.08955014 = sum of:
            0.08955014 = weight(_text_:22 in 6672) [ClassicSimilarity], result of:
              0.08955014 = score(doc=6672,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.5416616 = fieldWeight in 6672, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6672)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    31. 7.1996 9:22:19
  11. New tools for human translators (1997) 0.01
    0.014925024 = product of:
      0.04477507 = sum of:
        0.04477507 = product of:
          0.08955014 = sum of:
            0.08955014 = weight(_text_:22 in 1179) [ClassicSimilarity], result of:
              0.08955014 = score(doc=1179,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.5416616 = fieldWeight in 1179, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=1179)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    31. 7.1996 9:22:19
  12. Baayen, R.H.; Lieber, H.: Word frequency distributions and lexical semantics (1997) 0.01
    0.014925024 = product of:
      0.04477507 = sum of:
        0.04477507 = product of:
          0.08955014 = sum of:
            0.08955014 = weight(_text_:22 in 3117) [ClassicSimilarity], result of:
              0.08955014 = score(doc=3117,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.5416616 = fieldWeight in 3117, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=3117)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    28. 2.1999 10:48:22
  13. ¬Der Student aus dem Computer (2023) 0.01
    0.014925024 = product of:
      0.04477507 = sum of:
        0.04477507 = product of:
          0.08955014 = sum of:
            0.08955014 = weight(_text_:22 in 1079) [ClassicSimilarity], result of:
              0.08955014 = score(doc=1079,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.5416616 = fieldWeight in 1079, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=1079)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    27. 1.2023 16:22:55
  14. Byrne, C.C.; McCracken, S.A.: ¬An adaptive thesaurus employing semantic distance, relational inheritance and nominal compound interpretation for linguistic support of information retrieval (1999) 0.01
    0.012792877 = product of:
      0.03837863 = sum of:
        0.03837863 = product of:
          0.07675726 = sum of:
            0.07675726 = weight(_text_:22 in 4483) [ClassicSimilarity], result of:
              0.07675726 = score(doc=4483,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.46428138 = fieldWeight in 4483, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4483)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    15. 3.2000 10:22:37
  15. Boleda, G.; Evert, S.: Multiword expressions : a pain in the neck of lexical semantics (2009) 0.01
    0.012792877 = product of:
      0.03837863 = sum of:
        0.03837863 = product of:
          0.07675726 = sum of:
            0.07675726 = weight(_text_:22 in 4888) [ClassicSimilarity], result of:
              0.07675726 = score(doc=4888,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.46428138 = fieldWeight in 4888, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4888)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    1. 3.2013 14:56:22
  16. Monnerjahn, P.: Vorsprung ohne Technik : Übersetzen: Computer und Qualität (2000) 0.01
    0.012792877 = product of:
      0.03837863 = sum of:
        0.03837863 = product of:
          0.07675726 = sum of:
            0.07675726 = weight(_text_:22 in 5429) [ClassicSimilarity], result of:
              0.07675726 = score(doc=5429,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.46428138 = fieldWeight in 5429, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=5429)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Source
    c't. 2000, H.22, S.230-231
  17. Hutchins, J.: From first conception to first demonstration : the nascent years of machine translation, 1947-1954. A chronology (1997) 0.01
    0.01066073 = product of:
      0.03198219 = sum of:
        0.03198219 = product of:
          0.06396438 = sum of:
            0.06396438 = weight(_text_:22 in 1463) [ClassicSimilarity], result of:
              0.06396438 = score(doc=1463,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.38690117 = fieldWeight in 1463, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1463)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    31. 7.1996 9:22:19
  18. Kuhlmann, U.; Monnerjahn, P.: Sprache auf Knopfdruck : Sieben automatische Übersetzungsprogramme im Test (2000) 0.01
    0.01066073 = product of:
      0.03198219 = sum of:
        0.03198219 = product of:
          0.06396438 = sum of:
            0.06396438 = weight(_text_:22 in 5428) [ClassicSimilarity], result of:
              0.06396438 = score(doc=5428,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.38690117 = fieldWeight in 5428, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=5428)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Source
    c't. 2000, H.22, S.220-229
  19. Lezius, W.; Rapp, R.; Wettler, M.: ¬A morphology-system and part-of-speech tagger for German (1996) 0.01
    0.01066073 = product of:
      0.03198219 = sum of:
        0.03198219 = product of:
          0.06396438 = sum of:
            0.06396438 = weight(_text_:22 in 1693) [ClassicSimilarity], result of:
              0.06396438 = score(doc=1693,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.38690117 = fieldWeight in 1693, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1693)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    22. 3.2015 9:37:18
  20. Wanner, L.: Lexical choice in text generation and machine translation (1996) 0.01
    0.008528585 = product of:
      0.025585754 = sum of:
        0.025585754 = product of:
          0.051171508 = sum of:
            0.051171508 = weight(_text_:22 in 8521) [ClassicSimilarity], result of:
              0.051171508 = score(doc=8521,freq=2.0), product of:
                0.16532487 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.047210995 = queryNorm
                0.30952093 = fieldWeight in 8521, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=8521)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    31. 7.1996 9:22:19

Years

Languages

  • e 39
  • d 16

Types

  • a 42
  • el 7
  • m 5
  • s 3
  • p 2
  • x 2
  • d 1
  • More… Less…