Search (20 results, page 1 of 1)

  • × theme_ss:"Computerlinguistik"
  • × year_i:[2010 TO 2020}
  1. Fóris, A.: Network theory and terminology (2013) 0.04
    0.03759983 = product of:
      0.09399958 = sum of:
        0.058891464 = weight(_text_:40 in 1365) [ClassicSimilarity], result of:
          0.058891464 = score(doc=1365,freq=4.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.29795453 = fieldWeight in 1365, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1365)
        0.03510811 = weight(_text_:22 in 1365) [ClassicSimilarity], result of:
          0.03510811 = score(doc=1365,freq=2.0), product of:
            0.18148361 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051825367 = queryNorm
            0.19345059 = fieldWeight in 1365, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1365)
      0.4 = coord(2/5)
    
    Date
    2. 9.2014 19:19:40
    2. 9.2014 21:22:48
    Source
    Knowledge organization. 40(2013) no.6, S.424-429
  2. Rötzer, F.: Kann KI mit KI generierte Texte erkennen? (2019) 0.01
    0.011659914 = product of:
      0.058299568 = sum of:
        0.058299568 = weight(_text_:40 in 3977) [ClassicSimilarity], result of:
          0.058299568 = score(doc=3977,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.2949599 = fieldWeight in 3977, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3977)
      0.2 = coord(1/5)
    
    Abstract
    OpenAI hat einen Algorithmus zur Textgenerierung angeblich nicht vollständig veröffentlicht, weil er so gut sei und Missbrauch und Täuschung ermöglicht. Das u.a. von Elon Musk und Peter Thiel gegründete KI-Unternehmen OpenAI hatte im Februar erklärt, man habe den angeblich am weitesten fortgeschrittenen Algorithmus zur Sprachverarbeitung entwickelt. Der Algorithmus wurde lediglich anhand von 40 Gigabyte an Texten oder an 8 Millionen Webseiten trainiert, das nächste Wort in einem vorgegebenen Textausschnitt vorherzusagen. Damit könne man zusammenhängende, sinnvolle Texte erzeugen, die vielen Anforderungen genügen, zudem könne damit rudimentär Leseverständnis, Antworten auf Fragen, Zusammenfassungen und Übersetzungen erzeugt werden, ohne dies trainiert zu haben.
  3. Lezius, W.: Morphy - Morphologie und Tagging für das Deutsche (2013) 0.01
    0.011234595 = product of:
      0.056172974 = sum of:
        0.056172974 = weight(_text_:22 in 1490) [ClassicSimilarity], result of:
          0.056172974 = score(doc=1490,freq=2.0), product of:
            0.18148361 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051825367 = queryNorm
            0.30952093 = fieldWeight in 1490, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=1490)
      0.2 = coord(1/5)
    
    Date
    22. 3.2015 9:30:24
  4. Altmann, E.G.; Cristadoro, G.; Esposti, M.D.: On the origin of long-range correlations in texts (2012) 0.01
    0.009994212 = product of:
      0.04997106 = sum of:
        0.04997106 = weight(_text_:40 in 330) [ClassicSimilarity], result of:
          0.04997106 = score(doc=330,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.2528228 = fieldWeight in 330, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.046875 = fieldNorm(doc=330)
      0.2 = coord(1/5)
    
    Date
    24. 7.2012 11:40:06
  5. Rosemblat, G.; Resnick, M.P.; Auston, I.; Shin, D.; Sneiderman, C.; Fizsman, M.; Rindflesch, T.C.: Extending SemRep to the public health domain (2013) 0.01
    0.009994212 = product of:
      0.04997106 = sum of:
        0.04997106 = weight(_text_:40 in 2096) [ClassicSimilarity], result of:
          0.04997106 = score(doc=2096,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.2528228 = fieldWeight in 2096, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.046875 = fieldNorm(doc=2096)
      0.2 = coord(1/5)
    
    Date
    22.10.2013 19:54:40
  6. Anguiano Peña, G.; Naumis Peña, C.: Method for selecting specialized terms from a general language corpus (2015) 0.01
    0.009994212 = product of:
      0.04997106 = sum of:
        0.04997106 = weight(_text_:40 in 2196) [ClassicSimilarity], result of:
          0.04997106 = score(doc=2196,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.2528228 = fieldWeight in 2196, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.046875 = fieldNorm(doc=2196)
      0.2 = coord(1/5)
    
    Date
    2. 9.2014 19:19:40
  7. Li, N.; Sun, J.: Improving Chinese term association from the linguistic perspective (2017) 0.01
    0.009994212 = product of:
      0.04997106 = sum of:
        0.04997106 = weight(_text_:40 in 3381) [ClassicSimilarity], result of:
          0.04997106 = score(doc=3381,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.2528228 = fieldWeight in 3381, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.046875 = fieldNorm(doc=3381)
      0.2 = coord(1/5)
    
    Date
    2. 9.2014 19:19:40
  8. Lu, K.; Cai, X.; Ajiferuke, I.; Wolfram, D.: Vocabulary size and its effect on topic representation (2017) 0.01
    0.009994212 = product of:
      0.04997106 = sum of:
        0.04997106 = weight(_text_:40 in 3414) [ClassicSimilarity], result of:
          0.04997106 = score(doc=3414,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.2528228 = fieldWeight in 3414, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.046875 = fieldNorm(doc=3414)
      0.2 = coord(1/5)
    
    Abstract
    This study investigates how computational overhead for topic model training may be reduced by selectively removing terms from the vocabulary of text corpora being modeled. We compare the impact of removing singly occurring terms, the top 0.5%, 1% and 5% most frequently occurring terms and both top 0.5% most frequent and singly occurring terms, along with changes in the number of topics modeled (10, 20, 30, 40, 50, 100) using three datasets. Four outcome measures are compared. The removal of singly occurring terms has little impact on outcomes for all of the measures tested. Document discriminative capacity, as measured by the document space density, is reduced by the removal of frequently occurring terms, but increases with higher numbers of topics. Vocabulary size does not greatly influence entropy, but entropy is affected by the number of topics. Finally, topic similarity, as measured by pairwise topic similarity and Jensen-Shannon divergence, decreases with the removal of frequent terms. The findings have implications for information science research in information retrieval and informetrics that makes use of topic modeling.
  9. Huo, W.: Automatic multi-word term extraction and its application to Web-page summarization (2012) 0.01
    0.008425946 = product of:
      0.04212973 = sum of:
        0.04212973 = weight(_text_:22 in 563) [ClassicSimilarity], result of:
          0.04212973 = score(doc=563,freq=2.0), product of:
            0.18148361 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051825367 = queryNorm
            0.23214069 = fieldWeight in 563, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=563)
      0.2 = coord(1/5)
    
    Date
    10. 1.2013 19:22:47
  10. Lawrie, D.; Mayfield, J.; McNamee, P.; Oard, P.W.: Cross-language person-entity linking from 20 languages (2015) 0.01
    0.008425946 = product of:
      0.04212973 = sum of:
        0.04212973 = weight(_text_:22 in 1848) [ClassicSimilarity], result of:
          0.04212973 = score(doc=1848,freq=2.0), product of:
            0.18148361 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051825367 = queryNorm
            0.23214069 = fieldWeight in 1848, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=1848)
      0.2 = coord(1/5)
    
    Abstract
    The goal of entity linking is to associate references to an entity that is found in unstructured natural language content to an authoritative inventory of known entities. This article describes the construction of 6 test collections for cross-language person-entity linking that together span 22 languages. Fully automated components were used together with 2 crowdsourced validation stages to affordably generate ground-truth annotations with an accuracy comparable to that of a completely manual process. The resulting test collections each contain between 642 (Arabic) and 2,361 (Romanian) person references in non-English texts for which the correct resolution in English Wikipedia is known, plus a similar number of references for which no correct resolution into English Wikipedia is believed to exist. Fully automated cross-language person-name linking experiments with 20 non-English languages yielded a resolution accuracy of between 0.84 (Serbian) and 0.98 (Romanian), which compares favorably with previously reported cross-language entity linking results for Spanish.
  11. Gencosman, B.C.; Ozmutlu, H.C.; Ozmutlu, S.: Character n-gram application for automatic new topic identification (2014) 0.01
    0.00832851 = product of:
      0.04164255 = sum of:
        0.04164255 = weight(_text_:40 in 2688) [ClassicSimilarity], result of:
          0.04164255 = score(doc=2688,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.21068566 = fieldWeight in 2688, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2688)
      0.2 = coord(1/5)
    
    Date
    25. 1.2016 18:44:40
  12. Lian, T.; Yu, C.; Wang, W.; Yuan, Q.; Hou, Z.: Doctoral dissertations on tourism in China : a co-word analysis (2016) 0.01
    0.00832851 = product of:
      0.04164255 = sum of:
        0.04164255 = weight(_text_:40 in 3178) [ClassicSimilarity], result of:
          0.04164255 = score(doc=3178,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.21068566 = fieldWeight in 3178, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3178)
      0.2 = coord(1/5)
    
    Date
    2. 9.2014 19:19:40
  13. Collovini de Abreu, S.; Vieira, R.: RelP: Portuguese open relation extraction (2017) 0.01
    0.00832851 = product of:
      0.04164255 = sum of:
        0.04164255 = weight(_text_:40 in 3621) [ClassicSimilarity], result of:
          0.04164255 = score(doc=3621,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.21068566 = fieldWeight in 3621, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3621)
      0.2 = coord(1/5)
    
    Date
    2. 9.2014 19:19:40
  14. Chen, L.; Fang, H.: ¬An automatic method for ex-tracting innovative ideas based on the Scopus® database (2019) 0.01
    0.00832851 = product of:
      0.04164255 = sum of:
        0.04164255 = weight(_text_:40 in 5310) [ClassicSimilarity], result of:
          0.04164255 = score(doc=5310,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.21068566 = fieldWeight in 5310, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5310)
      0.2 = coord(1/5)
    
    Date
    2. 9.2014 19:19:40
  15. Vlachidis, A.; Binding, C.; Tudhope, D.; May, K.: Excavating grey literature : a case study on the rich indexing of archaeological documents via natural language-processing techniques and knowledge-based resources (2010) 0.01
    0.0066628084 = product of:
      0.03331404 = sum of:
        0.03331404 = weight(_text_:40 in 3948) [ClassicSimilarity], result of:
          0.03331404 = score(doc=3948,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.16854852 = fieldWeight in 3948, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.03125 = fieldNorm(doc=3948)
      0.2 = coord(1/5)
    
    Date
    29. 8.2010 12:03:40
  16. Bowker, L.; Ciro, J.B.: Machine translation and global research : towards improved machine translation literacy in the scholarly community (2019) 0.01
    0.0066628084 = product of:
      0.03331404 = sum of:
        0.03331404 = weight(_text_:40 in 5970) [ClassicSimilarity], result of:
          0.03331404 = score(doc=5970,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.16854852 = fieldWeight in 5970, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.03125 = fieldNorm(doc=5970)
      0.2 = coord(1/5)
    
    Date
    12. 9.2020 20:40:49
  17. Nagy T., I.: Detecting multiword expressions and named entities in natural language texts (2014) 0.01
    0.005829957 = product of:
      0.029149784 = sum of:
        0.029149784 = weight(_text_:40 in 1536) [ClassicSimilarity], result of:
          0.029149784 = score(doc=1536,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.14747995 = fieldWeight in 1536, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.02734375 = fieldNorm(doc=1536)
      0.2 = coord(1/5)
    
    Date
    30.10.2014 18:40:33
  18. Rötzer, F.: KI-Programm besser als Menschen im Verständnis natürlicher Sprache (2018) 0.01
    0.0056172977 = product of:
      0.028086487 = sum of:
        0.028086487 = weight(_text_:22 in 4217) [ClassicSimilarity], result of:
          0.028086487 = score(doc=4217,freq=2.0), product of:
            0.18148361 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051825367 = queryNorm
            0.15476047 = fieldWeight in 4217, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=4217)
      0.2 = coord(1/5)
    
    Date
    22. 1.2018 11:32:44
  19. RWI/PH: Auf der Suche nach dem entscheidenden Wort : die Häufung bestimmter Wörter innerhalb eines Textes macht diese zu Schlüsselwörtern (2012) 0.00
    0.004997106 = product of:
      0.02498553 = sum of:
        0.02498553 = weight(_text_:40 in 331) [ClassicSimilarity], result of:
          0.02498553 = score(doc=331,freq=2.0), product of:
            0.19765252 = queryWeight, product of:
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.051825367 = queryNorm
            0.1264114 = fieldWeight in 331, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.813818 = idf(docFreq=2651, maxDocs=44218)
              0.0234375 = fieldNorm(doc=331)
      0.2 = coord(1/5)
    
    Date
    24. 7.2012 11:40:06
  20. Deventer, J.P. van; Kruger, C.J.; Johnson, R.D.: Delineating knowledge management through lexical analysis : a retrospective (2015) 0.00
    0.0049151354 = product of:
      0.024575677 = sum of:
        0.024575677 = weight(_text_:22 in 3807) [ClassicSimilarity], result of:
          0.024575677 = score(doc=3807,freq=2.0), product of:
            0.18148361 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051825367 = queryNorm
            0.1354154 = fieldWeight in 3807, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02734375 = fieldNorm(doc=3807)
      0.2 = coord(1/5)
    
    Date
    20. 1.2015 18:30:22