Search (7 results, page 1 of 1)

  • × theme_ss:"Computerlinguistik"
  • × year_i:[2010 TO 2020}
  1. Lezius, W.: Morphy - Morphologie und Tagging für das Deutsche (2013) 0.01
    0.008724097 = product of:
      0.02617229 = sum of:
        0.02617229 = product of:
          0.05234458 = sum of:
            0.05234458 = weight(_text_:22 in 1490) [ClassicSimilarity], result of:
              0.05234458 = score(doc=1490,freq=2.0), product of:
                0.16911483 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.048293278 = queryNorm
                0.30952093 = fieldWeight in 1490, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1490)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    22. 3.2015 9:30:24
  2. Nagy T., I.: Detecting multiword expressions and named entities in natural language texts (2014) 0.01
    0.008087928 = product of:
      0.024263784 = sum of:
        0.024263784 = product of:
          0.04852757 = sum of:
            0.04852757 = weight(_text_:2002 in 1536) [ClassicSimilarity], result of:
              0.04852757 = score(doc=1536,freq=4.0), product of:
                0.20701107 = queryWeight, product of:
                  4.28654 = idf(docFreq=1652, maxDocs=44218)
                  0.048293278 = queryNorm
                0.23442015 = fieldWeight in 1536, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.28654 = idf(docFreq=1652, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=1536)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Multiword expressions (MWEs) are lexical items that can be decomposed into single words and display lexical, syntactic, semantic, pragmatic and/or statistical idiosyncrasy (Sag et al., 2002; Kim, 2008; Calzolari et al., 2002). The proper treatment of multiword expressions such as rock 'n' roll and make a decision is essential for many natural language processing (NLP) applications like information extraction and retrieval, terminology extraction and machine translation, and it is important to identify multiword expressions in context. For example, in machine translation we must know that MWEs form one semantic unit, hence their parts should not be translated separately. For this, multiword expressions should be identified first in the text to be translated. The chief aim of this thesis is to develop machine learning-based approaches for the automatic detection of different types of multiword expressions in English and Hungarian natural language texts. In our investigations, we pay attention to the characteristics of different types of multiword expressions such as nominal compounds, multiword named entities and light verb constructions, and we apply novel methods to identify MWEs in raw texts. In the thesis it will be demonstrated that nominal compounds and multiword amed entities may require a similar approach for their automatic detection as they behave in the same way from a linguistic point of view. Furthermore, it will be shown that the automatic detection of light verb constructions can be carried out using two effective machine learning-based approaches.
  3. Huo, W.: Automatic multi-word term extraction and its application to Web-page summarization (2012) 0.01
    0.006543072 = product of:
      0.019629216 = sum of:
        0.019629216 = product of:
          0.03925843 = sum of:
            0.03925843 = weight(_text_:22 in 563) [ClassicSimilarity], result of:
              0.03925843 = score(doc=563,freq=2.0), product of:
                0.16911483 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.048293278 = queryNorm
                0.23214069 = fieldWeight in 563, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=563)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    10. 1.2013 19:22:47
  4. Lawrie, D.; Mayfield, J.; McNamee, P.; Oard, P.W.: Cross-language person-entity linking from 20 languages (2015) 0.01
    0.006543072 = product of:
      0.019629216 = sum of:
        0.019629216 = product of:
          0.03925843 = sum of:
            0.03925843 = weight(_text_:22 in 1848) [ClassicSimilarity], result of:
              0.03925843 = score(doc=1848,freq=2.0), product of:
                0.16911483 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.048293278 = queryNorm
                0.23214069 = fieldWeight in 1848, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1848)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    The goal of entity linking is to associate references to an entity that is found in unstructured natural language content to an authoritative inventory of known entities. This article describes the construction of 6 test collections for cross-language person-entity linking that together span 22 languages. Fully automated components were used together with 2 crowdsourced validation stages to affordably generate ground-truth annotations with an accuracy comparable to that of a completely manual process. The resulting test collections each contain between 642 (Arabic) and 2,361 (Romanian) person references in non-English texts for which the correct resolution in English Wikipedia is known, plus a similar number of references for which no correct resolution into English Wikipedia is believed to exist. Fully automated cross-language person-name linking experiments with 20 non-English languages yielded a resolution accuracy of between 0.84 (Serbian) and 0.98 (Romanian), which compares favorably with previously reported cross-language entity linking results for Spanish.
  5. Fóris, A.: Network theory and terminology (2013) 0.01
    0.0054525603 = product of:
      0.01635768 = sum of:
        0.01635768 = product of:
          0.03271536 = sum of:
            0.03271536 = weight(_text_:22 in 1365) [ClassicSimilarity], result of:
              0.03271536 = score(doc=1365,freq=2.0), product of:
                0.16911483 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.048293278 = queryNorm
                0.19345059 = fieldWeight in 1365, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1365)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    2. 9.2014 21:22:48
  6. Rötzer, F.: KI-Programm besser als Menschen im Verständnis natürlicher Sprache (2018) 0.00
    0.0043620486 = product of:
      0.013086145 = sum of:
        0.013086145 = product of:
          0.02617229 = sum of:
            0.02617229 = weight(_text_:22 in 4217) [ClassicSimilarity], result of:
              0.02617229 = score(doc=4217,freq=2.0), product of:
                0.16911483 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.048293278 = queryNorm
                0.15476047 = fieldWeight in 4217, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=4217)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    22. 1.2018 11:32:44
  7. Deventer, J.P. van; Kruger, C.J.; Johnson, R.D.: Delineating knowledge management through lexical analysis : a retrospective (2015) 0.00
    0.0038167923 = product of:
      0.011450376 = sum of:
        0.011450376 = product of:
          0.022900753 = sum of:
            0.022900753 = weight(_text_:22 in 3807) [ClassicSimilarity], result of:
              0.022900753 = score(doc=3807,freq=2.0), product of:
                0.16911483 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.048293278 = queryNorm
                0.1354154 = fieldWeight in 3807, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=3807)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    20. 1.2015 18:30:22