Search (8 results, page 1 of 1)

Huo, W.: Automatic multi-word term extraction and its application to Web-page summarization (2012) 0.11

0.114436716 = product of:
  0.22887343 = sum of:
    0.21088406 = weight(_text_:2f in 563) [ClassicSimilarity], result of:
      0.21088406 = score(doc=563,freq=2.0), product of:
        0.3752265 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.04425879 = queryNorm
        0.56201804 = fieldWeight in 563, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=563)
    0.017989364 = product of:
      0.035978727 = sum of:
        0.035978727 = weight(_text_:22 in 563) [ClassicSimilarity], result of:
          0.035978727 = score(doc=563,freq=2.0), product of:
            0.15498674 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04425879 = queryNorm
            0.23214069 = fieldWeight in 563, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=563)
      0.5 = coord(1/2)
  0.5 = coord(2/4)

Content: A Thesis presented to The University of Guelph In partial fulfilment of requirements for the degree of Master of Science in Computer Science. Vgl. Unter: http://www.inf.ufrgs.br%2F~ceramisch%2Fdownload_files%2Fpublications%2F2009%2Fp01.pdf.
Date: 10. 1.2013 19:22:47

Anguiano Peña, G.; Naumis Peña, C.: Method for selecting specialized terms from a general language corpus (2015) 0.03
```
0.033471715 = product of:
  0.13388686 = sum of:
    0.13388686 = weight(_text_:assisted in 2196) [ClassicSimilarity], result of:
      0.13388686 = score(doc=2196,freq=2.0), product of:
        0.29897895 = queryWeight, product of:
          6.7552447 = idf(docFreq=139, maxDocs=44218)
          0.04425879 = queryNorm
        0.44781366 = fieldWeight in 2196, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.7552447 = idf(docFreq=139, maxDocs=44218)
          0.046875 = fieldNorm(doc=2196)
  0.25 = coord(1/4)
```
Abstract

Among the many aspects studied by library and information science are linguistic phenomena associated with document content analysis, for purposes of both information organization and retrieval. To this end, terms used in scientific and technical language must be recovered and their area of domain and behavior studied. Through language, society controls the knowledge available to people. Document content analysis, in this case of scientific texts, facilitates gathering knowledge of lexical units and their major applications and separating such specialized terms from the general language, to create indexing languages. The model presented here or other lexicographic resources with similar characteristics may be useful in the near future, in computer-assisted indexing or as corpora monitors, with respect to new text analyses or specialized corpora. Thus, using techniques for document content analysis of a lexicographically labeled general language corpus proposed herein, components which enable the extraction of lexical units from specialized language may be obtained and characterized.
Ramisch, C.: Multiword expressions acquisition : a generic and open framework (2015) 0.02
```
0.022314476 = product of:
  0.0892579 = sum of:
    0.0892579 = weight(_text_:assisted in 1649) [ClassicSimilarity], result of:
      0.0892579 = score(doc=1649,freq=2.0), product of:
        0.29897895 = queryWeight, product of:
          6.7552447 = idf(docFreq=139, maxDocs=44218)
          0.04425879 = queryNorm
        0.29854244 = fieldWeight in 1649, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.7552447 = idf(docFreq=139, maxDocs=44218)
          0.03125 = fieldNorm(doc=1649)
  0.25 = coord(1/4)
```
Abstract

This book is an excellent introduction to multiword expressions. It provides a unique, comprehensive and up-to-date overview of this exciting topic in computational linguistics. The first part describes the diversity and richness of multiword expressions, including many examples in several languages. These constructions are not only complex and arbitrary, but also much more frequent than one would guess, making them a real nightmare for natural language processing applications. The second part introduces a new generic framework for automatic acquisition of multiword expressions from texts. Furthermore, it describes the accompanying free software tool, the mwetoolkit, which comes in handy when looking for expressions in texts (regardless of the language). Evaluation is greatly emphasized, underlining the fact that results depend on parameters like corpus size, language, MWE type, etc. The last part contains solid experimental results and evaluates the mwetoolkit, demonstrating its usefulness for computer-assisted lexicography and machine translation. This is the first book to cover the whole pipeline of multiword expression acquisition in a single volume. It is addresses the needs of students and researchers in computational and theoretical linguistics, cognitive sciences, artificial intelligence and computer science. Its good balance between computational and linguistic views make it the perfect starting point for anyone interested in multiword expressions, language and text processing in general.

Lezius, W.: Morphy - Morphologie und Tagging für das Deutsche (2013) 0.01

0.005996455 = product of:
  0.02398582 = sum of:
    0.02398582 = product of:
      0.04797164 = sum of:
        0.04797164 = weight(_text_:22 in 1490) [ClassicSimilarity], result of:
          0.04797164 = score(doc=1490,freq=2.0), product of:
            0.15498674 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04425879 = queryNorm
            0.30952093 = fieldWeight in 1490, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=1490)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 22. 3.2015 9:30:24

Lawrie, D.; Mayfield, J.; McNamee, P.; Oard, P.W.: Cross-language person-entity linking from 20 languages (2015) 0.00
```
0.004497341 = product of:
  0.017989364 = sum of:
    0.017989364 = product of:
      0.035978727 = sum of:
        0.035978727 = weight(_text_:22 in 1848) [ClassicSimilarity], result of:
          0.035978727 = score(doc=1848,freq=2.0), product of:
            0.15498674 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04425879 = queryNorm
            0.23214069 = fieldWeight in 1848, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=1848)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

The goal of entity linking is to associate references to an entity that is found in unstructured natural language content to an authoritative inventory of known entities. This article describes the construction of 6 test collections for cross-language person-entity linking that together span 22 languages. Fully automated components were used together with 2 crowdsourced validation stages to affordably generate ground-truth annotations with an accuracy comparable to that of a completely manual process. The resulting test collections each contain between 642 (Arabic) and 2,361 (Romanian) person references in non-English texts for which the correct resolution in English Wikipedia is known, plus a similar number of references for which no correct resolution into English Wikipedia is believed to exist. Fully automated cross-language person-name linking experiments with 20 non-English languages yielded a resolution accuracy of between 0.84 (Serbian) and 0.98 (Romanian), which compares favorably with previously reported cross-language entity linking results for Spanish.

Fóris, A.: Network theory and terminology (2013) 0.00

0.0037477845 = product of:
  0.014991138 = sum of:
    0.014991138 = product of:
      0.029982276 = sum of:
        0.029982276 = weight(_text_:22 in 1365) [ClassicSimilarity], result of:
          0.029982276 = score(doc=1365,freq=2.0), product of:
            0.15498674 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04425879 = queryNorm
            0.19345059 = fieldWeight in 1365, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1365)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 2. 9.2014 21:22:48

Rötzer, F.: KI-Programm besser als Menschen im Verständnis natürlicher Sprache (2018) 0.00

0.0029982275 = product of:
  0.01199291 = sum of:
    0.01199291 = product of:
      0.02398582 = sum of:
        0.02398582 = weight(_text_:22 in 4217) [ClassicSimilarity], result of:
          0.02398582 = score(doc=4217,freq=2.0), product of:
            0.15498674 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04425879 = queryNorm
            0.15476047 = fieldWeight in 4217, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=4217)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 22. 1.2018 11:32:44

Deventer, J.P. van; Kruger, C.J.; Johnson, R.D.: Delineating knowledge management through lexical analysis : a retrospective (2015) 0.00

0.002623449 = product of:
  0.010493796 = sum of:
    0.010493796 = product of:
      0.020987593 = sum of:
        0.020987593 = weight(_text_:22 in 3807) [ClassicSimilarity], result of:
          0.020987593 = score(doc=3807,freq=2.0), product of:
            0.15498674 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04425879 = queryNorm
            0.1354154 = fieldWeight in 3807, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02734375 = fieldNorm(doc=3807)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 20. 1.2015 18:30:22

Search (8 results, page 1 of 1)

Authors

Languages

Types