Search (500 results, page 3 of 25)

Stede, M.: Lexicalization in natural language generation : a survey (1994/95) 0.00

0.003827074 = product of:
  0.007654148 = sum of:
    0.007654148 = product of:
      0.015308296 = sum of:
        0.015308296 = weight(_text_:a in 1913) [ClassicSimilarity], result of:
          0.015308296 = score(doc=1913,freq=16.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.28826174 = fieldWeight in 1913, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=1913)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Abstract: In natural language generation, a meaning representation of some kind is successively transformed into a sentence or a text. Naturally, a central subtask of this problem is the choice of words, or lexicalization. Proposes 4 major issues that determine how a generator tackles lexicalization, and surveys the contributions that research have made to them. Identifies open problems, and sketches a possible direction for research
Type: a

Barriere, C.: Building a concept hierarchy from corpus analysis (2004) 0.00

0.003827074 = product of:
  0.007654148 = sum of:
    0.007654148 = product of:
      0.015308296 = sum of:
        0.015308296 = weight(_text_:a in 6787) [ClassicSimilarity], result of:
          0.015308296 = score(doc=6787,freq=4.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.28826174 = fieldWeight in 6787, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.125 = fieldNorm(doc=6787)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Chung, T.M.: ¬A corpus comparison approach for terminology extraction (2003) 0.00

0.003827074 = product of:
  0.007654148 = sum of:
    0.007654148 = product of:
      0.015308296 = sum of:
        0.015308296 = weight(_text_:a in 4072) [ClassicSimilarity], result of:
          0.015308296 = score(doc=4072,freq=4.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.28826174 = fieldWeight in 4072, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.125 = fieldNorm(doc=4072)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Bernth, A.; McCord, M.; Warburton, K.: Terminology extraction for global content management (2003) 0.00

0.003827074 = product of:
  0.007654148 = sum of:
    0.007654148 = product of:
      0.015308296 = sum of:
        0.015308296 = weight(_text_:a in 4122) [ClassicSimilarity], result of:
          0.015308296 = score(doc=4122,freq=4.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.28826174 = fieldWeight in 4122, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.125 = fieldNorm(doc=4122)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Czejdo. B.D.; Tucci, R.P.: ¬A dataflow graphical language for database applications (1994) 0.00

0.0037819599 = product of:
  0.0075639198 = sum of:
    0.0075639198 = product of:
      0.0151278395 = sum of:
        0.0151278395 = weight(_text_:a in 559) [ClassicSimilarity], result of:
          0.0151278395 = score(doc=559,freq=10.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.28486365 = fieldWeight in 559, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=559)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Abstract: Discusses a graphical language for information retrieval and processing. A lot of recent activity has occured in the area of improving access to database systems. However, current results are restricted to simple interfacing of database systems. Proposes a graphical language for specifying complex applications
Type: a

Koppel, M.; Akiva, N.; Dagan, I.: Feature instability as a criterion for selecting potential style markers (2006) 0.00

0.0035799001 = product of:
  0.0071598003 = sum of:
    0.0071598003 = product of:
      0.014319601 = sum of:
        0.014319601 = weight(_text_:a in 6092) [ClassicSimilarity], result of:
          0.014319601 = score(doc=6092,freq=14.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.26964417 = fieldWeight in 6092, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=6092)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Abstract: We introduce a new measure on linguistic features, called stability, which captures the extent to which a language element such as a word or a syntactic construct is replaceable by semantically equivalent elements. This measure may be perceived as quantifying the degree of available "synonymy" for a language item. We show that frequent, but unstable, features are especially useful as discriminators of an author's writing style.
Type: a

Lawson, V.; Vasconcellos, M.: Forty ways to skin a cat : users report on machine translation (1994) 0.00

0.003515392 = product of:
  0.007030784 = sum of:
    0.007030784 = product of:
      0.014061568 = sum of:
        0.014061568 = weight(_text_:a in 6956) [ClassicSimilarity], result of:
          0.014061568 = score(doc=6956,freq=6.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.26478532 = fieldWeight in 6956, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=6956)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Abstract: In the most extensive survey of machine translation (MT) use ever performed, explores the responeses to a questionnaire survey of 40 MT users concerning their experiences
Type: a

McKelvie, D.; Brew, C.; Thompson, H.S.: Uisng SGML as a basis for data-intensive natural language processing (1998) 0.00

0.003515392 = product of:
  0.007030784 = sum of:
    0.007030784 = product of:
      0.014061568 = sum of:
        0.014061568 = weight(_text_:a in 3147) [ClassicSimilarity], result of:
          0.014061568 = score(doc=3147,freq=6.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.26478532 = fieldWeight in 3147, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=3147)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Abstract: Addresses advantages and disadvantages of SGML-approach compared with a non-SGML database aproach
Type: a

Rodriguez, H.; Climent, S.; Vossen, P.; Bloksma, L.; Peters, W.; Alonge, A.; Bertagna, F.; Roventini, A.: ¬The top-down strategy for building EuroWordNet : vocabulary coverage, base concept and top ontology (1998) 0.00

0.003515392 = product of:
  0.007030784 = sum of:
    0.007030784 = product of:
      0.014061568 = sum of:
        0.014061568 = weight(_text_:a in 6441) [ClassicSimilarity], result of:
          0.014061568 = score(doc=6441,freq=6.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.26478532 = fieldWeight in 6441, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=6441)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Vechtomova, O.: ¬A method for automatic extraction of multiword units representing business aspects from user reviews (2014) 0.00
```
0.003515392 = product of:
  0.007030784 = sum of:
    0.007030784 = product of:
      0.014061568 = sum of:
        0.014061568 = weight(_text_:a in 1304) [ClassicSimilarity], result of:
          0.014061568 = score(doc=1304,freq=24.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.26478532 = fieldWeight in 1304, product of:
              4.8989797 = tf(freq=24.0), with freq of:
                24.0 = termFreq=24.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=1304)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

The article describes a semi-supervised approach to extracting multiword aspects of user-written reviews that belong to a given category. The method starts with a small set of seed words, representing the target category, and calculates distributional similarity between the candidate and seed words. We compare 3 distributional similarity measures (Lin's, Weeds's, and balAPinc), and a document retrieval function, BM25, adapted as a word similarity measure. We then introduce a method for identifying multiword aspects by using a combination of syntactic rules and a co-occurrence association measure. Finally, we describe a method for ranking multiword aspects by the likelihood of belonging to the target aspect category. The task used for evaluation is extraction of restaurant dish names from a corpus of restaurant reviews.

Type

a

Ruge, G.; Schwarz, C.: Term association and computational linguistics (1991) 0.00

0.0033826875 = product of:
  0.006765375 = sum of:
    0.006765375 = product of:
      0.01353075 = sum of:
        0.01353075 = weight(_text_:a in 2310) [ClassicSimilarity], result of:
          0.01353075 = score(doc=2310,freq=8.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.25478977 = fieldWeight in 2310, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=2310)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Abstract: Most systems for term associations are statistically based. In general they exploit term co-occurrences. A critical overview about statistical approaches in this field is given. A new approach on the basis of a linguistic analysis for large amounts of textual data is outlined
Type: a

Kettunen, K.; Kunttu, T.; Järvelin, K.: To stem or lemmatize a highly inflectional language in a probabilistic IR environment? (2005) 0.00
```
0.0033826875 = product of:
  0.006765375 = sum of:
    0.006765375 = product of:
      0.01353075 = sum of:
        0.01353075 = weight(_text_:a in 4395) [ClassicSimilarity], result of:
          0.01353075 = score(doc=4395,freq=32.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.25478977 = fieldWeight in 4395, product of:
              5.656854 = tf(freq=32.0), with freq of:
                32.0 = termFreq=32.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4395)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Purpose - To show that stem generation compares well with lemmatization as a morphological tool for a highly inflectional language for IR purposes in a best-match retrieval system. Design/methodology/approach - Effects of three different morphological methods - lemmatization, stemming and stem production - for Finnish are compared in a probabilistic IR environment (INQUERY). Evaluation is done using a four-point relevance scale which is partitioned differently in different test settings. Findings - Results show that stem production, a lighter method than morphological lemmatization, compares well with lemmatization in a best-match IR environment. Differences in performance between stem production and lemmatization are small and they are not statistically significant in most of the tested settings. It is also shown that hitherto a rather neglected method of morphological processing for Finnish, stemming, performs reasonably well although the stemmer used - a Porter stemmer implementation - is far from optimal for a morphologically complex language like Finnish. In another series of tests, the effects of compound splitting and derivational expansion of queries are tested. Practical implications - Usefulness of morphological lemmatization and stem generation for IR purposes can be estimated with many factors. On the average P-R level they seem to behave very close to each other in a probabilistic IR system. Thus, the choice of the used method with highly inflectional languages needs to be estimated along other dimensions too. Originality/value - Results are achieved using Finnish as an example of a highly inflectional language. The results are of interest for anyone who is interested in processing of morphological variation of a highly inflected language for IR purposes.

Type

a

Rettinger, A.; Schumilin, A.; Thoma, S.; Ell, B.: Learning a cross-lingual semantic representation of relations expressed in text (2015) 0.00

0.0033826875 = product of:
  0.006765375 = sum of:
    0.006765375 = product of:
      0.01353075 = sum of:
        0.01353075 = weight(_text_:a in 2027) [ClassicSimilarity], result of:
          0.01353075 = score(doc=2027,freq=8.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.25478977 = fieldWeight in 2027, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=2027)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Comeau, D.C.; Wilbur, W.J.: Non-Word Identification or Spell Checking Without a Dictionary (2004) 0.00
```
0.0033657318 = product of:
  0.0067314636 = sum of:
    0.0067314636 = product of:
      0.013462927 = sum of:
        0.013462927 = weight(_text_:a in 2092) [ClassicSimilarity], result of:
          0.013462927 = score(doc=2092,freq=22.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.25351265 = fieldWeight in 2092, product of:
              4.690416 = tf(freq=22.0), with freq of:
                22.0 = termFreq=22.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2092)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

MEDLINE is a collection of more than 12 million references and abstracts covering recent life science literature. With its continued growth and cutting-edge terminology, spell-checking with a traditional lexicon based approach requires significant additional manual followup. In this work, an internal corpus based context quality rating a, frequency, and simple misspelling transformations are used to rank words from most likely to be misspellings to least likely. Eleven-point average precisions of 0.891 have been achieved within a class of 42,340 all alphabetic words having an a score less than 10. Our models predict that 16,274 or 38% of these words are misspellings. Based an test data, this result has a recall of 79% and a precision of 86%. In other words, spell checking can be done by statistics instead of with a dictionary. As an application we examine the time history of low a words in MEDLINE titles and abstracts.

Type

a
Zhou, L.; Zhang, D.: NLPIR: a theoretical framework for applying Natural Language Processing to information retrieval (2003) 0.00
```
0.0033657318 = product of:
  0.0067314636 = sum of:
    0.0067314636 = product of:
      0.013462927 = sum of:
        0.013462927 = weight(_text_:a in 5148) [ClassicSimilarity], result of:
          0.013462927 = score(doc=5148,freq=22.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.25351265 = fieldWeight in 5148, product of:
              4.690416 = tf(freq=22.0), with freq of:
                22.0 = termFreq=22.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=5148)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Zhou and Zhang believe that for the potential of natural language processing NLP to be reached in information retrieval a framework for guiding the effort should be in place. They provide a graphic model that identifies different levels of natural language processing effort during the query, document matching process. A direct matching approach uses little NLP, an expansion approach with thesauri, little more, but an extraction approach will often use a variety of NLP techniques, as well as statistical methods. A transformation approach which creates intermediate representations of documents and queries is a step higher in NLP usage, and a uniform approach, which relies on a body of knowledge beyond that of the documents and queries to provide inference and sense making prior to matching would require a maximum NPL effort.

Type

a
Driscoll, J.R.; Rajala, D.A.; Shaffer, W.H.: ¬The operation and performance of an artificially intelligent keywording system (1991) 0.00
```
0.00334869 = product of:
  0.00669738 = sum of:
    0.00669738 = product of:
      0.01339476 = sum of:
        0.01339476 = weight(_text_:a in 6681) [ClassicSimilarity], result of:
          0.01339476 = score(doc=6681,freq=16.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.25222903 = fieldWeight in 6681, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=6681)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Presents a new approach to text analysis for automating the key phrase indexing process, using artificial intelligence techniques. This mimics the behaviour of human experts by using a rule base consisting of insertion and deletion rules generated by subject-matter experts. The insertion rules are based on the idea that some phrases found in a text imply or trigger other phrases. The deletion rules apply to semantically ambiguous phrases where text presence alone does not determine appropriateness as a key phrase. The insertion and deletion rules are used to transform a list of found phrases to a list of key phrases for indexing a document. Statistical data are provided to demonstrate the performance of this expert rule based system

Type

a

Haas, S.W.: ¬A feasibility study of the case hierarchy model for the construction and porting of natural language interfaces (1990) 0.00

0.00334869 = product of:
  0.00669738 = sum of:
    0.00669738 = product of:
      0.01339476 = sum of:
        0.01339476 = weight(_text_:a in 8071) [ClassicSimilarity], result of:
          0.01339476 = score(doc=8071,freq=4.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.25222903 = fieldWeight in 8071, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.109375 = fieldNorm(doc=8071)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Sager, J.A.: ¬A practical course in terminology processing : with a bibliography by Blaise Nkwenti-Azeh (1990) 0.00

0.00334869 = product of:
  0.00669738 = sum of:
    0.00669738 = product of:
      0.01339476 = sum of:
        0.01339476 = weight(_text_:a in 5028) [ClassicSimilarity], result of:
          0.01339476 = score(doc=5028,freq=4.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.25222903 = fieldWeight in 5028, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.109375 = fieldNorm(doc=5028)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Fellbaum, C.: ¬A semantic network of English : the mother of all WordNets (1998) 0.00

0.00334869 = product of:
  0.00669738 = sum of:
    0.00669738 = product of:
      0.01339476 = sum of:
        0.01339476 = weight(_text_:a in 6416) [ClassicSimilarity], result of:
          0.01339476 = score(doc=6416,freq=4.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.25222903 = fieldWeight in 6416, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.109375 = fieldNorm(doc=6416)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Sharada, B.A.: Identification and interpretation of metaphors in document titles (1999) 0.00

0.00334869 = product of:
  0.00669738 = sum of:
    0.00669738 = product of:
      0.01339476 = sum of:
        0.01339476 = weight(_text_:a in 6792) [ClassicSimilarity], result of:
          0.01339476 = score(doc=6792,freq=4.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.25222903 = fieldWeight in 6792, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.109375 = fieldNorm(doc=6792)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Library science with a slant to documentation and information studies. 36(1999) no.1, S.27-33
Type: a

Search (500 results, page 3 of 25)

Authors

Years

Languages

Types

Themes

Subjects

Classifications