Search (29 results, page 1 of 2)

Godby, J.: WordSmith research project bridges gap between tokens and indexes (1998) 0.04

0.041952223 = product of:
  0.12585667 = sum of:
    0.12585667 = sum of:
      0.07843939 = weight(_text_:reports in 4729) [ClassicSimilarity], result of:
        0.07843939 = score(doc=4729,freq=2.0), product of:
          0.2251839 = queryWeight, product of:
            4.503953 = idf(docFreq=1329, maxDocs=44218)
            0.04999695 = queryNorm
          0.34833482 = fieldWeight in 4729, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.503953 = idf(docFreq=1329, maxDocs=44218)
            0.0546875 = fieldNorm(doc=4729)
      0.047417276 = weight(_text_:22 in 4729) [ClassicSimilarity], result of:
        0.047417276 = score(doc=4729,freq=2.0), product of:
          0.1750808 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.04999695 = queryNorm
          0.2708308 = fieldWeight in 4729, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0546875 = fieldNorm(doc=4729)
  0.33333334 = coord(1/3)

Abstract: Reports on an OCLC natural language processing research project to develop methods for identifying terminology in unstructured electronic text, especially material associated with new cultural trends and emerging subjects. Current OCLC production software can only identify single words as indexable terms in full text documents, thus a major goal of the WordSmith project is to develop software that can automatically identify and intelligently organize phrases for uses in database indexes. By analyzing user terminology from local newspapers in the USA, the latest cultural trends and technical developments as well as personal and geographic names have been drawm out. Notes that this new vocabulary can also be mapped into reference works
Source: OCLC newsletter. 1998, no.234, Jul/Aug, S.22-24

Garfield, E.: ¬The relationship between mechanical indexing, structural linguistics and information retrieval (1992) 0.03
```
0.032391485 = product of:
  0.09717445 = sum of:
    0.09717445 = weight(_text_:citation in 3632) [ClassicSimilarity], result of:
      0.09717445 = score(doc=3632,freq=2.0), product of:
        0.23445003 = queryWeight, product of:
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.04999695 = queryNorm
        0.4144783 = fieldWeight in 3632, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.0625 = fieldNorm(doc=3632)
  0.33333334 = coord(1/3)
```
Abstract

It is possible to locate over 60% of indexing terms used in the Current List of Medical Literature by analysing the titles of the articles. Citation indexes contain 'noise' and lack many pertinent citations. Mechanical indexing or analysis of text must begin with some linguistic technique. Discusses Harris' methods of structural linguistics, discourse analysis and transformational analysis. Provides 3 examples with references, abstracts and index entries

McMahon, J.G.; Smith, F.J.: Improved statistical language model performance with automatic generated word hierarchies (1996) 0.02

0.015805759 = product of:
  0.047417276 = sum of:
    0.047417276 = product of:
      0.09483455 = sum of:
        0.09483455 = weight(_text_:22 in 3164) [ClassicSimilarity], result of:
          0.09483455 = score(doc=3164,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.5416616 = fieldWeight in 3164, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=3164)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Computational linguistics. 22(1996) no.2, S.217-248

Ruge, G.: ¬A spreading activation network for automatic generation of thesaurus relationships (1991) 0.02

0.015805759 = product of:
  0.047417276 = sum of:
    0.047417276 = product of:
      0.09483455 = sum of:
        0.09483455 = weight(_text_:22 in 4506) [ClassicSimilarity], result of:
          0.09483455 = score(doc=4506,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.5416616 = fieldWeight in 4506, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=4506)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 8.10.2000 11:52:22

Somers, H.: Example-based machine translation : Review article (1999) 0.02

0.015805759 = product of:
  0.047417276 = sum of:
    0.047417276 = product of:
      0.09483455 = sum of:
        0.09483455 = weight(_text_:22 in 6672) [ClassicSimilarity], result of:
          0.09483455 = score(doc=6672,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.5416616 = fieldWeight in 6672, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=6672)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 31. 7.1996 9:22:19

New tools for human translators (1997) 0.02

0.015805759 = product of:
  0.047417276 = sum of:
    0.047417276 = product of:
      0.09483455 = sum of:
        0.09483455 = weight(_text_:22 in 1179) [ClassicSimilarity], result of:
          0.09483455 = score(doc=1179,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.5416616 = fieldWeight in 1179, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=1179)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 31. 7.1996 9:22:19

Baayen, R.H.; Lieber, H.: Word frequency distributions and lexical semantics (1997) 0.02

0.015805759 = product of:
  0.047417276 = sum of:
    0.047417276 = product of:
      0.09483455 = sum of:
        0.09483455 = weight(_text_:22 in 3117) [ClassicSimilarity], result of:
          0.09483455 = score(doc=3117,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.5416616 = fieldWeight in 3117, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=3117)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 28. 2.1999 10:48:22

Sumbatyan, M.A.; Khazagerov, G.G.: Tipy ruskikh omoform i ikx avtomaticheskoe razvedenie (1997) 0.01

0.014940838 = product of:
  0.044822514 = sum of:
    0.044822514 = product of:
      0.08964503 = sum of:
        0.08964503 = weight(_text_:reports in 2259) [ClassicSimilarity], result of:
          0.08964503 = score(doc=2259,freq=2.0), product of:
            0.2251839 = queryWeight, product of:
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.04999695 = queryNorm
            0.39809695 = fieldWeight in 2259, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.0625 = fieldNorm(doc=2259)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Abstract: Reports on the development of an algorithm which has been used to compile a comprehensive dictionary of Russian homonyms i.e. words with several meanings. The word 'lay' can serve as an example of an English homonym: it is either a verb in its own right (to lay) or the preterite of the verb 'to lie'. The compiled dictionary has been used to identify the existing individual types of homonyms

Pritchard-Schoch, T.: Comparing natural language retrieval : Win & Freestyle (1995) 0.01

0.014940838 = product of:
  0.044822514 = sum of:
    0.044822514 = product of:
      0.08964503 = sum of:
        0.08964503 = weight(_text_:reports in 2546) [ClassicSimilarity], result of:
          0.08964503 = score(doc=2546,freq=2.0), product of:
            0.2251839 = queryWeight, product of:
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.04999695 = queryNorm
            0.39809695 = fieldWeight in 2546, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.0625 = fieldNorm(doc=2546)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Abstract: Reports on a comparison of 2 natural language interfaces to full text legal databases: WIN for access to WESTLAW databases and FREESTYLE for access to the LEXIS database. 30 legal issues in natural langugae queries were presented to identical libraries in both systems. The top 20 ranked documents from each search were analyzed and reviewed for relevance to the legal issue

Conceptual structures : theory, tools and applications. 6th International Conference on Conceptual Structures, ICCS'98, Montpellier, France, August, 10-12, 1998, Proceedings (1998) 0.01
```
0.014940838 = product of:
  0.044822514 = sum of:
    0.044822514 = product of:
      0.08964503 = sum of:
        0.08964503 = weight(_text_:reports in 1378) [ClassicSimilarity], result of:
          0.08964503 = score(doc=1378,freq=2.0), product of:
            0.2251839 = queryWeight, product of:
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.04999695 = queryNorm
            0.39809695 = fieldWeight in 1378, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.0625 = fieldNorm(doc=1378)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)
```
Abstract

This book constitutes the refereed proceedings of the 6th International Conference on Conceptual Structures, ICCS'98, held in Montpellier, France, in August 1998. The 20 revised full papers and 10 research reports presented were carefully selected from a total of 66 submissions; also included are three invited contributions. The volume is divided in topical sections on knowledge representation and knowledge engineering, tools, conceptual graphs and other models, relationships with logics, algorithms and complexity, natural language processing, and applications.

Byrne, C.C.; McCracken, S.A.: ¬An adaptive thesaurus employing semantic distance, relational inheritance and nominal compound interpretation for linguistic support of information retrieval (1999) 0.01

0.013547793 = product of:
  0.04064338 = sum of:
    0.04064338 = product of:
      0.08128676 = sum of:
        0.08128676 = weight(_text_:22 in 4483) [ClassicSimilarity], result of:
          0.08128676 = score(doc=4483,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.46428138 = fieldWeight in 4483, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=4483)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 15. 3.2000 10:22:37

Ekmekcioglu, F.C.; Lynch, M.F.; Willet, P.: Development and evaluation of conflation techniques for the implementation of a document retrieval system for Turkish text databases (1995) 0.01
```
0.013073232 = product of:
  0.039219696 = sum of:
    0.039219696 = product of:
      0.07843939 = sum of:
        0.07843939 = weight(_text_:reports in 5797) [ClassicSimilarity], result of:
          0.07843939 = score(doc=5797,freq=2.0), product of:
            0.2251839 = queryWeight, product of:
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.04999695 = queryNorm
            0.34833482 = fieldWeight in 5797, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5797)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)
```
Abstract

Considers language processing techniques necessary for the implementation of a document retrieval system for Turkish text databases. Introduces the main characteristics of the Turkish language. Discusses the development of a stopword list and the evaluation of a stemming algorithm that takes account of the language's morphological structure. A 2 level description of Turkish morphology developed in Bilkent University, Ankara, is incorporated into a morphological parser, PC-KIMMO, to carry out stemming in Turkish databases. Describes the evaluation of string similarity measures - n-gram matching techniques - for Turkish. Reports experiments on 6 different Turkish text corpora
Greengrass, M.: Conflation methods for searching databases of Latin text (1996) 0.01
```
0.013073232 = product of:
  0.039219696 = sum of:
    0.039219696 = product of:
      0.07843939 = sum of:
        0.07843939 = weight(_text_:reports in 6987) [ClassicSimilarity], result of:
          0.07843939 = score(doc=6987,freq=2.0), product of:
            0.2251839 = queryWeight, product of:
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.04999695 = queryNorm
            0.34833482 = fieldWeight in 6987, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.0546875 = fieldNorm(doc=6987)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)
```
Abstract

Describes the results of a project to develop conflation tools for searching databases of Latin text. Reports on the results of a questionnaire sent to 64 users of Latin text retrieval systems. Describes a Latin stemming algorithm that uses a simple longest match with some recoding but differs from most stemmers in its use of 2 separate suffix dictionaries for processing query and database words. Describes a retrieval system in which a user inputs the principal component of their search term, these components are stemmed and the resulting stems matched against the noun based and verb based stem dictionaries. Evaluates the system, describing its limitations, and a more complex system

Yeap, W.K.: Computing rich semantic models of text in legal domains (1998) 0.01

0.013073232 = product of:
  0.039219696 = sum of:
    0.039219696 = product of:
      0.07843939 = sum of:
        0.07843939 = weight(_text_:reports in 2675) [ClassicSimilarity], result of:
          0.07843939 = score(doc=2675,freq=2.0), product of:
            0.2251839 = queryWeight, product of:
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.04999695 = queryNorm
            0.34833482 = fieldWeight in 2675, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2675)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Abstract: The richness provided in a deep demantic model of text is appealing and yet few such models have been developed. Considers the problems with existing practical natural language processing (NLP) systems and the difficulties in developing such a model. Argues that a possible solution must focus on the reasoning process using knowledge of words rather than the use of other mechanisms and especially those that speed up the pre processing stage. Suggests also that computing representations of text that are transcripts of judges' oral reports on Family Law cases is a challenging text area for these techniques

Hutchins, J.: From first conception to first demonstration : the nascent years of machine translation, 1947-1954. A chronology (1997) 0.01

0.0112898275 = product of:
  0.033869483 = sum of:
    0.033869483 = product of:
      0.067738965 = sum of:
        0.067738965 = weight(_text_:22 in 1463) [ClassicSimilarity], result of:
          0.067738965 = score(doc=1463,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.38690117 = fieldWeight in 1463, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1463)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 31. 7.1996 9:22:19

Lezius, W.; Rapp, R.; Wettler, M.: ¬A morphology-system and part-of-speech tagger for German (1996) 0.01

0.0112898275 = product of:
  0.033869483 = sum of:
    0.033869483 = product of:
      0.067738965 = sum of:
        0.067738965 = weight(_text_:22 in 1693) [ClassicSimilarity], result of:
          0.067738965 = score(doc=1693,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.38690117 = fieldWeight in 1693, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1693)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 22. 3.2015 9:37:18

Wanner, L.: Lexical choice in text generation and machine translation (1996) 0.01

0.009031862 = product of:
  0.027095586 = sum of:
    0.027095586 = product of:
      0.054191172 = sum of:
        0.054191172 = weight(_text_:22 in 8521) [ClassicSimilarity], result of:
          0.054191172 = score(doc=8521,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.30952093 = fieldWeight in 8521, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=8521)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 31. 7.1996 9:22:19

Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.01

0.009031862 = product of:
  0.027095586 = sum of:
    0.027095586 = product of:
      0.054191172 = sum of:
        0.054191172 = weight(_text_:22 in 6752) [ClassicSimilarity], result of:
          0.054191172 = score(doc=6752,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.30952093 = fieldWeight in 6752, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6752)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 6. 3.1997 16:22:15

Basili, R.; Pazienza, M.T.; Velardi, P.: ¬An empirical symbolic approach to natural language processing (1996) 0.01

0.009031862 = product of:
  0.027095586 = sum of:
    0.027095586 = product of:
      0.054191172 = sum of:
        0.054191172 = weight(_text_:22 in 6753) [ClassicSimilarity], result of:
          0.054191172 = score(doc=6753,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.30952093 = fieldWeight in 6753, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6753)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 6. 3.1997 16:22:15

Haas, S.W.: Natural language processing : toward large-scale, robust systems (1996) 0.01
```
0.009031862 = product of:
  0.027095586 = sum of:
    0.027095586 = product of:
      0.054191172 = sum of:
        0.054191172 = weight(_text_:22 in 7415) [ClassicSimilarity], result of:
          0.054191172 = score(doc=7415,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.30952093 = fieldWeight in 7415, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=7415)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)
```
Abstract

State of the art review of natural language processing updating an earlier review published in ARIST 22(1987). Discusses important developments that have allowed for significant advances in the field of natural language processing: materials and resources; knowledge based systems and statistical approaches; and a strong emphasis on evaluation. Reviews some natural language processing applications and common problems still awaiting solution. Considers closely related applications such as language generation and th egeneration phase of machine translation which face the same problems as natural language processing. Covers natural language methodologies for information retrieval only briefly

Search (29 results, page 1 of 2)

Authors

Languages

Types

Themes

Subjects

Classifications