Search (41 results, page 1 of 3)

Rahmstorf, G.: Concept structures for large vocabularies (1998) 0.03

0.029550051 = product of:
  0.08865015 = sum of:
    0.08865015 = sum of:
      0.048260607 = weight(_text_:indexing in 75) [ClassicSimilarity], result of:
        0.048260607 = score(doc=75,freq=2.0), product of:
          0.19018644 = queryWeight, product of:
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.049684696 = queryNorm
          0.2537542 = fieldWeight in 75, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.046875 = fieldNorm(doc=75)
      0.04038954 = weight(_text_:22 in 75) [ClassicSimilarity], result of:
        0.04038954 = score(doc=75,freq=2.0), product of:
          0.17398734 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.049684696 = queryNorm
          0.23214069 = fieldWeight in 75, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=75)
  0.33333334 = coord(1/3)

Abstract: A technology is described which supports the acquisition, visualisation and manipulation of large vocabularies with associated structures. It is used for dictionary production, terminology data bases, thesauri, library classification systems etc. Essential features of the technology are a lexicographic user interface, variable word description, unlimited list of word readings, a concept language, automatic transformations of formulas into graphic structures, structure manipulation operations and retransformation into formulas. The concept language includes notations for undefined concepts. The structure of defined concepts can be constructed interactively. The technology supports the generation of large vocabularies with structures representing word senses. Concept structures and ordering systems for indexing and retrieval can be constructed separately and connected by associating relations.
Date: 30.12.2001 19:01:22

Warner, A.J.: ¬The role of linguistic analysis in full-text retrieval (1994) 0.02

0.018768014 = product of:
  0.05630404 = sum of:
    0.05630404 = product of:
      0.11260808 = sum of:
        0.11260808 = weight(_text_:indexing in 2992) [ClassicSimilarity], result of:
          0.11260808 = score(doc=2992,freq=2.0), product of:
            0.19018644 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.049684696 = queryNorm
            0.5920931 = fieldWeight in 2992, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.109375 = fieldNorm(doc=2992)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Challenges in indexing electronic text and images. Ed.: R. Fidel et al

Garfield, E.: ¬The relationship between mechanical indexing, structural linguistics and information retrieval (1992) 0.02

0.018575516 = product of:
  0.055726547 = sum of:
    0.055726547 = product of:
      0.11145309 = sum of:
        0.11145309 = weight(_text_:indexing in 3632) [ClassicSimilarity], result of:
          0.11145309 = score(doc=3632,freq=6.0), product of:
            0.19018644 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.049684696 = queryNorm
            0.5860202 = fieldWeight in 3632, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0625 = fieldNorm(doc=3632)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Abstract: It is possible to locate over 60% of indexing terms used in the Current List of Medical Literature by analysing the titles of the articles. Citation indexes contain 'noise' and lack many pertinent citations. Mechanical indexing or analysis of text must begin with some linguistic technique. Discusses Harris' methods of structural linguistics, discourse analysis and transformational analysis. Provides 3 examples with references, abstracts and index entries

Smeaton, A.F.: Progress in the application of natural language processing to information retrieval tasks (1992) 0.02

0.016086869 = product of:
  0.048260607 = sum of:
    0.048260607 = product of:
      0.09652121 = sum of:
        0.09652121 = weight(_text_:indexing in 7080) [ClassicSimilarity], result of:
          0.09652121 = score(doc=7080,freq=2.0), product of:
            0.19018644 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.049684696 = queryNorm
            0.5075084 = fieldWeight in 7080, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.09375 = fieldNorm(doc=7080)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Abstract: Account of recent developments in automatic and semi-automatic text indexing as well as in the generation of thesauri, text retrieval, abstracting and summarization

Hagn-Meincke, L.L.: Sprogspil pa tvaers : sprogfilosofiske teoriers betydning for indeksering og emnesogning (1999) 0.02

0.016086869 = product of:
  0.048260607 = sum of:
    0.048260607 = product of:
      0.09652121 = sum of:
        0.09652121 = weight(_text_:indexing in 4643) [ClassicSimilarity], result of:
          0.09652121 = score(doc=4643,freq=2.0), product of:
            0.19018644 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.049684696 = queryNorm
            0.5075084 = fieldWeight in 4643, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.09375 = fieldNorm(doc=4643)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Footnote: Übers. d. Titels: Language-game interferences: the importance of linguistic theories for indexing and subject searching

Wright, L.W.; Nardini, H.K.G.; Aronson, A.R.; Rindflesch, T.C.: Hierarchical concept indexing of full-text documents in the Unified Medical Language System Information sources Map (1999) 0.02
```
0.016086869 = product of:
  0.048260607 = sum of:
    0.048260607 = product of:
      0.09652121 = sum of:
        0.09652121 = weight(_text_:indexing in 2111) [ClassicSimilarity], result of:
          0.09652121 = score(doc=2111,freq=8.0), product of:
            0.19018644 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.049684696 = queryNorm
            0.5075084 = fieldWeight in 2111, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.046875 = fieldNorm(doc=2111)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)
```
Abstract

Full-text documents are a vital and rapidly growing part of online biomedical information. A single large document can contain as much information as a small database, but normally lacks the tight structure and consistent indexing of a database. Retrieval systems will often miss highly relevant parts of a document if the document as a whole appears irrelevant. Access to full-text information is further complicated by the need to search separately many disparate information resources. This research explores how these problems can be addressed by the combined use of 2 techniques: 1) natural language processing for automatic concept-based indexing of full text, and 2) methods for exploiting the structure and hierarchy of full-text documents. We describe methods for applying these techniques to a large collection of full-text documents drawn from the Health Services / Technology Assessment Text (HSTAT) database at the NLM and examine how this hierarchical concept indexing can assist both document- and source-level retrieval in the context of NLM's Information Source Map project

McMahon, J.G.; Smith, F.J.: Improved statistical language model performance with automatic generated word hierarchies (1996) 0.02

0.015707046 = product of:
  0.047121134 = sum of:
    0.047121134 = product of:
      0.09424227 = sum of:
        0.09424227 = weight(_text_:22 in 3164) [ClassicSimilarity], result of:
          0.09424227 = score(doc=3164,freq=2.0), product of:
            0.17398734 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049684696 = queryNorm
            0.5416616 = fieldWeight in 3164, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=3164)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Computational linguistics. 22(1996) no.2, S.217-248

Ruge, G.: ¬A spreading activation network for automatic generation of thesaurus relationships (1991) 0.02

0.015707046 = product of:
  0.047121134 = sum of:
    0.047121134 = product of:
      0.09424227 = sum of:
        0.09424227 = weight(_text_:22 in 4506) [ClassicSimilarity], result of:
          0.09424227 = score(doc=4506,freq=2.0), product of:
            0.17398734 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049684696 = queryNorm
            0.5416616 = fieldWeight in 4506, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=4506)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 8.10.2000 11:52:22

Somers, H.: Example-based machine translation : Review article (1999) 0.02

0.015707046 = product of:
  0.047121134 = sum of:
    0.047121134 = product of:
      0.09424227 = sum of:
        0.09424227 = weight(_text_:22 in 6672) [ClassicSimilarity], result of:
          0.09424227 = score(doc=6672,freq=2.0), product of:
            0.17398734 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049684696 = queryNorm
            0.5416616 = fieldWeight in 6672, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=6672)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 31. 7.1996 9:22:19

New tools for human translators (1997) 0.02

0.015707046 = product of:
  0.047121134 = sum of:
    0.047121134 = product of:
      0.09424227 = sum of:
        0.09424227 = weight(_text_:22 in 1179) [ClassicSimilarity], result of:
          0.09424227 = score(doc=1179,freq=2.0), product of:
            0.17398734 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049684696 = queryNorm
            0.5416616 = fieldWeight in 1179, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=1179)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 31. 7.1996 9:22:19

Baayen, R.H.; Lieber, H.: Word frequency distributions and lexical semantics (1997) 0.02

0.015707046 = product of:
  0.047121134 = sum of:
    0.047121134 = product of:
      0.09424227 = sum of:
        0.09424227 = weight(_text_:22 in 3117) [ClassicSimilarity], result of:
          0.09424227 = score(doc=3117,freq=2.0), product of:
            0.17398734 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049684696 = queryNorm
            0.5416616 = fieldWeight in 3117, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=3117)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 28. 2.1999 10:48:22

Fox, C.: Lexical analysis and stoplists (1992) 0.02

0.015166845 = product of:
  0.045500536 = sum of:
    0.045500536 = product of:
      0.09100107 = sum of:
        0.09100107 = weight(_text_:indexing in 3502) [ClassicSimilarity], result of:
          0.09100107 = score(doc=3502,freq=4.0), product of:
            0.19018644 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.049684696 = queryNorm
            0.47848347 = fieldWeight in 3502, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0625 = fieldNorm(doc=3502)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Abstract: Lexical analysis is a fundamental operation in both query processing and automatic indexing, and filtering stoplist words is an important step in the automatic indexing process. Presents basic algorithms and data structures for lexical analysis, and shows how stoplist word removal can be efficiently incorporated into lexical analysis

Frakes, W.B.: Stemming algorithms (1992) 0.02

0.015166845 = product of:
  0.045500536 = sum of:
    0.045500536 = product of:
      0.09100107 = sum of:
        0.09100107 = weight(_text_:indexing in 3503) [ClassicSimilarity], result of:
          0.09100107 = score(doc=3503,freq=4.0), product of:
            0.19018644 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.049684696 = queryNorm
            0.47848347 = fieldWeight in 3503, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0625 = fieldNorm(doc=3503)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Abstract: Desribes stemming algorithms - programs that relate morphologically similar indexing and search terms. Stemming is used to improve retrieval effectiveness and to reduce the size of indexing files. Several approaches to stemming are describes - table lookup, affix removal, successor variety, and n-gram. empirical studies of stemming are summarized. The Porter stemmer is described in detail, and a full implementation in C is presented

Sharada, B.A.: Rules derivation for Kannada based indexing language using transformational grammar (1998) 0.02

0.015166845 = product of:
  0.045500536 = sum of:
    0.045500536 = product of:
      0.09100107 = sum of:
        0.09100107 = weight(_text_:indexing in 3533) [ClassicSimilarity], result of:
          0.09100107 = score(doc=3533,freq=4.0), product of:
            0.19018644 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.049684696 = queryNorm
            0.47848347 = fieldWeight in 3533, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0625 = fieldNorm(doc=3533)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Abstract: Discusses the importance of syntax in analysing document titles. In an natural language processing environment, suggests a rule for analysing the indexing language based on Kannada, one of the major Indian languages, using the principles of transformational grammar

Mustafa el Hadi, W.: ¬The contribution of terminology to the theoretical conception of classificatory languages and document indexing (1990) 0.02

0.015166845 = product of:
  0.045500536 = sum of:
    0.045500536 = product of:
      0.09100107 = sum of:
        0.09100107 = weight(_text_:indexing in 5273) [ClassicSimilarity], result of:
          0.09100107 = score(doc=5273,freq=4.0), product of:
            0.19018644 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.049684696 = queryNorm
            0.47848347 = fieldWeight in 5273, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0625 = fieldNorm(doc=5273)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Abstract: Demonstrates the contribution of indexing languages to the analysis of certain linguistic phenomena, and reciprocally, the contribution of linguistics to the analysis of semantic relationships used by documentalists in conceiving thesauri

Byrne, C.C.; McCracken, S.A.: ¬An adaptive thesaurus employing semantic distance, relational inheritance and nominal compound interpretation for linguistic support of information retrieval (1999) 0.01

0.0134631805 = product of:
  0.04038954 = sum of:
    0.04038954 = product of:
      0.08077908 = sum of:
        0.08077908 = weight(_text_:22 in 4483) [ClassicSimilarity], result of:
          0.08077908 = score(doc=4483,freq=2.0), product of:
            0.17398734 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049684696 = queryNorm
            0.46428138 = fieldWeight in 4483, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=4483)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 15. 3.2000 10:22:37

Sabourin, C.F. (Bearb.): Computational linguistics in information science : bibliography (1994) 0.01

0.0134057235 = product of:
  0.04021717 = sum of:
    0.04021717 = product of:
      0.08043434 = sum of:
        0.08043434 = weight(_text_:indexing in 8280) [ClassicSimilarity], result of:
          0.08043434 = score(doc=8280,freq=2.0), product of:
            0.19018644 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.049684696 = queryNorm
            0.42292362 = fieldWeight in 8280, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.078125 = fieldNorm(doc=8280)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Abstract: The bibliography covers information retrieval (2100 refs.), fulltext (890) or conceptual (60), automatic indexing (930), information extraction (520), query languages (1090), etc.; altogether 6390 references, fully indexed

Zimmermann, H.H.: Language and language technology (1991) 0.01

0.0134057235 = product of:
  0.04021717 = sum of:
    0.04021717 = product of:
      0.08043434 = sum of:
        0.08043434 = weight(_text_:indexing in 2568) [ClassicSimilarity], result of:
          0.08043434 = score(doc=2568,freq=2.0), product of:
            0.19018644 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.049684696 = queryNorm
            0.42292362 = fieldWeight in 2568, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.078125 = fieldNorm(doc=2568)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Abstract: Considers aspects of language and linguistic studies that directly affect information handling including: electronic word processing (hyphenation, spelling correction, dictionary-based synonym provision); man-machine communication; machine understanding of spoken language; automatic indexing; and machine translation

Driscoll, J.R.; Rajala, D.A.; Shaffer, W.H.: ¬The operation and performance of an artificially intelligent keywording system (1991) 0.01
```
0.013270989 = product of:
  0.039812967 = sum of:
    0.039812967 = product of:
      0.079625934 = sum of:
        0.079625934 = weight(_text_:indexing in 6681) [ClassicSimilarity], result of:
          0.079625934 = score(doc=6681,freq=4.0), product of:
            0.19018644 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.049684696 = queryNorm
            0.41867304 = fieldWeight in 6681, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0546875 = fieldNorm(doc=6681)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)
```
Abstract

Presents a new approach to text analysis for automating the key phrase indexing process, using artificial intelligence techniques. This mimics the behaviour of human experts by using a rule base consisting of insertion and deletion rules generated by subject-matter experts. The insertion rules are based on the idea that some phrases found in a text imply or trigger other phrases. The deletion rules apply to semantically ambiguous phrases where text presence alone does not determine appropriateness as a key phrase. The insertion and deletion rules are used to transform a list of found phrases to a list of key phrases for indexing a document. Statistical data are provided to demonstrate the performance of this expert rule based system

Hutchins, J.: From first conception to first demonstration : the nascent years of machine translation, 1947-1954. A chronology (1997) 0.01

0.011219318 = product of:
  0.033657953 = sum of:
    0.033657953 = product of:
      0.06731591 = sum of:
        0.06731591 = weight(_text_:22 in 1463) [ClassicSimilarity], result of:
          0.06731591 = score(doc=1463,freq=2.0), product of:
            0.17398734 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049684696 = queryNorm
            0.38690117 = fieldWeight in 1463, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1463)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 31. 7.1996 9:22:19

Search (41 results, page 1 of 3)

Authors

Languages

Types

Themes

Subjects

Classifications