Search (72 results, page 1 of 4)

Bordoni, L.; Pazienza, M.T.: Documents automatic indexing in an environmental domain (1997) 0.13

0.13130128 = product of:
  0.19695193 = sum of:
    0.06772823 = weight(_text_:semantic in 530) [ClassicSimilarity], result of:
      0.06772823 = score(doc=530,freq=2.0), product of:
        0.21061863 = queryWeight, product of:
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.050655533 = queryNorm
        0.32156807 = fieldWeight in 530, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.0546875 = fieldNorm(doc=530)
    0.1292237 = sum of:
      0.081181824 = weight(_text_:indexing in 530) [ClassicSimilarity], result of:
        0.081181824 = score(doc=530,freq=4.0), product of:
          0.19390269 = queryWeight, product of:
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.050655533 = queryNorm
          0.41867304 = fieldWeight in 530, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.0546875 = fieldNorm(doc=530)
      0.048041876 = weight(_text_:22 in 530) [ClassicSimilarity], result of:
        0.048041876 = score(doc=530,freq=2.0), product of:
          0.17738704 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.050655533 = queryNorm
          0.2708308 = fieldWeight in 530, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0546875 = fieldNorm(doc=530)
  0.6666667 = coord(2/3)

Abstract: Describes an application of Natural Language Processing (NLP) techniques, in HIRMA (Hypertextual Information Retrieval Managed by ARIOSTO), to the problem of document indexing by referring to a system which incorporates natural language processing techniques to determine the subject of the text of documents and to associate them with relevant semantic indexes. Describes briefly the overall system, details of its implementation on a corpus of scientific abstracts related to environmental topics and experimental evidence of the system's behaviour. Analyzes in detail an experiment designed to evaluate the system's retrieval ability in terms of recall and precision
Source: International forum on information and documentation. 22(1997) no.1, S.17-28

Prasad, A.R.D.: PROMETHEUS: an automatic indexing system (1996) 0.12

0.11671345 = product of:
  0.17507017 = sum of:
    0.10946535 = weight(_text_:semantic in 5189) [ClassicSimilarity], result of:
      0.10946535 = score(doc=5189,freq=4.0), product of:
        0.21061863 = queryWeight, product of:
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.050655533 = queryNorm
        0.51973253 = fieldWeight in 5189, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.0625 = fieldNorm(doc=5189)
    0.06560482 = product of:
      0.13120964 = sum of:
        0.13120964 = weight(_text_:indexing in 5189) [ClassicSimilarity], result of:
          0.13120964 = score(doc=5189,freq=8.0), product of:
            0.19390269 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.050655533 = queryNorm
            0.6766778 = fieldWeight in 5189, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0625 = fieldNorm(doc=5189)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: An automatic indexing system using the tools and techniques of artificial intelligence is described. The paper presents the various components of the system like the parser, grammar formalism, lexicon, and the frame based knowledge representation for semantic representation. The semantic representation is based on the Ranganathan school of thought, especially that of Deep Structure of Subject Indexing Languages enunciated by Bhattacharyya. It is attempted to demonstrate the various stepts in indexing by providing an illustration

Liu, G.Z.: Semantic vector space model : implementation and evaluation (1997) 0.10

0.10294118 = product of:
  0.15441176 = sum of:
    0.12980995 = weight(_text_:semantic in 161) [ClassicSimilarity], result of:
      0.12980995 = score(doc=161,freq=10.0), product of:
        0.21061863 = queryWeight, product of:
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.050655533 = queryNorm
        0.616327 = fieldWeight in 161, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.046875 = fieldNorm(doc=161)
    0.02460181 = product of:
      0.04920362 = sum of:
        0.04920362 = weight(_text_:indexing in 161) [ClassicSimilarity], result of:
          0.04920362 = score(doc=161,freq=2.0), product of:
            0.19390269 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.050655533 = queryNorm
            0.2537542 = fieldWeight in 161, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.046875 = fieldNorm(doc=161)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Presents the Semantic Vector Space Model (SVSM), a text representation and searching technique based on the combination of Vector Space Model (VSM) with heuristic syntax parsing and distributed representation of semantic case structures. Both document and queries are represented as semantic matrices. A search mechanism is designed to compute the similarity between 2 semantic matrices to predict relevancy. A prototype system was built to implement this model by modifying the SMART system and using the Xerox Part of Speech tagged as the pre-processor of the indexing. The prototype system was used in an experimental study to evaluate this technique in terms of precision, recall, and effectiveness of relevance ranking. Results show that if documents and queries were too short, the technique was less effective than VSM. But with longer documents and queires, especially when original docuemtns were used as queries, the system based on this technique was found be performance better than SMART

Hlava, M.M.K.: Machine-Aided Indexing (MAI) in a multilingual environemt (1992) 0.09

0.08947943 = product of:
  0.13421914 = sum of:
    0.077403694 = weight(_text_:semantic in 2378) [ClassicSimilarity], result of:
      0.077403694 = score(doc=2378,freq=2.0), product of:
        0.21061863 = queryWeight, product of:
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.050655533 = queryNorm
        0.36750638 = fieldWeight in 2378, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.0625 = fieldNorm(doc=2378)
    0.05681544 = product of:
      0.11363088 = sum of:
        0.11363088 = weight(_text_:indexing in 2378) [ClassicSimilarity], result of:
          0.11363088 = score(doc=2378,freq=6.0), product of:
            0.19390269 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.050655533 = queryNorm
            0.5860202 = fieldWeight in 2378, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0625 = fieldNorm(doc=2378)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: The Machine-Aided Indexing (MAI) program, developed by Access Innovations, Inc., is a semantic based, Boolean statement, rule interpreting application designed to operate in a multilingual environment. Use of MAI across several languages with controlled vocabularies for each language provides a consistency in indexing not available through any other mechanism

Leyva, I.G.; Munoz, J.V.R.: Tendencias en los sistemas de indizacion automatica : estudio evolutivo (1996) 0.09

0.08947943 = product of:
  0.13421914 = sum of:
    0.077403694 = weight(_text_:semantic in 1462) [ClassicSimilarity], result of:
      0.077403694 = score(doc=1462,freq=2.0), product of:
        0.21061863 = queryWeight, product of:
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.050655533 = queryNorm
        0.36750638 = fieldWeight in 1462, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.0625 = fieldNorm(doc=1462)
    0.05681544 = product of:
      0.11363088 = sum of:
        0.11363088 = weight(_text_:indexing in 1462) [ClassicSimilarity], result of:
          0.11363088 = score(doc=1462,freq=6.0), product of:
            0.19390269 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.050655533 = queryNorm
            0.5860202 = fieldWeight in 1462, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0625 = fieldNorm(doc=1462)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Early research at the end of the 1950s on computerized indexing used statistical methods based on e.g. frequency, probability, clustering, and relevance. In the 1960s interest began to focus on linguistic analysis and natural language processing e.g. morphological, morphosyntactical, syntactical and semantic analysis. Since the 1980s computerized indexing research has widened to include images, graphics and sound. Examples are given of notable systems developed within each line of approach
Footnote: Übers. d. Titels: Tendencies in computerized indexing systems: an evolutionary study

Hlava, M.M.K.: Machine aided indexing (MAI) in a multilingual environment (1993) 0.08

0.0782945 = product of:
  0.11744174 = sum of:
    0.06772823 = weight(_text_:semantic in 7405) [ClassicSimilarity], result of:
      0.06772823 = score(doc=7405,freq=2.0), product of:
        0.21061863 = queryWeight, product of:
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.050655533 = queryNorm
        0.32156807 = fieldWeight in 7405, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.0546875 = fieldNorm(doc=7405)
    0.049713515 = product of:
      0.09942703 = sum of:
        0.09942703 = weight(_text_:indexing in 7405) [ClassicSimilarity], result of:
          0.09942703 = score(doc=7405,freq=6.0), product of:
            0.19390269 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.050655533 = queryNorm
            0.5127677 = fieldWeight in 7405, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7405)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: The machine aided indexing (MAI) software devloped by Access Innovations, Inc., is a semantic based, Boolean statement, rule interpreting application with 3 modules: the MA engine which accepts input files, matches terms in the knowledge base, interprets rules, and outputs a text file with suggested indexing terms; a rule building application allowing each Boolean style rule in the knowledge base to be created or modifies; and a statistical computation module which analyzes performance of the MA software against text manually indexed by professional human indexers. The MA software can be applied across multiple languages and can be used where the text to be searched is in one language and the indexes to be output are in another

Gödert, W.; Liebig, M.: Maschinelle Indexierung auf dem Prüfstand : Ergebnisse eines Retrievaltests zum MILOS II Projekt (1997) 0.08

0.0782945 = product of:
  0.11744174 = sum of:
    0.06772823 = weight(_text_:semantic in 1174) [ClassicSimilarity], result of:
      0.06772823 = score(doc=1174,freq=2.0), product of:
        0.21061863 = queryWeight, product of:
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.050655533 = queryNorm
        0.32156807 = fieldWeight in 1174, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1174)
    0.049713515 = product of:
      0.09942703 = sum of:
        0.09942703 = weight(_text_:indexing in 1174) [ClassicSimilarity], result of:
          0.09942703 = score(doc=1174,freq=6.0), product of:
            0.19390269 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.050655533 = queryNorm
            0.5127677 = fieldWeight in 1174, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1174)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: The test ran between Nov 95-Aug 96 in Cologne Fachhochschule fur Bibliothekswesen (College of Librarianship).The test basis was a database of 190,000 book titles published between 1990-95. MILOS II mechanized indexing methods proved helpful in avoiding or reducing numbers of unsatisfied/no result retrieval searches. Retrieval from mechanised indexing is 3 times more successful than from title keyword data. MILOS II also used a standardized semantic vocabulary. Mechanised indexing demands high quality software and output data

Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.07

0.06990413 = product of:
  0.10485619 = sum of:
    0.077403694 = weight(_text_:semantic in 4709) [ClassicSimilarity], result of:
      0.077403694 = score(doc=4709,freq=2.0), product of:
        0.21061863 = queryWeight, product of:
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.050655533 = queryNorm
        0.36750638 = fieldWeight in 4709, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.0625 = fieldNorm(doc=4709)
    0.0274525 = product of:
      0.054905 = sum of:
        0.054905 = weight(_text_:22 in 4709) [ClassicSimilarity], result of:
          0.054905 = score(doc=4709,freq=2.0), product of:
            0.17738704 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050655533 = queryNorm
            0.30952093 = fieldWeight in 4709, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=4709)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Date: 31. 7.1996 9:22:19

Lassalle, E.: Text retrieval : from a monolingual system to a multilingual system (1993) 0.06

0.064286895 = product of:
  0.09643034 = sum of:
    0.06772823 = weight(_text_:semantic in 7403) [ClassicSimilarity], result of:
      0.06772823 = score(doc=7403,freq=2.0), product of:
        0.21061863 = queryWeight, product of:
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.050655533 = queryNorm
        0.32156807 = fieldWeight in 7403, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.0546875 = fieldNorm(doc=7403)
    0.028702112 = product of:
      0.057404224 = sum of:
        0.057404224 = weight(_text_:indexing in 7403) [ClassicSimilarity], result of:
          0.057404224 = score(doc=7403,freq=2.0), product of:
            0.19390269 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.050655533 = queryNorm
            0.29604656 = fieldWeight in 7403, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7403)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Describes the TELMI monolingual text retrieval system and its future extension, a multilingual system. TELMI is designed for medium sized databases containing short texts. The characteristics of the system are fine-grained natural language processing (NLP); an open domain and a large scale knowledge base; automated indexing based on conceptual representation of texts and reusability of the NLP tools. Discusses the French MINITEL service, the MGS information service and the TELMI research system covering the full text system; NLP architecture; the lexical level; the syntactic level; the semantic level and an example of the use of a generic system

Ward, M.L.: ¬The future of the human indexer (1996) 0.05
```
0.053900834 = product of:
  0.1617025 = sum of:
    0.1617025 = sum of:
      0.12052374 = weight(_text_:indexing in 7244) [ClassicSimilarity], result of:
        0.12052374 = score(doc=7244,freq=12.0), product of:
          0.19390269 = queryWeight, product of:
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.050655533 = queryNorm
          0.6215682 = fieldWeight in 7244, product of:
            3.4641016 = tf(freq=12.0), with freq of:
              12.0 = termFreq=12.0
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.046875 = fieldNorm(doc=7244)
      0.04117875 = weight(_text_:22 in 7244) [ClassicSimilarity], result of:
        0.04117875 = score(doc=7244,freq=2.0), product of:
          0.17738704 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.050655533 = queryNorm
          0.23214069 = fieldWeight in 7244, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=7244)
  0.33333334 = coord(1/3)
```
Abstract

Considers the principles of indexing and the intellectual skills involved in order to determine what automatic indexing systems would be required in order to supplant or complement the human indexer. Good indexing requires: considerable prior knowledge of the literature; judgement as to what to index and what depth to index; reading skills; abstracting skills; and classification skills, Illustrates these features with a detailed description of abstracting and indexing processes involved in generating entries for the mechanical engineering database POWERLINK. Briefly assesses the possibility of replacing human indexers with specialist indexing software, with particular reference to the Object Analyzer from the InTEXT automatic indexing system and using the criteria described for human indexers. At present, it is unlikely that the automatic indexer will replace the human indexer, but when more primary texts are available in electronic form, it may be a useful productivity tool for dealing with large quantities of low grade texts (should they be wanted in the database)

Date

9. 2.1997 18:44:22

Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.05

0.050212428 = product of:
  0.15063728 = sum of:
    0.15063728 = sum of:
      0.08200603 = weight(_text_:indexing in 4157) [ClassicSimilarity], result of:
        0.08200603 = score(doc=4157,freq=2.0), product of:
          0.19390269 = queryWeight, product of:
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.050655533 = queryNorm
          0.42292362 = fieldWeight in 4157, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.078125 = fieldNorm(doc=4157)
      0.068631254 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
        0.068631254 = score(doc=4157,freq=2.0), product of:
          0.17738704 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.050655533 = queryNorm
          0.38690117 = fieldWeight in 4157, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.078125 = fieldNorm(doc=4157)
  0.33333334 = coord(1/3)

Source: Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill

Tsareva, P.V.: Algoritmy dlya raspoznavaniya pozitivnykh i negativnykh vkhozdenii deskriptorov v tekst i protsedura avtomaticheskoi klassifikatsii tekstov (1999) 0.05

0.050212428 = product of:
  0.15063728 = sum of:
    0.15063728 = sum of:
      0.08200603 = weight(_text_:indexing in 374) [ClassicSimilarity], result of:
        0.08200603 = score(doc=374,freq=2.0), product of:
          0.19390269 = queryWeight, product of:
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.050655533 = queryNorm
          0.42292362 = fieldWeight in 374, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.078125 = fieldNorm(doc=374)
      0.068631254 = weight(_text_:22 in 374) [ClassicSimilarity], result of:
        0.068631254 = score(doc=374,freq=2.0), product of:
          0.17738704 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.050655533 = queryNorm
          0.38690117 = fieldWeight in 374, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.078125 = fieldNorm(doc=374)
  0.33333334 = coord(1/3)

Date: 1. 4.2002 10:22:41
Footnote: Übers. des Titels: Algorithms for selection of positive and negative descriptors from text and automated text indexing

SIGIR'92 : Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1992) 0.04
```
0.041494764 = product of:
  0.062242147 = sum of:
    0.04789109 = weight(_text_:semantic in 6671) [ClassicSimilarity], result of:
      0.04789109 = score(doc=6671,freq=4.0), product of:
        0.21061863 = queryWeight, product of:
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.050655533 = queryNorm
        0.22738299 = fieldWeight in 6671, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.02734375 = fieldNorm(doc=6671)
    0.014351056 = product of:
      0.028702112 = sum of:
        0.028702112 = weight(_text_:indexing in 6671) [ClassicSimilarity], result of:
          0.028702112 = score(doc=6671,freq=2.0), product of:
            0.19390269 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.050655533 = queryNorm
            0.14802328 = fieldWeight in 6671, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.02734375 = fieldNorm(doc=6671)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Content

HARMAN, D.: Relevance feedback revisited; AALBERSBERG, I.J.: Incremental relevance feedback; TAGUE-SUTCLIFFE, J.: Measuring the informativeness of a retrieval process; LEWIS, D.D.: An evaluation of phrasal and clustered representations on a text categorization task; BLOSSEVILLE, M.J., G. HÉBRAIL, M.G. MONTEIL u. N. PÉNOT: Automatic document classification: natural language processing, statistical analysis, and expert system techniques used together; MASAND, B., G. LINOFF u. D. WALTZ: Classifying news stories using memory based reasoning; KEEN, E.M.: Term position ranking: some new test results; CROUCH, C.J. u. B. YANG: Experiments in automatic statistical thesaurus construction; GREFENSTETTE, G.: Use of syntactic context to produce term association lists for text retrieval; ANICK, P.G. u. R.A. FLYNN: Versioning of full-text information retrieval system; BURKOWSKI, F.J.: Retrieval activities in a database consisting of heterogeneous collections; DEERWESTER, S.C., K. WACLENA u. M. LaMAR: A textual object management system; NIE, J.-Y.:Towards a probabilistic modal logic for semantic-based information retrieval; WANG, A.W., S.K.M. WONG u. Y.Y. YAO: An analysis of vector space models based on computational geometry; BARTELL, B.T., G.W. COTTRELL u. R.K. BELEW: Latent semantic indexing is an optimal special case of multidimensional scaling; GLAVITSCH, U. u. P. SCHÄUBLE: A system for retrieving speech documents; MARGULIS, E.L.: N-Poisson document modelling; HESS, M.: An incrementally extensible document retrieval system based on linguistics and logical principles; COOPER, W.S., F.C. GEY u. D.P. DABNEY: Probabilistic retrieval based on staged logistic regression; FUHR, N.: Integration of probabilistic fact and text retrieval; CROFT, B., L.A. SMITH u. H. TURTLE: A loosely-coupled integration of a text retrieval system and an object-oriented database system; DUMAIS, S.T. u. J. NIELSEN: Automating the assignement of submitted manuscripts to reviewers; GOST, M.A. u. M. MASOTTI: Design of an OPAC database to permit different subject searching accesses; ROBERTSON, A.M. u. P. WILLETT: Searching for historical word forms in a database of 17th century English text using spelling correction methods; FAX, E.A., Q.F. CHEN u. L.S. HEATH: A faster algorithm for constructing minimal perfect hash functions; MOFFAT, A. u. J. ZOBEL: Parameterised compression for sparse bitmaps; GRANDI, F., P. TIBERIO u. P. Zezula: Frame-sliced patitioned parallel signature files; ALLEN, B.: Cognitive differences in end user searching of a CD-ROM index; SONNENWALD, D.H.: Developing a theory to guide the process of designing information retrieval systems; CUTTING, D.R., J.O. PEDERSEN, D. KARGER, u. J.W. TUKEY: Scatter/ Gather: a cluster-based approach to browsing large document collections; CHALMERS, M. u. P. CHITSON: Bead: Explorations in information visualization; WILLIAMSON, C. u. B. SHNEIDERMAN: The dynamic HomeFinder: evaluating dynamic queries in a real-estate information exploring system

Chowdhury, G.G.: Natural language processing and information retrieval : pt.1: basic issues; pt.2: major applications (1991) 0.03

0.032251537 = product of:
  0.09675461 = sum of:
    0.09675461 = weight(_text_:semantic in 3313) [ClassicSimilarity], result of:
      0.09675461 = score(doc=3313,freq=2.0), product of:
        0.21061863 = queryWeight, product of:
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.050655533 = queryNorm
        0.45938298 = fieldWeight in 3313, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.1578603 = idf(docFreq=1879, maxDocs=44218)
          0.078125 = fieldNorm(doc=3313)
  0.33333334 = coord(1/3)

Abstract: Reviews the basic issues and procedures involved in natural language processing of textual material for final use in information retrieval. Covers: natural language processing; natural language understanding; syntactic and semantic analysis; parsing; knowledge bases and knowledge representation

Milstead, J.L.: Thesauri in a full-text world (1998) 0.03

0.03076755 = product of:
  0.09230265 = sum of:
    0.09230265 = sum of:
      0.05798702 = weight(_text_:indexing in 2337) [ClassicSimilarity], result of:
        0.05798702 = score(doc=2337,freq=4.0), product of:
          0.19390269 = queryWeight, product of:
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.050655533 = queryNorm
          0.29905218 = fieldWeight in 2337, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.0390625 = fieldNorm(doc=2337)
      0.034315627 = weight(_text_:22 in 2337) [ClassicSimilarity], result of:
        0.034315627 = score(doc=2337,freq=2.0), product of:
          0.17738704 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.050655533 = queryNorm
          0.19345059 = fieldWeight in 2337, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=2337)
  0.33333334 = coord(1/3)

Abstract: Despite early claims to the contemporary, thesauri continue to find use as access tools for information in the full-text environment. Their mode of use is changing, but this change actually represents an expansion rather than a contrdiction of their utility. Thesauri and similar vocabulary tools can complement full-text access by aiding users in focusing their searches, by supplementing the linguistic analysis of the text search engine, and even by serving as one of the tools used by the linguistic engine for its analysis. While human indexing contunues to be used for many databases, the trend is to increase the use of machine aids for this purpose. All machine-aided indexing (MAI) systems rely on thesauri as the basis for term selection. In the 21st century, the balance of effort between human and machine will change at both input and output, but thesauri will continue to play an important role for the foreseeable future
Date: 22. 9.1997 19:16:05

Humphrey, S.M.: Automatic indexing of documents from journal descriptors : a preliminary investigation (1999) 0.03
```
0.025932586 = product of:
  0.077797756 = sum of:
    0.077797756 = product of:
      0.15559551 = sum of:
        0.15559551 = weight(_text_:indexing in 3769) [ClassicSimilarity], result of:
          0.15559551 = score(doc=3769,freq=20.0), product of:
            0.19390269 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.050655533 = queryNorm
            0.80244124 = fieldWeight in 3769, product of:
              4.472136 = tf(freq=20.0), with freq of:
                20.0 = termFreq=20.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.046875 = fieldNorm(doc=3769)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)
```
Abstract

A new, fully automated approach for indedexing documents is presented based on associating textwords in a training set of bibliographic citations with the indexing of journals. This journal-level indexing is in the form of a consistent, timely set of journal descriptors (JDs) indexing the individual journals themselves. This indexing is maintained in journal records in a serials authority database. The advantage of this novel approach is that the training set does not depend on previous manual indexing of thousands of documents (i.e., any such indexing already in the training set is not used), but rather the relatively small intellectual effort of indexing at the journal level, usually a matter of a few thousand unique journals for which retrospective indexing to maintain consistency and currency may be feasible. If successful, JD indexing would provide topical categorization of documents outside the training set, i.e., journal articles, monographs, Web documents, reports from the grey literature, etc., and therefore be applied in searching. Because JDs are quite general, corresponding to subject domains, their most problable use would be for improving or refining search results
Wan, T.-L.; Evens, M.; Wan, Y.-W.; Pao, Y.-Y.: Experiments with automatic indexing and a relational thesaurus in a Chinese information retrieval system (1997) 0.03
```
0.025312882 = product of:
  0.07593864 = sum of:
    0.07593864 = product of:
      0.15187728 = sum of:
        0.15187728 = weight(_text_:indexing in 956) [ClassicSimilarity], result of:
          0.15187728 = score(doc=956,freq=14.0), product of:
            0.19390269 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.050655533 = queryNorm
            0.78326553 = fieldWeight in 956, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0546875 = fieldNorm(doc=956)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)
```
Abstract

This article describes a series of experiments with an interactive Chinese information retrieval system named CIRS and an interactive relational thesaurus. 2 important issues have been explored: whether thesauri enhance the retrieval effectiveness of Chinese documents, and whether automatic indexing can complete with manual indexing in a Chinese information retrieval system. Recall and precision are used to measure and evaluate the effectiveness of the system. Statistical analysis of the recall and precision measures suggest that the use of the relational thesaurus does improve the retrieval effectiveness both in the automatic indexing environment and in the manual indexing environment and that automatic indexing is at least as good as manual indexing

Plaunt, C.; Norgard, B.A.: ¬An association-based method for automatic indexing with a controlled vocabulary (1998) 0.03

0.025106214 = product of:
  0.07531864 = sum of:
    0.07531864 = sum of:
      0.041003015 = weight(_text_:indexing in 1794) [ClassicSimilarity], result of:
        0.041003015 = score(doc=1794,freq=2.0), product of:
          0.19390269 = queryWeight, product of:
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.050655533 = queryNorm
          0.21146181 = fieldWeight in 1794, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.8278677 = idf(docFreq=2614, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1794)
      0.034315627 = weight(_text_:22 in 1794) [ClassicSimilarity], result of:
        0.034315627 = score(doc=1794,freq=2.0), product of:
          0.17738704 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.050655533 = queryNorm
          0.19345059 = fieldWeight in 1794, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1794)
  0.33333334 = coord(1/3)

Date: 11. 9.2000 19:53:22

Kim, P.K.: ¬An automatic indexing of compound words based on mutual information for Korean text retrieval (1995) 0.02
```
0.024449475 = product of:
  0.073348425 = sum of:
    0.073348425 = product of:
      0.14669685 = sum of:
        0.14669685 = weight(_text_:indexing in 620) [ClassicSimilarity], result of:
          0.14669685 = score(doc=620,freq=10.0), product of:
            0.19390269 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.050655533 = queryNorm
            0.7565488 = fieldWeight in 620, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0625 = fieldNorm(doc=620)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)
```
Abstract

Presents an automatic indexing technique for compound words suitable for an agglutinative language, specifically Korean. Discusses some construction conditions for compound words and the rules for decomposing compound words to enhance the exhaustivity of indexing, demonstrating that this system, mutual information, enhances both the exhaustivity of indexing and the specifity of terms. Suggests that the construction conditions and rules for decomposition presented may be used in multilingual information retrieval systems to translate the indexing terms of the specific language into those of the language required

Li, Z.: Research on dynamic morphological indexing (1998) 0.02

0.024449475 = product of:
  0.073348425 = sum of:
    0.073348425 = product of:
      0.14669685 = sum of:
        0.14669685 = weight(_text_:indexing in 3242) [ClassicSimilarity], result of:
          0.14669685 = score(doc=3242,freq=10.0), product of:
            0.19390269 = queryWeight, product of:
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.050655533 = queryNorm
            0.7565488 = fieldWeight in 3242, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.8278677 = idf(docFreq=2614, maxDocs=44218)
              0.0625 = fieldNorm(doc=3242)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Abstract: Notes that in automatic indexing of Chinese words using dictionary matching methods, there is some difficulty in the indexing of proper nouns. Presents a solution called dynamic morphological indexing, based on work using automatic indexing of archive documents. Presents the algorithm for this solution

Search (72 results, page 1 of 4)

Authors

Languages

Types

Themes