Search (99 results, page 1 of 5)

Ward, M.L.: ¬The future of the human indexer (1996) 0.06

0.05920849 = product of:
  0.08881273 = sum of:
    0.022382967 = weight(_text_:of in 7244) [ClassicSimilarity], result of:
      0.022382967 = score(doc=7244,freq=14.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.2742677 = fieldWeight in 7244, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=7244)
    0.066429764 = sum of:
      0.02400495 = weight(_text_:science in 7244) [ClassicSimilarity], result of:
        0.02400495 = score(doc=7244,freq=2.0), product of:
          0.13747036 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.05218836 = queryNorm
          0.17461908 = fieldWeight in 7244, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.046875 = fieldNorm(doc=7244)
      0.042424813 = weight(_text_:22 in 7244) [ClassicSimilarity], result of:
        0.042424813 = score(doc=7244,freq=2.0), product of:
          0.18275474 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05218836 = queryNorm
          0.23214069 = fieldWeight in 7244, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=7244)
  0.6666667 = coord(2/3)

Abstract: Considers the principles of indexing and the intellectual skills involved in order to determine what automatic indexing systems would be required in order to supplant or complement the human indexer. Good indexing requires: considerable prior knowledge of the literature; judgement as to what to index and what depth to index; reading skills; abstracting skills; and classification skills, Illustrates these features with a detailed description of abstracting and indexing processes involved in generating entries for the mechanical engineering database POWERLINK. Briefly assesses the possibility of replacing human indexers with specialist indexing software, with particular reference to the Object Analyzer from the InTEXT automatic indexing system and using the criteria described for human indexers. At present, it is unlikely that the automatic indexer will replace the human indexer, but when more primary texts are available in electronic form, it may be a useful productivity tool for dealing with large quantities of low grade texts (should they be wanted in the database)
Date: 9. 2.1997 18:44:22
Source: Journal of librarianship and information science. 28(1996) no.4, S.217-225

Milstead, J.L.: Thesauri in a full-text world (1998) 0.06

0.057292055 = product of:
  0.08593808 = sum of:
    0.022293966 = weight(_text_:of in 2337) [ClassicSimilarity], result of:
      0.022293966 = score(doc=2337,freq=20.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.27317715 = fieldWeight in 2337, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2337)
    0.06364411 = sum of:
      0.028290104 = weight(_text_:science in 2337) [ClassicSimilarity], result of:
        0.028290104 = score(doc=2337,freq=4.0), product of:
          0.13747036 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.05218836 = queryNorm
          0.20579056 = fieldWeight in 2337, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.0390625 = fieldNorm(doc=2337)
      0.03535401 = weight(_text_:22 in 2337) [ClassicSimilarity], result of:
        0.03535401 = score(doc=2337,freq=2.0), product of:
          0.18275474 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05218836 = queryNorm
          0.19345059 = fieldWeight in 2337, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=2337)
  0.6666667 = coord(2/3)

Abstract: Despite early claims to the contemporary, thesauri continue to find use as access tools for information in the full-text environment. Their mode of use is changing, but this change actually represents an expansion rather than a contrdiction of their utility. Thesauri and similar vocabulary tools can complement full-text access by aiding users in focusing their searches, by supplementing the linguistic analysis of the text search engine, and even by serving as one of the tools used by the linguistic engine for its analysis. While human indexing contunues to be used for many databases, the trend is to increase the use of machine aids for this purpose. All machine-aided indexing (MAI) systems rely on thesauri as the basis for term selection. In the 21st century, the balance of effort between human and machine will change at both input and output, but thesauri will continue to play an important role for the foreseeable future
Date: 22. 9.1997 19:16:05
Imprint: Urbana-Champaign, IL : Illinois University at Urbana-Champaign, Graduate School of Library and Information Science
Source: Visualizing subject access for 21st century information resources: Papers presented at the 1997 Clinic on Library Applications of Data Processing, 2-4 Mar 1997, Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign. Ed.: P.A. Cochrane et al

Plaunt, C.; Norgard, B.A.: ¬An association-based method for automatic indexing with a controlled vocabulary (1998) 0.05

0.05019898 = product of:
  0.075298466 = sum of:
    0.019940332 = weight(_text_:of in 1794) [ClassicSimilarity], result of:
      0.019940332 = score(doc=1794,freq=16.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.24433708 = fieldWeight in 1794, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1794)
    0.055358134 = sum of:
      0.020004123 = weight(_text_:science in 1794) [ClassicSimilarity], result of:
        0.020004123 = score(doc=1794,freq=2.0), product of:
          0.13747036 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.05218836 = queryNorm
          0.1455159 = fieldWeight in 1794, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1794)
      0.03535401 = weight(_text_:22 in 1794) [ClassicSimilarity], result of:
        0.03535401 = score(doc=1794,freq=2.0), product of:
          0.18275474 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05218836 = queryNorm
          0.19345059 = fieldWeight in 1794, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1794)
  0.6666667 = coord(2/3)

Abstract: In this article, we describe and test a two-stage algorithm based on a lexical collocation technique which maps from the lexical clues contained in a document representation into a controlled vocabulary list of subject headings. Using a collection of 4.626 INSPEC documents, we create a 'dictionary' of associations between the lexical items contained in the titles, authors, and abstracts, and controlled vocabulary subject headings assigned to those records by human indexers using a likelihood ratio statistic as the measure of association. In the deployment stage, we use the dictiony to predict which of the controlled vocabulary subject headings best describe new documents when they are presented to the system. Our evaluation of this algorithm, in which we compare the automatically assigned subject headings to the subject headings assigned to the test documents by human catalogers, shows that we can obtain results comparable to, and consistent with, human cataloging. In effect we have cast this as a classic partial match information retrieval problem. We consider the problem to be one of 'retrieving' (or assigning) the most probably 'relevant' (or correct) controlled vocabulary subject headings to a document based on the clues contained in that document
Date: 11. 9.2000 19:53:22
Source: Journal of the American Society for Information Science. 49(1998) no.10, S.888-902

Thiel, T.J.: Automated indexing of information stored on optical disk electronic document image management systems (1994) 0.04

0.037281495 = product of:
  0.05592224 = sum of:
    0.027916465 = weight(_text_:of in 1260) [ClassicSimilarity], result of:
      0.027916465 = score(doc=1260,freq=4.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.34207192 = fieldWeight in 1260, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.109375 = fieldNorm(doc=1260)
    0.028005775 = product of:
      0.05601155 = sum of:
        0.05601155 = weight(_text_:science in 1260) [ClassicSimilarity], result of:
          0.05601155 = score(doc=1260,freq=2.0), product of:
            0.13747036 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.05218836 = queryNorm
            0.40744454 = fieldWeight in 1260, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.109375 = fieldNorm(doc=1260)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Source: Encyclopedia of library and information science. Vol.54, [=Suppl.17]

Silvester, J.P.: Computer supported indexing : a history and evaluation of NASA's MAI system (1998) 0.04

0.037281495 = product of:
  0.05592224 = sum of:
    0.027916465 = weight(_text_:of in 1302) [ClassicSimilarity], result of:
      0.027916465 = score(doc=1302,freq=4.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.34207192 = fieldWeight in 1302, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.109375 = fieldNorm(doc=1302)
    0.028005775 = product of:
      0.05601155 = sum of:
        0.05601155 = weight(_text_:science in 1302) [ClassicSimilarity], result of:
          0.05601155 = score(doc=1302,freq=2.0), product of:
            0.13747036 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.05218836 = queryNorm
            0.40744454 = fieldWeight in 1302, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.109375 = fieldNorm(doc=1302)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Source: Encyclopedia of library and information science. Vol.61, [=Suppl.24]

Garfield, E.; Sher, I.H.: KeyWords Plus: algorithmic derivative indexing (1993) 0.04

0.036377676 = product of:
  0.05456651 = sum of:
    0.02255991 = weight(_text_:of in 4341) [ClassicSimilarity], result of:
      0.02255991 = score(doc=4341,freq=2.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.27643585 = fieldWeight in 4341, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.125 = fieldNorm(doc=4341)
    0.0320066 = product of:
      0.0640132 = sum of:
        0.0640132 = weight(_text_:science in 4341) [ClassicSimilarity], result of:
          0.0640132 = score(doc=4341,freq=2.0), product of:
            0.13747036 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.05218836 = queryNorm
            0.4656509 = fieldWeight in 4341, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.125 = fieldNorm(doc=4341)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Source: Journal of the American Society for Information Science. 44(1993) no.5, S.298-299

Garfield, E.; Sager, N.: Mechanical indexing, structural linguistics and information retrieval (1993) 0.04

0.036377676 = product of:
  0.05456651 = sum of:
    0.02255991 = weight(_text_:of in 5900) [ClassicSimilarity], result of:
      0.02255991 = score(doc=5900,freq=2.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.27643585 = fieldWeight in 5900, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.125 = fieldNorm(doc=5900)
    0.0320066 = product of:
      0.0640132 = sum of:
        0.0640132 = weight(_text_:science in 5900) [ClassicSimilarity], result of:
          0.0640132 = score(doc=5900,freq=2.0), product of:
            0.13747036 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.05218836 = queryNorm
            0.4656509 = fieldWeight in 5900, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.125 = fieldNorm(doc=5900)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Source: Journal of information science. 19(1993) no.2, S.164-165

Bordoni, L.; Pazienza, M.T.: Documents automatic indexing in an environmental domain (1997) 0.04

0.035109516 = product of:
  0.052664272 = sum of:
    0.027916465 = weight(_text_:of in 530) [ClassicSimilarity], result of:
      0.027916465 = score(doc=530,freq=16.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.34207192 = fieldWeight in 530, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=530)
    0.024747808 = product of:
      0.049495615 = sum of:
        0.049495615 = weight(_text_:22 in 530) [ClassicSimilarity], result of:
          0.049495615 = score(doc=530,freq=2.0), product of:
            0.18275474 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05218836 = queryNorm
            0.2708308 = fieldWeight in 530, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=530)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Describes an application of Natural Language Processing (NLP) techniques, in HIRMA (Hypertextual Information Retrieval Managed by ARIOSTO), to the problem of document indexing by referring to a system which incorporates natural language processing techniques to determine the subject of the text of documents and to associate them with relevant semantic indexes. Describes briefly the overall system, details of its implementation on a corpus of scientific abstracts related to environmental topics and experimental evidence of the system's behaviour. Analyzes in detail an experiment designed to evaluate the system's retrieval ability in terms of recall and precision
Source: International forum on information and documentation. 22(1997) no.1, S.17-28

Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.03

0.032969303 = product of:
  0.049453955 = sum of:
    0.014099943 = weight(_text_:of in 4157) [ClassicSimilarity], result of:
      0.014099943 = score(doc=4157,freq=2.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.17277241 = fieldWeight in 4157, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.078125 = fieldNorm(doc=4157)
    0.03535401 = product of:
      0.07070802 = sum of:
        0.07070802 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
          0.07070802 = score(doc=4157,freq=2.0), product of:
            0.18275474 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05218836 = queryNorm
            0.38690117 = fieldWeight in 4157, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=4157)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Source: Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill

Tsareva, P.V.: Algoritmy dlya raspoznavaniya pozitivnykh i negativnykh vkhozdenii deskriptorov v tekst i protsedura avtomaticheskoi klassifikatsii tekstov (1999) 0.03

0.032969303 = product of:
  0.049453955 = sum of:
    0.014099943 = weight(_text_:of in 374) [ClassicSimilarity], result of:
      0.014099943 = score(doc=374,freq=2.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.17277241 = fieldWeight in 374, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.078125 = fieldNorm(doc=374)
    0.03535401 = product of:
      0.07070802 = sum of:
        0.07070802 = weight(_text_:22 in 374) [ClassicSimilarity], result of:
          0.07070802 = score(doc=374,freq=2.0), product of:
            0.18275474 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05218836 = queryNorm
            0.38690117 = fieldWeight in 374, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=374)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Date: 1. 4.2002 10:22:41
Footnote: Übers. des Titels: Algorithms for selection of positive and negative descriptors from text and automated text indexing

Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.03

0.031880446 = product of:
  0.047820665 = sum of:
    0.019537456 = weight(_text_:of in 4709) [ClassicSimilarity], result of:
      0.019537456 = score(doc=4709,freq=6.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.23940048 = fieldWeight in 4709, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=4709)
    0.028283209 = product of:
      0.056566417 = sum of:
        0.056566417 = weight(_text_:22 in 4709) [ClassicSimilarity], result of:
          0.056566417 = score(doc=4709,freq=2.0), product of:
            0.18275474 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05218836 = queryNorm
            0.30952093 = fieldWeight in 4709, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=4709)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Proposes automatic linguistic knowledge acquisition from sublanguage corpora. The system combines existing linguistic knowledge and human intervention with corpus based techniques. The algorithm involves a gradual approximation which works to converge linguistic knowledge gradually towards desirable results. The 1st experiment revealed the characteristic of this algorithm and the others proved the effectiveness of this algorithm for a real corpus
Date: 31. 7.1996 9:22:19

Salton, G.; Allen, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine-readable data (1994) 0.03

0.031830467 = product of:
  0.047745697 = sum of:
    0.01973992 = weight(_text_:of in 1168) [ClassicSimilarity], result of:
      0.01973992 = score(doc=1168,freq=2.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.24188137 = fieldWeight in 1168, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.109375 = fieldNorm(doc=1168)
    0.028005775 = product of:
      0.05601155 = sum of:
        0.05601155 = weight(_text_:science in 1168) [ClassicSimilarity], result of:
          0.05601155 = score(doc=1168,freq=2.0), product of:
            0.13747036 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.05218836 = queryNorm
            0.40744454 = fieldWeight in 1168, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.109375 = fieldNorm(doc=1168)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Source: Science. 264(1994) no.5164, S.1421-1426

Zeng, L.: Automatic indexing for Chinese text : problems and progress (1992) 0.03

0.031830467 = product of:
  0.047745697 = sum of:
    0.01973992 = weight(_text_:of in 1289) [ClassicSimilarity], result of:
      0.01973992 = score(doc=1289,freq=2.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.24188137 = fieldWeight in 1289, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.109375 = fieldNorm(doc=1289)
    0.028005775 = product of:
      0.05601155 = sum of:
        0.05601155 = weight(_text_:science in 1289) [ClassicSimilarity], result of:
          0.05601155 = score(doc=1289,freq=2.0), product of:
            0.13747036 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.05218836 = queryNorm
            0.40744454 = fieldWeight in 1289, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.109375 = fieldNorm(doc=1289)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Source: Encyclopedia of library and information science. Vol.49, [=Suppl.12]

Schuegraf, E.J.; Bommel, M.F.van: ¬An automatic document indexing system based on cooperating expert systems : design and development (1993) 0.03

0.030564837 = product of:
  0.045847256 = sum of:
    0.029843956 = weight(_text_:of in 6504) [ClassicSimilarity], result of:
      0.029843956 = score(doc=6504,freq=14.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.36569026 = fieldWeight in 6504, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=6504)
    0.0160033 = product of:
      0.0320066 = sum of:
        0.0320066 = weight(_text_:science in 6504) [ClassicSimilarity], result of:
          0.0320066 = score(doc=6504,freq=2.0), product of:
            0.13747036 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.05218836 = queryNorm
            0.23282544 = fieldWeight in 6504, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0625 = fieldNorm(doc=6504)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Discusses the design of an automatic indexing system based on two cooperating expert systems and the investigation related to its development. The design combines statistical and artificial intelligence techniques. Examines choice of content indicators, the effect of stemming and the identification of characteristic vocabularies for given subject areas. Presents experimental results. Discusses the application of machine learning algorithms to the identification of vocabularies
Source: Canadian journal of information and library science. 18(1993) no.2, S.32-50

Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.03

0.029490318 = product of:
  0.044235475 = sum of:
    0.015952265 = weight(_text_:of in 6752) [ClassicSimilarity], result of:
      0.015952265 = score(doc=6752,freq=4.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.19546966 = fieldWeight in 6752, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=6752)
    0.028283209 = product of:
      0.056566417 = sum of:
        0.056566417 = weight(_text_:22 in 6752) [ClassicSimilarity], result of:
          0.056566417 = score(doc=6752,freq=2.0), product of:
            0.18275474 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05218836 = queryNorm
            0.30952093 = fieldWeight in 6752, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6752)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: AutoSlog is a system that addresses the knowledge engineering bottleneck for information extraction. AutoSlog automatically creates domain specific dictionaries for information extraction, given an appropriate training corpus. Describes experiments with AutoSlog in terrorism, joint ventures and microelectronics domains. Compares the performance of AutoSlog across the 3 domains, discusses the lessons learned and presents results from 2 experiments which demonstrate that novice users can generate effective dictionaries using AutoSlog
Date: 6. 3.1997 16:22:15

Garfield, E.: ¬The relationship between mechanical indexing, structural linguistics and information retrieval (1992) 0.03

0.029088955 = product of:
  0.04363343 = sum of:
    0.027630134 = weight(_text_:of in 3632) [ClassicSimilarity], result of:
      0.027630134 = score(doc=3632,freq=12.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.33856338 = fieldWeight in 3632, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=3632)
    0.0160033 = product of:
      0.0320066 = sum of:
        0.0320066 = weight(_text_:science in 3632) [ClassicSimilarity], result of:
          0.0320066 = score(doc=3632,freq=2.0), product of:
            0.13747036 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.05218836 = queryNorm
            0.23282544 = fieldWeight in 3632, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0625 = fieldNorm(doc=3632)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: It is possible to locate over 60% of indexing terms used in the Current List of Medical Literature by analysing the titles of the articles. Citation indexes contain 'noise' and lack many pertinent citations. Mechanical indexing or analysis of text must begin with some linguistic technique. Discusses Harris' methods of structural linguistics, discourse analysis and transformational analysis. Provides 3 examples with references, abstracts and index entries
Source: Journal of information science. 18(1992) no.5, S.343-354

Kim, P.K.: ¬An automatic indexing of compound words based on mutual information for Korean text retrieval (1995) 0.03

0.029088955 = product of:
  0.04363343 = sum of:
    0.027630134 = weight(_text_:of in 620) [ClassicSimilarity], result of:
      0.027630134 = score(doc=620,freq=12.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.33856338 = fieldWeight in 620, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=620)
    0.0160033 = product of:
      0.0320066 = sum of:
        0.0320066 = weight(_text_:science in 620) [ClassicSimilarity], result of:
          0.0320066 = score(doc=620,freq=2.0), product of:
            0.13747036 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.05218836 = queryNorm
            0.23282544 = fieldWeight in 620, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0625 = fieldNorm(doc=620)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Presents an automatic indexing technique for compound words suitable for an agglutinative language, specifically Korean. Discusses some construction conditions for compound words and the rules for decomposing compound words to enhance the exhaustivity of indexing, demonstrating that this system, mutual information, enhances both the exhaustivity of indexing and the specifity of terms. Suggests that the construction conditions and rules for decomposition presented may be used in multilingual information retrieval systems to translate the indexing terms of the specific language into those of the language required
Source: Library and information science. 1995, no.34, S.29-38

Bonzi, S.: Representation of concepts in text : a comparison of within-document frequency, anaphora, and synonymy (1991) 0.03

0.02907518 = product of:
  0.043612767 = sum of:
    0.02960988 = weight(_text_:of in 4933) [ClassicSimilarity], result of:
      0.02960988 = score(doc=4933,freq=18.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.36282203 = fieldWeight in 4933, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4933)
    0.0140028875 = product of:
      0.028005775 = sum of:
        0.028005775 = weight(_text_:science in 4933) [ClassicSimilarity], result of:
          0.028005775 = score(doc=4933,freq=2.0), product of:
            0.13747036 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.05218836 = queryNorm
            0.20372227 = fieldWeight in 4933, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4933)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Investigates the 3 major ways by which a concept may be represented in text: within-document frequency, anaphoric reference, and synonyms in order to determine which provides the optical means of representation. Analysis a sample of 60 abstracts, drawn at random for the abstracting journals of 4 disciplines. Results show that in general, initial within-document frequency is higher for keyword terms. Additionally, frequency of keyword terms referenced anaphorically or with intellectually related terms is higher that that of other keyword terms. It appears that initial document length influences both the number and impact of both anaphoric resolutions and intellectually related terms
Source: Canadian journal of information science. 16(1991) no.3, S.21-31

Bookstein, A.; Klein, S.T.; Raita, T.: Clumping properties of content-bearing words (1998) 0.03

0.02907518 = product of:
  0.043612767 = sum of:
    0.02960988 = weight(_text_:of in 442) [ClassicSimilarity], result of:
      0.02960988 = score(doc=442,freq=18.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.36282203 = fieldWeight in 442, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=442)
    0.0140028875 = product of:
      0.028005775 = sum of:
        0.028005775 = weight(_text_:science in 442) [ClassicSimilarity], result of:
          0.028005775 = score(doc=442,freq=2.0), product of:
            0.13747036 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.05218836 = queryNorm
            0.20372227 = fieldWeight in 442, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0546875 = fieldNorm(doc=442)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Information Retrieval Systems identify content bearing words, and possibly also assign weights, as part of the process of formulating requests. For optimal retrieval efficiency, it is desirable that this be done automatically. This article defines the notion of serial clustering of words in text, and explores the value of such clustering as an indicator of a word's bearing content. This approach is flexible in the sense that it is sensitive to context: a term may be assessed as content-bearing within one collection, but not another. Our approach, being numerical, may also be of value in assigning weights to terms in requests. Experimental support is obtained from natural text databases in three different languages
Source: Journal of the American Society for Information Science. 49(1998) no.2, S.102-114

Haas, S.; He, S.: Toward the automatic identification of sublanguage vocabulary (1993) 0.03

0.028113028 = product of:
  0.04216954 = sum of:
    0.019537456 = weight(_text_:of in 4891) [ClassicSimilarity], result of:
      0.019537456 = score(doc=4891,freq=6.0), product of:
        0.08160993 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.05218836 = queryNorm
        0.23940048 = fieldWeight in 4891, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=4891)
    0.022632083 = product of:
      0.045264166 = sum of:
        0.045264166 = weight(_text_:science in 4891) [ClassicSimilarity], result of:
          0.045264166 = score(doc=4891,freq=4.0), product of:
            0.13747036 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.05218836 = queryNorm
            0.3292649 = fieldWeight in 4891, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0625 = fieldNorm(doc=4891)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Describes a method developed for automatic identification of sublanguage vocabulary words as they occur in abstracts. Describes the sublanguage vocabulary identification procedures using abstracts from computer science and library and information science as sublanguage sources. Evaluates the results using three criteria. Discuss the practical and theoretical significance of this research and plans for further experiments

Search (99 results, page 1 of 5)

Authors

Languages

Types

Themes