Document (#3093)

Editor
Knowledge-based systems development
Author
Milstead, J.L.
Title
Methodologies for subject analysis in bibliographic databases
Source
IEEE proceedings. Pt.A. 28(1992) no.3, S.407-432
Year
1992
Abstract
Report on a subject analysis review undertaken to aid managers of databases in determining if new and little-known capabilities would improve the cost-effectiveness of subject analysis operations. Operational machine-aided and automatic indexing systems were found to form a continuum. Commercial automatic indexing packages were also reviewed. The primary obstacle to development of automatic indexing is the lack of machine understanding of natural language. Recommendations for action include: increasing the power of the indexer interface, studying indexing policies, enrichment of thesauri, and considering the development of machine-aided indexing

Similar documents (author)

  1. Milstead, J.L.: Database design : Indexing applications (1989) 5.38
    5.384371 = sum of:
      5.384371 = weight(author_txt:milstead in 867) [ClassicSimilarity], result of:
        5.384371 = fieldWeight in 867, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.614993 = idf(docFreq=20, maxDocs=42596)
          0.625 = fieldNorm(doc=867)
    
  2. Milstead, J.L.: Specifications for thesaurus software (1991) 5.38
    5.384371 = sum of:
      5.384371 = weight(author_txt:milstead in 2291) [ClassicSimilarity], result of:
        5.384371 = fieldWeight in 2291, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.614993 = idf(docFreq=20, maxDocs=42596)
          0.625 = fieldNorm(doc=2291)
    
  3. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 5.38
    5.384371 = sum of:
      5.384371 = weight(author_txt:milstead in 2311) [ClassicSimilarity], result of:
        5.384371 = fieldWeight in 2311, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.614993 = idf(docFreq=20, maxDocs=42596)
          0.625 = fieldNorm(doc=2311)
    
  4. Milstead, J.L.: Natural versus inverted word order in subject headings (1980) 5.38
    5.384371 = sum of:
      5.384371 = weight(author_txt:milstead in 2867) [ClassicSimilarity], result of:
        5.384371 = fieldWeight in 2867, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.614993 = idf(docFreq=20, maxDocs=42596)
          0.625 = fieldNorm(doc=2867)
    
  5. Milstead, J.L.: Thesaurus software packages (1990) 5.38
    5.384371 = sum of:
      5.384371 = weight(author_txt:milstead in 4868) [ClassicSimilarity], result of:
        5.384371 = fieldWeight in 4868, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.614993 = idf(docFreq=20, maxDocs=42596)
          0.625 = fieldNorm(doc=4868)
    

Similar documents (content)

  1. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 0.43
    0.42761707 = sum of:
      0.42761707 = product of:
        1.0690427 = sum of:
          0.07639486 = weight(abstract_txt:reviewed in 2311) [ClassicSimilarity], result of:
            0.07639486 = score(doc=2311,freq=1.0), product of:
              0.13275842 = queryWeight, product of:
                1.0038788 = boost
                6.138055 = idf(docFreq=249, maxDocs=42596)
                0.021545175 = queryNorm
              0.5754427 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.138055 = idf(docFreq=249, maxDocs=42596)
                0.09375 = fieldNorm(doc=2311)
          0.08290414 = weight(abstract_txt:policies in 2311) [ClassicSimilarity], result of:
            0.08290414 = score(doc=2311,freq=1.0), product of:
              0.14019638 = queryWeight, product of:
                1.0316174 = boost
                6.3076577 = idf(docFreq=210, maxDocs=42596)
                0.021545175 = queryNorm
              0.5913429 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3076577 = idf(docFreq=210, maxDocs=42596)
                0.09375 = fieldNorm(doc=2311)
          0.111770615 = weight(abstract_txt:packages in 2311) [ClassicSimilarity], result of:
            0.111770615 = score(doc=2311,freq=1.0), product of:
              0.17109518 = queryWeight, product of:
                1.1396438 = boost
                6.968168 = idf(docFreq=108, maxDocs=42596)
                0.021545175 = queryNorm
              0.6532657 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.968168 = idf(docFreq=108, maxDocs=42596)
                0.09375 = fieldNorm(doc=2311)
          0.03330289 = weight(abstract_txt:were in 2311) [ClassicSimilarity], result of:
            0.03330289 = score(doc=2311,freq=1.0), product of:
              0.09616505 = queryWeight, product of:
                1.2082975 = boost
                3.69397 = idf(docFreq=2879, maxDocs=42596)
                0.021545175 = queryNorm
              0.3463097 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.69397 = idf(docFreq=2879, maxDocs=42596)
                0.09375 = fieldNorm(doc=2311)
          0.037930824 = weight(abstract_txt:development in 2311) [ClassicSimilarity], result of:
            0.037930824 = score(doc=2311,freq=1.0), product of:
              0.104879566 = queryWeight, product of:
                1.2618586 = boost
                3.8577151 = idf(docFreq=2444, maxDocs=42596)
                0.021545175 = queryNorm
              0.36166078 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8577151 = idf(docFreq=2444, maxDocs=42596)
                0.09375 = fieldNorm(doc=2311)
          0.04977184 = weight(abstract_txt:analysis in 2311) [ClassicSimilarity], result of:
            0.04977184 = score(doc=2311,freq=1.0), product of:
              0.14389606 = queryWeight, product of:
                1.8102365 = boost
                3.6894662 = idf(docFreq=2892, maxDocs=42596)
                0.021545175 = queryNorm
              0.34588745 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6894662 = idf(docFreq=2892, maxDocs=42596)
                0.09375 = fieldNorm(doc=2311)
          0.059152104 = weight(abstract_txt:subject in 2311) [ClassicSimilarity], result of:
            0.059152104 = score(doc=2311,freq=1.0), product of:
              0.16145068 = queryWeight, product of:
                1.9174799 = boost
                3.9080403 = idf(docFreq=2324, maxDocs=42596)
                0.021545175 = queryNorm
              0.36637878 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9080403 = idf(docFreq=2324, maxDocs=42596)
                0.09375 = fieldNorm(doc=2311)
          0.196847 = weight(abstract_txt:automatic in 2311) [ClassicSimilarity], result of:
            0.196847 = score(doc=2311,freq=2.0), product of:
              0.285629 = queryWeight, product of:
                2.550422 = boost
                5.1980476 = idf(docFreq=639, maxDocs=42596)
                0.021545175 = queryNorm
              0.68917024 = fieldWeight in 2311, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1980476 = idf(docFreq=639, maxDocs=42596)
                0.09375 = fieldNorm(doc=2311)
          0.15110902 = weight(abstract_txt:machine in 2311) [ClassicSimilarity], result of:
            0.15110902 = score(doc=2311,freq=1.0), product of:
              0.3017079 = queryWeight, product of:
                2.6212246 = boost
                5.342351 = idf(docFreq=553, maxDocs=42596)
                0.021545175 = queryNorm
              0.50084543 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.342351 = idf(docFreq=553, maxDocs=42596)
                0.09375 = fieldNorm(doc=2311)
          0.2698593 = weight(abstract_txt:indexing in 2311) [ClassicSimilarity], result of:
            0.2698593 = score(doc=2311,freq=4.0), product of:
              0.33170164 = queryWeight, product of:
                3.548208 = boost
                4.338989 = idf(docFreq=1510, maxDocs=42596)
                0.021545175 = queryNorm
              0.81356037 = fieldWeight in 2311, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.338989 = idf(docFreq=1510, maxDocs=42596)
                0.09375 = fieldNorm(doc=2311)
        0.4 = coord(10/25)
    
  2. White, H.; Willis, C.; Greenberg, J.: HIVEing : the effect of a semantic web technology on inter-indexer consistency (2014) 0.24
    0.23640057 = sum of:
      0.23640057 = product of:
        0.84428775 = sum of:
          0.061145708 = weight(abstract_txt:studying in 2782) [ClassicSimilarity], result of:
            0.061145708 = score(doc=2782,freq=1.0), product of:
              0.14996532 = queryWeight, product of:
                1.0669539 = boost
                6.5237174 = idf(docFreq=169, maxDocs=42596)
                0.021545175 = queryNorm
              0.40773234 = fieldWeight in 2782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5237174 = idf(docFreq=169, maxDocs=42596)
                0.0625 = fieldNorm(doc=2782)
          0.13061854 = weight(abstract_txt:indexer in 2782) [ClassicSimilarity], result of:
            0.13061854 = score(doc=2782,freq=3.0), product of:
              0.17246845 = queryWeight, product of:
                1.1442083 = boost
                6.9960766 = idf(docFreq=105, maxDocs=42596)
                0.021545175 = queryNorm
              0.75734746 = fieldWeight in 2782, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9960766 = idf(docFreq=105, maxDocs=42596)
                0.0625 = fieldNorm(doc=2782)
          0.031398267 = weight(abstract_txt:were in 2782) [ClassicSimilarity], result of:
            0.031398267 = score(doc=2782,freq=2.0), product of:
              0.09616505 = queryWeight, product of:
                1.2082975 = boost
                3.69397 = idf(docFreq=2879, maxDocs=42596)
                0.021545175 = queryNorm
              0.3265039 = fieldWeight in 2782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.69397 = idf(docFreq=2879, maxDocs=42596)
                0.0625 = fieldNorm(doc=2782)
          0.033181228 = weight(abstract_txt:analysis in 2782) [ClassicSimilarity], result of:
            0.033181228 = score(doc=2782,freq=1.0), product of:
              0.14389606 = queryWeight, product of:
                1.8102365 = boost
                3.6894662 = idf(docFreq=2892, maxDocs=42596)
                0.021545175 = queryNorm
              0.23059164 = fieldWeight in 2782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6894662 = idf(docFreq=2892, maxDocs=42596)
                0.0625 = fieldNorm(doc=2782)
          0.28967372 = weight(abstract_txt:aided in 2782) [ClassicSimilarity], result of:
            0.28967372 = score(doc=2782,freq=2.0), product of:
              0.42301223 = queryWeight, product of:
                2.5342047 = boost
                7.747493 = idf(docFreq=49, maxDocs=42596)
                0.021545175 = queryNorm
              0.68478805 = fieldWeight in 2782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.747493 = idf(docFreq=49, maxDocs=42596)
                0.0625 = fieldNorm(doc=2782)
          0.14246693 = weight(abstract_txt:machine in 2782) [ClassicSimilarity], result of:
            0.14246693 = score(doc=2782,freq=2.0), product of:
              0.3017079 = queryWeight, product of:
                2.6212246 = boost
                5.342351 = idf(docFreq=553, maxDocs=42596)
                0.021545175 = queryNorm
              0.47220156 = fieldWeight in 2782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.342351 = idf(docFreq=553, maxDocs=42596)
                0.0625 = fieldNorm(doc=2782)
          0.15580335 = weight(abstract_txt:indexing in 2782) [ClassicSimilarity], result of:
            0.15580335 = score(doc=2782,freq=3.0), product of:
              0.33170164 = queryWeight, product of:
                3.548208 = boost
                4.338989 = idf(docFreq=1510, maxDocs=42596)
                0.021545175 = queryNorm
              0.4697093 = fieldWeight in 2782, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.338989 = idf(docFreq=1510, maxDocs=42596)
                0.0625 = fieldNorm(doc=2782)
        0.28 = coord(7/25)
    
  3. Greenrich, E.: CD-ROM data preparation enhancements (1993) 0.23
    0.22515011 = sum of:
      0.22515011 = product of:
        1.1257505 = sum of:
          0.07459637 = weight(abstract_txt:databases in 7843) [ClassicSimilarity], result of:
            0.07459637 = score(doc=7843,freq=1.0), product of:
              0.1358987 = queryWeight, product of:
                1.4363917 = boost
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.021545175 = queryNorm
              0.5489116 = fieldWeight in 7843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.125 = fieldNorm(doc=7843)
          0.40966052 = weight(abstract_txt:aided in 7843) [ClassicSimilarity], result of:
            0.40966052 = score(doc=7843,freq=1.0), product of:
              0.42301223 = queryWeight, product of:
                2.5342047 = boost
                7.747493 = idf(docFreq=49, maxDocs=42596)
                0.021545175 = queryNorm
              0.9684366 = fieldWeight in 7843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.747493 = idf(docFreq=49, maxDocs=42596)
                0.125 = fieldNorm(doc=7843)
          0.18558915 = weight(abstract_txt:automatic in 7843) [ClassicSimilarity], result of:
            0.18558915 = score(doc=7843,freq=1.0), product of:
              0.285629 = queryWeight, product of:
                2.550422 = boost
                5.1980476 = idf(docFreq=639, maxDocs=42596)
                0.021545175 = queryNorm
              0.64975595 = fieldWeight in 7843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1980476 = idf(docFreq=639, maxDocs=42596)
                0.125 = fieldNorm(doc=7843)
          0.20147868 = weight(abstract_txt:machine in 7843) [ClassicSimilarity], result of:
            0.20147868 = score(doc=7843,freq=1.0), product of:
              0.3017079 = queryWeight, product of:
                2.6212246 = boost
                5.342351 = idf(docFreq=553, maxDocs=42596)
                0.021545175 = queryNorm
              0.66779387 = fieldWeight in 7843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.342351 = idf(docFreq=553, maxDocs=42596)
                0.125 = fieldNorm(doc=7843)
          0.2544258 = weight(abstract_txt:indexing in 7843) [ClassicSimilarity], result of:
            0.2544258 = score(doc=7843,freq=2.0), product of:
              0.33170164 = queryWeight, product of:
                3.548208 = boost
                4.338989 = idf(docFreq=1510, maxDocs=42596)
                0.021545175 = queryNorm
              0.7670321 = fieldWeight in 7843, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.338989 = idf(docFreq=1510, maxDocs=42596)
                0.125 = fieldNorm(doc=7843)
        0.2 = coord(5/25)
    
  4. Lancaster, F.W.: Trends in subject indexing from 1957 to 2000 (1980) 0.15
    0.14848882 = sum of:
      0.14848882 = product of:
        0.9280551 = sum of:
          0.30724537 = weight(abstract_txt:aided in 209) [ClassicSimilarity], result of:
            0.30724537 = score(doc=209,freq=1.0), product of:
              0.42301223 = queryWeight, product of:
                2.5342047 = boost
                7.747493 = idf(docFreq=49, maxDocs=42596)
                0.021545175 = queryNorm
              0.7263274 = fieldWeight in 209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.747493 = idf(docFreq=49, maxDocs=42596)
                0.09375 = fieldNorm(doc=209)
          0.13919187 = weight(abstract_txt:automatic in 209) [ClassicSimilarity], result of:
            0.13919187 = score(doc=209,freq=1.0), product of:
              0.285629 = queryWeight, product of:
                2.550422 = boost
                5.1980476 = idf(docFreq=639, maxDocs=42596)
                0.021545175 = queryNorm
              0.48731697 = fieldWeight in 209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1980476 = idf(docFreq=639, maxDocs=42596)
                0.09375 = fieldNorm(doc=209)
          0.15110902 = weight(abstract_txt:machine in 209) [ClassicSimilarity], result of:
            0.15110902 = score(doc=209,freq=1.0), product of:
              0.3017079 = queryWeight, product of:
                2.6212246 = boost
                5.342351 = idf(docFreq=553, maxDocs=42596)
                0.021545175 = queryNorm
              0.50084543 = fieldWeight in 209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.342351 = idf(docFreq=553, maxDocs=42596)
                0.09375 = fieldNorm(doc=209)
          0.33050883 = weight(abstract_txt:indexing in 209) [ClassicSimilarity], result of:
            0.33050883 = score(doc=209,freq=6.0), product of:
              0.33170164 = queryWeight, product of:
                3.548208 = boost
                4.338989 = idf(docFreq=1510, maxDocs=42596)
                0.021545175 = queryNorm
              0.996404 = fieldWeight in 209, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.338989 = idf(docFreq=1510, maxDocs=42596)
                0.09375 = fieldNorm(doc=209)
        0.16 = coord(4/25)
    
  5. Milstead, J.L.: Thesauri in a full-text world (1998) 0.15
    0.14768809 = sum of:
      0.14768809 = product of:
        0.73844045 = sum of:
          0.046622727 = weight(abstract_txt:databases in 3338) [ClassicSimilarity], result of:
            0.046622727 = score(doc=3338,freq=1.0), product of:
              0.1358987 = queryWeight, product of:
                1.4363917 = boost
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.021545175 = queryNorm
              0.34306973 = fieldWeight in 3338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.078125 = fieldNorm(doc=3338)
          0.05865668 = weight(abstract_txt:analysis in 3338) [ClassicSimilarity], result of:
            0.05865668 = score(doc=3338,freq=2.0), product of:
              0.14389606 = queryWeight, product of:
                1.8102365 = boost
                3.6894662 = idf(docFreq=2892, maxDocs=42596)
                0.021545175 = queryNorm
              0.4076323 = fieldWeight in 3338, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6894662 = idf(docFreq=2892, maxDocs=42596)
                0.078125 = fieldNorm(doc=3338)
          0.25603783 = weight(abstract_txt:aided in 3338) [ClassicSimilarity], result of:
            0.25603783 = score(doc=3338,freq=1.0), product of:
              0.42301223 = queryWeight, product of:
                2.5342047 = boost
                7.747493 = idf(docFreq=49, maxDocs=42596)
                0.021545175 = queryNorm
              0.6052729 = fieldWeight in 3338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.747493 = idf(docFreq=49, maxDocs=42596)
                0.078125 = fieldNorm(doc=3338)
          0.21810707 = weight(abstract_txt:machine in 3338) [ClassicSimilarity], result of:
            0.21810707 = score(doc=3338,freq=3.0), product of:
              0.3017079 = queryWeight, product of:
                2.6212246 = boost
                5.342351 = idf(docFreq=553, maxDocs=42596)
                0.021545175 = queryNorm
              0.7229081 = fieldWeight in 3338, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.342351 = idf(docFreq=553, maxDocs=42596)
                0.078125 = fieldNorm(doc=3338)
          0.15901613 = weight(abstract_txt:indexing in 3338) [ClassicSimilarity], result of:
            0.15901613 = score(doc=3338,freq=2.0), product of:
              0.33170164 = queryWeight, product of:
                3.548208 = boost
                4.338989 = idf(docFreq=1510, maxDocs=42596)
                0.021545175 = queryNorm
              0.47939506 = fieldWeight in 3338, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.338989 = idf(docFreq=1510, maxDocs=42596)
                0.078125 = fieldNorm(doc=3338)
        0.2 = coord(5/25)