Document (#3093)

Editor
Knowledge-based systems development
Author
Milstead, J.L.
Title
Methodologies for subject analysis in bibliographic databases
Source
IEEE proceedings. Pt.A. 28(1992) no.3, S.407-432
Year
1992
Abstract
Report on a subject analysis review undertaken to aid managers of databases in determining if new and little-known capabilities would improve the cost-effectiveness of subject analysis operations. Operational machine-aided and automatic indexing systems were found to form a continuum. Commercial automatic indexing packages were also reviewed. The primary obstacle to development of automatic indexing is the lack of machine understanding of natural language. Recommendations for action include: increasing the power of the indexer interface, studying indexing policies, enrichment of thesauri, and considering the development of machine-aided indexing

Similar documents (author)

  1. Milstead, J.L.: Database design : Indexing applications (1989) 5.39
    5.3864803 = sum of:
      5.3864803 = weight(author_txt:milstead in 867) [ClassicSimilarity], result of:
        5.3864803 = fieldWeight in 867, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.618368 = idf(docFreq=20, maxDocs=42740)
          0.625 = fieldNorm(doc=867)
    
  2. Milstead, J.L.: Specifications for thesaurus software (1991) 5.39
    5.3864803 = sum of:
      5.3864803 = weight(author_txt:milstead in 2291) [ClassicSimilarity], result of:
        5.3864803 = fieldWeight in 2291, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.618368 = idf(docFreq=20, maxDocs=42740)
          0.625 = fieldNorm(doc=2291)
    
  3. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 5.39
    5.3864803 = sum of:
      5.3864803 = weight(author_txt:milstead in 2311) [ClassicSimilarity], result of:
        5.3864803 = fieldWeight in 2311, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.618368 = idf(docFreq=20, maxDocs=42740)
          0.625 = fieldNorm(doc=2311)
    
  4. Milstead, J.L.: Natural versus inverted word order in subject headings (1980) 5.39
    5.3864803 = sum of:
      5.3864803 = weight(author_txt:milstead in 2867) [ClassicSimilarity], result of:
        5.3864803 = fieldWeight in 2867, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.618368 = idf(docFreq=20, maxDocs=42740)
          0.625 = fieldNorm(doc=2867)
    
  5. Milstead, J.L.: Thesaurus software packages (1990) 5.39
    5.3864803 = sum of:
      5.3864803 = weight(author_txt:milstead in 4868) [ClassicSimilarity], result of:
        5.3864803 = fieldWeight in 4868, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.618368 = idf(docFreq=20, maxDocs=42740)
          0.625 = fieldNorm(doc=4868)
    

Similar documents (content)

  1. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 0.43
    0.42777902 = sum of:
      0.42777902 = product of:
        1.0694475 = sum of:
          0.076535895 = weight(abstract_txt:reviewed in 2311) [ClassicSimilarity], result of:
            0.076535895 = score(doc=2311,freq=1.0), product of:
              0.13293044 = queryWeight, product of:
                1.0038767 = boost
                6.1414294 = idf(docFreq=249, maxDocs=42740)
                0.021561284 = queryNorm
              0.575759 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1414294 = idf(docFreq=249, maxDocs=42740)
                0.09375 = fieldNorm(doc=2311)
          0.082681626 = weight(abstract_txt:policies in 2311) [ClassicSimilarity], result of:
            0.082681626 = score(doc=2311,freq=1.0), product of:
              0.13995454 = queryWeight, product of:
                1.0300579 = boost
                6.3015985 = idf(docFreq=212, maxDocs=42740)
                0.021561284 = queryNorm
              0.5907749 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3015985 = idf(docFreq=212, maxDocs=42740)
                0.09375 = fieldNorm(doc=2311)
          0.11195501 = weight(abstract_txt:packages in 2311) [ClassicSimilarity], result of:
            0.11195501 = score(doc=2311,freq=1.0), product of:
              0.17129447 = queryWeight, product of:
                1.1395669 = boost
                6.971543 = idf(docFreq=108, maxDocs=42740)
                0.021561284 = queryNorm
              0.65358216 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.971543 = idf(docFreq=108, maxDocs=42740)
                0.09375 = fieldNorm(doc=2311)
          0.033260196 = weight(abstract_txt:were in 2311) [ClassicSimilarity], result of:
            0.033260196 = score(doc=2311,freq=1.0), product of:
              0.09608912 = queryWeight, product of:
                1.207036 = boost
                3.69215 = idf(docFreq=2894, maxDocs=42740)
                0.021561284 = queryNorm
              0.34613907 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.69215 = idf(docFreq=2894, maxDocs=42740)
                0.09375 = fieldNorm(doc=2311)
          0.03794144 = weight(abstract_txt:development in 2311) [ClassicSimilarity], result of:
            0.03794144 = score(doc=2311,freq=1.0), product of:
              0.10490597 = queryWeight, product of:
                1.2611979 = boost
                3.8578234 = idf(docFreq=2452, maxDocs=42740)
                0.021561284 = queryNorm
              0.36167094 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8578234 = idf(docFreq=2452, maxDocs=42740)
                0.09375 = fieldNorm(doc=2311)
          0.04973672 = weight(abstract_txt:analysis in 2311) [ClassicSimilarity], result of:
            0.04973672 = score(doc=2311,freq=1.0), product of:
              0.14383774 = queryWeight, product of:
                1.8086944 = boost
                3.6883576 = idf(docFreq=2905, maxDocs=42740)
                0.021561284 = queryNorm
              0.34578353 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6883576 = idf(docFreq=2905, maxDocs=42740)
                0.09375 = fieldNorm(doc=2311)
          0.05925844 = weight(abstract_txt:subject in 2311) [ClassicSimilarity], result of:
            0.05925844 = score(doc=2311,freq=1.0), product of:
              0.16165465 = queryWeight, product of:
                1.9174448 = boost
                3.9101257 = idf(docFreq=2327, maxDocs=42740)
                0.021561284 = queryNorm
              0.3665743 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9101257 = idf(docFreq=2327, maxDocs=42740)
                0.09375 = fieldNorm(doc=2311)
          0.19709167 = weight(abstract_txt:automatic in 2311) [ClassicSimilarity], result of:
            0.19709167 = score(doc=2311,freq=2.0), product of:
              0.28588426 = queryWeight, product of:
                2.5499043 = boost
                5.199861 = idf(docFreq=640, maxDocs=42740)
                0.021561284 = queryNorm
              0.6894107 = fieldWeight in 2311, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.199861 = idf(docFreq=640, maxDocs=42740)
                0.09375 = fieldNorm(doc=2311)
          0.15081464 = weight(abstract_txt:machine in 2311) [ClassicSimilarity], result of:
            0.15081464 = score(doc=2311,freq=1.0), product of:
              0.30133557 = queryWeight, product of:
                2.6179056 = boost
                5.3385315 = idf(docFreq=557, maxDocs=42740)
                0.021561284 = queryNorm
              0.5004873 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3385315 = idf(docFreq=557, maxDocs=42740)
                0.09375 = fieldNorm(doc=2311)
          0.27017185 = weight(abstract_txt:indexing in 2311) [ClassicSimilarity], result of:
            0.27017185 = score(doc=2311,freq=4.0), product of:
              0.33197933 = queryWeight, product of:
                3.5473878 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.021561284 = queryNorm
              0.8138213 = fieldWeight in 2311, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.09375 = fieldNorm(doc=2311)
        0.4 = coord(10/25)
    
  2. White, H.; Willis, C.; Greenberg, J.: HIVEing : the effect of a semantic web technology on inter-indexer consistency (2014) 0.24
    0.23652135 = sum of:
      0.23652135 = product of:
        0.8447191 = sum of:
          0.06108765 = weight(abstract_txt:studying in 3782) [ClassicSimilarity], result of:
            0.06108765 = score(doc=3782,freq=1.0), product of:
              0.14988014 = queryWeight, product of:
                1.0659583 = boost
                6.521227 = idf(docFreq=170, maxDocs=42740)
                0.021561284 = queryNorm
              0.40757668 = fieldWeight in 3782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.521227 = idf(docFreq=170, maxDocs=42740)
                0.0625 = fieldNorm(doc=3782)
          0.13083327 = weight(abstract_txt:indexer in 3782) [ClassicSimilarity], result of:
            0.13083327 = score(doc=3782,freq=3.0), product of:
              0.17266867 = queryWeight, product of:
                1.1441288 = boost
                6.9994516 = idf(docFreq=105, maxDocs=42740)
                0.021561284 = queryNorm
              0.75771284 = fieldWeight in 3782, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9994516 = idf(docFreq=105, maxDocs=42740)
                0.0625 = fieldNorm(doc=3782)
          0.031358015 = weight(abstract_txt:were in 3782) [ClassicSimilarity], result of:
            0.031358015 = score(doc=3782,freq=2.0), product of:
              0.09608912 = queryWeight, product of:
                1.207036 = boost
                3.69215 = idf(docFreq=2894, maxDocs=42740)
                0.021561284 = queryNorm
              0.32634303 = fieldWeight in 3782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.69215 = idf(docFreq=2894, maxDocs=42740)
                0.0625 = fieldNorm(doc=3782)
          0.033157814 = weight(abstract_txt:analysis in 3782) [ClassicSimilarity], result of:
            0.033157814 = score(doc=3782,freq=1.0), product of:
              0.14383774 = queryWeight, product of:
                1.8086944 = boost
                3.6883576 = idf(docFreq=2905, maxDocs=42740)
                0.021561284 = queryNorm
              0.23052235 = fieldWeight in 3782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6883576 = idf(docFreq=2905, maxDocs=42740)
                0.0625 = fieldNorm(doc=3782)
          0.29010916 = weight(abstract_txt:aided in 3782) [ClassicSimilarity], result of:
            0.29010916 = score(doc=3782,freq=2.0), product of:
              0.42346364 = queryWeight, product of:
                2.53391 = boost
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.021561284 = queryNorm
              0.68508637 = fieldWeight in 3782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.0625 = fieldNorm(doc=3782)
          0.1421894 = weight(abstract_txt:machine in 3782) [ClassicSimilarity], result of:
            0.1421894 = score(doc=3782,freq=2.0), product of:
              0.30133557 = queryWeight, product of:
                2.6179056 = boost
                5.3385315 = idf(docFreq=557, maxDocs=42740)
                0.021561284 = queryNorm
              0.47186396 = fieldWeight in 3782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3385315 = idf(docFreq=557, maxDocs=42740)
                0.0625 = fieldNorm(doc=3782)
          0.15598379 = weight(abstract_txt:indexing in 3782) [ClassicSimilarity], result of:
            0.15598379 = score(doc=3782,freq=3.0), product of:
              0.33197933 = queryWeight, product of:
                3.5473878 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.021561284 = queryNorm
              0.46985993 = fieldWeight in 3782, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.0625 = fieldNorm(doc=3782)
        0.28 = coord(7/25)
    
  3. Greenrich, E.: CD-ROM data preparation enhancements (1993) 0.23
    0.22533007 = sum of:
      0.22533007 = product of:
        1.1266503 = sum of:
          0.07474755 = weight(abstract_txt:databases in 7843) [ClassicSimilarity], result of:
            0.07474755 = score(doc=7843,freq=1.0), product of:
              0.13609113 = queryWeight, product of:
                1.4364749 = boost
                4.3939705 = idf(docFreq=1434, maxDocs=42740)
                0.021561284 = queryNorm
              0.5492463 = fieldWeight in 7843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3939705 = idf(docFreq=1434, maxDocs=42740)
                0.125 = fieldNorm(doc=7843)
          0.41027632 = weight(abstract_txt:aided in 7843) [ClassicSimilarity], result of:
            0.41027632 = score(doc=7843,freq=1.0), product of:
              0.42346364 = queryWeight, product of:
                2.53391 = boost
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.021561284 = queryNorm
              0.9688584 = fieldWeight in 7843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.125 = fieldNorm(doc=7843)
          0.1858198 = weight(abstract_txt:automatic in 7843) [ClassicSimilarity], result of:
            0.1858198 = score(doc=7843,freq=1.0), product of:
              0.28588426 = queryWeight, product of:
                2.5499043 = boost
                5.199861 = idf(docFreq=640, maxDocs=42740)
                0.021561284 = queryNorm
              0.64998263 = fieldWeight in 7843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.199861 = idf(docFreq=640, maxDocs=42740)
                0.125 = fieldNorm(doc=7843)
          0.20108618 = weight(abstract_txt:machine in 7843) [ClassicSimilarity], result of:
            0.20108618 = score(doc=7843,freq=1.0), product of:
              0.30133557 = queryWeight, product of:
                2.6179056 = boost
                5.3385315 = idf(docFreq=557, maxDocs=42740)
                0.021561284 = queryNorm
              0.66731644 = fieldWeight in 7843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3385315 = idf(docFreq=557, maxDocs=42740)
                0.125 = fieldNorm(doc=7843)
          0.25472048 = weight(abstract_txt:indexing in 7843) [ClassicSimilarity], result of:
            0.25472048 = score(doc=7843,freq=2.0), product of:
              0.33197933 = queryWeight, product of:
                3.5473878 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.021561284 = queryNorm
              0.7672781 = fieldWeight in 7843, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.125 = fieldNorm(doc=7843)
        0.2 = coord(5/25)
    
  4. Lancaster, F.W.: Trends in subject indexing from 1957 to 2000 (1980) 0.15
    0.14860453 = sum of:
      0.14860453 = product of:
        0.9287783 = sum of:
          0.30770722 = weight(abstract_txt:aided in 209) [ClassicSimilarity], result of:
            0.30770722 = score(doc=209,freq=1.0), product of:
              0.42346364 = queryWeight, product of:
                2.53391 = boost
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.021561284 = queryNorm
              0.7266438 = fieldWeight in 209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.09375 = fieldNorm(doc=209)
          0.13936485 = weight(abstract_txt:automatic in 209) [ClassicSimilarity], result of:
            0.13936485 = score(doc=209,freq=1.0), product of:
              0.28588426 = queryWeight, product of:
                2.5499043 = boost
                5.199861 = idf(docFreq=640, maxDocs=42740)
                0.021561284 = queryNorm
              0.48748696 = fieldWeight in 209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.199861 = idf(docFreq=640, maxDocs=42740)
                0.09375 = fieldNorm(doc=209)
          0.15081464 = weight(abstract_txt:machine in 209) [ClassicSimilarity], result of:
            0.15081464 = score(doc=209,freq=1.0), product of:
              0.30133557 = queryWeight, product of:
                2.6179056 = boost
                5.3385315 = idf(docFreq=557, maxDocs=42740)
                0.021561284 = queryNorm
              0.5004873 = fieldWeight in 209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3385315 = idf(docFreq=557, maxDocs=42740)
                0.09375 = fieldNorm(doc=209)
          0.33089158 = weight(abstract_txt:indexing in 209) [ClassicSimilarity], result of:
            0.33089158 = score(doc=209,freq=6.0), product of:
              0.33197933 = queryWeight, product of:
                3.5473878 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.021561284 = queryNorm
              0.9967234 = fieldWeight in 209, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.09375 = fieldNorm(doc=209)
        0.16 = coord(4/25)
    
  5. Milstead, J.L.: Thesauri in a full-text world (1998) 0.15
    0.14772753 = sum of:
      0.14772753 = product of:
        0.7386377 = sum of:
          0.04671722 = weight(abstract_txt:databases in 3338) [ClassicSimilarity], result of:
            0.04671722 = score(doc=3338,freq=1.0), product of:
              0.13609113 = queryWeight, product of:
                1.4364749 = boost
                4.3939705 = idf(docFreq=1434, maxDocs=42740)
                0.021561284 = queryNorm
              0.34327894 = fieldWeight in 3338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3939705 = idf(docFreq=1434, maxDocs=42740)
                0.078125 = fieldNorm(doc=3338)
          0.058615282 = weight(abstract_txt:analysis in 3338) [ClassicSimilarity], result of:
            0.058615282 = score(doc=3338,freq=2.0), product of:
              0.14383774 = queryWeight, product of:
                1.8086944 = boost
                3.6883576 = idf(docFreq=2905, maxDocs=42740)
                0.021561284 = queryNorm
              0.40750977 = fieldWeight in 3338, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6883576 = idf(docFreq=2905, maxDocs=42740)
                0.078125 = fieldNorm(doc=3338)
          0.2564227 = weight(abstract_txt:aided in 3338) [ClassicSimilarity], result of:
            0.2564227 = score(doc=3338,freq=1.0), product of:
              0.42346364 = queryWeight, product of:
                2.53391 = boost
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.021561284 = queryNorm
              0.6055365 = fieldWeight in 3338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.078125 = fieldNorm(doc=3338)
          0.21768218 = weight(abstract_txt:machine in 3338) [ClassicSimilarity], result of:
            0.21768218 = score(doc=3338,freq=3.0), product of:
              0.30133557 = queryWeight, product of:
                2.6179056 = boost
                5.3385315 = idf(docFreq=557, maxDocs=42740)
                0.021561284 = queryNorm
              0.72239125 = fieldWeight in 3338, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.3385315 = idf(docFreq=557, maxDocs=42740)
                0.078125 = fieldNorm(doc=3338)
          0.1592003 = weight(abstract_txt:indexing in 3338) [ClassicSimilarity], result of:
            0.1592003 = score(doc=3338,freq=2.0), product of:
              0.33197933 = queryWeight, product of:
                3.5473878 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.021561284 = queryNorm
              0.4795488 = fieldWeight in 3338, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.078125 = fieldNorm(doc=3338)
        0.2 = coord(5/25)