Document (#3093)

Editor
Knowledge-based systems development
Author
Milstead, J.L.
Title
Methodologies for subject analysis in bibliographic databases
Source
IEEE proceedings. Pt.A. 28(1992) no.3, S.407-432
Year
1992
Abstract
Report on a subject analysis review undertaken to aid managers of databases in determining if new and little-known capabilities would improve the cost-effectiveness of subject analysis operations. Operational machine-aided and automatic indexing systems were found to form a continuum. Commercial automatic indexing packages were also reviewed. The primary obstacle to development of automatic indexing is the lack of machine understanding of natural language. Recommendations for action include: increasing the power of the indexer interface, studying indexing policies, enrichment of thesauri, and considering the development of machine-aided indexing

Similar documents (author)

  1. Milstead, J.L.: Database design : Indexing applications (1989) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:milstead in 867) [ClassicSimilarity], result of:
        5.4077277 = score(doc=867,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 867, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=867)
    
  2. Milstead, J.L.: Specifications for thesaurus software (1991) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:milstead in 2291) [ClassicSimilarity], result of:
        5.4077277 = score(doc=2291,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 2291, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=2291)
    
  3. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:milstead in 2311) [ClassicSimilarity], result of:
        5.4077277 = score(doc=2311,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 2311, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=2311)
    
  4. Milstead, J.L.: Natural versus inverted word order in subject headings (1980) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:milstead in 2867) [ClassicSimilarity], result of:
        5.4077277 = score(doc=2867,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 2867, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=2867)
    
  5. Milstead, J.L.: Thesaurus software packages (1990) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:milstead in 4868) [ClassicSimilarity], result of:
        5.4077277 = score(doc=4868,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 4868, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=4868)
    

Similar documents (content)

  1. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 0.43
    0.42588025 = sum of:
      0.42588025 = product of:
        1.0647006 = sum of:
          0.07572091 = weight(abstract_txt:reviewed in 2311) [ClassicSimilarity], result of:
            0.07572091 = score(doc=2311,freq=1.0), product of:
              0.13211814 = queryWeight, product of:
                6.113391 = idf(docFreq=265, maxDocs=44218)
                0.02161127 = queryNorm
              0.57313037 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.113391 = idf(docFreq=265, maxDocs=44218)
                0.09375 = fieldNorm(doc=2311)
          0.08058424 = weight(abstract_txt:policies in 2311) [ClassicSimilarity], result of:
            0.08058424 = score(doc=2311,freq=1.0), product of:
              0.13771628 = queryWeight, product of:
                1.0209663 = boost
                6.241566 = idf(docFreq=233, maxDocs=44218)
                0.02161127 = queryNorm
              0.58514684 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.241566 = idf(docFreq=233, maxDocs=44218)
                0.09375 = fieldNorm(doc=2311)
          0.1139447 = weight(abstract_txt:packages in 2311) [ClassicSimilarity], result of:
            0.1139447 = score(doc=2311,freq=1.0), product of:
              0.17349273 = queryWeight, product of:
                1.1459335 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.02161127 = queryNorm
              0.65676934 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.09375 = fieldNorm(doc=2311)
          0.032765754 = weight(abstract_txt:were in 2311) [ClassicSimilarity], result of:
            0.032765754 = score(doc=2311,freq=1.0), product of:
              0.09523033 = queryWeight, product of:
                1.2006638 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.02161127 = queryNorm
              0.34406847 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.09375 = fieldNorm(doc=2311)
          0.03775346 = weight(abstract_txt:development in 2311) [ClassicSimilarity], result of:
            0.03775346 = score(doc=2311,freq=1.0), product of:
              0.10466457 = queryWeight, product of:
                1.258733 = boost
                3.8475635 = idf(docFreq=2563, maxDocs=44218)
                0.02161127 = queryNorm
              0.36070907 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8475635 = idf(docFreq=2563, maxDocs=44218)
                0.09375 = fieldNorm(doc=2311)
          0.048487972 = weight(abstract_txt:analysis in 2311) [ClassicSimilarity], result of:
            0.048487972 = score(doc=2311,freq=1.0), product of:
              0.14156252 = queryWeight, product of:
                1.7928896 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.02161127 = queryNorm
              0.34251985 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.09375 = fieldNorm(doc=2311)
          0.059296228 = weight(abstract_txt:subject in 2311) [ClassicSimilarity], result of:
            0.059296228 = score(doc=2311,freq=1.0), product of:
              0.16188638 = queryWeight, product of:
                1.9172757 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.02161127 = queryNorm
              0.366283 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.09375 = fieldNorm(doc=2311)
          0.19720237 = weight(abstract_txt:automatic in 2311) [ClassicSimilarity], result of:
            0.19720237 = score(doc=2311,freq=2.0), product of:
              0.2862796 = queryWeight, product of:
                2.549615 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.02161127 = queryNorm
              0.6888454 = fieldWeight in 2311, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=2311)
          0.14622708 = weight(abstract_txt:machine in 2311) [ClassicSimilarity], result of:
            0.14622708 = score(doc=2311,freq=1.0), product of:
              0.29549092 = queryWeight, product of:
                2.5903084 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.02161127 = queryNorm
              0.49486148 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.09375 = fieldNorm(doc=2311)
          0.27271783 = weight(abstract_txt:indexing in 2311) [ClassicSimilarity], result of:
            0.27271783 = score(doc=2311,freq=4.0), product of:
              0.33439842 = queryWeight, product of:
                3.557426 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.02161127 = queryNorm
              0.81554765 = fieldWeight in 2311, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=2311)
        0.4 = coord(10/25)
    
  2. White, H.; Willis, C.; Greenberg, J.: HIVEing : the effect of a semantic web technology on inter-indexer consistency (2014) 0.24
    0.23602287 = sum of:
      0.23602287 = product of:
        0.84293884 = sum of:
          0.059722867 = weight(abstract_txt:studying in 1781) [ClassicSimilarity], result of:
            0.059722867 = score(doc=1781,freq=1.0), product of:
              0.14778821 = queryWeight, product of:
                1.057642 = boost
                6.465779 = idf(docFreq=186, maxDocs=44218)
                0.02161127 = queryNorm
              0.40411118 = fieldWeight in 1781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.465779 = idf(docFreq=186, maxDocs=44218)
                0.0625 = fieldNorm(doc=1781)
          0.132092 = weight(abstract_txt:indexer in 1781) [ClassicSimilarity], result of:
            0.132092 = score(doc=1781,freq=3.0), product of:
              0.17394954 = queryWeight, product of:
                1.1474411 = boost
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.02161127 = queryNorm
              0.7593696 = fieldWeight in 1781, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.0625 = fieldNorm(doc=1781)
          0.030891849 = weight(abstract_txt:were in 1781) [ClassicSimilarity], result of:
            0.030891849 = score(doc=1781,freq=2.0), product of:
              0.09523033 = queryWeight, product of:
                1.2006638 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.02161127 = queryNorm
              0.32439086 = fieldWeight in 1781, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0625 = fieldNorm(doc=1781)
          0.032325316 = weight(abstract_txt:analysis in 1781) [ClassicSimilarity], result of:
            0.032325316 = score(doc=1781,freq=1.0), product of:
              0.14156252 = queryWeight, product of:
                1.7928896 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.02161127 = queryNorm
              0.22834657 = fieldWeight in 1781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=1781)
          0.2925889 = weight(abstract_txt:aided in 1781) [ClassicSimilarity], result of:
            0.2925889 = score(doc=1781,freq=2.0), product of:
              0.42630255 = queryWeight, product of:
                2.5403452 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.02161127 = queryNorm
              0.6863409 = fieldWeight in 1781, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=1781)
          0.13786422 = weight(abstract_txt:machine in 1781) [ClassicSimilarity], result of:
            0.13786422 = score(doc=1781,freq=2.0), product of:
              0.29549092 = queryWeight, product of:
                2.5903084 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.02161127 = queryNorm
              0.4665599 = fieldWeight in 1781, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0625 = fieldNorm(doc=1781)
          0.15745372 = weight(abstract_txt:indexing in 1781) [ClassicSimilarity], result of:
            0.15745372 = score(doc=1781,freq=3.0), product of:
              0.33439842 = queryWeight, product of:
                3.557426 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.02161127 = queryNorm
              0.47085664 = fieldWeight in 1781, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0625 = fieldNorm(doc=1781)
        0.28 = coord(7/25)
    
  3. Greenrich, E.: CD-ROM data preparation enhancements (1993) 0.23
    0.22554742 = sum of:
      0.22554742 = product of:
        1.127737 = sum of:
          0.07593935 = weight(abstract_txt:databases in 7843) [ClassicSimilarity], result of:
            0.07593935 = score(doc=7843,freq=1.0), product of:
              0.13767253 = queryWeight, product of:
                1.443635 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.02161127 = queryNorm
              0.5515941 = fieldWeight in 7843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.125 = fieldNorm(doc=7843)
          0.4137832 = weight(abstract_txt:aided in 7843) [ClassicSimilarity], result of:
            0.4137832 = score(doc=7843,freq=1.0), product of:
              0.42630255 = queryWeight, product of:
                2.5403452 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.02161127 = queryNorm
              0.9706327 = fieldWeight in 7843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.125 = fieldNorm(doc=7843)
          0.1859242 = weight(abstract_txt:automatic in 7843) [ClassicSimilarity], result of:
            0.1859242 = score(doc=7843,freq=1.0), product of:
              0.2862796 = queryWeight, product of:
                2.549615 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.02161127 = queryNorm
              0.6494497 = fieldWeight in 7843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.125 = fieldNorm(doc=7843)
          0.19496943 = weight(abstract_txt:machine in 7843) [ClassicSimilarity], result of:
            0.19496943 = score(doc=7843,freq=1.0), product of:
              0.29549092 = queryWeight, product of:
                2.5903084 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.02161127 = queryNorm
              0.6598153 = fieldWeight in 7843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.125 = fieldNorm(doc=7843)
          0.25712085 = weight(abstract_txt:indexing in 7843) [ClassicSimilarity], result of:
            0.25712085 = score(doc=7843,freq=2.0), product of:
              0.33439842 = queryWeight, product of:
                3.557426 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.02161127 = queryNorm
              0.7689057 = fieldWeight in 7843, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.125 = fieldNorm(doc=7843)
        0.2 = coord(5/25)
    
  4. Lancaster, F.W.: Trends in subject indexing from 1957 to 2000 (1980) 0.15
    0.14880277 = sum of:
      0.14880277 = product of:
        0.93001735 = sum of:
          0.3103374 = weight(abstract_txt:aided in 208) [ClassicSimilarity], result of:
            0.3103374 = score(doc=208,freq=1.0), product of:
              0.42630255 = queryWeight, product of:
                2.5403452 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.02161127 = queryNorm
              0.72797453 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
          0.13944314 = weight(abstract_txt:automatic in 208) [ClassicSimilarity], result of:
            0.13944314 = score(doc=208,freq=1.0), product of:
              0.2862796 = queryWeight, product of:
                2.549615 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.02161127 = queryNorm
              0.48708728 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
          0.14622708 = weight(abstract_txt:machine in 208) [ClassicSimilarity], result of:
            0.14622708 = score(doc=208,freq=1.0), product of:
              0.29549092 = queryWeight, product of:
                2.5903084 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.02161127 = queryNorm
              0.49486148 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
          0.3340098 = weight(abstract_txt:indexing in 208) [ClassicSimilarity], result of:
            0.3340098 = score(doc=208,freq=6.0), product of:
              0.33439842 = queryWeight, product of:
                3.557426 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.02161127 = queryNorm
              0.9988378 = fieldWeight in 208, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
        0.16 = coord(4/25)
    
  5. Milstead, J.L.: Thesauri in a full-text world (1998) 0.15
    0.14699626 = sum of:
      0.14699626 = product of:
        0.7349813 = sum of:
          0.047462095 = weight(abstract_txt:databases in 2337) [ClassicSimilarity], result of:
            0.047462095 = score(doc=2337,freq=1.0), product of:
              0.13767253 = queryWeight, product of:
                1.443635 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.02161127 = queryNorm
              0.3447463 = fieldWeight in 2337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.078125 = fieldNorm(doc=2337)
          0.057143625 = weight(abstract_txt:analysis in 2337) [ClassicSimilarity], result of:
            0.057143625 = score(doc=2337,freq=2.0), product of:
              0.14156252 = queryWeight, product of:
                1.7928896 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.02161127 = queryNorm
              0.40366352 = fieldWeight in 2337, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=2337)
          0.25861448 = weight(abstract_txt:aided in 2337) [ClassicSimilarity], result of:
            0.25861448 = score(doc=2337,freq=1.0), product of:
              0.42630255 = queryWeight, product of:
                2.5403452 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.02161127 = queryNorm
              0.6066454 = fieldWeight in 2337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.078125 = fieldNorm(doc=2337)
          0.2110606 = weight(abstract_txt:machine in 2337) [ClassicSimilarity], result of:
            0.2110606 = score(doc=2337,freq=3.0), product of:
              0.29549092 = queryWeight, product of:
                2.5903084 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.02161127 = queryNorm
              0.714271 = fieldWeight in 2337, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.078125 = fieldNorm(doc=2337)
          0.16070053 = weight(abstract_txt:indexing in 2337) [ClassicSimilarity], result of:
            0.16070053 = score(doc=2337,freq=2.0), product of:
              0.33439842 = queryWeight, product of:
                3.557426 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.02161127 = queryNorm
              0.48056605 = fieldWeight in 2337, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.078125 = fieldNorm(doc=2337)
        0.2 = coord(5/25)