Document (#3093)

Editor
Knowledge-based systems development
Author
Milstead, J.L.
Title
Methodologies for subject analysis in bibliographic databases
Source
IEEE proceedings. Pt.A. 28(1992) no.3, S.407-432
Year
1992
Abstract
Report on a subject analysis review undertaken to aid managers of databases in determining if new and little-known capabilities would improve the cost-effectiveness of subject analysis operations. Operational machine-aided and automatic indexing systems were found to form a continuum. Commercial automatic indexing packages were also reviewed. The primary obstacle to development of automatic indexing is the lack of machine understanding of natural language. Recommendations for action include: increasing the power of the indexer interface, studying indexing policies, enrichment of thesauri, and considering the development of machine-aided indexing

Similar documents (author)

  1. Milstead, J.L.: Database design : Indexing applications (1989) 5.39
    5.393951 = sum of:
      5.393951 = weight(author_txt:milstead in 867) [ClassicSimilarity], result of:
        5.393951 = score(doc=867,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.11587052 = queryNorm
          5.3939514 = fieldWeight in 867, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.625 = fieldNorm(doc=867)
    
  2. Milstead, J.L.: Specifications for thesaurus software (1991) 5.39
    5.393951 = sum of:
      5.393951 = weight(author_txt:milstead in 2291) [ClassicSimilarity], result of:
        5.393951 = score(doc=2291,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.11587052 = queryNorm
          5.3939514 = fieldWeight in 2291, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.625 = fieldNorm(doc=2291)
    
  3. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 5.39
    5.393951 = sum of:
      5.393951 = weight(author_txt:milstead in 2311) [ClassicSimilarity], result of:
        5.393951 = score(doc=2311,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.11587052 = queryNorm
          5.3939514 = fieldWeight in 2311, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.625 = fieldNorm(doc=2311)
    
  4. Milstead, J.L.: Natural versus inverted word order in subject headings (1980) 5.39
    5.393951 = sum of:
      5.393951 = weight(author_txt:milstead in 2867) [ClassicSimilarity], result of:
        5.393951 = score(doc=2867,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.11587052 = queryNorm
          5.3939514 = fieldWeight in 2867, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.625 = fieldNorm(doc=2867)
    
  5. Milstead, J.L.: Thesaurus software packages (1990) 5.39
    5.393951 = sum of:
      5.393951 = weight(author_txt:milstead in 4868) [ClassicSimilarity], result of:
        5.393951 = score(doc=4868,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.11587052 = queryNorm
          5.3939514 = fieldWeight in 4868, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.625 = fieldNorm(doc=4868)
    

Similar documents (content)

  1. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 0.43
    0.42647967 = sum of:
      0.42647967 = product of:
        1.0661992 = sum of:
          0.075717226 = weight(abstract_txt:reviewed in 2311) [ClassicSimilarity], result of:
            0.075717226 = score(doc=2311,freq=1.0), product of:
              0.1320118 = queryWeight, product of:
                1.0006303 = boost
                6.1180167 = idf(docFreq=258, maxDocs=43254)
                0.021563958 = queryNorm
              0.57356405 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1180167 = idf(docFreq=258, maxDocs=43254)
                0.09375 = fieldNorm(doc=2311)
          0.08106332 = weight(abstract_txt:policies in 2311) [ClassicSimilarity], result of:
            0.08106332 = score(doc=2311,freq=1.0), product of:
              0.13815477 = queryWeight, product of:
                1.023647 = boost
                6.2587447 = idf(docFreq=224, maxDocs=43254)
                0.021563958 = queryNorm
              0.5867573 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2587447 = idf(docFreq=224, maxDocs=43254)
                0.09375 = fieldNorm(doc=2311)
          0.1126112 = weight(abstract_txt:packages in 2311) [ClassicSimilarity], result of:
            0.1126112 = score(doc=2311,freq=1.0), product of:
              0.17200352 = queryWeight, product of:
                1.1421835 = boost
                6.983497 = idf(docFreq=108, maxDocs=43254)
                0.021563958 = queryNorm
              0.65470284 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.983497 = idf(docFreq=108, maxDocs=43254)
                0.09375 = fieldNorm(doc=2311)
          0.03303449 = weight(abstract_txt:were in 2311) [ClassicSimilarity], result of:
            0.03303449 = score(doc=2311,freq=1.0), product of:
              0.09567637 = queryWeight, product of:
                1.2047157 = boost
                3.6829145 = idf(docFreq=2956, maxDocs=43254)
                0.021563958 = queryNorm
              0.34527323 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6829145 = idf(docFreq=2956, maxDocs=43254)
                0.09375 = fieldNorm(doc=2311)
          0.037831884 = weight(abstract_txt:development in 2311) [ClassicSimilarity], result of:
            0.037831884 = score(doc=2311,freq=1.0), product of:
              0.10472851 = queryWeight, product of:
                1.2604183 = boost
                3.8532019 = idf(docFreq=2493, maxDocs=43254)
                0.021563958 = queryNorm
              0.36123767 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8532019 = idf(docFreq=2493, maxDocs=43254)
                0.09375 = fieldNorm(doc=2311)
          0.049172305 = weight(abstract_txt:analysis in 2311) [ClassicSimilarity], result of:
            0.049172305 = score(doc=2311,freq=1.0), product of:
              0.142781 = queryWeight, product of:
                1.8024493 = boost
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.021563958 = queryNorm
              0.34438968 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.09375 = fieldNorm(doc=2311)
          0.05930014 = weight(abstract_txt:subject in 2311) [ClassicSimilarity], result of:
            0.05930014 = score(doc=2311,freq=1.0), product of:
              0.16176847 = queryWeight, product of:
                1.9185574 = boost
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.021563958 = queryNorm
              0.36657417 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.09375 = fieldNorm(doc=2311)
          0.19682963 = weight(abstract_txt:automatic in 2311) [ClassicSimilarity], result of:
            0.19682963 = score(doc=2311,freq=2.0), product of:
              0.2856979 = queryWeight, product of:
                2.549655 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.021563958 = queryNorm
              0.6889432 = fieldWeight in 2311, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.09375 = fieldNorm(doc=2311)
          0.14939474 = weight(abstract_txt:machine in 2311) [ClassicSimilarity], result of:
            0.14939474 = score(doc=2311,freq=1.0), product of:
              0.29951155 = queryWeight, product of:
                2.610566 = boost
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.021563958 = queryNorm
              0.49879456 = fieldWeight in 2311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.09375 = fieldNorm(doc=2311)
          0.2712443 = weight(abstract_txt:indexing in 2311) [ClassicSimilarity], result of:
            0.2712443 = score(doc=2311,freq=4.0), product of:
              0.33293542 = queryWeight, product of:
                3.553303 = boost
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.021563958 = queryNorm
              0.8147054 = fieldWeight in 2311, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.09375 = fieldNorm(doc=2311)
        0.4 = coord(10/25)
    
  2. White, H.; Willis, C.; Greenberg, J.: HIVEing : the effect of a semantic web technology on inter-indexer consistency (2014) 0.24
    0.2366381 = sum of:
      0.2366381 = product of:
        0.84513605 = sum of:
          0.06049925 = weight(abstract_txt:studying in 3246) [ClassicSimilarity], result of:
            0.06049925 = score(doc=3246,freq=1.0), product of:
              0.14895113 = queryWeight, product of:
                1.062892 = boost
                6.4986954 = idf(docFreq=176, maxDocs=43254)
                0.021563958 = queryNorm
              0.40616846 = fieldWeight in 3246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4986954 = idf(docFreq=176, maxDocs=43254)
                0.0625 = fieldNorm(doc=3246)
          0.13159741 = weight(abstract_txt:indexer in 3246) [ClassicSimilarity], result of:
            0.13159741 = score(doc=3246,freq=3.0), product of:
              0.17338105 = queryWeight, product of:
                1.1467482 = boost
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.021563958 = queryNorm
              0.7590069 = fieldWeight in 3246, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.0625 = fieldNorm(doc=3246)
          0.031145215 = weight(abstract_txt:were in 3246) [ClassicSimilarity], result of:
            0.031145215 = score(doc=3246,freq=2.0), product of:
              0.09567637 = queryWeight, product of:
                1.2047157 = boost
                3.6829145 = idf(docFreq=2956, maxDocs=43254)
                0.021563958 = queryNorm
              0.3255267 = fieldWeight in 3246, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6829145 = idf(docFreq=2956, maxDocs=43254)
                0.0625 = fieldNorm(doc=3246)
          0.032781538 = weight(abstract_txt:analysis in 3246) [ClassicSimilarity], result of:
            0.032781538 = score(doc=3246,freq=1.0), product of:
              0.142781 = queryWeight, product of:
                1.8024493 = boost
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.021563958 = queryNorm
              0.22959313 = fieldWeight in 3246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.0625 = fieldNorm(doc=3246)
          0.29165897 = weight(abstract_txt:aided in 3246) [ClassicSimilarity], result of:
            0.29165897 = score(doc=3246,freq=2.0), product of:
              0.42507023 = queryWeight, product of:
                2.5392916 = boost
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.021563958 = queryNorm
              0.68614304 = fieldWeight in 3246, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.0625 = fieldNorm(doc=3246)
          0.1408507 = weight(abstract_txt:machine in 3246) [ClassicSimilarity], result of:
            0.1408507 = score(doc=3246,freq=2.0), product of:
              0.29951155 = queryWeight, product of:
                2.610566 = boost
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.021563958 = queryNorm
              0.47026798 = fieldWeight in 3246, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.0625 = fieldNorm(doc=3246)
          0.15660295 = weight(abstract_txt:indexing in 3246) [ClassicSimilarity], result of:
            0.15660295 = score(doc=3246,freq=3.0), product of:
              0.33293542 = queryWeight, product of:
                3.553303 = boost
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.021563958 = queryNorm
              0.47037035 = fieldWeight in 3246, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.0625 = fieldNorm(doc=3246)
        0.28 = coord(7/25)
    
  3. Greenrich, E.: CD-ROM data preparation enhancements (1993) 0.23
    0.22564697 = sum of:
      0.22564697 = product of:
        1.1282349 = sum of:
          0.075269565 = weight(abstract_txt:databases in 843) [ClassicSimilarity], result of:
            0.075269565 = score(doc=843,freq=1.0), product of:
              0.1367561 = queryWeight, product of:
                1.4403087 = boost
                4.4031415 = idf(docFreq=1438, maxDocs=43254)
                0.021563958 = queryNorm
              0.5503927 = fieldWeight in 843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4031415 = idf(docFreq=1438, maxDocs=43254)
                0.125 = fieldNorm(doc=843)
          0.41246808 = weight(abstract_txt:aided in 843) [ClassicSimilarity], result of:
            0.41246808 = score(doc=843,freq=1.0), product of:
              0.42507023 = queryWeight, product of:
                2.5392916 = boost
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.021563958 = queryNorm
              0.97035277 = fieldWeight in 843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.125 = fieldNorm(doc=843)
          0.18557276 = weight(abstract_txt:automatic in 843) [ClassicSimilarity], result of:
            0.18557276 = score(doc=843,freq=1.0), product of:
              0.2856979 = queryWeight, product of:
                2.549655 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.021563958 = queryNorm
              0.6495419 = fieldWeight in 843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.125 = fieldNorm(doc=843)
          0.19919297 = weight(abstract_txt:machine in 843) [ClassicSimilarity], result of:
            0.19919297 = score(doc=843,freq=1.0), product of:
              0.29951155 = queryWeight, product of:
                2.610566 = boost
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.021563958 = queryNorm
              0.6650594 = fieldWeight in 843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.125 = fieldNorm(doc=843)
          0.25573152 = weight(abstract_txt:indexing in 843) [ClassicSimilarity], result of:
            0.25573152 = score(doc=843,freq=2.0), product of:
              0.33293542 = queryWeight, product of:
                3.553303 = boost
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.021563958 = queryNorm
              0.7681115 = fieldWeight in 843, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.125 = fieldNorm(doc=843)
        0.2 = coord(5/25)
    
  4. Lancaster, F.W.: Trends in subject indexing from 1957 to 2000 (1980) 0.15
    0.14882086 = sum of:
      0.14882086 = product of:
        0.9301304 = sum of:
          0.30935106 = weight(abstract_txt:aided in 1209) [ClassicSimilarity], result of:
            0.30935106 = score(doc=1209,freq=1.0), product of:
              0.42507023 = queryWeight, product of:
                2.5392916 = boost
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.021563958 = queryNorm
              0.7277646 = fieldWeight in 1209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.09375 = fieldNorm(doc=1209)
          0.13917957 = weight(abstract_txt:automatic in 1209) [ClassicSimilarity], result of:
            0.13917957 = score(doc=1209,freq=1.0), product of:
              0.2856979 = queryWeight, product of:
                2.549655 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.021563958 = queryNorm
              0.48715645 = fieldWeight in 1209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.09375 = fieldNorm(doc=1209)
          0.14939474 = weight(abstract_txt:machine in 1209) [ClassicSimilarity], result of:
            0.14939474 = score(doc=1209,freq=1.0), product of:
              0.29951155 = queryWeight, product of:
                2.610566 = boost
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.021563958 = queryNorm
              0.49879456 = fieldWeight in 1209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.09375 = fieldNorm(doc=1209)
          0.33220506 = weight(abstract_txt:indexing in 1209) [ClassicSimilarity], result of:
            0.33220506 = score(doc=1209,freq=6.0), product of:
              0.33293542 = queryWeight, product of:
                3.553303 = boost
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.021563958 = queryNorm
              0.99780625 = fieldWeight in 1209, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.09375 = fieldNorm(doc=1209)
        0.16 = coord(4/25)
    
  5. Milstead, J.L.: Thesauri in a full-text world (1998) 0.15
    0.14765023 = sum of:
      0.14765023 = product of:
        0.7382511 = sum of:
          0.047043476 = weight(abstract_txt:databases in 4338) [ClassicSimilarity], result of:
            0.047043476 = score(doc=4338,freq=1.0), product of:
              0.1367561 = queryWeight, product of:
                1.4403087 = boost
                4.4031415 = idf(docFreq=1438, maxDocs=43254)
                0.021563958 = queryNorm
              0.34399542 = fieldWeight in 4338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4031415 = idf(docFreq=1438, maxDocs=43254)
                0.078125 = fieldNorm(doc=4338)
          0.057950117 = weight(abstract_txt:analysis in 4338) [ClassicSimilarity], result of:
            0.057950117 = score(doc=4338,freq=2.0), product of:
              0.142781 = queryWeight, product of:
                1.8024493 = boost
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.021563958 = queryNorm
              0.40586713 = fieldWeight in 4338, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.078125 = fieldNorm(doc=4338)
          0.25779253 = weight(abstract_txt:aided in 4338) [ClassicSimilarity], result of:
            0.25779253 = score(doc=4338,freq=1.0), product of:
              0.42507023 = queryWeight, product of:
                2.5392916 = boost
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.021563958 = queryNorm
              0.60647047 = fieldWeight in 4338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.078125 = fieldNorm(doc=4338)
          0.2156327 = weight(abstract_txt:machine in 4338) [ClassicSimilarity], result of:
            0.2156327 = score(doc=4338,freq=3.0), product of:
              0.29951155 = queryWeight, product of:
                2.610566 = boost
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.021563958 = queryNorm
              0.7199479 = fieldWeight in 4338, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.078125 = fieldNorm(doc=4338)
          0.15983221 = weight(abstract_txt:indexing in 4338) [ClassicSimilarity], result of:
            0.15983221 = score(doc=4338,freq=2.0), product of:
              0.33293542 = queryWeight, product of:
                3.553303 = boost
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.021563958 = queryNorm
              0.4800697 = fieldWeight in 4338, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.078125 = fieldNorm(doc=4338)
        0.2 = coord(5/25)