Document (#2312)

Author
Milstead, J.L.
Title
Methodologies for subject analysis in bibliographic databases
Source
Information processing and management. 28(1992) no.3, S.407-431
Year
1992
Abstract
The goal of the study was to determine the state of the art of subject analysis as applied to large bibliographic data bases. The intent was to gather and evaluate information, casting it in a form that could be applied by management. There was no attempt to determine actual costs or trade-offs among costs and possible benefits. Commercial automatic indexing packages were also reviewed. The overall conclusion was that data base producers should begin working seriously on upgrading their thesauri and codifying their indexing policies as a means of moving toward development of machine aids to indexing, but that fully automatic indexing is not yet ready for wholesale implementation
Theme
Automatisches Indexieren

Similar documents (author)

  1. Milstead, J.L.: Database design : Indexing applications (1989) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:milstead in 867) [ClassicSimilarity], result of:
        5.4077277 = score(doc=867,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 867, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=867)
    
  2. Milstead, J.L.: Specifications for thesaurus software (1991) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:milstead in 2291) [ClassicSimilarity], result of:
        5.4077277 = score(doc=2291,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 2291, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=2291)
    
  3. Milstead, J.L.: Natural versus inverted word order in subject headings (1980) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:milstead in 2867) [ClassicSimilarity], result of:
        5.4077277 = score(doc=2867,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 2867, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=2867)
    
  4. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:milstead in 3092) [ClassicSimilarity], result of:
        5.4077277 = score(doc=3092,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 3092, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=3092)
    
  5. Milstead, J.L.: Thesaurus software packages (1990) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:milstead in 4868) [ClassicSimilarity], result of:
        5.4077277 = score(doc=4868,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 4868, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=4868)
    

Similar documents (content)

  1. Keyser, P. de: Indexing : from thesauri to the Semantic Web (2012) 0.17
    0.17169058 = sum of:
      0.17169058 = product of:
        0.71537745 = sum of:
          0.0821336 = weight(abstract_txt:moving in 3197) [ClassicSimilarity], result of:
            0.0821336 = score(doc=3197,freq=1.0), product of:
              0.15876707 = queryWeight, product of:
                1.0172383 = boost
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.023570422 = queryNorm
              0.51732135 = fieldWeight in 3197, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.078125 = fieldNorm(doc=3197)
          0.011289829 = weight(abstract_txt:that in 3197) [ClassicSimilarity], result of:
            0.011289829 = score(doc=3197,freq=1.0), product of:
              0.06098811 = queryWeight, product of:
                1.0920076 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.023570422 = queryNorm
              0.18511525 = fieldWeight in 3197, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=3197)
          0.033742413 = weight(abstract_txt:subject in 3197) [ClassicSimilarity], result of:
            0.033742413 = score(doc=3197,freq=1.0), product of:
              0.11054538 = queryWeight, product of:
                1.2004049 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.023570422 = queryNorm
              0.30523583 = fieldWeight in 3197, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.078125 = fieldNorm(doc=3197)
          0.062380116 = weight(abstract_txt:applied in 3197) [ClassicSimilarity], result of:
            0.062380116 = score(doc=3197,freq=1.0), product of:
              0.16651523 = queryWeight, product of:
                1.4732771 = boost
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.023570422 = queryNorm
              0.3746211 = fieldWeight in 3197, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.078125 = fieldNorm(doc=3197)
          0.17743172 = weight(abstract_txt:automatic in 3197) [ClassicSimilarity], result of:
            0.17743172 = score(doc=3197,freq=5.0), product of:
              0.19548826 = queryWeight, product of:
                1.596312 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.023570422 = queryNorm
              0.9076336 = fieldWeight in 3197, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.078125 = fieldNorm(doc=3197)
          0.3483998 = weight(abstract_txt:indexing in 3197) [ClassicSimilarity], result of:
            0.3483998 = score(doc=3197,freq=14.0), product of:
              0.2740159 = queryWeight, product of:
                2.6727622 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.023570422 = queryNorm
              1.2714583 = fieldWeight in 3197, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.078125 = fieldNorm(doc=3197)
        0.24 = coord(6/25)
    
  2. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 0.13
    0.1271152 = sum of:
      0.1271152 = product of:
        0.635576 = sum of:
          0.11671204 = weight(abstract_txt:packages in 3092) [ClassicSimilarity], result of:
            0.11671204 = score(doc=3092,freq=1.0), product of:
              0.17770629 = queryWeight, product of:
                1.0762022 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.023570422 = queryNorm
              0.65676934 = fieldWeight in 3092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
          0.046825167 = weight(abstract_txt:analysis in 3092) [ClassicSimilarity], result of:
            0.046825167 = score(doc=3092,freq=2.0), product of:
              0.09666707 = queryWeight, product of:
                1.1225269 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.023570422 = queryNorm
              0.48439622 = fieldWeight in 3092, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
          0.05726277 = weight(abstract_txt:subject in 3092) [ClassicSimilarity], result of:
            0.05726277 = score(doc=3092,freq=2.0), product of:
              0.11054538 = queryWeight, product of:
                1.2004049 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.023570422 = queryNorm
              0.5180024 = fieldWeight in 3092, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
          0.1649256 = weight(abstract_txt:automatic in 3092) [ClassicSimilarity], result of:
            0.1649256 = score(doc=3092,freq=3.0), product of:
              0.19548826 = queryWeight, product of:
                1.596312 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.023570422 = queryNorm
              0.8436599 = fieldWeight in 3092, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
          0.24985044 = weight(abstract_txt:indexing in 3092) [ClassicSimilarity], result of:
            0.24985044 = score(doc=3092,freq=5.0), product of:
              0.2740159 = queryWeight, product of:
                2.6727622 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.023570422 = queryNorm
              0.91181 = fieldWeight in 3092, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
        0.2 = coord(5/25)
    
  3. Poulter, A.: Filling in the blanks in RDA or remaining blank? : the strange case of FRSAD (2013) 0.13
    0.12607959 = sum of:
      0.12607959 = product of:
        0.39399874 = sum of:
          0.057493523 = weight(abstract_txt:moving in 980) [ClassicSimilarity], result of:
            0.057493523 = score(doc=980,freq=1.0), product of:
              0.15876707 = queryWeight, product of:
                1.0172383 = boost
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.023570422 = queryNorm
              0.36212498 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.0546875 = fieldNorm(doc=980)
          0.020800162 = weight(abstract_txt:data in 980) [ClassicSimilarity], result of:
            0.020800162 = score(doc=980,freq=2.0), product of:
              0.080610625 = queryWeight, product of:
                1.0250702 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.023570422 = queryNorm
              0.2580325 = fieldWeight in 980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0546875 = fieldNorm(doc=980)
          0.00790288 = weight(abstract_txt:that in 980) [ClassicSimilarity], result of:
            0.00790288 = score(doc=980,freq=1.0), product of:
              0.06098811 = queryWeight, product of:
                1.0920076 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.023570422 = queryNorm
              0.12958068 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=980)
          0.07123087 = weight(abstract_txt:begin in 980) [ClassicSimilarity], result of:
            0.07123087 = score(doc=980,freq=1.0), product of:
              0.18314427 = queryWeight, product of:
                1.0925444 = boost
                7.11192 = idf(docFreq=97, maxDocs=44218)
                0.023570422 = queryNorm
              0.38893312 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.11192 = idf(docFreq=97, maxDocs=44218)
                0.0546875 = fieldNorm(doc=980)
          0.047239378 = weight(abstract_txt:subject in 980) [ClassicSimilarity], result of:
            0.047239378 = score(doc=980,freq=4.0), product of:
              0.11054538 = queryWeight, product of:
                1.2004049 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.023570422 = queryNorm
              0.42733017 = fieldWeight in 980, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0546875 = fieldNorm(doc=980)
          0.02996469 = weight(abstract_txt:bibliographic in 980) [ClassicSimilarity], result of:
            0.02996469 = score(doc=980,freq=1.0), product of:
              0.12954809 = queryWeight, product of:
                1.29949 = boost
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.023570422 = queryNorm
              0.23130167 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.0546875 = fieldNorm(doc=980)
          0.09418764 = weight(abstract_txt:costs in 980) [ClassicSimilarity], result of:
            0.09418764 = score(doc=980,freq=1.0), product of:
              0.2779844 = queryWeight, product of:
                1.9035648 = boost
                6.195629 = idf(docFreq=244, maxDocs=44218)
                0.023570422 = queryNorm
              0.33882347 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.195629 = idf(docFreq=244, maxDocs=44218)
                0.0546875 = fieldNorm(doc=980)
          0.06517963 = weight(abstract_txt:indexing in 980) [ClassicSimilarity], result of:
            0.06517963 = score(doc=980,freq=1.0), product of:
              0.2740159 = queryWeight, product of:
                2.6727622 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.023570422 = queryNorm
              0.23786807 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0546875 = fieldNorm(doc=980)
        0.32 = coord(8/25)
    
  4. Golub, K.; Hansson, J.; Soergel, D.; Tudhope, D.: Managing classification in libraries : a methodological outline for evaluating automatic subject indexing and classification in Swedish library catalogues (2015) 0.10
    0.10387596 = sum of:
      0.10387596 = product of:
        0.37098557 = sum of:
          0.016809069 = weight(abstract_txt:data in 2300) [ClassicSimilarity], result of:
            0.016809069 = score(doc=2300,freq=1.0), product of:
              0.080610625 = queryWeight, product of:
                1.0250702 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.023570422 = queryNorm
              0.20852174 = fieldWeight in 2300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=2300)
          0.031216776 = weight(abstract_txt:analysis in 2300) [ClassicSimilarity], result of:
            0.031216776 = score(doc=2300,freq=2.0), product of:
              0.09666707 = queryWeight, product of:
                1.1225269 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.023570422 = queryNorm
              0.3229308 = fieldWeight in 2300, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=2300)
          0.060360264 = weight(abstract_txt:subject in 2300) [ClassicSimilarity], result of:
            0.060360264 = score(doc=2300,freq=5.0), product of:
              0.11054538 = queryWeight, product of:
                1.2004049 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.023570422 = queryNorm
              0.5460225 = fieldWeight in 2300, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0625 = fieldNorm(doc=2300)
          0.048430245 = weight(abstract_txt:bibliographic in 2300) [ClassicSimilarity], result of:
            0.048430245 = score(doc=2300,freq=2.0), product of:
              0.12954809 = queryWeight, product of:
                1.29949 = boost
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.023570422 = queryNorm
              0.3738399 = fieldWeight in 2300, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.0625 = fieldNorm(doc=2300)
          0.049904093 = weight(abstract_txt:applied in 2300) [ClassicSimilarity], result of:
            0.049904093 = score(doc=2300,freq=1.0), product of:
              0.16651523 = queryWeight, product of:
                1.4732771 = boost
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.023570422 = queryNorm
              0.29969686 = fieldWeight in 2300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.0625 = fieldNorm(doc=2300)
          0.089774124 = weight(abstract_txt:automatic in 2300) [ClassicSimilarity], result of:
            0.089774124 = score(doc=2300,freq=2.0), product of:
              0.19548826 = queryWeight, product of:
                1.596312 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.023570422 = queryNorm
              0.45923027 = fieldWeight in 2300, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=2300)
          0.07449101 = weight(abstract_txt:indexing in 2300) [ClassicSimilarity], result of:
            0.07449101 = score(doc=2300,freq=1.0), product of:
              0.2740159 = queryWeight, product of:
                2.6727622 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.023570422 = queryNorm
              0.27184922 = fieldWeight in 2300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0625 = fieldNorm(doc=2300)
        0.28 = coord(7/25)
    
  5. Das, A.; Jain, A.: Indexing the World Wide Web : the journey so far (2012) 0.10
    0.09953108 = sum of:
      0.09953108 = product of:
        0.4976554 = sum of:
          0.0363927 = weight(abstract_txt:data in 95) [ClassicSimilarity], result of:
            0.0363927 = score(doc=95,freq=3.0), product of:
              0.080610625 = queryWeight, product of:
                1.0250702 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.023570422 = queryNorm
              0.4514628 = fieldWeight in 95, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=95)
          0.011289829 = weight(abstract_txt:that in 95) [ClassicSimilarity], result of:
            0.011289829 = score(doc=95,freq=1.0), product of:
              0.06098811 = queryWeight, product of:
                1.0920076 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.023570422 = queryNorm
              0.18511525 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=95)
          0.10545769 = weight(abstract_txt:trade in 95) [ClassicSimilarity], result of:
            0.10545769 = score(doc=95,freq=1.0), product of:
              0.18755646 = queryWeight, product of:
                1.1056266 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.023570422 = queryNorm
              0.5622717 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.078125 = fieldNorm(doc=95)
          0.1832374 = weight(abstract_txt:offs in 95) [ClassicSimilarity], result of:
            0.1832374 = score(doc=95,freq=1.0), product of:
              0.27107486 = queryWeight, product of:
                1.32919 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.023570422 = queryNorm
              0.675966 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.078125 = fieldNorm(doc=95)
          0.16127776 = weight(abstract_txt:indexing in 95) [ClassicSimilarity], result of:
            0.16127776 = score(doc=95,freq=3.0), product of:
              0.2740159 = queryWeight, product of:
                2.6727622 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.023570422 = queryNorm
              0.5885708 = fieldWeight in 95, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.078125 = fieldNorm(doc=95)
        0.2 = coord(5/25)