Document (#2312)

Author
Milstead, J.L.
Title
Methodologies for subject analysis in bibliographic databases
Source
Information processing and management. 28(1992) no.3, S.407-431
Year
1992
Abstract
The goal of the study was to determine the state of the art of subject analysis as applied to large bibliographic data bases. The intent was to gather and evaluate information, casting it in a form that could be applied by management. There was no attempt to determine actual costs or trade-offs among costs and possible benefits. Commercial automatic indexing packages were also reviewed. The overall conclusion was that data base producers should begin working seriously on upgrading their thesauri and codifying their indexing policies as a means of moving toward development of machine aids to indexing, but that fully automatic indexing is not yet ready for wholesale implementation
Theme
Automatisches Indexieren

Similar documents (author)

  1. Milstead, J.L.: Database design : Indexing applications (1989) 5.38
    5.380101 = sum of:
      5.380101 = weight(author_txt:milstead in 867) [ClassicSimilarity], result of:
        5.380101 = fieldWeight in 867, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.608162 = idf(docFreq=20, maxDocs=42306)
          0.625 = fieldNorm(doc=867)
    
  2. Milstead, J.L.: Specifications for thesaurus software (1991) 5.38
    5.380101 = sum of:
      5.380101 = weight(author_txt:milstead in 2291) [ClassicSimilarity], result of:
        5.380101 = fieldWeight in 2291, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.608162 = idf(docFreq=20, maxDocs=42306)
          0.625 = fieldNorm(doc=2291)
    
  3. Milstead, J.L.: Natural versus inverted word order in subject headings (1980) 5.38
    5.380101 = sum of:
      5.380101 = weight(author_txt:milstead in 2867) [ClassicSimilarity], result of:
        5.380101 = fieldWeight in 2867, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.608162 = idf(docFreq=20, maxDocs=42306)
          0.625 = fieldNorm(doc=2867)
    
  4. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 5.38
    5.380101 = sum of:
      5.380101 = weight(author_txt:milstead in 3092) [ClassicSimilarity], result of:
        5.380101 = fieldWeight in 3092, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.608162 = idf(docFreq=20, maxDocs=42306)
          0.625 = fieldNorm(doc=3092)
    
  5. Milstead, J.L.: Thesaurus software packages (1990) 5.38
    5.380101 = sum of:
      5.380101 = weight(author_txt:milstead in 4868) [ClassicSimilarity], result of:
        5.380101 = fieldWeight in 4868, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.608162 = idf(docFreq=20, maxDocs=42306)
          0.625 = fieldNorm(doc=4868)
    

Similar documents (content)

  1. Keyser, P. de: Indexing : from thesauri to the Semantic Web (2012) 0.18
    0.18274722 = sum of:
      0.18274722 = product of:
        0.7614468 = sum of:
          0.08797424 = weight(abstract_txt:moving in 116) [ClassicSimilarity], result of:
            0.08797424 = score(doc=116,freq=1.0), product of:
              0.16970634 = queryWeight, product of:
                1.0365304 = boost
                6.6354046 = idf(docFreq=150, maxDocs=42306)
                0.024674516 = queryNorm
              0.518391 = fieldWeight in 116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6354046 = idf(docFreq=150, maxDocs=42306)
                0.078125 = fieldNorm(doc=116)
          0.012564394 = weight(abstract_txt:that in 116) [ClassicSimilarity], result of:
            0.012564394 = score(doc=116,freq=1.0), product of:
              0.0668748 = queryWeight, product of:
                1.1270025 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.024674516 = queryNorm
              0.18787934 = fieldWeight in 116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.078125 = fieldNorm(doc=116)
          0.03598475 = weight(abstract_txt:subject in 116) [ClassicSimilarity], result of:
            0.03598475 = score(doc=116,freq=1.0), product of:
              0.117819384 = queryWeight, product of:
                1.2213956 = boost
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.024674516 = queryNorm
              0.30542302 = fieldWeight in 116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.078125 = fieldNorm(doc=116)
          0.068066485 = weight(abstract_txt:applied in 116) [ClassicSimilarity], result of:
            0.068066485 = score(doc=116,freq=1.0), product of:
              0.18020214 = queryWeight, product of:
                1.5105252 = boost
                4.8348536 = idf(docFreq=913, maxDocs=42306)
                0.024674516 = queryNorm
              0.37772295 = fieldWeight in 116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8348536 = idf(docFreq=913, maxDocs=42306)
                0.078125 = fieldNorm(doc=116)
          0.18890975 = weight(abstract_txt:automatic in 116) [ClassicSimilarity], result of:
            0.18890975 = score(doc=116,freq=5.0), product of:
              0.20812167 = queryWeight, product of:
                1.6233294 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.024674516 = queryNorm
              0.907689 = fieldWeight in 116, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.078125 = fieldNorm(doc=116)
          0.36794716 = weight(abstract_txt:indexing in 116) [ClassicSimilarity], result of:
            0.36794716 = score(doc=116,freq=14.0), product of:
              0.29015407 = queryWeight, product of:
                2.7106743 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.024674516 = queryNorm
              1.2681096 = fieldWeight in 116, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.078125 = fieldNorm(doc=116)
        0.24 = coord(6/25)
    
  2. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 0.13
    0.13484149 = sum of:
      0.13484149 = product of:
        0.67420745 = sum of:
          0.121902436 = weight(abstract_txt:packages in 3092) [ClassicSimilarity], result of:
            0.121902436 = score(doc=3092,freq=1.0), product of:
              0.1867878 = queryWeight, product of:
                1.0874449 = boost
                6.961336 = idf(docFreq=108, maxDocs=42306)
                0.024674516 = queryNorm
              0.65262526 = fieldWeight in 3092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.961336 = idf(docFreq=108, maxDocs=42306)
                0.09375 = fieldNorm(doc=3092)
          0.051773686 = weight(abstract_txt:analysis in 3092) [ClassicSimilarity], result of:
            0.051773686 = score(doc=3092,freq=2.0), product of:
              0.105539 = queryWeight, product of:
                1.1559911 = boost
                3.7000692 = idf(docFreq=2842, maxDocs=42306)
                0.024674516 = queryNorm
              0.4905645 = fieldWeight in 3092, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7000692 = idf(docFreq=2842, maxDocs=42306)
                0.09375 = fieldNorm(doc=3092)
          0.061068147 = weight(abstract_txt:subject in 3092) [ClassicSimilarity], result of:
            0.061068147 = score(doc=3092,freq=2.0), product of:
              0.117819384 = queryWeight, product of:
                1.2213956 = boost
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.024674516 = queryNorm
              0.51832 = fieldWeight in 3092, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.09375 = fieldNorm(doc=3092)
          0.17559463 = weight(abstract_txt:automatic in 3092) [ClassicSimilarity], result of:
            0.17559463 = score(doc=3092,freq=3.0), product of:
              0.20812167 = queryWeight, product of:
                1.6233294 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.024674516 = queryNorm
              0.8437114 = fieldWeight in 3092, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.09375 = fieldNorm(doc=3092)
          0.26386857 = weight(abstract_txt:indexing in 3092) [ClassicSimilarity], result of:
            0.26386857 = score(doc=3092,freq=5.0), product of:
              0.29015407 = queryWeight, product of:
                2.7106743 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.024674516 = queryNorm
              0.90940845 = fieldWeight in 3092, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.09375 = fieldNorm(doc=3092)
        0.2 = coord(5/25)
    
  3. Poulter, A.: Filling in the blanks in RDA or remaining blank? : the strange case of FRSAD (2013) 0.13
    0.13481414 = sum of:
      0.13481414 = product of:
        0.4212942 = sum of:
          0.061581966 = weight(abstract_txt:moving in 2981) [ClassicSimilarity], result of:
            0.061581966 = score(doc=2981,freq=1.0), product of:
              0.16970634 = queryWeight, product of:
                1.0365304 = boost
                6.6354046 = idf(docFreq=150, maxDocs=42306)
                0.024674516 = queryNorm
              0.36287367 = fieldWeight in 2981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6354046 = idf(docFreq=150, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2981)
          0.023076802 = weight(abstract_txt:data in 2981) [ClassicSimilarity], result of:
            0.023076802 = score(doc=2981,freq=2.0), product of:
              0.08820899 = queryWeight, product of:
                1.0568283 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.024674516 = queryNorm
              0.2616151 = fieldWeight in 2981, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2981)
          0.0771424 = weight(abstract_txt:begin in 2981) [ClassicSimilarity], result of:
            0.0771424 = score(doc=2981,freq=1.0), product of:
              0.19720799 = queryWeight, product of:
                1.1173655 = boost
                7.1528745 = idf(docFreq=89, maxDocs=42306)
                0.024674516 = queryNorm
              0.39117283 = fieldWeight in 2981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1528745 = idf(docFreq=89, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2981)
          0.008795075 = weight(abstract_txt:that in 2981) [ClassicSimilarity], result of:
            0.008795075 = score(doc=2981,freq=1.0), product of:
              0.0668748 = queryWeight, product of:
                1.1270025 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.024674516 = queryNorm
              0.13151553 = fieldWeight in 2981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2981)
          0.05037865 = weight(abstract_txt:subject in 2981) [ClassicSimilarity], result of:
            0.05037865 = score(doc=2981,freq=4.0), product of:
              0.117819384 = queryWeight, product of:
                1.2213956 = boost
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.024674516 = queryNorm
              0.42759222 = fieldWeight in 2981, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2981)
          0.031753745 = weight(abstract_txt:bibliographic in 2981) [ClassicSimilarity], result of:
            0.031753745 = score(doc=2981,freq=1.0), product of:
              0.13748933 = queryWeight, product of:
                1.3194183 = boost
                4.223163 = idf(docFreq=1684, maxDocs=42306)
                0.024674516 = queryNorm
              0.23095423 = fieldWeight in 2981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.223163 = idf(docFreq=1684, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2981)
          0.099728964 = weight(abstract_txt:costs in 2981) [ClassicSimilarity], result of:
            0.099728964 = score(doc=2981,freq=1.0), product of:
              0.29486275 = queryWeight, product of:
                1.9322262 = boost
                6.184624 = idf(docFreq=236, maxDocs=42306)
                0.024674516 = queryNorm
              0.33822164 = fieldWeight in 2981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.184624 = idf(docFreq=236, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2981)
          0.06883661 = weight(abstract_txt:indexing in 2981) [ClassicSimilarity], result of:
            0.06883661 = score(doc=2981,freq=1.0), product of:
              0.29015407 = queryWeight, product of:
                2.7106743 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.024674516 = queryNorm
              0.23724157 = fieldWeight in 2981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2981)
        0.32 = coord(8/25)
    
  4. Hudon, M.: Conceptual compatibility in controlled language tools used to index and access the content of moving image collections (2004) 0.13
    0.12732534 = sum of:
      0.12732534 = product of:
        0.5305223 = sum of:
          0.023699034 = weight(abstract_txt:their in 3656) [ClassicSimilarity], result of:
            0.023699034 = score(doc=3656,freq=1.0), product of:
              0.078977615 = queryWeight, product of:
                3.2007766 = idf(docFreq=4683, maxDocs=42306)
                0.024674516 = queryNorm
              0.3000728 = fieldWeight in 3656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2007766 = idf(docFreq=4683, maxDocs=42306)
                0.09375 = fieldNorm(doc=3656)
          0.14929722 = weight(abstract_txt:moving in 3656) [ClassicSimilarity], result of:
            0.14929722 = score(doc=3656,freq=2.0), product of:
              0.16970634 = queryWeight, product of:
                1.0365304 = boost
                6.6354046 = idf(docFreq=150, maxDocs=42306)
                0.024674516 = queryNorm
              0.8797386 = fieldWeight in 3656, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6354046 = idf(docFreq=150, maxDocs=42306)
                0.09375 = fieldNorm(doc=3656)
          0.027973311 = weight(abstract_txt:data in 3656) [ClassicSimilarity], result of:
            0.027973311 = score(doc=3656,freq=1.0), product of:
              0.08820899 = queryWeight, product of:
                1.0568283 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.024674516 = queryNorm
              0.3171254 = fieldWeight in 3656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.09375 = fieldNorm(doc=3656)
          0.021322483 = weight(abstract_txt:that in 3656) [ClassicSimilarity], result of:
            0.021322483 = score(doc=3656,freq=2.0), product of:
              0.0668748 = queryWeight, product of:
                1.1270025 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.024674516 = queryNorm
              0.31884181 = fieldWeight in 3656, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.09375 = fieldNorm(doc=3656)
          0.1038385 = weight(abstract_txt:determine in 3656) [ClassicSimilarity], result of:
            0.1038385 = score(doc=3656,freq=1.0), product of:
              0.21147345 = queryWeight, product of:
                1.636349 = boost
                5.2375875 = idf(docFreq=610, maxDocs=42306)
                0.024674516 = queryNorm
              0.49102384 = fieldWeight in 3656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2375875 = idf(docFreq=610, maxDocs=42306)
                0.09375 = fieldNorm(doc=3656)
          0.2043917 = weight(abstract_txt:indexing in 3656) [ClassicSimilarity], result of:
            0.2043917 = score(doc=3656,freq=3.0), product of:
              0.29015407 = queryWeight, product of:
                2.7106743 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.024674516 = queryNorm
              0.70442474 = fieldWeight in 3656, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.09375 = fieldNorm(doc=3656)
        0.24 = coord(6/25)
    
  5. Jun, W.: ¬A knowledge network constructed by integrating classification, thesaurus and metadata in a digital library (2003) 0.12
    0.11513771 = sum of:
      0.11513771 = product of:
        0.35980535 = sum of:
          0.019550707 = weight(abstract_txt:their in 2255) [ClassicSimilarity], result of:
            0.019550707 = score(doc=2255,freq=2.0), product of:
              0.078977615 = queryWeight, product of:
                3.2007766 = idf(docFreq=4683, maxDocs=42306)
                0.024674516 = queryNorm
              0.24754745 = fieldWeight in 2255, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2007766 = idf(docFreq=4683, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2255)
          0.012438115 = weight(abstract_txt:that in 2255) [ClassicSimilarity], result of:
            0.012438115 = score(doc=2255,freq=2.0), product of:
              0.0668748 = queryWeight, product of:
                1.1270025 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.024674516 = queryNorm
              0.18599105 = fieldWeight in 2255, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2255)
          0.021355556 = weight(abstract_txt:analysis in 2255) [ClassicSimilarity], result of:
            0.021355556 = score(doc=2255,freq=1.0), product of:
              0.105539 = queryWeight, product of:
                1.1559911 = boost
                3.7000692 = idf(docFreq=2842, maxDocs=42306)
                0.024674516 = queryNorm
              0.20234753 = fieldWeight in 2255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7000692 = idf(docFreq=2842, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2255)
          0.025189325 = weight(abstract_txt:subject in 2255) [ClassicSimilarity], result of:
            0.025189325 = score(doc=2255,freq=1.0), product of:
              0.117819384 = queryWeight, product of:
                1.2213956 = boost
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.024674516 = queryNorm
              0.21379611 = fieldWeight in 2255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2255)
          0.104521714 = weight(abstract_txt:seriously in 2255) [ClassicSimilarity], result of:
            0.104521714 = score(doc=2255,freq=1.0), product of:
              0.24147198 = queryWeight, product of:
                1.236421 = boost
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.024674516 = queryNorm
              0.43285236 = fieldWeight in 2255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2255)
          0.031753745 = weight(abstract_txt:bibliographic in 2255) [ClassicSimilarity], result of:
            0.031753745 = score(doc=2255,freq=1.0), product of:
              0.13748933 = queryWeight, product of:
                1.3194183 = boost
                4.223163 = idf(docFreq=1684, maxDocs=42306)
                0.024674516 = queryNorm
              0.23095423 = fieldWeight in 2255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.223163 = idf(docFreq=1684, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2255)
          0.047646537 = weight(abstract_txt:applied in 2255) [ClassicSimilarity], result of:
            0.047646537 = score(doc=2255,freq=1.0), product of:
              0.18020214 = queryWeight, product of:
                1.5105252 = boost
                4.8348536 = idf(docFreq=913, maxDocs=42306)
                0.024674516 = queryNorm
              0.26440606 = fieldWeight in 2255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8348536 = idf(docFreq=913, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2255)
          0.09734966 = weight(abstract_txt:indexing in 2255) [ClassicSimilarity], result of:
            0.09734966 = score(doc=2255,freq=2.0), product of:
              0.29015407 = queryWeight, product of:
                2.7106743 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.024674516 = queryNorm
              0.33551022 = fieldWeight in 2255, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2255)
        0.32 = coord(8/25)