Document (#21569)

Author
Jianchao, X.
Ming, H.
Milin, S.
Title
On indexing descriptors for document archive
Source
Journal of the China Society for Scientific and Technical Information. 17(1998) no.4, S.263-265
Year
1998
Abstract
Describes a method of indexing the descriptors of the full text of document archives. Explains how the method organizes the thesaurus of descriptors, and mixes both keyword and index terms from the thesaurus. Presents a procedure for weighting descriptors and discusses the technical issues involved
Footnote
[In Chinesisch]

Similar documents (content)

  1. Ferber, R.: Automated indexing with thesaurus descriptors : a co-occurence based approach to multilingual retrieval (1997) 0.34
    0.3383961 = sum of:
      0.3383961 = product of:
        1.1602153 = sum of:
          0.00814804 = weight(abstract_txt:from in 5558) [ClassicSimilarity], result of:
            0.00814804 = score(doc=5558,freq=2.0), product of:
              0.03285982 = queryWeight, product of:
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.011713111 = queryNorm
              0.2479636 = fieldWeight in 5558, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.0625 = fieldNorm(doc=5558)
          0.024720693 = weight(abstract_txt:terms in 5558) [ClassicSimilarity], result of:
            0.024720693 = score(doc=5558,freq=2.0), product of:
              0.06886579 = queryWeight, product of:
                1.4476687 = boost
                4.061272 = idf(docFreq=1964, maxDocs=41962)
                0.011713111 = queryNorm
              0.35896912 = fieldWeight in 5558, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.061272 = idf(docFreq=1964, maxDocs=41962)
                0.0625 = fieldNorm(doc=5558)
          0.08878249 = weight(abstract_txt:weighting in 5558) [ClassicSimilarity], result of:
            0.08878249 = score(doc=5558,freq=1.0), product of:
              0.20348136 = queryWeight, product of:
                2.4884546 = boost
                6.9810805 = idf(docFreq=105, maxDocs=41962)
                0.011713111 = queryNorm
              0.43631753 = fieldWeight in 5558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9810805 = idf(docFreq=105, maxDocs=41962)
                0.0625 = fieldNorm(doc=5558)
          0.057917926 = weight(abstract_txt:document in 5558) [ClassicSimilarity], result of:
            0.057917926 = score(doc=5558,freq=2.0), product of:
              0.15305533 = queryWeight, product of:
                3.0521553 = boost
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.011713111 = queryNorm
              0.3784117 = fieldWeight in 5558, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.0625 = fieldNorm(doc=5558)
          0.0849722 = weight(abstract_txt:indexing in 5558) [ClassicSimilarity], result of:
            0.0849722 = score(doc=5558,freq=4.0), product of:
              0.15684873 = queryWeight, product of:
                3.089747 = boost
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.011713111 = queryNorm
              0.5417462 = fieldWeight in 5558, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.0625 = fieldNorm(doc=5558)
          0.142387 = weight(abstract_txt:thesaurus in 5558) [ClassicSimilarity], result of:
            0.142387 = score(doc=5558,freq=4.0), product of:
              0.22128059 = queryWeight, product of:
                3.6698985 = boost
                5.1477447 = idf(docFreq=662, maxDocs=41962)
                0.011713111 = queryNorm
              0.6434681 = fieldWeight in 5558, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1477447 = idf(docFreq=662, maxDocs=41962)
                0.0625 = fieldNorm(doc=5558)
          0.7532869 = weight(abstract_txt:descriptors in 5558) [ClassicSimilarity], result of:
            0.7532869 = score(doc=5558,freq=6.0), product of:
              0.73946273 = queryWeight, product of:
                9.487582 = boost
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.011713111 = queryNorm
              1.0186949 = fieldWeight in 5558, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.0625 = fieldNorm(doc=5558)
        0.29166666 = coord(7/24)
    
  2. Loosjes, T.P.; Tichelaar, P.A.; Goossens, J.; Stuurman, P.: Ontsluiting op onderwerp (1977) 0.34
    0.3361016 = sum of:
      0.3361016 = product of:
        1.0083047 = sum of:
          0.0072019175 = weight(abstract_txt:from in 979) [ClassicSimilarity], result of:
            0.0072019175 = score(doc=979,freq=1.0), product of:
              0.03285982 = queryWeight, product of:
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.011713111 = queryNorm
              0.21917093 = fieldWeight in 979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.078125 = fieldNorm(doc=979)
          0.021760235 = weight(abstract_txt:text in 979) [ClassicSimilarity], result of:
            0.021760235 = score(doc=979,freq=1.0), product of:
              0.0686766 = queryWeight, product of:
                1.4456787 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.011713111 = queryNorm
              0.31685078 = fieldWeight in 979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.078125 = fieldNorm(doc=979)
          0.0345793 = weight(abstract_txt:index in 979) [ClassicSimilarity], result of:
            0.0345793 = score(doc=979,freq=1.0), product of:
              0.0935213 = queryWeight, product of:
                1.6870295 = boost
                4.7327724 = idf(docFreq=1003, maxDocs=41962)
                0.011713111 = queryNorm
              0.36974785 = fieldWeight in 979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7327724 = idf(docFreq=1003, maxDocs=41962)
                0.078125 = fieldNorm(doc=979)
          0.03872213 = weight(abstract_txt:full in 979) [ClassicSimilarity], result of:
            0.03872213 = score(doc=979,freq=1.0), product of:
              0.10084923 = queryWeight, product of:
                1.7518774 = boost
                4.9146957 = idf(docFreq=836, maxDocs=41962)
                0.011713111 = queryNorm
              0.3839606 = fieldWeight in 979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9146957 = idf(docFreq=836, maxDocs=41962)
                0.078125 = fieldNorm(doc=979)
          0.053107627 = weight(abstract_txt:indexing in 979) [ClassicSimilarity], result of:
            0.053107627 = score(doc=979,freq=1.0), product of:
              0.15684873 = queryWeight, product of:
                3.089747 = boost
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.011713111 = queryNorm
              0.33859137 = fieldWeight in 979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.078125 = fieldNorm(doc=979)
          0.06126215 = weight(abstract_txt:method in 979) [ClassicSimilarity], result of:
            0.06126215 = score(doc=979,freq=1.0), product of:
              0.17251939 = queryWeight, product of:
                3.2404203 = boost
                4.545318 = idf(docFreq=1210, maxDocs=41962)
                0.011713111 = queryNorm
              0.355103 = fieldWeight in 979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.545318 = idf(docFreq=1210, maxDocs=41962)
                0.078125 = fieldNorm(doc=979)
          0.12585351 = weight(abstract_txt:thesaurus in 979) [ClassicSimilarity], result of:
            0.12585351 = score(doc=979,freq=2.0), product of:
              0.22128059 = queryWeight, product of:
                3.6698985 = boost
                5.1477447 = idf(docFreq=662, maxDocs=41962)
                0.011713111 = queryNorm
              0.5687508 = fieldWeight in 979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1477447 = idf(docFreq=662, maxDocs=41962)
                0.078125 = fieldNorm(doc=979)
          0.66581786 = weight(abstract_txt:descriptors in 979) [ClassicSimilarity], result of:
            0.66581786 = score(doc=979,freq=3.0), product of:
              0.73946273 = queryWeight, product of:
                9.487582 = boost
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.011713111 = queryNorm
              0.90040755 = fieldWeight in 979, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.078125 = fieldNorm(doc=979)
        0.33333334 = coord(8/24)
    
  3. Lu, K.; Mao, J.; Li, G.: Toward effective automated weighted subject indexing : a comparison of different approaches in different environments (2018) 0.30
    0.29915953 = sum of:
      0.29915953 = product of:
        0.8974786 = sum of:
          0.005761534 = weight(abstract_txt:from in 857) [ClassicSimilarity], result of:
            0.005761534 = score(doc=857,freq=1.0), product of:
              0.03285982 = queryWeight, product of:
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.011713111 = queryNorm
              0.17533675 = fieldWeight in 857, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.0625 = fieldNorm(doc=857)
          0.024618892 = weight(abstract_txt:text in 857) [ClassicSimilarity], result of:
            0.024618892 = score(doc=857,freq=2.0), product of:
              0.0686766 = queryWeight, product of:
                1.4456787 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.011713111 = queryNorm
              0.35847571 = fieldWeight in 857, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=857)
          0.043809094 = weight(abstract_txt:full in 857) [ClassicSimilarity], result of:
            0.043809094 = score(doc=857,freq=2.0), product of:
              0.10084923 = queryWeight, product of:
                1.7518774 = boost
                4.9146957 = idf(docFreq=836, maxDocs=41962)
                0.011713111 = queryNorm
              0.43440184 = fieldWeight in 857, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9146957 = idf(docFreq=836, maxDocs=41962)
                0.0625 = fieldNorm(doc=857)
          0.17756498 = weight(abstract_txt:weighting in 857) [ClassicSimilarity], result of:
            0.17756498 = score(doc=857,freq=4.0), product of:
              0.20348136 = queryWeight, product of:
                2.4884546 = boost
                6.9810805 = idf(docFreq=105, maxDocs=41962)
                0.011713111 = queryNorm
              0.87263507 = fieldWeight in 857, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9810805 = idf(docFreq=105, maxDocs=41962)
                0.0625 = fieldNorm(doc=857)
          0.04095416 = weight(abstract_txt:document in 857) [ClassicSimilarity], result of:
            0.04095416 = score(doc=857,freq=1.0), product of:
              0.15305533 = queryWeight, product of:
                3.0521553 = boost
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.011713111 = queryNorm
              0.2675775 = fieldWeight in 857, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.0625 = fieldNorm(doc=857)
          0.0849722 = weight(abstract_txt:indexing in 857) [ClassicSimilarity], result of:
            0.0849722 = score(doc=857,freq=4.0), product of:
              0.15684873 = queryWeight, product of:
                3.089747 = boost
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.011713111 = queryNorm
              0.5417462 = fieldWeight in 857, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.0625 = fieldNorm(doc=857)
          0.08488732 = weight(abstract_txt:method in 857) [ClassicSimilarity], result of:
            0.08488732 = score(doc=857,freq=3.0), product of:
              0.17251939 = queryWeight, product of:
                3.2404203 = boost
                4.545318 = idf(docFreq=1210, maxDocs=41962)
                0.011713111 = queryNorm
              0.4920451 = fieldWeight in 857, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.545318 = idf(docFreq=1210, maxDocs=41962)
                0.0625 = fieldNorm(doc=857)
          0.4349104 = weight(abstract_txt:descriptors in 857) [ClassicSimilarity], result of:
            0.4349104 = score(doc=857,freq=2.0), product of:
              0.73946273 = queryWeight, product of:
                9.487582 = boost
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.011713111 = queryNorm
              0.58814377 = fieldWeight in 857, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.0625 = fieldNorm(doc=857)
        0.33333334 = coord(8/24)
    
  4. Fagan, J.L.: ¬The effectiveness of a nonsyntactic approach to automatic phrase indexing for document retrieval (1989) 0.30
    0.2978799 = sum of:
      0.2978799 = product of:
        1.0213026 = sum of:
          0.005761534 = weight(abstract_txt:from in 3846) [ClassicSimilarity], result of:
            0.005761534 = score(doc=3846,freq=1.0), product of:
              0.03285982 = queryWeight, product of:
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.011713111 = queryNorm
              0.17533675 = fieldWeight in 3846, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.0625 = fieldNorm(doc=3846)
          0.024618892 = weight(abstract_txt:text in 3846) [ClassicSimilarity], result of:
            0.024618892 = score(doc=3846,freq=2.0), product of:
              0.0686766 = queryWeight, product of:
                1.4456787 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.011713111 = queryNorm
              0.35847571 = fieldWeight in 3846, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=3846)
          0.13000733 = weight(abstract_txt:procedure in 3846) [ClassicSimilarity], result of:
            0.13000733 = score(doc=3846,freq=3.0), product of:
              0.1819329 = queryWeight, product of:
                2.353006 = boost
                6.6010947 = idf(docFreq=154, maxDocs=41962)
                0.011713111 = queryNorm
              0.7145895 = fieldWeight in 3846, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.6010947 = idf(docFreq=154, maxDocs=41962)
                0.0625 = fieldNorm(doc=3846)
          0.09157629 = weight(abstract_txt:document in 3846) [ClassicSimilarity], result of:
            0.09157629 = score(doc=3846,freq=5.0), product of:
              0.15305533 = queryWeight, product of:
                3.0521553 = boost
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.011713111 = queryNorm
              0.5983215 = fieldWeight in 3846, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.0625 = fieldNorm(doc=3846)
          0.0849722 = weight(abstract_txt:indexing in 3846) [ClassicSimilarity], result of:
            0.0849722 = score(doc=3846,freq=4.0), product of:
              0.15684873 = queryWeight, product of:
                3.089747 = boost
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.011713111 = queryNorm
              0.5417462 = fieldWeight in 3846, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.0625 = fieldNorm(doc=3846)
          0.06931021 = weight(abstract_txt:method in 3846) [ClassicSimilarity], result of:
            0.06931021 = score(doc=3846,freq=2.0), product of:
              0.17251939 = queryWeight, product of:
                3.2404203 = boost
                4.545318 = idf(docFreq=1210, maxDocs=41962)
                0.011713111 = queryNorm
              0.40175316 = fieldWeight in 3846, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.545318 = idf(docFreq=1210, maxDocs=41962)
                0.0625 = fieldNorm(doc=3846)
          0.61505616 = weight(abstract_txt:descriptors in 3846) [ClassicSimilarity], result of:
            0.61505616 = score(doc=3846,freq=4.0), product of:
              0.73946273 = queryWeight, product of:
                9.487582 = boost
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.011713111 = queryNorm
              0.8317609 = fieldWeight in 3846, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.0625 = fieldNorm(doc=3846)
        0.29166666 = coord(7/24)
    
  5. Gopinath, M.A.: Descriptors and their role in information retrieval (1993) 0.26
    0.2579883 = sum of:
      0.2579883 = product of:
        1.2383438 = sum of:
          0.011523068 = weight(abstract_txt:from in 7802) [ClassicSimilarity], result of:
            0.011523068 = score(doc=7802,freq=1.0), product of:
              0.03285982 = queryWeight, product of:
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.011713111 = queryNorm
              0.3506735 = fieldWeight in 7802, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.125 = fieldNorm(doc=7802)
          0.031653296 = weight(abstract_txt:discusses in 7802) [ClassicSimilarity], result of:
            0.031653296 = score(doc=7802,freq=1.0), product of:
              0.064451404 = queryWeight, product of:
                1.4005016 = boost
                3.9289503 = idf(docFreq=2242, maxDocs=41962)
                0.011713111 = queryNorm
              0.4911188 = fieldWeight in 7802, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9289503 = idf(docFreq=2242, maxDocs=41962)
                0.125 = fieldNorm(doc=7802)
          0.034816373 = weight(abstract_txt:text in 7802) [ClassicSimilarity], result of:
            0.034816373 = score(doc=7802,freq=1.0), product of:
              0.0686766 = queryWeight, product of:
                1.4456787 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.011713111 = queryNorm
              0.5069612 = fieldWeight in 7802, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.125 = fieldNorm(doc=7802)
          0.09504248 = weight(abstract_txt:explains in 7802) [ClassicSimilarity], result of:
            0.09504248 = score(doc=7802,freq=1.0), product of:
              0.13414206 = queryWeight, product of:
                2.0204582 = boost
                5.668169 = idf(docFreq=393, maxDocs=41962)
                0.011713111 = queryNorm
              0.7085211 = fieldWeight in 7802, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.668169 = idf(docFreq=393, maxDocs=41962)
                0.125 = fieldNorm(doc=7802)
          1.0653086 = weight(abstract_txt:descriptors in 7802) [ClassicSimilarity], result of:
            1.0653086 = score(doc=7802,freq=3.0), product of:
              0.73946273 = queryWeight, product of:
                9.487582 = boost
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.011713111 = queryNorm
              1.4406521 = fieldWeight in 7802, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.125 = fieldNorm(doc=7802)
        0.20833333 = coord(5/24)