Document (#21243)

Author
Yongcheng, W.
Xiaoming, G.
Lixia, W.
Title
Automatic indexing on subject of Chinese text
Source
Journal of the China Society for Scientific and Technical Information. 17(1998) no.3, S.219-225.
Year
1998
Abstract
Outlines the underlying ideas, the basic algorithm and structure of CSAIS 2.1, an automatic indexing system for the subjects of Chinese documents, developed by the authors in 1993
Footnote
[In Chinesisch]
Theme
Automatisches Indexieren

Similar documents (content)

  1. Li, Z.: Research on dynamic morphological indexing (1998) 0.39
    0.38933688 = sum of:
      0.38933688 = product of:
        1.3237454 = sum of:
          0.061965965 = weight(abstract_txt:documents in 4243) [ClassicSimilarity], result of:
            0.061965965 = score(doc=4243,freq=1.0), product of:
              0.12044542 = queryWeight, product of:
                1.2231513 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.023925291 = queryNorm
              0.5144734 = fieldWeight in 4243, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.125 = fieldNorm(doc=4243)
          0.16707098 = weight(abstract_txt:algorithm in 4243) [ClassicSimilarity], result of:
            0.16707098 = score(doc=4243,freq=1.0), product of:
              0.2333219 = queryWeight, product of:
                1.7024046 = boost
                5.7284284 = idf(docFreq=373, maxDocs=42306)
                0.023925291 = queryNorm
              0.71605355 = fieldWeight in 4243, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7284284 = idf(docFreq=373, maxDocs=42306)
                0.125 = fieldNorm(doc=4243)
          0.2902436 = weight(abstract_txt:indexing in 4243) [ClassicSimilarity], result of:
            0.2902436 = score(doc=4243,freq=4.0), product of:
              0.26762083 = queryWeight, product of:
                2.5784576 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.023925291 = queryNorm
              1.0845329 = fieldWeight in 4243, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.125 = fieldNorm(doc=4243)
          0.35263515 = weight(abstract_txt:automatic in 4243) [ClassicSimilarity], result of:
            0.35263515 = score(doc=4243,freq=2.0), product of:
              0.38391808 = queryWeight, product of:
                3.0882988 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.023925291 = queryNorm
              0.91851664 = fieldWeight in 4243, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.125 = fieldNorm(doc=4243)
          0.4518297 = weight(abstract_txt:chinese in 4243) [ClassicSimilarity], result of:
            0.4518297 = score(doc=4243,freq=1.0), product of:
              0.5706214 = queryWeight, product of:
                3.7650785 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.023925291 = queryNorm
              0.7918205 = fieldWeight in 4243, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.125 = fieldNorm(doc=4243)
        0.29411766 = coord(5/17)
    
  2. Wan, T.-L.; Evens, M.; Wan, Y.-W.; Pao, Y.-Y.: Experiments with automatic indexing and a relational thesaurus in a Chinese information retrieval system (1997) 0.37
    0.37292013 = sum of:
      0.37292013 = product of:
        1.2679284 = sum of:
          0.04398803 = weight(abstract_txt:system in 957) [ClassicSimilarity], result of:
            0.04398803 = score(doc=957,freq=3.0), product of:
              0.08050631 = queryWeight, product of:
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.023925291 = queryNorm
              0.5463923 = fieldWeight in 957, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.09375 = fieldNorm(doc=957)
          0.04647447 = weight(abstract_txt:documents in 957) [ClassicSimilarity], result of:
            0.04647447 = score(doc=957,freq=1.0), product of:
              0.12044542 = queryWeight, product of:
                1.2231513 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.023925291 = queryNorm
              0.38585502 = fieldWeight in 957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.09375 = fieldNorm(doc=957)
          0.26660576 = weight(abstract_txt:indexing in 957) [ClassicSimilarity], result of:
            0.26660576 = score(doc=957,freq=6.0), product of:
              0.26762083 = queryWeight, product of:
                2.5784576 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.023925291 = queryNorm
              0.9962071 = fieldWeight in 957, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.09375 = fieldNorm(doc=957)
          0.32391605 = weight(abstract_txt:automatic in 957) [ClassicSimilarity], result of:
            0.32391605 = score(doc=957,freq=3.0), product of:
              0.38391808 = queryWeight, product of:
                3.0882988 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.023925291 = queryNorm
              0.8437114 = fieldWeight in 957, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.09375 = fieldNorm(doc=957)
          0.586944 = weight(abstract_txt:chinese in 957) [ClassicSimilarity], result of:
            0.586944 = score(doc=957,freq=3.0), product of:
              0.5706214 = queryWeight, product of:
                3.7650785 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.023925291 = queryNorm
              1.028605 = fieldWeight in 957, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.09375 = fieldNorm(doc=957)
        0.29411766 = coord(5/17)
    
  3. Yang, C.C.; Li, K.W.: ¬A heuristic method based on a statistical approach for chinese text segmentation (2005) 0.37
    0.37097034 = sum of:
      0.37097034 = product of:
        1.0510826 = sum of:
          0.07820808 = weight(abstract_txt:text in 581) [ClassicSimilarity], result of:
            0.07820808 = score(doc=581,freq=7.0), product of:
              0.11672842 = queryWeight, product of:
                1.2041299 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.023925291 = queryNorm
              0.6700004 = fieldWeight in 581, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
          0.033081394 = weight(abstract_txt:developed in 581) [ClassicSimilarity], result of:
            0.033081394 = score(doc=581,freq=1.0), product of:
              0.12582415 = queryWeight, product of:
                1.2501642 = boost
                4.2066827 = idf(docFreq=1712, maxDocs=42306)
                0.023925291 = queryNorm
              0.26291767 = fieldWeight in 581, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2066827 = idf(docFreq=1712, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
          0.06481232 = weight(abstract_txt:authors in 581) [ClassicSimilarity], result of:
            0.06481232 = score(doc=581,freq=2.0), product of:
              0.15636392 = queryWeight, product of:
                1.3936486 = boost
                4.689494 = idf(docFreq=1056, maxDocs=42306)
                0.023925291 = queryNorm
              0.41449663 = fieldWeight in 581, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.689494 = idf(docFreq=1056, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
          0.0725609 = weight(abstract_txt:indexing in 581) [ClassicSimilarity], result of:
            0.0725609 = score(doc=581,freq=1.0), product of:
              0.26762083 = queryWeight, product of:
                2.5784576 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.023925291 = queryNorm
              0.2711332 = fieldWeight in 581, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
          0.12467535 = weight(abstract_txt:automatic in 581) [ClassicSimilarity], result of:
            0.12467535 = score(doc=581,freq=1.0), product of:
              0.38391808 = queryWeight, product of:
                3.0882988 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.023925291 = queryNorm
              0.32474467 = fieldWeight in 581, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
          0.67774457 = weight(abstract_txt:chinese in 581) [ClassicSimilarity], result of:
            0.67774457 = score(doc=581,freq=9.0), product of:
              0.5706214 = queryWeight, product of:
                3.7650785 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.023925291 = queryNorm
              1.1877308 = fieldWeight in 581, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.0625 = fieldNorm(doc=581)
        0.3529412 = coord(6/17)
    
  4. Wang, F.L.; Yang, C.C.: Mining Web data for Chinese segmentation (2007) 0.30
    0.29523247 = sum of:
      0.29523247 = product of:
        1.0037904 = sum of:
          0.029559879 = weight(abstract_txt:text in 2605) [ClassicSimilarity], result of:
            0.029559879 = score(doc=2605,freq=1.0), product of:
              0.11672842 = queryWeight, product of:
                1.2041299 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.023925291 = queryNorm
              0.25323635 = fieldWeight in 2605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=2605)
          0.06928005 = weight(abstract_txt:documents in 2605) [ClassicSimilarity], result of:
            0.06928005 = score(doc=2605,freq=5.0), product of:
              0.12044542 = queryWeight, product of:
                1.2231513 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.023925291 = queryNorm
              0.5751987 = fieldWeight in 2605, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.0625 = fieldNorm(doc=2605)
          0.20461933 = weight(abstract_txt:algorithm in 2605) [ClassicSimilarity], result of:
            0.20461933 = score(doc=2605,freq=6.0), product of:
              0.2333219 = queryWeight, product of:
                1.7024046 = boost
                5.7284284 = idf(docFreq=373, maxDocs=42306)
                0.023925291 = queryNorm
              0.8769829 = fieldWeight in 2605, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.7284284 = idf(docFreq=373, maxDocs=42306)
                0.0625 = fieldNorm(doc=2605)
          0.1026166 = weight(abstract_txt:indexing in 2605) [ClassicSimilarity], result of:
            0.1026166 = score(doc=2605,freq=2.0), product of:
              0.26762083 = queryWeight, product of:
                2.5784576 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.023925291 = queryNorm
              0.38344026 = fieldWeight in 2605, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.0625 = fieldNorm(doc=2605)
          0.59771454 = weight(abstract_txt:chinese in 2605) [ClassicSimilarity], result of:
            0.59771454 = score(doc=2605,freq=7.0), product of:
              0.5706214 = queryWeight, product of:
                3.7650785 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.023925291 = queryNorm
              1.0474801 = fieldWeight in 2605, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.0625 = fieldNorm(doc=2605)
        0.29411766 = coord(5/17)
    
  5. Shen, Z.: CJK: the unique need of Chinese, Japanese, and Korean language cataloging (1993) 0.29
    0.2859824 = sum of:
      0.2859824 = product of:
        0.9723401 = sum of:
          0.047888104 = weight(abstract_txt:system in 4727) [ClassicSimilarity], result of:
            0.047888104 = score(doc=4727,freq=2.0), product of:
              0.08050631 = queryWeight, product of:
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.023925291 = queryNorm
              0.59483665 = fieldWeight in 4727, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.125 = fieldNorm(doc=4727)
          0.06616279 = weight(abstract_txt:developed in 4727) [ClassicSimilarity], result of:
            0.06616279 = score(doc=4727,freq=1.0), product of:
              0.12582415 = queryWeight, product of:
                1.2501642 = boost
                4.2066827 = idf(docFreq=1712, maxDocs=42306)
                0.023925291 = queryNorm
              0.52583534 = fieldWeight in 4727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2066827 = idf(docFreq=1712, maxDocs=42306)
                0.125 = fieldNorm(doc=4727)
          0.18900087 = weight(abstract_txt:outlines in 4727) [ClassicSimilarity], result of:
            0.18900087 = score(doc=4727,freq=2.0), product of:
              0.2010577 = queryWeight, product of:
                1.5803213 = boost
                5.31763 = idf(docFreq=563, maxDocs=42306)
                0.023925291 = queryNorm
              0.940033 = fieldWeight in 4727, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.31763 = idf(docFreq=563, maxDocs=42306)
                0.125 = fieldNorm(doc=4727)
          0.2174587 = weight(abstract_txt:1993 in 4727) [ClassicSimilarity], result of:
            0.2174587 = score(doc=4727,freq=1.0), product of:
              0.2781459 = queryWeight, product of:
                1.8587517 = boost
                6.2545214 = idf(docFreq=220, maxDocs=42306)
                0.023925291 = queryNorm
              0.7818152 = fieldWeight in 4727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2545214 = idf(docFreq=220, maxDocs=42306)
                0.125 = fieldNorm(doc=4727)
          0.4518297 = weight(abstract_txt:chinese in 4727) [ClassicSimilarity], result of:
            0.4518297 = score(doc=4727,freq=1.0), product of:
              0.5706214 = queryWeight, product of:
                3.7650785 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.023925291 = queryNorm
              0.7918205 = fieldWeight in 4727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.125 = fieldNorm(doc=4727)
        0.29411766 = coord(5/17)