Document (#34712)

Author
Bose, I.
Chen, X.
Title
¬A method for extension of generative topographic mapping for fuzzy clustering
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.2, S.363-371
Year
2009
Abstract
In this paper, a new method for fuzzy clustering is proposed that combines generative topographic mapping (GTM) and Fuzzy c-means (FCM) clustering. GTM is used to generate latent variables and their posterior probabilities. These two provide the distribution of the input data in the latent space. FCM determines the seeds of clusters, as well as the resultant clusters and the corresponding membership functions of the input data, based on the latent variables obtained from GTM. Experiments are conducted to compare the results obtained using FCM and the Gustafson-Kessel (GK) algorithm with the proposed method in terms of four cluster-validity indexes. Using simulated and benchmark data sets, it is observed that the hybrid method (GTMFCM) performs better than FCM and GK algorithms in terms of these indexes. It is also found that the superiority of GTMFCM over FCM and GK algorithms becomes more pronounced with the increase in the dimensionality of the input data set.

Similar documents (author)

  1. Chen, Y.N.; Chen, S.J.: ¬A metadata practice of the OFLA FRBR model : a case study for the National Palace Museum in Taipai (2004) 4.35
    4.3499155 = sum of:
      4.3499155 = weight(author_txt:chen in 3384) [ClassicSimilarity], result of:
        4.3499155 = fieldWeight in 3384, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.5 = fieldNorm(doc=3384)
    
  2. Chen, C.C.; Chen, H.H.; Chen, K.H.: ¬The design of the XML/Metadata management system (2000) 4.00
    3.9956524 = sum of:
      3.9956524 = weight(author_txt:chen in 4633) [ClassicSimilarity], result of:
        3.9956524 = fieldWeight in 4633, product of:
          1.7320508 = tf(freq=3.0), with freq of:
            3.0 = termFreq=3.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.375 = fieldNorm(doc=4633)
    
  3. Chen, W.Y.: Observations on cataloguing and classification (1991) 3.84
    3.8448186 = sum of:
      3.8448186 = weight(author_txt:chen in 4184) [ClassicSimilarity], result of:
        3.8448186 = fieldWeight in 4184, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.625 = fieldNorm(doc=4184)
    
  4. Chen, H.: Knowledge-based document retrieval : framework and design (1992) 3.84
    3.8448186 = sum of:
      3.8448186 = weight(author_txt:chen in 5283) [ClassicSimilarity], result of:
        3.8448186 = fieldWeight in 5283, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.625 = fieldNorm(doc=5283)
    
  5. Chen, P.S.: On inference rules of logic-based information retrieval systems (1994) 3.84
    3.8448186 = sum of:
      3.8448186 = weight(author_txt:chen in 6731) [ClassicSimilarity], result of:
        3.8448186 = fieldWeight in 6731, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.625 = fieldNorm(doc=6731)
    

Similar documents (content)

  1. Liu, Y.; Li, W.; Huang, Z.; Fang, Q.: ¬A fast method based on multiple clustering for name disambiguation in bibliographic citations (2015) 0.19
    0.19339111 = sum of:
      0.19339111 = product of:
        0.6043472 = sum of:
          0.0181545 = weight(abstract_txt:terms in 1672) [ClassicSimilarity], result of:
            0.0181545 = score(doc=1672,freq=1.0), product of:
              0.071830265 = queryWeight, product of:
                1.0614729 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016734077 = queryNorm
              0.25274166 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.026884539 = weight(abstract_txt:proposed in 1672) [ClassicSimilarity], result of:
            0.026884539 = score(doc=1672,freq=1.0), product of:
              0.09332249 = queryWeight, product of:
                1.2098968 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.016734077 = queryNorm
              0.2880821 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.07406844 = weight(abstract_txt:obtained in 1672) [ClassicSimilarity], result of:
            0.07406844 = score(doc=1672,freq=2.0), product of:
              0.14556715 = queryWeight, product of:
                1.5110779 = boost
                5.756716 = idf(docFreq=379, maxDocs=44218)
                0.016734077 = queryNorm
              0.5088266 = fieldWeight in 1672, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.756716 = idf(docFreq=379, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.07592029 = weight(abstract_txt:clusters in 1672) [ClassicSimilarity], result of:
            0.07592029 = score(doc=1672,freq=1.0), product of:
              0.18644747 = queryWeight, product of:
                1.7101469 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.016734077 = queryNorm
              0.407194 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.020390967 = weight(abstract_txt:data in 1672) [ClassicSimilarity], result of:
            0.020390967 = score(doc=1672,freq=1.0), product of:
              0.09778821 = queryWeight, product of:
                1.7515131 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016734077 = queryNorm
              0.20852174 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.05006525 = weight(abstract_txt:method in 1672) [ClassicSimilarity], result of:
            0.05006525 = score(doc=1672,freq=1.0), product of:
              0.17797221 = queryWeight, product of:
                2.362905 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.016734077 = queryNorm
              0.28130937 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.19783367 = weight(abstract_txt:clustering in 1672) [ClassicSimilarity], result of:
            0.19783367 = score(doc=1672,freq=4.0), product of:
              0.25460202 = queryWeight, product of:
                2.4475508 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.016734077 = queryNorm
              0.77703106 = fieldWeight in 1672, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.14102961 = weight(abstract_txt:latent in 1672) [ClassicSimilarity], result of:
            0.14102961 = score(doc=1672,freq=1.0), product of:
              0.32251894 = queryWeight, product of:
                2.754726 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.016734077 = queryNorm
              0.43727544 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
        0.32 = coord(8/25)
    
  2. Miyamoto, S.: Information clustering based an fuzzy multisets (2003) 0.17
    0.16545567 = sum of:
      0.16545567 = product of:
        0.8272783 = sum of:
          0.047525603 = weight(abstract_txt:proposed in 1071) [ClassicSimilarity], result of:
            0.047525603 = score(doc=1071,freq=2.0), product of:
              0.09332249 = queryWeight, product of:
                1.2098968 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.016734077 = queryNorm
              0.509262 = fieldWeight in 1071, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.1105349 = weight(abstract_txt:algorithms in 1071) [ClassicSimilarity], result of:
            0.1105349 = score(doc=1071,freq=3.0), product of:
              0.14311016 = queryWeight, product of:
                1.4982711 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.016734077 = queryNorm
              0.77237636 = fieldWeight in 1071, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.06258156 = weight(abstract_txt:method in 1071) [ClassicSimilarity], result of:
            0.06258156 = score(doc=1071,freq=1.0), product of:
              0.17797221 = queryWeight, product of:
                2.362905 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.016734077 = queryNorm
              0.3516367 = fieldWeight in 1071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.27648097 = weight(abstract_txt:clustering in 1071) [ClassicSimilarity], result of:
            0.27648097 = score(doc=1071,freq=5.0), product of:
              0.25460202 = queryWeight, product of:
                2.4475508 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.016734077 = queryNorm
              1.0859339 = fieldWeight in 1071, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.33015528 = weight(abstract_txt:fuzzy in 1071) [ClassicSimilarity], result of:
            0.33015528 = score(doc=1071,freq=4.0), product of:
              0.30869803 = queryWeight, product of:
                2.6950555 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.016734077 = queryNorm
              1.0695089 = fieldWeight in 1071, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
        0.2 = coord(5/25)
    
  3. Polanco, X.; Francois, C.: Data clustering and cluster mapping or visualization in text processing and mining (2000) 0.15
    0.15368491 = sum of:
      0.15368491 = product of:
        0.6403538 = sum of:
          0.033605676 = weight(abstract_txt:proposed in 129) [ClassicSimilarity], result of:
            0.033605676 = score(doc=129,freq=1.0), product of:
              0.09332249 = queryWeight, product of:
                1.2098968 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.016734077 = queryNorm
              0.36010262 = fieldWeight in 129, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.078125 = fieldNorm(doc=129)
          0.114655666 = weight(abstract_txt:mapping in 129) [ClassicSimilarity], result of:
            0.114655666 = score(doc=129,freq=3.0), product of:
              0.1466452 = queryWeight, product of:
                1.5166631 = boost
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.016734077 = queryNorm
              0.7818576 = fieldWeight in 129, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.078125 = fieldNorm(doc=129)
          0.16437225 = weight(abstract_txt:clusters in 129) [ClassicSimilarity], result of:
            0.16437225 = score(doc=129,freq=3.0), product of:
              0.18644747 = queryWeight, product of:
                1.7101469 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.016734077 = queryNorm
              0.88160086 = fieldWeight in 129, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.078125 = fieldNorm(doc=129)
          0.05097742 = weight(abstract_txt:data in 129) [ClassicSimilarity], result of:
            0.05097742 = score(doc=129,freq=4.0), product of:
              0.09778821 = queryWeight, product of:
                1.7515131 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016734077 = queryNorm
              0.52130437 = fieldWeight in 129, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=129)
          0.06258156 = weight(abstract_txt:method in 129) [ClassicSimilarity], result of:
            0.06258156 = score(doc=129,freq=1.0), product of:
              0.17797221 = queryWeight, product of:
                2.362905 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.016734077 = queryNorm
              0.3516367 = fieldWeight in 129, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=129)
          0.21416123 = weight(abstract_txt:clustering in 129) [ClassicSimilarity], result of:
            0.21416123 = score(doc=129,freq=3.0), product of:
              0.25460202 = queryWeight, product of:
                2.4475508 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.016734077 = queryNorm
              0.8411608 = fieldWeight in 129, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=129)
        0.24 = coord(6/25)
    
  4. Chen, R.-S.; Hu, Y.-C.: ¬A novel method for discovering Fuzzy sequential patterns using the simple Fuzzy partition method (2003) 0.14
    0.13755117 = sum of:
      0.13755117 = product of:
        0.6877558 = sum of:
          0.026884539 = weight(abstract_txt:proposed in 1614) [ClassicSimilarity], result of:
            0.026884539 = score(doc=1614,freq=1.0), product of:
              0.09332249 = queryWeight, product of:
                1.2098968 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.016734077 = queryNorm
              0.2880821 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
          0.052374292 = weight(abstract_txt:obtained in 1614) [ClassicSimilarity], result of:
            0.052374292 = score(doc=1614,freq=1.0), product of:
              0.14556715 = queryWeight, product of:
                1.5110779 = boost
                5.756716 = idf(docFreq=379, maxDocs=44218)
                0.016734077 = queryNorm
              0.35979474 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.756716 = idf(docFreq=379, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
          0.020390967 = weight(abstract_txt:data in 1614) [ClassicSimilarity], result of:
            0.020390967 = score(doc=1614,freq=1.0), product of:
              0.09778821 = queryWeight, product of:
                1.7515131 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016734077 = queryNorm
              0.20852174 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
          0.11194931 = weight(abstract_txt:method in 1614) [ClassicSimilarity], result of:
            0.11194931 = score(doc=1614,freq=5.0), product of:
              0.17797221 = queryWeight, product of:
                2.362905 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.016734077 = queryNorm
              0.6290269 = fieldWeight in 1614, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
          0.4761567 = weight(abstract_txt:fuzzy in 1614) [ClassicSimilarity], result of:
            0.4761567 = score(doc=1614,freq=13.0), product of:
              0.30869803 = queryWeight, product of:
                2.6950555 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.016734077 = queryNorm
              1.5424676 = fieldWeight in 1614, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
        0.2 = coord(5/25)
    
  5. Zhu, W.Z.; Allen, R.B.: Document clustering using the LSI subspace signature model (2013) 0.13
    0.12658139 = sum of:
      0.12658139 = product of:
        0.6329069 = sum of:
          0.022693127 = weight(abstract_txt:terms in 690) [ClassicSimilarity], result of:
            0.022693127 = score(doc=690,freq=1.0), product of:
              0.071830265 = queryWeight, product of:
                1.0614729 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016734077 = queryNorm
              0.3159271 = fieldWeight in 690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=690)
          0.06381735 = weight(abstract_txt:algorithms in 690) [ClassicSimilarity], result of:
            0.06381735 = score(doc=690,freq=1.0), product of:
              0.14311016 = queryWeight, product of:
                1.4982711 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.016734077 = queryNorm
              0.4459317 = fieldWeight in 690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.078125 = fieldNorm(doc=690)
          0.06619648 = weight(abstract_txt:mapping in 690) [ClassicSimilarity], result of:
            0.06619648 = score(doc=690,freq=1.0), product of:
              0.1466452 = queryWeight, product of:
                1.5166631 = boost
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.016734077 = queryNorm
              0.4514057 = fieldWeight in 690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.078125 = fieldNorm(doc=690)
          0.17486191 = weight(abstract_txt:clustering in 690) [ClassicSimilarity], result of:
            0.17486191 = score(doc=690,freq=2.0), product of:
              0.25460202 = queryWeight, product of:
                2.4475508 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.016734077 = queryNorm
              0.6868049 = fieldWeight in 690, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=690)
          0.30533808 = weight(abstract_txt:latent in 690) [ClassicSimilarity], result of:
            0.30533808 = score(doc=690,freq=3.0), product of:
              0.32251894 = queryWeight, product of:
                2.754726 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.016734077 = queryNorm
              0.9467291 = fieldWeight in 690, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.078125 = fieldNorm(doc=690)
        0.2 = coord(5/25)