Document (#33368)

Author
Dang, E.K.F.
Luk, R.W.P.
Ho, K.S.
Chan, S.C.F.
Lee, D.L.
Title
¬A new measure of clustering effectiveness : algorithms and experimental studies
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.3, S.390-406
Year
2008
Abstract
We propose a new optimal clustering effectiveness measure, called CS1, based on a combination of clusters rather than selecting a single optimal cluster as in the traditional MK1 measure. For hierarchical clustering, we present an algorithm to compute CS1, defined by seeking the optimal combinations of disjoint clusters obtained by cutting the hierarchical structure at a certain similarity level. By reformulating the optimization to a 0-1 linear fractional programming problem, we demonstrate that an exact solution can be obtained by a linear time algorithm. We further discuss how our approach can be generalized to more general problems involving overlapping clusters, and we show how optimal estimates can be obtained by greedy algorithms.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: Beyond bag-of-words : bigram-enhanced context-dependent term weights (2014) 4.44
    4.436089 = sum of:
      4.436089 = product of:
        5.9147854 = sum of:
          1.8448054 = weight(author_txt:r.w.p in 1283) [ClassicSimilarity], result of:
            1.8448054 = score(doc=1283,freq=1.0), product of:
              0.52366644 = queryWeight, product of:
                1.286461 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.04333049 = queryNorm
              3.5228634 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.375 = fieldNorm(doc=1283)
          1.9075745 = weight(author_txt:dang in 1283) [ClassicSimilarity], result of:
            1.9075745 = score(doc=1283,freq=1.0), product of:
              0.53547853 = queryWeight, product of:
                1.3008891 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.04333049 = queryNorm
              3.5623734 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.375 = fieldNorm(doc=1283)
          2.1624055 = weight(author_txt:e.k.f in 1283) [ClassicSimilarity], result of:
            2.1624055 = score(doc=1283,freq=1.0), product of:
              0.58216465 = queryWeight, product of:
                1.3564137 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.04333049 = queryNorm
              3.7144227 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.375 = fieldNorm(doc=1283)
        0.75 = coord(3/4)
    
  2. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: ¬A context-dependent relevance model (2016) 4.44
    4.436089 = sum of:
      4.436089 = product of:
        5.9147854 = sum of:
          1.8448054 = weight(author_txt:r.w.p in 2778) [ClassicSimilarity], result of:
            1.8448054 = score(doc=2778,freq=1.0), product of:
              0.52366644 = queryWeight, product of:
                1.286461 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.04333049 = queryNorm
              3.5228634 = fieldWeight in 2778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.375 = fieldNorm(doc=2778)
          1.9075745 = weight(author_txt:dang in 2778) [ClassicSimilarity], result of:
            1.9075745 = score(doc=2778,freq=1.0), product of:
              0.53547853 = queryWeight, product of:
                1.3008891 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.04333049 = queryNorm
              3.5623734 = fieldWeight in 2778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.375 = fieldNorm(doc=2778)
          2.1624055 = weight(author_txt:e.k.f in 2778) [ClassicSimilarity], result of:
            2.1624055 = score(doc=2778,freq=1.0), product of:
              0.58216465 = queryWeight, product of:
                1.3564137 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.04333049 = queryNorm
              3.7144227 = fieldWeight in 2778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.375 = fieldNorm(doc=2778)
        0.75 = coord(3/4)
    
  3. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: ¬A retrieval model family based on the probability ranking principle for ad hoc retrieval (2022) 4.44
    4.436089 = sum of:
      4.436089 = product of:
        5.9147854 = sum of:
          1.8448054 = weight(author_txt:r.w.p in 638) [ClassicSimilarity], result of:
            1.8448054 = score(doc=638,freq=1.0), product of:
              0.52366644 = queryWeight, product of:
                1.286461 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.04333049 = queryNorm
              3.5228634 = fieldWeight in 638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.375 = fieldNorm(doc=638)
          1.9075745 = weight(author_txt:dang in 638) [ClassicSimilarity], result of:
            1.9075745 = score(doc=638,freq=1.0), product of:
              0.53547853 = queryWeight, product of:
                1.3008891 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.04333049 = queryNorm
              3.5623734 = fieldWeight in 638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.375 = fieldNorm(doc=638)
          2.1624055 = weight(author_txt:e.k.f in 638) [ClassicSimilarity], result of:
            2.1624055 = score(doc=638,freq=1.0), product of:
              0.58216465 = queryWeight, product of:
                1.3564137 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.04333049 = queryNorm
              3.7144227 = fieldWeight in 638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.375 = fieldNorm(doc=638)
        0.75 = coord(3/4)
    
  4. Dang, E.K.F.; Luk, R.W.P.; Allan, J.; Ho, K.S.; Chung, K.F.L.; Lee, D.L.: ¬A new context-dependent term weight computed by boost and discount using relevance information (2010) 2.96
    2.9573927 = sum of:
      2.9573927 = product of:
        3.94319 = sum of:
          1.2298702 = weight(author_txt:r.w.p in 4120) [ClassicSimilarity], result of:
            1.2298702 = score(doc=4120,freq=1.0), product of:
              0.52366644 = queryWeight, product of:
                1.286461 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.04333049 = queryNorm
              2.3485756 = fieldWeight in 4120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.25 = fieldNorm(doc=4120)
          1.2717164 = weight(author_txt:dang in 4120) [ClassicSimilarity], result of:
            1.2717164 = score(doc=4120,freq=1.0), product of:
              0.53547853 = queryWeight, product of:
                1.3008891 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.04333049 = queryNorm
              2.3749156 = fieldWeight in 4120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.25 = fieldNorm(doc=4120)
          1.4416038 = weight(author_txt:e.k.f in 4120) [ClassicSimilarity], result of:
            1.4416038 = score(doc=4120,freq=1.0), product of:
              0.58216465 = queryWeight, product of:
                1.3564137 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.04333049 = queryNorm
              2.476282 = fieldWeight in 4120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.25 = fieldNorm(doc=4120)
        0.75 = coord(3/4)
    
  5. Luk, R.W.P.; Leong, H.V.; Dillon, T.S.; Chan, A.T.S.; Croft, W.B.; Allen, J.: ¬A survey in indexing and searching XML documents (2002) 0.90
    0.9037632 = sum of:
      0.9037632 = product of:
        1.8075264 = sum of:
          0.57765615 = weight(author_txt:chan in 460) [ClassicSimilarity], result of:
            0.57765615 = score(doc=460,freq=1.0), product of:
              0.31641823 = queryWeight, product of:
                7.3024383 = idf(docFreq=80, maxDocs=44218)
                0.04333049 = queryNorm
              1.8256096 = fieldWeight in 460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3024383 = idf(docFreq=80, maxDocs=44218)
                0.25 = fieldNorm(doc=460)
          1.2298702 = weight(author_txt:r.w.p in 460) [ClassicSimilarity], result of:
            1.2298702 = score(doc=460,freq=1.0), product of:
              0.52366644 = queryWeight, product of:
                1.286461 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.04333049 = queryNorm
              2.3485756 = fieldWeight in 460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.25 = fieldNorm(doc=460)
        0.5 = coord(2/4)
    

Similar documents (content)

  1. Mather, L.A.: ¬A linear algebra measure of cluster quality (2000) 0.20
    0.20135255 = sum of:
      0.20135255 = product of:
        0.838969 = sum of:
          0.07498123 = weight(abstract_txt:cluster in 4767) [ClassicSimilarity], result of:
            0.07498123 = score(doc=4767,freq=3.0), product of:
              0.10575742 = queryWeight, product of:
                1.005263 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.01606313 = queryNorm
              0.70899254 = fieldWeight in 4767, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.0625 = fieldNorm(doc=4767)
          0.10952445 = weight(abstract_txt:disjoint in 4767) [ClassicSimilarity], result of:
            0.10952445 = score(doc=4767,freq=1.0), product of:
              0.1963618 = queryWeight, product of:
                1.369786 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.01606313 = queryNorm
              0.55776864 = fieldWeight in 4767, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0625 = fieldNorm(doc=4767)
          0.081052944 = weight(abstract_txt:algorithms in 4767) [ClassicSimilarity], result of:
            0.081052944 = score(doc=4767,freq=2.0), product of:
              0.16065545 = queryWeight, product of:
                1.7522134 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.01606313 = queryNorm
              0.5045141 = fieldWeight in 4767, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=4767)
          0.12989326 = weight(abstract_txt:linear in 4767) [ClassicSimilarity], result of:
            0.12989326 = score(doc=4767,freq=2.0), product of:
              0.22000912 = queryWeight, product of:
                2.0504992 = boost
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.01606313 = queryNorm
              0.59039944 = fieldWeight in 4767, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.0625 = fieldNorm(doc=4767)
          0.22208805 = weight(abstract_txt:clustering in 4767) [ClassicSimilarity], result of:
            0.22208805 = score(doc=4767,freq=4.0), product of:
              0.2858162 = queryWeight, product of:
                2.8623867 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.01606313 = queryNorm
              0.77703106 = fieldWeight in 4767, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=4767)
          0.22142911 = weight(abstract_txt:clusters in 4767) [ClassicSimilarity], result of:
            0.22142911 = score(doc=4767,freq=3.0), product of:
              0.31395885 = queryWeight, product of:
                3.0 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.01606313 = queryNorm
              0.70528066 = fieldWeight in 4767, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=4767)
        0.24 = coord(6/25)
    
  2. Bose, I.; Chen, X.: ¬A method for extension of generative topographic mapping for fuzzy clustering (2009) 0.19
    0.1932423 = sum of:
      0.1932423 = product of:
        0.80517626 = sum of:
          0.054113038 = weight(abstract_txt:cluster in 2711) [ClassicSimilarity], result of:
            0.054113038 = score(doc=2711,freq=1.0), product of:
              0.10575742 = queryWeight, product of:
                1.005263 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.01606313 = queryNorm
              0.5116713 = fieldWeight in 2711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.078125 = fieldNorm(doc=2711)
          0.07154715 = weight(abstract_txt:algorithm in 2711) [ClassicSimilarity], result of:
            0.07154715 = score(doc=2711,freq=1.0), product of:
              0.16051458 = queryWeight, product of:
                1.7514449 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.01606313 = queryNorm
              0.44573617 = fieldWeight in 2711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.078125 = fieldNorm(doc=2711)
          0.101316184 = weight(abstract_txt:algorithms in 2711) [ClassicSimilarity], result of:
            0.101316184 = score(doc=2711,freq=2.0), product of:
              0.16065545 = queryWeight, product of:
                1.7522134 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.01606313 = queryNorm
              0.63064265 = fieldWeight in 2711, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.078125 = fieldNorm(doc=2711)
          0.1559048 = weight(abstract_txt:obtained in 2711) [ClassicSimilarity], result of:
            0.1559048 = score(doc=2711,freq=2.0), product of:
              0.24512051 = queryWeight, product of:
                2.6507862 = boost
                5.756716 = idf(docFreq=379, maxDocs=44218)
                0.01606313 = queryNorm
              0.6360333 = fieldWeight in 2711, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.756716 = idf(docFreq=379, maxDocs=44218)
                0.078125 = fieldNorm(doc=2711)
          0.19629996 = weight(abstract_txt:clustering in 2711) [ClassicSimilarity], result of:
            0.19629996 = score(doc=2711,freq=2.0), product of:
              0.2858162 = queryWeight, product of:
                2.8623867 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.01606313 = queryNorm
              0.6868049 = fieldWeight in 2711, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=2711)
          0.22599514 = weight(abstract_txt:clusters in 2711) [ClassicSimilarity], result of:
            0.22599514 = score(doc=2711,freq=2.0), product of:
              0.31395885 = queryWeight, product of:
                3.0 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.01606313 = queryNorm
              0.7198241 = fieldWeight in 2711, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.078125 = fieldNorm(doc=2711)
        0.24 = coord(6/25)
    
  3. Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.18
    0.18436329 = sum of:
      0.18436329 = product of:
        0.7681804 = sum of:
          0.043290433 = weight(abstract_txt:cluster in 448) [ClassicSimilarity], result of:
            0.043290433 = score(doc=448,freq=1.0), product of:
              0.10575742 = queryWeight, product of:
                1.005263 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.01606313 = queryNorm
              0.40933704 = fieldWeight in 448, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
          0.12798743 = weight(abstract_txt:algorithm in 448) [ClassicSimilarity], result of:
            0.12798743 = score(doc=448,freq=5.0), product of:
              0.16051458 = queryWeight, product of:
                1.7514449 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.01606313 = queryNorm
              0.7973571 = fieldWeight in 448, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
          0.081052944 = weight(abstract_txt:algorithms in 448) [ClassicSimilarity], result of:
            0.081052944 = score(doc=448,freq=2.0), product of:
              0.16065545 = queryWeight, product of:
                1.7522134 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.01606313 = queryNorm
              0.5045141 = fieldWeight in 448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
          0.11600616 = weight(abstract_txt:hierarchical in 448) [ClassicSimilarity], result of:
            0.11600616 = score(doc=448,freq=4.0), product of:
              0.16194229 = queryWeight, product of:
                1.7592169 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.01606313 = queryNorm
              0.71634257 = fieldWeight in 448, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
          0.27200124 = weight(abstract_txt:clustering in 448) [ClassicSimilarity], result of:
            0.27200124 = score(doc=448,freq=6.0), product of:
              0.2858162 = queryWeight, product of:
                2.8623867 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.01606313 = queryNorm
              0.95166487 = fieldWeight in 448, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
          0.12784216 = weight(abstract_txt:clusters in 448) [ClassicSimilarity], result of:
            0.12784216 = score(doc=448,freq=1.0), product of:
              0.31395885 = queryWeight, product of:
                3.0 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.01606313 = queryNorm
              0.407194 = fieldWeight in 448, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
        0.24 = coord(6/25)
    
  4. Burgin, R.: ¬The retrieval effectiveness of 5 clustering algorithms as a function of indexing exhaustivity (1995) 0.18
    0.18422917 = sum of:
      0.18422917 = product of:
        0.7676215 = sum of:
          0.043290433 = weight(abstract_txt:cluster in 3365) [ClassicSimilarity], result of:
            0.043290433 = score(doc=3365,freq=1.0), product of:
              0.10575742 = queryWeight, product of:
                1.005263 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.01606313 = queryNorm
              0.40933704 = fieldWeight in 3365, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.0625 = fieldNorm(doc=3365)
          0.0816855 = weight(abstract_txt:effectiveness in 3365) [ClassicSimilarity], result of:
            0.0816855 = score(doc=3365,freq=4.0), product of:
              0.12817487 = queryWeight, product of:
                1.565095 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.01606313 = queryNorm
              0.6372973 = fieldWeight in 3365, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.0625 = fieldNorm(doc=3365)
          0.05800308 = weight(abstract_txt:hierarchical in 3365) [ClassicSimilarity], result of:
            0.05800308 = score(doc=3365,freq=1.0), product of:
              0.16194229 = queryWeight, product of:
                1.7592169 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.01606313 = queryNorm
              0.35817128 = fieldWeight in 3365, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.0625 = fieldNorm(doc=3365)
          0.27200124 = weight(abstract_txt:clustering in 3365) [ClassicSimilarity], result of:
            0.27200124 = score(doc=3365,freq=6.0), product of:
              0.2858162 = queryWeight, product of:
                2.8623867 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.01606313 = queryNorm
              0.95166487 = fieldWeight in 3365, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=3365)
          0.12784216 = weight(abstract_txt:clusters in 3365) [ClassicSimilarity], result of:
            0.12784216 = score(doc=3365,freq=1.0), product of:
              0.31395885 = queryWeight, product of:
                3.0 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.01606313 = queryNorm
              0.407194 = fieldWeight in 3365, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=3365)
          0.18479906 = weight(abstract_txt:optimal in 3365) [ClassicSimilarity], result of:
            0.18479906 = score(doc=3365,freq=1.0), product of:
              0.44177666 = queryWeight, product of:
                4.1091843 = boost
                6.6929407 = idf(docFreq=148, maxDocs=44218)
                0.01606313 = queryNorm
              0.4183088 = fieldWeight in 3365, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6929407 = idf(docFreq=148, maxDocs=44218)
                0.0625 = fieldNorm(doc=3365)
        0.24 = coord(6/25)
    
  5. Kishida, K.: High-speed rough clustering for very large document collections (2010) 0.16
    0.15893328 = sum of:
      0.15893328 = product of:
        0.662222 = sum of:
          0.043290433 = weight(abstract_txt:cluster in 3463) [ClassicSimilarity], result of:
            0.043290433 = score(doc=3463,freq=1.0), product of:
              0.10575742 = queryWeight, product of:
                1.005263 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.01606313 = queryNorm
              0.40933704 = fieldWeight in 3463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.0625 = fieldNorm(doc=3463)
          0.04084275 = weight(abstract_txt:effectiveness in 3463) [ClassicSimilarity], result of:
            0.04084275 = score(doc=3463,freq=1.0), product of:
              0.12817487 = queryWeight, product of:
                1.565095 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.01606313 = queryNorm
              0.31864864 = fieldWeight in 3463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.0625 = fieldNorm(doc=3463)
          0.09913864 = weight(abstract_txt:algorithm in 3463) [ClassicSimilarity], result of:
            0.09913864 = score(doc=3463,freq=3.0), product of:
              0.16051458 = queryWeight, product of:
                1.7514449 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.01606313 = queryNorm
              0.6176301 = fieldWeight in 3463, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0625 = fieldNorm(doc=3463)
          0.05731309 = weight(abstract_txt:algorithms in 3463) [ClassicSimilarity], result of:
            0.05731309 = score(doc=3463,freq=1.0), product of:
              0.16065545 = queryWeight, product of:
                1.7522134 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.01606313 = queryNorm
              0.35674536 = fieldWeight in 3463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=3463)
          0.2937949 = weight(abstract_txt:clustering in 3463) [ClassicSimilarity], result of:
            0.2937949 = score(doc=3463,freq=7.0), product of:
              0.2858162 = queryWeight, product of:
                2.8623867 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.01606313 = queryNorm
              1.0279155 = fieldWeight in 3463, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=3463)
          0.12784216 = weight(abstract_txt:clusters in 3463) [ClassicSimilarity], result of:
            0.12784216 = score(doc=3463,freq=1.0), product of:
              0.31395885 = queryWeight, product of:
                3.0 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.01606313 = queryNorm
              0.407194 = fieldWeight in 3463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=3463)
        0.24 = coord(6/25)