Document (#2415)

Author
Griffiths, A.
Robinson, L.A.
Willett, P.
Title
Hierarchic agglomerative clustering methods for automatic document classification
Source
Journal of documentation. 40(1984) no.3, S.175-205
Year
1984
Theme
Automatisches Indexieren

Similar documents (author)

  1. Griffiths, A.; Luckhurst, H.C.; Willett, P.: Using interdocument similarity information in document retrieval systems (1986) 2.66
    2.6568456 = sum of:
      2.6568456 = product of:
        3.985268 = sum of:
          1.6304511 = weight(author_txt:willett in 2415) [ClassicSimilarity], result of:
            1.6304511 = score(doc=2415,freq=1.0), product of:
              0.541876 = queryWeight, product of:
                1.0664501 = boost
                8.023735 = idf(docFreq=36, maxDocs=41550)
                0.063326105 = queryNorm
              3.0089006 = fieldWeight in 2415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.023735 = idf(docFreq=36, maxDocs=41550)
                0.375 = fieldNorm(doc=2415)
          2.3548172 = weight(author_txt:griffiths in 2415) [ClassicSimilarity], result of:
            2.3548172 = score(doc=2415,freq=1.0), product of:
              0.6923614 = queryWeight, product of:
                1.2054718 = boost
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.063326105 = queryNorm
              3.4011388 = fieldWeight in 2415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.375 = fieldNorm(doc=2415)
        0.6666667 = coord(2/3)
    
  2. Griffiths, R.: Health information (1993) 1.31
    1.3082318 = sum of:
      1.3082318 = product of:
        3.9246953 = sum of:
          3.9246953 = weight(author_txt:griffiths in 120) [ClassicSimilarity], result of:
            3.9246953 = score(doc=120,freq=1.0), product of:
              0.6923614 = queryWeight, product of:
                1.2054718 = boost
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.063326105 = queryNorm
              5.6685643 = fieldWeight in 120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.625 = fieldNorm(doc=120)
        0.33333334 = coord(1/3)
    
  3. Griffiths, J.: ¬The value of information and related systems, products and services (1982) 1.31
    1.3082318 = sum of:
      1.3082318 = product of:
        3.9246953 = sum of:
          3.9246953 = weight(author_txt:griffiths in 5903) [ClassicSimilarity], result of:
            3.9246953 = score(doc=5903,freq=1.0), product of:
              0.6923614 = queryWeight, product of:
                1.2054718 = boost
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.063326105 = queryNorm
              5.6685643 = fieldWeight in 5903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.625 = fieldNorm(doc=5903)
        0.33333334 = coord(1/3)
    
  4. Griffiths, P.: Personal searching gets the right results (1997) 1.31
    1.3082318 = sum of:
      1.3082318 = product of:
        3.9246953 = sum of:
          3.9246953 = weight(author_txt:griffiths in 2262) [ClassicSimilarity], result of:
            3.9246953 = score(doc=2262,freq=1.0), product of:
              0.6923614 = queryWeight, product of:
                1.2054718 = boost
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.063326105 = queryNorm
              5.6685643 = fieldWeight in 2262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.625 = fieldNorm(doc=2262)
        0.33333334 = coord(1/3)
    
  5. Griffiths, A.: Setting up a subject directory of Web sites : a case study of management links (1999) 1.31
    1.3082318 = sum of:
      1.3082318 = product of:
        3.9246953 = sum of:
          3.9246953 = weight(author_txt:griffiths in 6037) [ClassicSimilarity], result of:
            3.9246953 = score(doc=6037,freq=1.0), product of:
              0.6923614 = queryWeight, product of:
                1.2054718 = boost
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.063326105 = queryNorm
              5.6685643 = fieldWeight in 6037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.625 = fieldNorm(doc=6037)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Tombros, A.; Villa, R.; Rijsbergen, C.J. Van: ¬The effectiveness of query-specific hierarchic clustering in information retrieval (2002) 1.38
    1.3823001 = sum of:
      1.3823001 = product of:
        2.4190252 = sum of:
          0.041289248 = weight(abstract_txt:methods in 3586) [ClassicSimilarity], result of:
            0.041289248 = score(doc=3586,freq=1.0), product of:
              0.12610446 = queryWeight, product of:
                1.0455072 = boost
                4.190989 = idf(docFreq=1708, maxDocs=41550)
                0.028779741 = queryNorm
              0.327421 = fieldWeight in 3586, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.190989 = idf(docFreq=1708, maxDocs=41550)
                0.078125 = fieldNorm(doc=3586)
          0.061982233 = weight(abstract_txt:document in 3586) [ClassicSimilarity], result of:
            0.061982233 = score(doc=3586,freq=2.0), product of:
              0.13122219 = queryWeight, product of:
                1.0665113 = boost
                4.275185 = idf(docFreq=1570, maxDocs=41550)
                0.028779741 = queryNorm
              0.47234568 = fieldWeight in 3586, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.275185 = idf(docFreq=1570, maxDocs=41550)
                0.078125 = fieldNorm(doc=3586)
          0.35840467 = weight(abstract_txt:clustering in 3586) [ClassicSimilarity], result of:
            0.35840467 = score(doc=3586,freq=7.0), product of:
              0.27843395 = queryWeight, product of:
                1.5535417 = boost
                6.227481 = idf(docFreq=222, maxDocs=41550)
                0.028779741 = queryNorm
              1.2872161 = fieldWeight in 3586, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.227481 = idf(docFreq=222, maxDocs=41550)
                0.078125 = fieldNorm(doc=3586)
          1.957349 = weight(title_txt:hierarchic in 3586) [ClassicSimilarity], result of:
            1.957349 = score(doc=3586,freq=1.0), product of:
              0.6555079 = queryWeight, product of:
                2.3836956 = boost
                9.555211 = idf(docFreq=7, maxDocs=41550)
                0.028779741 = queryNorm
              2.9860034 = fieldWeight in 3586, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.555211 = idf(docFreq=7, maxDocs=41550)
                0.3125 = fieldNorm(doc=3586)
        0.5714286 = coord(4/7)
    
  2. Kirriemuir, J.W.; Willet, P.: Identification of duplicate and near-duplicate full-text records in database search-outputs using hierarchic cluster analysis (1995) 1.00
    0.9969161 = sum of:
      0.9969161 = product of:
        1.7446032 = sum of:
          0.043354798 = weight(abstract_txt:classification in 2498) [ClassicSimilarity], result of:
            0.043354798 = score(doc=2498,freq=1.0), product of:
              0.11536562 = queryWeight, product of:
                4.00857 = idf(docFreq=2050, maxDocs=41550)
                0.028779741 = queryNorm
              0.37580347 = fieldWeight in 2498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.00857 = idf(docFreq=2050, maxDocs=41550)
                0.09375 = fieldNorm(doc=2498)
          0.049547102 = weight(abstract_txt:methods in 2498) [ClassicSimilarity], result of:
            0.049547102 = score(doc=2498,freq=1.0), product of:
              0.12610446 = queryWeight, product of:
                1.0455072 = boost
                4.190989 = idf(docFreq=1708, maxDocs=41550)
                0.028779741 = queryNorm
              0.39290524 = fieldWeight in 2498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.190989 = idf(docFreq=1708, maxDocs=41550)
                0.09375 = fieldNorm(doc=2498)
          0.2815571 = weight(abstract_txt:clustering in 2498) [ClassicSimilarity], result of:
            0.2815571 = score(doc=2498,freq=3.0), product of:
              0.27843395 = queryWeight, product of:
                1.5535417 = boost
                6.227481 = idf(docFreq=222, maxDocs=41550)
                0.028779741 = queryNorm
              1.0112169 = fieldWeight in 2498, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.227481 = idf(docFreq=222, maxDocs=41550)
                0.09375 = fieldNorm(doc=2498)
          1.3701441 = weight(title_txt:hierarchic in 2498) [ClassicSimilarity], result of:
            1.3701441 = score(doc=2498,freq=1.0), product of:
              0.6555079 = queryWeight, product of:
                2.3836956 = boost
                9.555211 = idf(docFreq=7, maxDocs=41550)
                0.028779741 = queryNorm
              2.0902023 = fieldWeight in 2498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.555211 = idf(docFreq=7, maxDocs=41550)
                0.21875 = fieldNorm(doc=2498)
        0.5714286 = coord(4/7)
    
  3. Miyamoto, S.: Information clustering based an fuzzy multisets (2003) 0.49
    0.49113622 = sum of:
      0.49113622 = product of:
        0.85948837 = sum of:
          0.041289248 = weight(abstract_txt:methods in 3071) [ClassicSimilarity], result of:
            0.041289248 = score(doc=3071,freq=1.0), product of:
              0.12610446 = queryWeight, product of:
                1.0455072 = boost
                4.190989 = idf(docFreq=1708, maxDocs=41550)
                0.028779741 = queryNorm
              0.327421 = fieldWeight in 3071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.190989 = idf(docFreq=1708, maxDocs=41550)
                0.078125 = fieldNorm(doc=3071)
          0.04382806 = weight(abstract_txt:document in 3071) [ClassicSimilarity], result of:
            0.04382806 = score(doc=3071,freq=1.0), product of:
              0.13122219 = queryWeight, product of:
                1.0665113 = boost
                4.275185 = idf(docFreq=1570, maxDocs=41550)
                0.028779741 = queryNorm
              0.33399883 = fieldWeight in 3071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.275185 = idf(docFreq=1570, maxDocs=41550)
                0.078125 = fieldNorm(doc=3071)
          0.3029072 = weight(abstract_txt:clustering in 3071) [ClassicSimilarity], result of:
            0.3029072 = score(doc=3071,freq=5.0), product of:
              0.27843395 = queryWeight, product of:
                1.5535417 = boost
                6.227481 = idf(docFreq=222, maxDocs=41550)
                0.028779741 = queryNorm
              1.0878961 = fieldWeight in 3071, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.227481 = idf(docFreq=222, maxDocs=41550)
                0.078125 = fieldNorm(doc=3071)
          0.4714639 = weight(abstract_txt:agglomerative in 3071) [ClassicSimilarity], result of:
            0.4714639 = score(doc=3071,freq=1.0), product of:
              0.6394473 = queryWeight, product of:
                2.354313 = boost
                9.437428 = idf(docFreq=8, maxDocs=41550)
                0.028779741 = queryNorm
              0.7372991 = fieldWeight in 3071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.437428 = idf(docFreq=8, maxDocs=41550)
                0.078125 = fieldNorm(doc=3071)
        0.5714286 = coord(4/7)
    
  4. Rijsbergen, C.J. van: ¬A fast hierarchic clustering algorithm (1970) 0.39
    0.39146978 = sum of:
      0.39146978 = product of:
        2.7402883 = sum of:
          2.7402883 = weight(title_txt:hierarchic in 3300) [ClassicSimilarity], result of:
            2.7402883 = score(doc=3300,freq=1.0), product of:
              0.6555079 = queryWeight, product of:
                2.3836956 = boost
                9.555211 = idf(docFreq=7, maxDocs=41550)
                0.028779741 = queryNorm
              4.1804047 = fieldWeight in 3300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.555211 = idf(docFreq=7, maxDocs=41550)
                0.4375 = fieldNorm(doc=3300)
        0.14285715 = coord(1/7)
    
  5. Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.36
    0.36361754 = sum of:
      0.36361754 = product of:
        0.8484409 = sum of:
          0.04958579 = weight(abstract_txt:document in 2448) [ClassicSimilarity], result of:
            0.04958579 = score(doc=2448,freq=2.0), product of:
              0.13122219 = queryWeight, product of:
                1.0665113 = boost
                4.275185 = idf(docFreq=1570, maxDocs=41550)
                0.028779741 = queryNorm
              0.37787655 = fieldWeight in 2448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.275185 = idf(docFreq=1570, maxDocs=41550)
                0.0625 = fieldNorm(doc=2448)
          0.2654546 = weight(abstract_txt:clustering in 2448) [ClassicSimilarity], result of:
            0.2654546 = score(doc=2448,freq=6.0), product of:
              0.27843395 = queryWeight, product of:
                1.5535417 = boost
                6.227481 = idf(docFreq=222, maxDocs=41550)
                0.028779741 = queryNorm
              0.95338446 = fieldWeight in 2448, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.227481 = idf(docFreq=222, maxDocs=41550)
                0.0625 = fieldNorm(doc=2448)
          0.53340054 = weight(abstract_txt:agglomerative in 2448) [ClassicSimilarity], result of:
            0.53340054 = score(doc=2448,freq=2.0), product of:
              0.6394473 = queryWeight, product of:
                2.354313 = boost
                9.437428 = idf(docFreq=8, maxDocs=41550)
                0.028779741 = queryNorm
              0.8341587 = fieldWeight in 2448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.437428 = idf(docFreq=8, maxDocs=41550)
                0.0625 = fieldNorm(doc=2448)
        0.42857143 = coord(3/7)