Document (#2415)

Author
Griffiths, A.
Robinson, L.A.
Willett, P.
Title
Hierarchic agglomerative clustering methods for automatic document classification
Source
Journal of documentation. 40(1984) no.3, S.175-205
Year
1984
Theme
Automatisches Indexieren

Similar documents (author)

  1. Griffiths, A.; Luckhurst, H.C.; Willett, P.: Using interdocument similarity information in document retrieval systems (1986) 2.66
    2.6617312 = sum of:
      2.6617312 = product of:
        3.9925966 = sum of:
          1.6242388 = weight(author_txt:willett in 2415) [ClassicSimilarity], result of:
            1.6242388 = score(doc=2415,freq=1.0), product of:
              0.54094416 = queryWeight, product of:
                1.0696396 = boost
                8.006933 = idf(docFreq=37, maxDocs=41962)
                0.06316097 = queryNorm
              3.0026 = fieldWeight in 2415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.006933 = idf(docFreq=37, maxDocs=41962)
                0.375 = fieldNorm(doc=2415)
          2.3683577 = weight(author_txt:griffiths in 2415) [ClassicSimilarity], result of:
            2.3683577 = score(doc=2415,freq=1.0), product of:
              0.6955858 = queryWeight, product of:
                1.2129323 = boost
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.06316097 = queryNorm
              3.404839 = fieldWeight in 2415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.375 = fieldNorm(doc=2415)
        0.6666667 = coord(2/3)
    
  2. Griffiths, R.: Health information (1993) 1.32
    1.3157543 = sum of:
      1.3157543 = product of:
        3.9472628 = sum of:
          3.9472628 = weight(author_txt:griffiths in 120) [ClassicSimilarity], result of:
            3.9472628 = score(doc=120,freq=1.0), product of:
              0.6955858 = queryWeight, product of:
                1.2129323 = boost
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.06316097 = queryNorm
              5.6747317 = fieldWeight in 120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.625 = fieldNorm(doc=120)
        0.33333334 = coord(1/3)
    
  3. Griffiths, J.: ¬The value of information and related systems, products and services (1982) 1.32
    1.3157543 = sum of:
      1.3157543 = product of:
        3.9472628 = sum of:
          3.9472628 = weight(author_txt:griffiths in 5904) [ClassicSimilarity], result of:
            3.9472628 = score(doc=5904,freq=1.0), product of:
              0.6955858 = queryWeight, product of:
                1.2129323 = boost
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.06316097 = queryNorm
              5.6747317 = fieldWeight in 5904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.625 = fieldNorm(doc=5904)
        0.33333334 = coord(1/3)
    
  4. Griffiths, P.: Personal searching gets the right results (1997) 1.32
    1.3157543 = sum of:
      1.3157543 = product of:
        3.9472628 = sum of:
          3.9472628 = weight(author_txt:griffiths in 2198) [ClassicSimilarity], result of:
            3.9472628 = score(doc=2198,freq=1.0), product of:
              0.6955858 = queryWeight, product of:
                1.2129323 = boost
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.06316097 = queryNorm
              5.6747317 = fieldWeight in 2198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.625 = fieldNorm(doc=2198)
        0.33333334 = coord(1/3)
    
  5. Griffiths, A.: Setting up a subject directory of Web sites : a case study of management links (1999) 1.32
    1.3157543 = sum of:
      1.3157543 = product of:
        3.9472628 = sum of:
          3.9472628 = weight(author_txt:griffiths in 5973) [ClassicSimilarity], result of:
            3.9472628 = score(doc=5973,freq=1.0), product of:
              0.6955858 = queryWeight, product of:
                1.2129323 = boost
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.06316097 = queryNorm
              5.6747317 = fieldWeight in 5973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.625 = fieldNorm(doc=5973)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Tombros, A.; Villa, R.; Rijsbergen, C.J. Van: ¬The effectiveness of query-specific hierarchic clustering in information retrieval (2002) 1.38
    1.3833686 = sum of:
      1.3833686 = product of:
        2.4208949 = sum of:
          0.04134446 = weight(abstract_txt:methods in 3587) [ClassicSimilarity], result of:
            0.04134446 = score(doc=3587,freq=1.0), product of:
              0.12613419 = queryWeight, product of:
                1.0449744 = boost
                4.195604 = idf(docFreq=1717, maxDocs=41962)
                0.028769523 = queryNorm
              0.32778156 = fieldWeight in 3587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.195604 = idf(docFreq=1717, maxDocs=41962)
                0.078125 = fieldNorm(doc=3587)
          0.062123742 = weight(abstract_txt:document in 3587) [ClassicSimilarity], result of:
            0.062123742 = score(doc=3587,freq=2.0), product of:
              0.13133577 = queryWeight, product of:
                1.0663034 = boost
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.028769523 = queryNorm
              0.47301465 = fieldWeight in 3587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.078125 = fieldNorm(doc=3587)
          0.35786268 = weight(abstract_txt:clustering in 3587) [ClassicSimilarity], result of:
            0.35786268 = score(doc=3587,freq=7.0), product of:
              0.27797103 = queryWeight, product of:
                1.551276 = boost
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.028769523 = queryNorm
              1.28741 = fieldWeight in 3587, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.078125 = fieldNorm(doc=3587)
          1.959564 = weight(title_txt:hierarchic in 3587) [ClassicSimilarity], result of:
            1.959564 = score(doc=3587,freq=1.0), product of:
              0.6555728 = queryWeight, product of:
                2.382318 = boost
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.028769523 = queryNorm
              2.9890869 = fieldWeight in 3587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.3125 = fieldNorm(doc=3587)
        0.5714286 = coord(4/7)
    
  2. Kirriemuir, J.W.; Willet, P.: Identification of duplicate and near-duplicate full-text records in database search-outputs using hierarchic cluster analysis (1995) 1.00
    0.9976678 = sum of:
      0.9976678 = product of:
        1.7459185 = sum of:
          0.043479197 = weight(abstract_txt:classification in 2498) [ClassicSimilarity], result of:
            0.043479197 = score(doc=2498,freq=1.0), product of:
              0.11551049 = queryWeight, product of:
                4.01503 = idf(docFreq=2057, maxDocs=41962)
                0.028769523 = queryNorm
              0.37640905 = fieldWeight in 2498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.01503 = idf(docFreq=2057, maxDocs=41962)
                0.09375 = fieldNorm(doc=2498)
          0.04961335 = weight(abstract_txt:methods in 2498) [ClassicSimilarity], result of:
            0.04961335 = score(doc=2498,freq=1.0), product of:
              0.12613419 = queryWeight, product of:
                1.0449744 = boost
                4.195604 = idf(docFreq=1717, maxDocs=41962)
                0.028769523 = queryNorm
              0.39333785 = fieldWeight in 2498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.195604 = idf(docFreq=1717, maxDocs=41962)
                0.09375 = fieldNorm(doc=2498)
          0.28113136 = weight(abstract_txt:clustering in 2498) [ClassicSimilarity], result of:
            0.28113136 = score(doc=2498,freq=3.0), product of:
              0.27797103 = queryWeight, product of:
                1.551276 = boost
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.028769523 = queryNorm
              1.0113692 = fieldWeight in 2498, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.09375 = fieldNorm(doc=2498)
          1.3716947 = weight(title_txt:hierarchic in 2498) [ClassicSimilarity], result of:
            1.3716947 = score(doc=2498,freq=1.0), product of:
              0.6555728 = queryWeight, product of:
                2.382318 = boost
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.028769523 = queryNorm
              2.0923607 = fieldWeight in 2498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.21875 = fieldNorm(doc=2498)
        0.5714286 = coord(4/7)
    
  3. Miyamoto, S.: Information clustering based an fuzzy multisets (2003) 0.49
    0.49127853 = sum of:
      0.49127853 = product of:
        0.8597374 = sum of:
          0.04134446 = weight(abstract_txt:methods in 3072) [ClassicSimilarity], result of:
            0.04134446 = score(doc=3072,freq=1.0), product of:
              0.12613419 = queryWeight, product of:
                1.0449744 = boost
                4.195604 = idf(docFreq=1717, maxDocs=41962)
                0.028769523 = queryNorm
              0.32778156 = fieldWeight in 3072, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.195604 = idf(docFreq=1717, maxDocs=41962)
                0.078125 = fieldNorm(doc=3072)
          0.04392812 = weight(abstract_txt:document in 3072) [ClassicSimilarity], result of:
            0.04392812 = score(doc=3072,freq=1.0), product of:
              0.13133577 = queryWeight, product of:
                1.0663034 = boost
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.028769523 = queryNorm
              0.33447188 = fieldWeight in 3072, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.078125 = fieldNorm(doc=3072)
          0.30244917 = weight(abstract_txt:clustering in 3072) [ClassicSimilarity], result of:
            0.30244917 = score(doc=3072,freq=5.0), product of:
              0.27797103 = queryWeight, product of:
                1.551276 = boost
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.028769523 = queryNorm
              1.08806 = fieldWeight in 3072, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.078125 = fieldNorm(doc=3072)
          0.47201565 = weight(abstract_txt:agglomerative in 3072) [ClassicSimilarity], result of:
            0.47201565 = score(doc=3072,freq=1.0), product of:
              0.63952696 = queryWeight, product of:
                2.3529825 = boost
                9.447295 = idf(docFreq=8, maxDocs=41962)
                0.028769523 = queryNorm
              0.73806995 = fieldWeight in 3072, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.447295 = idf(docFreq=8, maxDocs=41962)
                0.078125 = fieldNorm(doc=3072)
        0.5714286 = coord(4/7)
    
  4. Rijsbergen, C.J. van: ¬A fast hierarchic clustering algorithm (1970) 0.39
    0.3919128 = sum of:
      0.3919128 = product of:
        2.7433894 = sum of:
          2.7433894 = weight(title_txt:hierarchic in 3300) [ClassicSimilarity], result of:
            2.7433894 = score(doc=3300,freq=1.0), product of:
              0.6555728 = queryWeight, product of:
                2.382318 = boost
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.028769523 = queryNorm
              4.1847215 = fieldWeight in 3300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.4375 = fieldNorm(doc=3300)
        0.14285715 = coord(1/7)
    
  5. Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.36
    0.3637615 = sum of:
      0.3637615 = product of:
        0.8487769 = sum of:
          0.04969899 = weight(abstract_txt:document in 2449) [ClassicSimilarity], result of:
            0.04969899 = score(doc=2449,freq=2.0), product of:
              0.13133577 = queryWeight, product of:
                1.0663034 = boost
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.028769523 = queryNorm
              0.3784117 = fieldWeight in 2449, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.0625 = fieldNorm(doc=2449)
          0.26505318 = weight(abstract_txt:clustering in 2449) [ClassicSimilarity], result of:
            0.26505318 = score(doc=2449,freq=6.0), product of:
              0.27797103 = queryWeight, product of:
                1.551276 = boost
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.028769523 = queryNorm
              0.9535281 = fieldWeight in 2449, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.0625 = fieldNorm(doc=2449)
          0.5340247 = weight(abstract_txt:agglomerative in 2449) [ClassicSimilarity], result of:
            0.5340247 = score(doc=2449,freq=2.0), product of:
              0.63952696 = queryWeight, product of:
                2.3529825 = boost
                9.447295 = idf(docFreq=8, maxDocs=41962)
                0.028769523 = queryNorm
              0.8350308 = fieldWeight in 2449, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.447295 = idf(docFreq=8, maxDocs=41962)
                0.0625 = fieldNorm(doc=2449)
        0.42857143 = coord(3/7)