Document (#2415)

Author
Griffiths, A.
Robinson, L.A.
Willett, P.
Title
Hierarchic agglomerative clustering methods for automatic document classification
Source
Journal of documentation. 40(1984) no.3, S.175-205
Year
1984
Theme
Automatisches Indexieren

Similar documents (author)

  1. Griffiths, A.; Luckhurst, H.C.; Willett, P.: Using interdocument similarity information in document retrieval systems (1986) 2.68
    2.6769378 = sum of:
      2.6769378 = product of:
        4.0154066 = sum of:
          1.6254108 = weight(author_txt:willett in 2415) [ClassicSimilarity], result of:
            1.6254108 = score(doc=2415,freq=1.0), product of:
              0.54104054 = queryWeight, product of:
                1.0766784 = boost
                8.011283 = idf(docFreq=38, maxDocs=43254)
                0.06272516 = queryNorm
              3.004231 = fieldWeight in 2415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.011283 = idf(docFreq=38, maxDocs=43254)
                0.375 = fieldNorm(doc=2415)
          2.389996 = weight(author_txt:griffiths in 2415) [ClassicSimilarity], result of:
            2.389996 = score(doc=2415,freq=1.0), product of:
              0.6996044 = queryWeight, product of:
                1.2243268 = boost
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.06272516 = queryNorm
              3.416211 = fieldWeight in 2415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.375 = fieldNorm(doc=2415)
        0.6666667 = coord(2/3)
    
  2. Griffiths, R.: Health information (1993) 1.33
    1.3277756 = sum of:
      1.3277756 = product of:
        3.9833267 = sum of:
          3.9833267 = weight(author_txt:griffiths in 1120) [ClassicSimilarity], result of:
            3.9833267 = score(doc=1120,freq=1.0), product of:
              0.6996044 = queryWeight, product of:
                1.2243268 = boost
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.06272516 = queryNorm
              5.6936846 = fieldWeight in 1120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.625 = fieldNorm(doc=1120)
        0.33333334 = coord(1/3)
    
  3. Griffiths, J.: ¬The value of information and related systems, products and services (1982) 1.33
    1.3277756 = sum of:
      1.3277756 = product of:
        3.9833267 = sum of:
          3.9833267 = weight(author_txt:griffiths in 6904) [ClassicSimilarity], result of:
            3.9833267 = score(doc=6904,freq=1.0), product of:
              0.6996044 = queryWeight, product of:
                1.2243268 = boost
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.06272516 = queryNorm
              5.6936846 = fieldWeight in 6904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.625 = fieldNorm(doc=6904)
        0.33333334 = coord(1/3)
    
  4. Griffiths, P.: Personal searching gets the right results (1997) 1.33
    1.3277756 = sum of:
      1.3277756 = product of:
        3.9833267 = sum of:
          3.9833267 = weight(author_txt:griffiths in 2785) [ClassicSimilarity], result of:
            3.9833267 = score(doc=2785,freq=1.0), product of:
              0.6996044 = queryWeight, product of:
                1.2243268 = boost
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.06272516 = queryNorm
              5.6936846 = fieldWeight in 2785, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.625 = fieldNorm(doc=2785)
        0.33333334 = coord(1/3)
    
  5. Griffiths, A.: Setting up a subject directory of Web sites : a case study of management links (1999) 1.33
    1.3277756 = sum of:
      1.3277756 = product of:
        3.9833267 = sum of:
          3.9833267 = weight(author_txt:griffiths in 6560) [ClassicSimilarity], result of:
            3.9833267 = score(doc=6560,freq=1.0), product of:
              0.6996044 = queryWeight, product of:
                1.2243268 = boost
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.06272516 = queryNorm
              5.6936846 = fieldWeight in 6560, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.625 = fieldNorm(doc=6560)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Tombros, A.; Villa, R.; Rijsbergen, C.J. Van: ¬The effectiveness of query-specific hierarchic clustering in information retrieval (2002) 1.39
    1.3868927 = sum of:
      1.3868927 = product of:
        2.427062 = sum of:
          0.040254496 = weight(abstract_txt:methods in 4587) [ClassicSimilarity], result of:
            0.040254496 = score(doc=4587,freq=1.0), product of:
              0.123696156 = queryWeight, product of:
                1.0419223 = boost
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.028500516 = queryNorm
              0.32543045 = fieldWeight in 4587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.078125 = fieldNorm(doc=4587)
          0.06192807 = weight(abstract_txt:document in 4587) [ClassicSimilarity], result of:
            0.06192807 = score(doc=4587,freq=2.0), product of:
              0.13083634 = queryWeight, product of:
                1.0715722 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.028500516 = queryNorm
              0.47332472 = fieldWeight in 4587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.078125 = fieldNorm(doc=4587)
          0.3567226 = weight(abstract_txt:clustering in 4587) [ClassicSimilarity], result of:
            0.3567226 = score(doc=4587,freq=7.0), product of:
              0.27690727 = queryWeight, product of:
                1.5589222 = boost
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.028500516 = queryNorm
              1.2882385 = fieldWeight in 4587, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.078125 = fieldNorm(doc=4587)
          1.9681569 = weight(title_txt:hierarchic in 4587) [ClassicSimilarity], result of:
            1.9681569 = score(doc=4587,freq=1.0), product of:
              0.6563665 = queryWeight, product of:
                2.4001062 = boost
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.028500516 = queryNorm
              2.9985638 = fieldWeight in 4587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.3125 = fieldNorm(doc=4587)
        0.5714286 = coord(4/7)
    
  2. Kirriemuir, J.W.; Willet, P.: Identification of duplicate and near-duplicate full-text records in database search-outputs using hierarchic cluster analysis (1995) 1.00
    0.999404 = sum of:
      0.999404 = product of:
        1.7489569 = sum of:
          0.042706065 = weight(abstract_txt:classification in 3498) [ClassicSimilarity], result of:
            0.042706065 = score(doc=3498,freq=1.0), product of:
              0.11394244 = queryWeight, product of:
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.028500516 = queryNorm
              0.37480387 = fieldWeight in 3498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.09375 = fieldNorm(doc=3498)
          0.048305392 = weight(abstract_txt:methods in 3498) [ClassicSimilarity], result of:
            0.048305392 = score(doc=3498,freq=1.0), product of:
              0.123696156 = queryWeight, product of:
                1.0419223 = boost
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.028500516 = queryNorm
              0.39051652 = fieldWeight in 3498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.09375 = fieldNorm(doc=3498)
          0.28023568 = weight(abstract_txt:clustering in 3498) [ClassicSimilarity], result of:
            0.28023568 = score(doc=3498,freq=3.0), product of:
              0.27690727 = queryWeight, product of:
                1.5589222 = boost
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.028500516 = queryNorm
              1.01202 = fieldWeight in 3498, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.09375 = fieldNorm(doc=3498)
          1.3777097 = weight(title_txt:hierarchic in 3498) [ClassicSimilarity], result of:
            1.3777097 = score(doc=3498,freq=1.0), product of:
              0.6563665 = queryWeight, product of:
                2.4001062 = boost
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.028500516 = queryNorm
              2.0989945 = fieldWeight in 3498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.21875 = fieldNorm(doc=3498)
        0.5714286 = coord(4/7)
    
  3. Miyamoto, S.: Information clustering based an fuzzy multisets (2003) 0.49
    0.49124074 = sum of:
      0.49124074 = product of:
        0.85967124 = sum of:
          0.040254496 = weight(abstract_txt:methods in 3072) [ClassicSimilarity], result of:
            0.040254496 = score(doc=3072,freq=1.0), product of:
              0.123696156 = queryWeight, product of:
                1.0419223 = boost
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.028500516 = queryNorm
              0.32543045 = fieldWeight in 3072, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.078125 = fieldNorm(doc=3072)
          0.043789763 = weight(abstract_txt:document in 3072) [ClassicSimilarity], result of:
            0.043789763 = score(doc=3072,freq=1.0), product of:
              0.13083634 = queryWeight, product of:
                1.0715722 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.028500516 = queryNorm
              0.33469114 = fieldWeight in 3072, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.078125 = fieldNorm(doc=3072)
          0.3014856 = weight(abstract_txt:clustering in 3072) [ClassicSimilarity], result of:
            0.3014856 = score(doc=3072,freq=5.0), product of:
              0.27690727 = queryWeight, product of:
                1.5589222 = boost
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.028500516 = queryNorm
              1.0887601 = fieldWeight in 3072, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.078125 = fieldNorm(doc=3072)
          0.4741414 = weight(abstract_txt:agglomerative in 3072) [ClassicSimilarity], result of:
            0.4741414 = score(doc=3072,freq=1.0), product of:
              0.64035165 = queryWeight, product of:
                2.370645 = boost
                9.47762 = idf(docFreq=8, maxDocs=43254)
                0.028500516 = queryNorm
              0.74043906 = fieldWeight in 3072, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.47762 = idf(docFreq=8, maxDocs=43254)
                0.078125 = fieldNorm(doc=3072)
        0.5714286 = coord(4/7)
    
  4. Rijsbergen, C.J. van: ¬A fast hierarchic clustering algorithm (1970) 0.39
    0.39363137 = sum of:
      0.39363137 = product of:
        2.7554195 = sum of:
          2.7554195 = weight(title_txt:hierarchic in 3300) [ClassicSimilarity], result of:
            2.7554195 = score(doc=3300,freq=1.0), product of:
              0.6563665 = queryWeight, product of:
                2.4001062 = boost
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.028500516 = queryNorm
              4.197989 = fieldWeight in 3300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.4375 = fieldNorm(doc=3300)
        0.14285715 = coord(1/7)
    
  5. Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.36
    0.36436325 = sum of:
      0.36436325 = product of:
        0.8501809 = sum of:
          0.04954246 = weight(abstract_txt:document in 2449) [ClassicSimilarity], result of:
            0.04954246 = score(doc=2449,freq=2.0), product of:
              0.13083634 = queryWeight, product of:
                1.0715722 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.028500516 = queryNorm
              0.37865978 = fieldWeight in 2449, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=2449)
          0.26420876 = weight(abstract_txt:clustering in 2449) [ClassicSimilarity], result of:
            0.26420876 = score(doc=2449,freq=6.0), product of:
              0.27690727 = queryWeight, product of:
                1.5589222 = boost
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.028500516 = queryNorm
              0.9541417 = fieldWeight in 2449, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.0625 = fieldNorm(doc=2449)
          0.5364297 = weight(abstract_txt:agglomerative in 2449) [ClassicSimilarity], result of:
            0.5364297 = score(doc=2449,freq=2.0), product of:
              0.64035165 = queryWeight, product of:
                2.370645 = boost
                9.47762 = idf(docFreq=8, maxDocs=43254)
                0.028500516 = queryNorm
              0.83771116 = fieldWeight in 2449, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.47762 = idf(docFreq=8, maxDocs=43254)
                0.0625 = fieldNorm(doc=2449)
        0.42857143 = coord(3/7)