Document (#2415)

Author
Griffiths, A.
Robinson, L.A.
Willett, P.
Title
Hierarchic agglomerative clustering methods for automatic document classification
Source
Journal of documentation. 40(1984) no.3, S.175-205
Year
1984
Theme
Automatisches Indexieren

Similar documents (author)

  1. Griffiths, A.; Luckhurst, H.C.; Willett, P.: Using interdocument similarity information in document retrieval systems (1986) 2.68
    2.6833405 = sum of:
      2.6833405 = product of:
        4.0250106 = sum of:
          1.6302615 = weight(author_txt:willett in 2415) [ClassicSimilarity], result of:
            1.6302615 = score(doc=2415,freq=1.0), product of:
              0.5411662 = queryWeight, product of:
                1.076452 = boost
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.06258073 = queryNorm
              3.012497 = fieldWeight in 2415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.375 = fieldNorm(doc=2415)
          2.394749 = weight(author_txt:griffiths in 2415) [ClassicSimilarity], result of:
            2.394749 = score(doc=2415,freq=1.0), product of:
              0.6993036 = queryWeight, product of:
                1.2236642 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.06258073 = queryNorm
              3.4244766 = fieldWeight in 2415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.375 = fieldNorm(doc=2415)
        0.6666667 = coord(2/3)
    
  2. Griffiths, R.: Health information (1993) 1.33
    1.3304162 = sum of:
      1.3304162 = product of:
        3.9912484 = sum of:
          3.9912484 = weight(author_txt:griffiths in 51) [ClassicSimilarity], result of:
            3.9912484 = score(doc=51,freq=1.0), product of:
              0.6993036 = queryWeight, product of:
                1.2236642 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.06258073 = queryNorm
              5.7074614 = fieldWeight in 51, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.625 = fieldNorm(doc=51)
        0.33333334 = coord(1/3)
    
  3. Griffiths, J.: ¬The value of information and related systems, products and services (1982) 1.33
    1.3304162 = sum of:
      1.3304162 = product of:
        3.9912484 = sum of:
          3.9912484 = weight(author_txt:griffiths in 5835) [ClassicSimilarity], result of:
            3.9912484 = score(doc=5835,freq=1.0), product of:
              0.6993036 = queryWeight, product of:
                1.2236642 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.06258073 = queryNorm
              5.7074614 = fieldWeight in 5835, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.625 = fieldNorm(doc=5835)
        0.33333334 = coord(1/3)
    
  4. Griffiths, P.: Personal searching gets the right results (1997) 1.33
    1.3304162 = sum of:
      1.3304162 = product of:
        3.9912484 = sum of:
          3.9912484 = weight(author_txt:griffiths in 784) [ClassicSimilarity], result of:
            3.9912484 = score(doc=784,freq=1.0), product of:
              0.6993036 = queryWeight, product of:
                1.2236642 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.06258073 = queryNorm
              5.7074614 = fieldWeight in 784, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.625 = fieldNorm(doc=784)
        0.33333334 = coord(1/3)
    
  5. Griffiths, A.: Setting up a subject directory of Web sites : a case study of management links (1999) 1.33
    1.3304162 = sum of:
      1.3304162 = product of:
        3.9912484 = sum of:
          3.9912484 = weight(author_txt:griffiths in 4559) [ClassicSimilarity], result of:
            3.9912484 = score(doc=4559,freq=1.0), product of:
              0.6993036 = queryWeight, product of:
                1.2236642 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.06258073 = queryNorm
              5.7074614 = fieldWeight in 4559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.625 = fieldNorm(doc=4559)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Tombros, A.; Villa, R.; Rijsbergen, C.J. Van: ¬The effectiveness of query-specific hierarchic clustering in information retrieval (2002) 1.40
    1.400907 = sum of:
      1.400907 = product of:
        2.4515872 = sum of:
          0.039939094 = weight(abstract_txt:methods in 2586) [ClassicSimilarity], result of:
            0.039939094 = score(doc=2586,freq=1.0), product of:
              0.123282135 = queryWeight, product of:
                1.0387459 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.028620869 = queryNorm
              0.32396498 = fieldWeight in 2586, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.078125 = fieldNorm(doc=2586)
          0.06265459 = weight(abstract_txt:document in 2586) [ClassicSimilarity], result of:
            0.06265459 = score(doc=2586,freq=2.0), product of:
              0.13210724 = queryWeight, product of:
                1.0752825 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.028620869 = queryNorm
              0.4742707 = fieldWeight in 2586, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=2586)
          0.3559658 = weight(abstract_txt:clustering in 2586) [ClassicSimilarity], result of:
            0.3559658 = score(doc=2586,freq=7.0), product of:
              0.27703896 = queryWeight, product of:
                1.5571471 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.028620869 = queryNorm
              1.2848943 = fieldWeight in 2586, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=2586)
          1.9930278 = weight(title_txt:hierarchic in 2586) [ClassicSimilarity], result of:
            1.9930278 = score(doc=2586,freq=1.0), product of:
              0.6631375 = queryWeight, product of:
                2.4091344 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.028620869 = queryNorm
              3.005452 = fieldWeight in 2586, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=2586)
        0.5714286 = coord(4/7)
    
  2. Kirriemuir, J.W.; Willet, P.: Identification of duplicate and near-duplicate full-text records in database search-outputs using hierarchic cluster analysis (1995) 1.01
    1.0088279 = sum of:
      1.0088279 = product of:
        1.7654488 = sum of:
          0.04276136 = weight(abstract_txt:classification in 2429) [ClassicSimilarity], result of:
            0.04276136 = score(doc=2429,freq=1.0), product of:
              0.11425666 = queryWeight, product of:
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.028620869 = queryNorm
              0.37425706 = fieldWeight in 2429, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.09375 = fieldNorm(doc=2429)
          0.047926918 = weight(abstract_txt:methods in 2429) [ClassicSimilarity], result of:
            0.047926918 = score(doc=2429,freq=1.0), product of:
              0.123282135 = queryWeight, product of:
                1.0387459 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.028620869 = queryNorm
              0.388758 = fieldWeight in 2429, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.09375 = fieldNorm(doc=2429)
          0.27964118 = weight(abstract_txt:clustering in 2429) [ClassicSimilarity], result of:
            0.27964118 = score(doc=2429,freq=3.0), product of:
              0.27703896 = queryWeight, product of:
                1.5571471 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.028620869 = queryNorm
              1.009393 = fieldWeight in 2429, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.09375 = fieldNorm(doc=2429)
          1.3951194 = weight(title_txt:hierarchic in 2429) [ClassicSimilarity], result of:
            1.3951194 = score(doc=2429,freq=1.0), product of:
              0.6631375 = queryWeight, product of:
                2.4091344 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.028620869 = queryNorm
              2.1038163 = fieldWeight in 2429, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.21875 = fieldNorm(doc=2429)
        0.5714286 = coord(4/7)
    
  3. Miyamoto, S.: Information clustering based an fuzzy multisets (2003) 0.49
    0.48540714 = sum of:
      0.48540714 = product of:
        0.84946245 = sum of:
          0.039939094 = weight(abstract_txt:methods in 1071) [ClassicSimilarity], result of:
            0.039939094 = score(doc=1071,freq=1.0), product of:
              0.123282135 = queryWeight, product of:
                1.0387459 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.028620869 = queryNorm
              0.32396498 = fieldWeight in 1071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.044303488 = weight(abstract_txt:document in 1071) [ClassicSimilarity], result of:
            0.044303488 = score(doc=1071,freq=1.0), product of:
              0.13210724 = queryWeight, product of:
                1.0752825 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.028620869 = queryNorm
              0.33536002 = fieldWeight in 1071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.300846 = weight(abstract_txt:clustering in 1071) [ClassicSimilarity], result of:
            0.300846 = score(doc=1071,freq=5.0), product of:
              0.27703896 = queryWeight, product of:
                1.5571471 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.028620869 = queryNorm
              1.0859339 = fieldWeight in 1071, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.46437386 = weight(abstract_txt:agglomerative in 1071) [ClassicSimilarity], result of:
            0.46437386 = score(doc=1071,freq=1.0), product of:
              0.6327224 = queryWeight, product of:
                2.3532379 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.028620869 = queryNorm
              0.7339299 = fieldWeight in 1071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
        0.5714286 = coord(4/7)
    
  4. Rijsbergen, C.J. van: ¬A fast hierarchic clustering algorithm (1970) 0.40
    0.39860556 = sum of:
      0.39860556 = product of:
        2.7902389 = sum of:
          2.7902389 = weight(title_txt:hierarchic in 3300) [ClassicSimilarity], result of:
            2.7902389 = score(doc=3300,freq=1.0), product of:
              0.6631375 = queryWeight, product of:
                2.4091344 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.028620869 = queryNorm
              4.2076325 = fieldWeight in 3300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.4375 = fieldNorm(doc=3300)
        0.14285715 = coord(1/7)
    
  5. Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.36
    0.3596361 = sum of:
      0.3596361 = product of:
        0.8391509 = sum of:
          0.050123677 = weight(abstract_txt:document in 448) [ClassicSimilarity], result of:
            0.050123677 = score(doc=448,freq=2.0), product of:
              0.13210724 = queryWeight, product of:
                1.0752825 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.028620869 = queryNorm
              0.37941656 = fieldWeight in 448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
          0.26364824 = weight(abstract_txt:clustering in 448) [ClassicSimilarity], result of:
            0.26364824 = score(doc=448,freq=6.0), product of:
              0.27703896 = queryWeight, product of:
                1.5571471 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.028620869 = queryNorm
              0.95166487 = fieldWeight in 448, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
          0.525379 = weight(abstract_txt:agglomerative in 448) [ClassicSimilarity], result of:
            0.525379 = score(doc=448,freq=2.0), product of:
              0.6327224 = queryWeight, product of:
                2.3532379 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.028620869 = queryNorm
              0.8303468 = fieldWeight in 448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
        0.42857143 = coord(3/7)