Document (#2605)

Author
Willett, P.
Title
Recent trends in hierarchic document clustering : a critical review
Source
Information processing and management. 24(1988) no.5, S.577-597
Year
1988
Theme
Automatisches Indexieren
Literaturübersicht

Similar documents (author)

  1. Willett, P.: Best-match text retrieval (1993) 5.02
    5.020828 = sum of:
      5.020828 = weight(author_txt:willett in 7818) [ClassicSimilarity], result of:
        5.020828 = score(doc=7818,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          5.0208282 = fieldWeight in 7818, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.625 = fieldNorm(doc=7818)
    
  2. Willett, P.: From chemical documentation to chemoinformatics : 50 years of chemical information science (2009) 5.02
    5.020828 = sum of:
      5.020828 = weight(author_txt:willett in 3656) [ClassicSimilarity], result of:
        5.020828 = score(doc=3656,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          5.0208282 = fieldWeight in 3656, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.625 = fieldNorm(doc=3656)
    
  3. Perry, R.; Willett, P.: ¬A revies of the use of inverted files for best match searching in information retrieval systems (1983) 4.02
    4.016662 = sum of:
      4.016662 = weight(author_txt:willett in 2701) [ClassicSimilarity], result of:
        4.016662 = score(doc=2701,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          4.0166626 = fieldWeight in 2701, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.5 = fieldNorm(doc=2701)
    
  4. Robertson, A.M.; Willett, P.: Retrieval techniques for historical English text : searching the sixteenth and seventeenth century titles in the Catalogue of Caterbury Cathedral Library using spelling-correction methods (1992) 4.02
    4.016662 = sum of:
      4.016662 = weight(author_txt:willett in 4209) [ClassicSimilarity], result of:
        4.016662 = score(doc=4209,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          4.0166626 = fieldWeight in 4209, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.5 = fieldNorm(doc=4209)
    
  5. Shaw, R.J.; Willett, P.: On the non-random nature of nearest-neighbour document clusters (1993) 4.02
    4.016662 = sum of:
      4.016662 = weight(author_txt:willett in 5817) [ClassicSimilarity], result of:
        4.016662 = score(doc=5817,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          4.0166626 = fieldWeight in 5817, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.5 = fieldNorm(doc=5817)
    

Similar documents (content)

  1. Tombros, A.; Villa, R.; Rijsbergen, C.J. Van: ¬The effectiveness of query-specific hierarchic clustering in information retrieval (2002) 1.25
    1.2489802 = sum of:
      1.2489802 = product of:
        2.914287 = sum of:
          0.075713135 = weight(abstract_txt:document in 2586) [ClassicSimilarity], result of:
            0.075713135 = score(doc=2586,freq=2.0), product of:
              0.15964118 = queryWeight, product of:
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.037189785 = queryNorm
              0.4742707 = fieldWeight in 2586, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=2586)
          0.4301566 = weight(abstract_txt:clustering in 2586) [ClassicSimilarity], result of:
            0.4301566 = score(doc=2586,freq=7.0), product of:
              0.33477974 = queryWeight, product of:
                1.4481286 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.037189785 = queryNorm
              1.2848943 = fieldWeight in 2586, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=2586)
          2.4084172 = weight(title_txt:hierarchic in 2586) [ClassicSimilarity], result of:
            2.4084172 = score(doc=2586,freq=1.0), product of:
              0.80134946 = queryWeight, product of:
                2.2404668 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.037189785 = queryNorm
              3.005452 = fieldWeight in 2586, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=2586)
        0.42857143 = coord(3/7)
    
  2. Kirriemuir, J.W.; Willet, P.: Identification of duplicate and near-duplicate full-text records in database search-outputs using hierarchic cluster analysis (1995) 0.58
    0.57823324 = sum of:
      0.57823324 = product of:
        2.0238163 = sum of:
          0.33792433 = weight(abstract_txt:clustering in 2429) [ClassicSimilarity], result of:
            0.33792433 = score(doc=2429,freq=3.0), product of:
              0.33477974 = queryWeight, product of:
                1.4481286 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.037189785 = queryNorm
              1.009393 = fieldWeight in 2429, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.09375 = fieldNorm(doc=2429)
          1.685892 = weight(title_txt:hierarchic in 2429) [ClassicSimilarity], result of:
            1.685892 = score(doc=2429,freq=1.0), product of:
              0.80134946 = queryWeight, product of:
                2.2404668 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.037189785 = queryNorm
              2.1038163 = fieldWeight in 2429, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.21875 = fieldNorm(doc=2429)
        0.2857143 = coord(2/7)
    
  3. Rijsbergen, C.J. van: ¬A fast hierarchic clustering algorithm (1970) 0.48
    0.48168343 = sum of:
      0.48168343 = product of:
        3.371784 = sum of:
          3.371784 = weight(title_txt:hierarchic in 3300) [ClassicSimilarity], result of:
            3.371784 = score(doc=3300,freq=1.0), product of:
              0.80134946 = queryWeight, product of:
                2.2404668 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.037189785 = queryNorm
              4.2076325 = fieldWeight in 3300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.4375 = fieldNorm(doc=3300)
        0.14285715 = coord(1/7)
    
  4. Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.34
    0.34405962 = sum of:
      0.34405962 = product of:
        2.4084172 = sum of:
          2.4084172 = weight(title_txt:hierarchic in 402) [ClassicSimilarity], result of:
            2.4084172 = score(doc=402,freq=1.0), product of:
              0.80134946 = queryWeight, product of:
                2.2404668 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.037189785 = queryNorm
              3.005452 = fieldWeight in 402, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=402)
        0.14285715 = coord(1/7)
    
  5. Griffiths, A.; Robinson, L.A.; Willett, P.: Hierarchic agglomerative clustering methods for automatic document classification (1984) 0.34
    0.34405962 = sum of:
      0.34405962 = product of:
        2.4084172 = sum of:
          2.4084172 = weight(title_txt:hierarchic in 2414) [ClassicSimilarity], result of:
            2.4084172 = score(doc=2414,freq=1.0), product of:
              0.80134946 = queryWeight, product of:
                2.2404668 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.037189785 = queryNorm
              3.005452 = fieldWeight in 2414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=2414)
        0.14285715 = coord(1/7)