Document (#403)

Author
Voorhees, E.M.
Title
Implementing agglomerative hierarchic clustering algorithms for use in document retrieval
Source
Information processing and management. 22(1986) no.6, S.465-476
Year
1986
Theme
Automatisches Indexieren
Retrievalalgorithmen

Similar documents (author)

  1. Voorhees, E.M.: Question answering in TREC (2005) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:voorhees in 6487) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 6487, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=6487)
    
  2. Voorhees, E.M.: Variations in relevance judgements and the measurement of retrieval effectiveness (2000) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:voorhees in 8710) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 8710, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=8710)
    
  3. Voorhees, E.M.: Using WordNet to disambiguate word senses for text retrieval (1993) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:voorhees in 8799) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 8799, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=8799)
    
  4. Voorhees, E.M.: On test collections for adaptive information retrieval (2008) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:voorhees in 2444) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 2444, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=2444)
    
  5. Voorhees, E.M.: Text REtrieval Conference (TREC) (2009) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:voorhees in 3890) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 3890, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=3890)
    

Similar documents (content)

  1. Tombros, A.; Villa, R.; Rijsbergen, C.J. Van: ¬The effectiveness of query-specific hierarchic clustering in information retrieval (2002) 1.34
    1.3393832 = sum of:
      1.3393832 = product of:
        2.3439205 = sum of:
          0.031870507 = weight(abstract_txt:retrieval in 2586) [ClassicSimilarity], result of:
            0.031870507 = score(doc=2586,freq=2.0), product of:
              0.083006434 = queryWeight, product of:
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.023885787 = queryNorm
              0.38395226 = fieldWeight in 2586, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=2586)
          0.06006703 = weight(abstract_txt:document in 2586) [ClassicSimilarity], result of:
            0.06006703 = score(doc=2586,freq=2.0), product of:
              0.12665136 = queryWeight, product of:
                1.2352334 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.023885787 = queryNorm
              0.4742707 = fieldWeight in 2586, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=2586)
          0.34126478 = weight(abstract_txt:clustering in 2586) [ClassicSimilarity], result of:
            0.34126478 = score(doc=2586,freq=7.0), product of:
              0.26559755 = queryWeight, product of:
                1.7887768 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.023885787 = queryNorm
              1.2848943 = fieldWeight in 2586, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=2586)
          1.9107182 = weight(title_txt:hierarchic in 2586) [ClassicSimilarity], result of:
            1.9107182 = score(doc=2586,freq=1.0), product of:
              0.6357507 = queryWeight, product of:
                2.7674994 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.023885787 = queryNorm
              3.005452 = fieldWeight in 2586, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=2586)
        0.5714286 = coord(4/7)
    
  2. Miyamoto, S.: Information clustering based an fuzzy multisets (2003) 0.70
    0.70066017 = sum of:
      0.70066017 = product of:
        0.98092425 = sum of:
          0.031870507 = weight(abstract_txt:retrieval in 1071) [ClassicSimilarity], result of:
            0.031870507 = score(doc=1071,freq=2.0), product of:
              0.083006434 = queryWeight, product of:
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.023885787 = queryNorm
              0.38395226 = fieldWeight in 1071, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.042473804 = weight(abstract_txt:document in 1071) [ClassicSimilarity], result of:
            0.042473804 = score(doc=1071,freq=1.0), product of:
              0.12665136 = queryWeight, product of:
                1.2352334 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.023885787 = queryNorm
              0.33536002 = fieldWeight in 1071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.17296289 = weight(abstract_txt:algorithms in 1071) [ClassicSimilarity], result of:
            0.17296289 = score(doc=1071,freq=3.0), product of:
              0.22393602 = queryWeight, product of:
                1.6425027 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.023885787 = queryNorm
              0.77237636 = fieldWeight in 1071, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.2884214 = weight(abstract_txt:clustering in 1071) [ClassicSimilarity], result of:
            0.2884214 = score(doc=1071,freq=5.0), product of:
              0.26559755 = queryWeight, product of:
                1.7887768 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.023885787 = queryNorm
              1.0859339 = fieldWeight in 1071, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.44519567 = weight(abstract_txt:agglomerative in 1071) [ClassicSimilarity], result of:
            0.44519567 = score(doc=1071,freq=1.0), product of:
              0.6065916 = queryWeight, product of:
                2.7032878 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.023885787 = queryNorm
              0.7339299 = fieldWeight in 1071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
        0.71428573 = coord(5/7)
    
  3. Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.67
    0.66821605 = sum of:
      0.66821605 = product of:
        0.93550247 = sum of:
          0.018028682 = weight(abstract_txt:retrieval in 448) [ClassicSimilarity], result of:
            0.018028682 = score(doc=448,freq=1.0), product of:
              0.083006434 = queryWeight, product of:
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.023885787 = queryNorm
              0.21719621 = fieldWeight in 448, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
          0.048053622 = weight(abstract_txt:document in 448) [ClassicSimilarity], result of:
            0.048053622 = score(doc=448,freq=2.0), product of:
              0.12665136 = queryWeight, product of:
                1.2352334 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.023885787 = queryNorm
              0.37941656 = fieldWeight in 448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
          0.11297888 = weight(abstract_txt:algorithms in 448) [ClassicSimilarity], result of:
            0.11297888 = score(doc=448,freq=2.0), product of:
              0.22393602 = queryWeight, product of:
                1.6425027 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.023885787 = queryNorm
              0.5045141 = fieldWeight in 448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
          0.25275984 = weight(abstract_txt:clustering in 448) [ClassicSimilarity], result of:
            0.25275984 = score(doc=448,freq=6.0), product of:
              0.26559755 = queryWeight, product of:
                1.7887768 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.023885787 = queryNorm
              0.95166487 = fieldWeight in 448, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
          0.5036814 = weight(abstract_txt:agglomerative in 448) [ClassicSimilarity], result of:
            0.5036814 = score(doc=448,freq=2.0), product of:
              0.6065916 = queryWeight, product of:
                2.7032878 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.023885787 = queryNorm
              0.8303468 = fieldWeight in 448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=448)
        0.71428573 = coord(5/7)
    
  4. Kirriemuir, J.W.; Willet, P.: Identification of duplicate and near-duplicate full-text records in database search-outputs using hierarchic cluster analysis (1995) 0.46
    0.45874146 = sum of:
      0.45874146 = product of:
        1.605595 = sum of:
          0.2680923 = weight(abstract_txt:clustering in 2429) [ClassicSimilarity], result of:
            0.2680923 = score(doc=2429,freq=3.0), product of:
              0.26559755 = queryWeight, product of:
                1.7887768 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.023885787 = queryNorm
              1.009393 = fieldWeight in 2429, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.09375 = fieldNorm(doc=2429)
          1.3375027 = weight(title_txt:hierarchic in 2429) [ClassicSimilarity], result of:
            1.3375027 = score(doc=2429,freq=1.0), product of:
              0.6357507 = queryWeight, product of:
                2.7674994 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.023885787 = queryNorm
              2.1038163 = fieldWeight in 2429, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.21875 = fieldNorm(doc=2429)
        0.2857143 = coord(2/7)
    
  5. Rijsbergen, C.J. van: ¬A fast hierarchic clustering algorithm (1970) 0.38
    0.38214365 = sum of:
      0.38214365 = product of:
        2.6750054 = sum of:
          2.6750054 = weight(title_txt:hierarchic in 3300) [ClassicSimilarity], result of:
            2.6750054 = score(doc=3300,freq=1.0), product of:
              0.6357507 = queryWeight, product of:
                2.7674994 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.023885787 = queryNorm
              4.2076325 = fieldWeight in 3300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.4375 = fieldNorm(doc=3300)
        0.14285715 = coord(1/7)