Document (#403)

Author
Voorhees, E.M.
Title
Implementing agglomerative hierarchic clustering algorithms for use in document retrieval
Source
Information processing and management. 22(1986) no.6, S.465-476
Year
1986
Theme
Automatisches Indexieren
Retrievalalgorithmen

Similar documents (author)

  1. Voorhees, E.M.: Question answering in TREC (2005) 5.53
    5.52602 = sum of:
      5.52602 = weight(author_txt:voorhees in 6487) [ClassicSimilarity], result of:
        5.52602 = fieldWeight in 6487, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.841632 = idf(docFreq=16, maxDocs=43254)
          0.625 = fieldNorm(doc=6487)
    
  2. Voorhees, E.M.: Variations in relevance judgements and the measurement of retrieval effectiveness (2000) 5.53
    5.52602 = sum of:
      5.52602 = weight(author_txt:voorhees in 710) [ClassicSimilarity], result of:
        5.52602 = fieldWeight in 710, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.841632 = idf(docFreq=16, maxDocs=43254)
          0.625 = fieldNorm(doc=710)
    
  3. Voorhees, E.M.: Using WordNet to disambiguate word senses for text retrieval (1993) 5.53
    5.52602 = sum of:
      5.52602 = weight(author_txt:voorhees in 799) [ClassicSimilarity], result of:
        5.52602 = fieldWeight in 799, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.841632 = idf(docFreq=16, maxDocs=43254)
          0.625 = fieldNorm(doc=799)
    
  4. Voorhees, E.M.: On test collections for adaptive information retrieval (2008) 5.53
    5.52602 = sum of:
      5.52602 = weight(author_txt:voorhees in 4445) [ClassicSimilarity], result of:
        5.52602 = fieldWeight in 4445, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.841632 = idf(docFreq=16, maxDocs=43254)
          0.625 = fieldNorm(doc=4445)
    
  5. Voorhees, E.M.: Text REtrieval Conference (TREC) (2009) 5.53
    5.52602 = sum of:
      5.52602 = weight(author_txt:voorhees in 355) [ClassicSimilarity], result of:
        5.52602 = fieldWeight in 355, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.841632 = idf(docFreq=16, maxDocs=43254)
          0.625 = fieldNorm(doc=355)
    

Similar documents (content)

  1. Tombros, A.; Villa, R.; Rijsbergen, C.J. Van: ¬The effectiveness of query-specific hierarchic clustering in information retrieval (2002) 1.32
    1.3247265 = sum of:
      1.3247265 = product of:
        2.3182712 = sum of:
          0.031526517 = weight(abstract_txt:retrieval in 4587) [ClassicSimilarity], result of:
            0.031526517 = score(doc=4587,freq=2.0), product of:
              0.082234494 = queryWeight, product of:
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.023699386 = queryNorm
              0.38337338 = fieldWeight in 4587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.078125 = fieldNorm(doc=4587)
          0.059331823 = weight(abstract_txt:document in 4587) [ClassicSimilarity], result of:
            0.059331823 = score(doc=4587,freq=2.0), product of:
              0.1253512 = queryWeight, product of:
                1.2346312 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.023699386 = queryNorm
              0.47332472 = fieldWeight in 4587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.078125 = fieldNorm(doc=4587)
          0.34176758 = weight(abstract_txt:clustering in 4587) [ClassicSimilarity], result of:
            0.34176758 = score(doc=4587,freq=7.0), product of:
              0.26529837 = queryWeight, product of:
                1.7961403 = boost
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.023699386 = queryNorm
              1.2882385 = fieldWeight in 4587, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.078125 = fieldNorm(doc=4587)
          1.8856452 = weight(title_txt:hierarchic in 4587) [ClassicSimilarity], result of:
            1.8856452 = score(doc=4587,freq=1.0), product of:
              0.62884945 = queryWeight, product of:
                2.7653258 = boost
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.023699386 = queryNorm
              2.9985638 = fieldWeight in 4587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.3125 = fieldNorm(doc=4587)
        0.5714286 = coord(4/7)
    
  2. Miyamoto, S.: Information clustering based an fuzzy multisets (2003) 0.71
    0.7086435 = sum of:
      0.7086435 = product of:
        0.99210083 = sum of:
          0.031526517 = weight(abstract_txt:retrieval in 3072) [ClassicSimilarity], result of:
            0.031526517 = score(doc=3072,freq=2.0), product of:
              0.082234494 = queryWeight, product of:
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.023699386 = queryNorm
              0.38337338 = fieldWeight in 3072, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.078125 = fieldNorm(doc=3072)
          0.041953936 = weight(abstract_txt:document in 3072) [ClassicSimilarity], result of:
            0.041953936 = score(doc=3072,freq=1.0), product of:
              0.1253512 = queryWeight, product of:
                1.2346312 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.023699386 = queryNorm
              0.33469114 = fieldWeight in 3072, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.078125 = fieldNorm(doc=3072)
          0.17551035 = weight(abstract_txt:algorithms in 3072) [ClassicSimilarity], result of:
            0.17551035 = score(doc=3072,freq=3.0), product of:
              0.22565316 = queryWeight, product of:
                1.6565086 = boost
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.023699386 = queryNorm
              0.7777881 = fieldWeight in 3072, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.078125 = fieldNorm(doc=3072)
          0.28884628 = weight(abstract_txt:clustering in 3072) [ClassicSimilarity], result of:
            0.28884628 = score(doc=3072,freq=5.0), product of:
              0.26529837 = queryWeight, product of:
                1.7961403 = boost
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.023699386 = queryNorm
              1.0887601 = fieldWeight in 3072, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.078125 = fieldNorm(doc=3072)
          0.45426372 = weight(abstract_txt:agglomerative in 3072) [ClassicSimilarity], result of:
            0.45426372 = score(doc=3072,freq=1.0), product of:
              0.6135059 = queryWeight, product of:
                2.7313814 = boost
                9.47762 = idf(docFreq=8, maxDocs=43254)
                0.023699386 = queryNorm
              0.74043906 = fieldWeight in 3072, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.47762 = idf(docFreq=8, maxDocs=43254)
                0.078125 = fieldNorm(doc=3072)
        0.71428573 = coord(5/7)
    
  3. Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.68
    0.6764396 = sum of:
      0.6764396 = product of:
        0.9470154 = sum of:
          0.017834092 = weight(abstract_txt:retrieval in 2449) [ClassicSimilarity], result of:
            0.017834092 = score(doc=2449,freq=1.0), product of:
              0.082234494 = queryWeight, product of:
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.023699386 = queryNorm
              0.21686874 = fieldWeight in 2449, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=2449)
          0.047465462 = weight(abstract_txt:document in 2449) [ClassicSimilarity], result of:
            0.047465462 = score(doc=2449,freq=2.0), product of:
              0.1253512 = queryWeight, product of:
                1.2346312 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.023699386 = queryNorm
              0.37865978 = fieldWeight in 2449, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=2449)
          0.11464287 = weight(abstract_txt:algorithms in 2449) [ClassicSimilarity], result of:
            0.11464287 = score(doc=2449,freq=2.0), product of:
              0.22565316 = queryWeight, product of:
                1.6565086 = boost
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.023699386 = queryNorm
              0.5080491 = fieldWeight in 2449, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.0625 = fieldNorm(doc=2449)
          0.25313222 = weight(abstract_txt:clustering in 2449) [ClassicSimilarity], result of:
            0.25313222 = score(doc=2449,freq=6.0), product of:
              0.26529837 = queryWeight, product of:
                1.7961403 = boost
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.023699386 = queryNorm
              0.9541417 = fieldWeight in 2449, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.0625 = fieldNorm(doc=2449)
          0.51394075 = weight(abstract_txt:agglomerative in 2449) [ClassicSimilarity], result of:
            0.51394075 = score(doc=2449,freq=2.0), product of:
              0.6135059 = queryWeight, product of:
                2.7313814 = boost
                9.47762 = idf(docFreq=8, maxDocs=43254)
                0.023699386 = queryNorm
              0.83771116 = fieldWeight in 2449, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.47762 = idf(docFreq=8, maxDocs=43254)
                0.0625 = fieldNorm(doc=2449)
        0.71428573 = coord(5/7)
    
  4. Kirriemuir, J.W.; Willet, P.: Identification of duplicate and near-duplicate full-text records in database search-outputs using hierarchic cluster analysis (1995) 0.45
    0.45383966 = sum of:
      0.45383966 = product of:
        1.5884387 = sum of:
          0.26848724 = weight(abstract_txt:clustering in 3498) [ClassicSimilarity], result of:
            0.26848724 = score(doc=3498,freq=3.0), product of:
              0.26529837 = queryWeight, product of:
                1.7961403 = boost
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.023699386 = queryNorm
              1.01202 = fieldWeight in 3498, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.09375 = fieldNorm(doc=3498)
          1.3199515 = weight(title_txt:hierarchic in 3498) [ClassicSimilarity], result of:
            1.3199515 = score(doc=3498,freq=1.0), product of:
              0.62884945 = queryWeight, product of:
                2.7653258 = boost
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.023699386 = queryNorm
              2.0989945 = fieldWeight in 3498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.21875 = fieldNorm(doc=3498)
        0.2857143 = coord(2/7)
    
  5. Rijsbergen, C.J. van: ¬A fast hierarchic clustering algorithm (1970) 0.38
    0.37712902 = sum of:
      0.37712902 = product of:
        2.639903 = sum of:
          2.639903 = weight(title_txt:hierarchic in 3300) [ClassicSimilarity], result of:
            2.639903 = score(doc=3300,freq=1.0), product of:
              0.62884945 = queryWeight, product of:
                2.7653258 = boost
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.023699386 = queryNorm
              4.197989 = fieldWeight in 3300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.4375 = fieldNorm(doc=3300)
        0.14285715 = coord(1/7)