Document (#5490)

Author
Croft, W.B.
Title
Clustering large files of documents using the single link method
Source
Journal of the American Society for Information Science. 28(1977), S.341-344
Year
1977
Theme
Automatisches Indexieren

Similar documents (author)

  1. Croft, W.B.: Approaches to intelligent information retrieval (1987) 5.02
    5.020828 = sum of:
      5.020828 = weight(author_txt:croft in 1094) [ClassicSimilarity], result of:
        5.020828 = score(doc=1094,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          5.0208282 = fieldWeight in 1094, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.625 = fieldNorm(doc=1094)
    
  2. Croft, W.B.: Knowledge-based and statistical approaches to text retrieval (1993) 5.02
    5.020828 = sum of:
      5.020828 = weight(author_txt:croft in 7863) [ClassicSimilarity], result of:
        5.020828 = score(doc=7863,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          5.0208282 = fieldWeight in 7863, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.625 = fieldNorm(doc=7863)
    
  3. Croft, W.B.: Hypertext and information retrieval : what are the fundamental concepts? (1990) 5.02
    5.020828 = sum of:
      5.020828 = weight(author_txt:croft in 8003) [ClassicSimilarity], result of:
        5.020828 = score(doc=8003,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          5.0208282 = fieldWeight in 8003, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.625 = fieldNorm(doc=8003)
    
  4. Croft, W.B.: What do people want from information retrieval? : the top 10 research issues for companies that use and sell IR systems (1995) 5.02
    5.020828 = sum of:
      5.020828 = weight(author_txt:croft in 3402) [ClassicSimilarity], result of:
        5.020828 = score(doc=3402,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          5.0208282 = fieldWeight in 3402, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.625 = fieldNorm(doc=3402)
    
  5. Croft, W.B.: Effective retrieval based on combining evidence from the corpus and users (1995) 5.02
    5.020828 = sum of:
      5.020828 = weight(author_txt:croft in 4489) [ClassicSimilarity], result of:
        5.020828 = score(doc=4489,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.12448145 = queryNorm
          5.0208282 = fieldWeight in 4489, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.033325 = idf(docFreq=38, maxDocs=44218)
            0.625 = fieldNorm(doc=4489)
    

Similar documents (content)

  1. Burgin, R.: ¬The retrieval effectiveness of 5 clustering algorithms as a function of indexing exhaustivity (1995) 0.77
    0.766325 = sum of:
      0.766325 = product of:
        1.22612 = sum of:
          0.07375534 = weight(abstract_txt:large in 3365) [ClassicSimilarity], result of:
            0.07375534 = score(doc=3365,freq=1.0), product of:
              0.2649443 = queryWeight, product of:
                1.2861497 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.04624919 = queryNorm
              0.27838057 = fieldWeight in 3365, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=3365)
          0.07610782 = weight(abstract_txt:method in 3365) [ClassicSimilarity], result of:
            0.07610782 = score(doc=3365,freq=1.0), product of:
              0.2705485 = queryWeight, product of:
                1.2996812 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.04624919 = queryNorm
              0.28130937 = fieldWeight in 3365, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=3365)
          0.23806258 = weight(abstract_txt:single in 3365) [ClassicSimilarity], result of:
            0.23806258 = score(doc=3365,freq=4.0), product of:
              0.36452976 = queryWeight, product of:
                1.5086231 = boost
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.04624919 = queryNorm
              0.6530676 = fieldWeight in 3365, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.0625 = fieldNorm(doc=3365)
          0.34708574 = weight(abstract_txt:link in 3365) [ClassicSimilarity], result of:
            0.34708574 = score(doc=3365,freq=5.0), product of:
              0.43510434 = queryWeight, product of:
                1.648204 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.04624919 = queryNorm
              0.7977069 = fieldWeight in 3365, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=3365)
          0.49110857 = weight(abstract_txt:clustering in 3365) [ClassicSimilarity], result of:
            0.49110857 = score(doc=3365,freq=6.0), product of:
              0.516052 = queryWeight, product of:
                1.7949858 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.04624919 = queryNorm
              0.95166487 = fieldWeight in 3365, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=3365)
        0.625 = coord(5/8)
    
  2. Salton, G.: Fast document classification in automatic information retrieval (1978) 0.50
    0.49991533 = sum of:
      0.49991533 = product of:
        0.99983066 = sum of:
          0.12907185 = weight(abstract_txt:large in 2331) [ClassicSimilarity], result of:
            0.12907185 = score(doc=2331,freq=1.0), product of:
              0.2649443 = queryWeight, product of:
                1.2861497 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.04624919 = queryNorm
              0.487166 = fieldWeight in 2331, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.109375 = fieldNorm(doc=2331)
          0.1331887 = weight(abstract_txt:method in 2331) [ClassicSimilarity], result of:
            0.1331887 = score(doc=2331,freq=1.0), product of:
              0.2705485 = queryWeight, product of:
                1.2996812 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.04624919 = queryNorm
              0.4922914 = fieldWeight in 2331, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.109375 = fieldNorm(doc=2331)
          0.38670522 = weight(abstract_txt:files in 2331) [ClassicSimilarity], result of:
            0.38670522 = score(doc=2331,freq=2.0), product of:
              0.43702897 = queryWeight, product of:
                1.6518453 = boost
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.04624919 = queryNorm
              0.8848503 = fieldWeight in 2331, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.109375 = fieldNorm(doc=2331)
          0.3508649 = weight(abstract_txt:clustering in 2331) [ClassicSimilarity], result of:
            0.3508649 = score(doc=2331,freq=1.0), product of:
              0.516052 = queryWeight, product of:
                1.7949858 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.04624919 = queryNorm
              0.6799022 = fieldWeight in 2331, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.109375 = fieldNorm(doc=2331)
        0.5 = coord(4/8)
    
  3. Rasmussen, E.: Clustering algorithms (1992) 0.49
    0.48788562 = sum of:
      0.48788562 = product of:
        0.780617 = sum of:
          0.058427896 = weight(abstract_txt:documents in 3513) [ClassicSimilarity], result of:
            0.058427896 = score(doc=3513,freq=1.0), product of:
              0.22683273 = queryWeight, product of:
                1.1900553 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.04624919 = queryNorm
              0.2575814 = fieldWeight in 3513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=3513)
          0.07375534 = weight(abstract_txt:large in 3513) [ClassicSimilarity], result of:
            0.07375534 = score(doc=3513,freq=1.0), product of:
              0.2649443 = queryWeight, product of:
                1.2861497 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.04624919 = queryNorm
              0.27838057 = fieldWeight in 3513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=3513)
          0.13182262 = weight(abstract_txt:method in 3513) [ClassicSimilarity], result of:
            0.13182262 = score(doc=3513,freq=3.0), product of:
              0.2705485 = queryWeight, product of:
                1.2996812 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.04624919 = queryNorm
              0.4872421 = fieldWeight in 3513, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=3513)
          0.20616822 = weight(abstract_txt:single in 3513) [ClassicSimilarity], result of:
            0.20616822 = score(doc=3513,freq=3.0), product of:
              0.36452976 = queryWeight, product of:
                1.5086231 = boost
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.04624919 = queryNorm
              0.5655731 = fieldWeight in 3513, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.0625 = fieldNorm(doc=3513)
          0.31044292 = weight(abstract_txt:link in 3513) [ClassicSimilarity], result of:
            0.31044292 = score(doc=3513,freq=4.0), product of:
              0.43510434 = queryWeight, product of:
                1.648204 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.04624919 = queryNorm
              0.7134907 = fieldWeight in 3513, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=3513)
        0.625 = coord(5/8)
    
  4. Broder, A.Z.: Syntactic clustering of the Web (1997) 0.45
    0.44984183 = sum of:
      0.44984183 = product of:
        0.89968365 = sum of:
          0.06933442 = weight(abstract_txt:using in 2671) [ClassicSimilarity], result of:
            0.06933442 = score(doc=2671,freq=1.0), product of:
              0.16016643 = queryWeight, product of:
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.04624919 = queryNorm
              0.43288982 = fieldWeight in 2671, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.125 = fieldNorm(doc=2671)
          0.11685579 = weight(abstract_txt:documents in 2671) [ClassicSimilarity], result of:
            0.11685579 = score(doc=2671,freq=1.0), product of:
              0.22683273 = queryWeight, product of:
                1.1900553 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.04624919 = queryNorm
              0.5151628 = fieldWeight in 2671, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.125 = fieldNorm(doc=2671)
          0.312505 = weight(abstract_txt:files in 2671) [ClassicSimilarity], result of:
            0.312505 = score(doc=2671,freq=1.0), product of:
              0.43702897 = queryWeight, product of:
                1.6518453 = boost
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.04624919 = queryNorm
              0.715067 = fieldWeight in 2671, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.125 = fieldNorm(doc=2671)
          0.40098843 = weight(abstract_txt:clustering in 2671) [ClassicSimilarity], result of:
            0.40098843 = score(doc=2671,freq=1.0), product of:
              0.516052 = queryWeight, product of:
                1.7949858 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.04624919 = queryNorm
              0.77703106 = fieldWeight in 2671, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.125 = fieldNorm(doc=2671)
        0.5 = coord(4/8)
    
  5. Girdhar, N.; Bharadwaj, K.K.: Community detection in signed social networks using multiobjective genetic algorithm (2019) 0.44
    0.43697 = sum of:
      0.43697 = product of:
        0.699152 = sum of:
          0.03466721 = weight(abstract_txt:using in 5318) [ClassicSimilarity], result of:
            0.03466721 = score(doc=5318,freq=1.0), product of:
              0.16016643 = queryWeight, product of:
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.04624919 = queryNorm
              0.21644491 = fieldWeight in 5318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=5318)
          0.07610782 = weight(abstract_txt:method in 5318) [ClassicSimilarity], result of:
            0.07610782 = score(doc=5318,freq=1.0), product of:
              0.2705485 = queryWeight, product of:
                1.2996812 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.04624919 = queryNorm
              0.28130937 = fieldWeight in 5318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=5318)
          0.11903129 = weight(abstract_txt:single in 5318) [ClassicSimilarity], result of:
            0.11903129 = score(doc=5318,freq=1.0), product of:
              0.36452976 = queryWeight, product of:
                1.5086231 = boost
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.04624919 = queryNorm
              0.3265338 = fieldWeight in 5318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.0625 = fieldNorm(doc=5318)
          0.26885146 = weight(abstract_txt:link in 5318) [ClassicSimilarity], result of:
            0.26885146 = score(doc=5318,freq=3.0), product of:
              0.43510434 = queryWeight, product of:
                1.648204 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.04624919 = queryNorm
              0.6179011 = fieldWeight in 5318, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=5318)
          0.20049421 = weight(abstract_txt:clustering in 5318) [ClassicSimilarity], result of:
            0.20049421 = score(doc=5318,freq=1.0), product of:
              0.516052 = queryWeight, product of:
                1.7949858 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.04624919 = queryNorm
              0.38851553 = fieldWeight in 5318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=5318)
        0.625 = coord(5/8)