Document (#26104)

Author
Subramanian, S.
Shafer, K.E.
Title
Clustering
Source
http://www.oclc.org/research/publications/arr/1997/
Year
1998
Abstract
This article presents our exploration of computer science clustering algorithms as they relate to the Scorpion system. Scorpion is a research project at OCLC that explores the indexing and cataloging of electronic resources. For a more complete description of the Scorpion, please visit the Scorpion Web site at <http://purl.oclc.org/scorpion>
Theme
Automatisches Klassifizieren
Internet
Object
Scorpion
DDC

Similar documents (author)

  1. Shafer, K.: Scorpion Project explores using Dewey to organize the Web (1996) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:shafer in 6750) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 6750, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=6750)
    
  2. Shafer, K.: Scorpion helps catalog the Web (1997) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:shafer in 2532) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 2532, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=2532)
    
  3. Shafer, K.E.: Manipulating Tagged text (2001) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:shafer in 4011) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 4011, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=4011)
    
  4. Shafer, K.E.: Mantis Project : A Toolkit for Cataloging (2001) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:shafer in 1028) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 1028, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=1028)
    
  5. Shafer, K.E.: Translating Mathematical Markup for Electronic Journals (2001) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:shafer in 1030) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 1030, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=1030)
    

Similar documents (content)

  1. Shafer, K.: Scorpion helps catalog the Web (1997) 1.00
    1.0027685 = sum of:
      1.0027685 = product of:
        4.178202 = sum of:
          0.02226554 = weight(abstract_txt:resources in 2532) [ClassicSimilarity], result of:
            0.02226554 = score(doc=2532,freq=2.0), product of:
              0.034088805 = queryWeight, product of:
                1.2521638 = boost
                4.2226825 = idf(docFreq=1761, maxDocs=44218)
                0.006447068 = queryNorm
              0.65316284 = fieldWeight in 2532, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2226825 = idf(docFreq=1761, maxDocs=44218)
                0.109375 = fieldNorm(doc=2532)
          0.016413659 = weight(abstract_txt:electronic in 2532) [ClassicSimilarity], result of:
            0.016413659 = score(doc=2532,freq=1.0), product of:
              0.035048537 = queryWeight, product of:
                1.269668 = boost
                4.281712 = idf(docFreq=1660, maxDocs=44218)
                0.006447068 = queryNorm
              0.46831226 = fieldWeight in 2532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.281712 = idf(docFreq=1660, maxDocs=44218)
                0.109375 = fieldNorm(doc=2532)
          0.017206686 = weight(abstract_txt:indexing in 2532) [ClassicSimilarity], result of:
            0.017206686 = score(doc=2532,freq=1.0), product of:
              0.03616855 = queryWeight, product of:
                1.2897953 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.006447068 = queryNorm
              0.47573614 = fieldWeight in 2532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.109375 = fieldNorm(doc=2532)
          0.01755027 = weight(abstract_txt:project in 2532) [ClassicSimilarity], result of:
            0.01755027 = score(doc=2532,freq=1.0), product of:
              0.036648437 = queryWeight, product of:
                1.2983236 = boost
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.006447068 = queryNorm
              0.4788818 = fieldWeight in 2532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.109375 = fieldNorm(doc=2532)
          0.040676683 = weight(abstract_txt:oclc in 2532) [ClassicSimilarity], result of:
            0.040676683 = score(doc=2532,freq=1.0), product of:
              0.06418447 = queryWeight, product of:
                1.7181861 = boost
                5.794254 = idf(docFreq=365, maxDocs=44218)
                0.006447068 = queryNorm
              0.6337465 = fieldWeight in 2532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.794254 = idf(docFreq=365, maxDocs=44218)
                0.109375 = fieldNorm(doc=2532)
          4.0640893 = weight(title_txt:scorpion in 2532) [ClassicSimilarity], result of:
            4.0640893 = score(doc=2532,freq=1.0), product of:
              0.9378322 = queryWeight, product of:
                14.685975 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.006447068 = queryNorm
              4.333493 = fieldWeight in 2532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.4375 = fieldNorm(doc=2532)
        0.24 = coord(6/25)
    
  2. Shafer, K.E.: Evaluating Scorpion results (1998) 0.95
    0.9487462 = sum of:
      0.9487462 = product of:
        4.743731 = sum of:
          0.013753552 = weight(abstract_txt:science in 1569) [ClassicSimilarity], result of:
            0.013753552 = score(doc=1569,freq=1.0), product of:
              0.028498033 = queryWeight, product of:
                1.1448871 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.006447068 = queryNorm
              0.48261407 = fieldWeight in 1569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.125 = fieldNorm(doc=1569)
          0.018758468 = weight(abstract_txt:electronic in 1569) [ClassicSimilarity], result of:
            0.018758468 = score(doc=1569,freq=1.0), product of:
              0.035048537 = queryWeight, product of:
                1.269668 = boost
                4.281712 = idf(docFreq=1660, maxDocs=44218)
                0.006447068 = queryNorm
              0.535214 = fieldWeight in 1569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.281712 = idf(docFreq=1660, maxDocs=44218)
                0.125 = fieldNorm(doc=1569)
          0.020057451 = weight(abstract_txt:project in 1569) [ClassicSimilarity], result of:
            0.020057451 = score(doc=1569,freq=1.0), product of:
              0.036648437 = queryWeight, product of:
                1.2983236 = boost
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.006447068 = queryNorm
              0.5472935 = fieldWeight in 1569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.125 = fieldNorm(doc=1569)
          0.04648764 = weight(abstract_txt:oclc in 1569) [ClassicSimilarity], result of:
            0.04648764 = score(doc=1569,freq=1.0), product of:
              0.06418447 = queryWeight, product of:
                1.7181861 = boost
                5.794254 = idf(docFreq=365, maxDocs=44218)
                0.006447068 = queryNorm
              0.7242817 = fieldWeight in 1569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.794254 = idf(docFreq=365, maxDocs=44218)
                0.125 = fieldNorm(doc=1569)
          4.644674 = weight(title_txt:scorpion in 1569) [ClassicSimilarity], result of:
            4.644674 = score(doc=1569,freq=1.0), product of:
              0.9378322 = queryWeight, product of:
                14.685975 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.006447068 = queryNorm
              4.952564 = fieldWeight in 1569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.5 = fieldNorm(doc=1569)
        0.2 = coord(5/25)
    
  3. Shafer, K.: Scorpion Project explores using Dewey to organize the Web (1996) 0.72
    0.71807104 = sum of:
      0.71807104 = product of:
        2.9919627 = sum of:
          0.010315164 = weight(abstract_txt:science in 6750) [ClassicSimilarity], result of:
            0.010315164 = score(doc=6750,freq=1.0), product of:
              0.028498033 = queryWeight, product of:
                1.1448871 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.006447068 = queryNorm
              0.36196056 = fieldWeight in 6750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.09375 = fieldNorm(doc=6750)
          0.014068851 = weight(abstract_txt:electronic in 6750) [ClassicSimilarity], result of:
            0.014068851 = score(doc=6750,freq=1.0), product of:
              0.035048537 = queryWeight, product of:
                1.269668 = boost
                4.281712 = idf(docFreq=1660, maxDocs=44218)
                0.006447068 = queryNorm
              0.40141052 = fieldWeight in 6750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.281712 = idf(docFreq=1660, maxDocs=44218)
                0.09375 = fieldNorm(doc=6750)
          0.014748587 = weight(abstract_txt:indexing in 6750) [ClassicSimilarity], result of:
            0.014748587 = score(doc=6750,freq=1.0), product of:
              0.03616855 = queryWeight, product of:
                1.2897953 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.006447068 = queryNorm
              0.40777382 = fieldWeight in 6750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=6750)
          0.015043089 = weight(abstract_txt:project in 6750) [ClassicSimilarity], result of:
            0.015043089 = score(doc=6750,freq=1.0), product of:
              0.036648437 = queryWeight, product of:
                1.2983236 = boost
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.006447068 = queryNorm
              0.41047013 = fieldWeight in 6750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.09375 = fieldNorm(doc=6750)
          0.03486573 = weight(abstract_txt:oclc in 6750) [ClassicSimilarity], result of:
            0.03486573 = score(doc=6750,freq=1.0), product of:
              0.06418447 = queryWeight, product of:
                1.7181861 = boost
                5.794254 = idf(docFreq=365, maxDocs=44218)
                0.006447068 = queryNorm
              0.5432113 = fieldWeight in 6750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.794254 = idf(docFreq=365, maxDocs=44218)
                0.09375 = fieldNorm(doc=6750)
          2.9029212 = weight(title_txt:scorpion in 6750) [ClassicSimilarity], result of:
            2.9029212 = score(doc=6750,freq=1.0), product of:
              0.9378322 = queryWeight, product of:
                14.685975 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.006447068 = queryNorm
              3.0953524 = fieldWeight in 6750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.3125 = fieldNorm(doc=6750)
        0.24 = coord(6/25)
    
  4. Shafer, K.E.: Evaluating Scorpion Results (2001) 0.57
    0.5686583 = sum of:
      0.5686583 = product of:
        4.738819 = sum of:
          0.04498319 = weight(abstract_txt:resources in 4085) [ClassicSimilarity], result of:
            0.04498319 = score(doc=4085,freq=1.0), product of:
              0.034088805 = queryWeight, product of:
                1.2521638 = boost
                4.2226825 = idf(docFreq=1761, maxDocs=44218)
                0.006447068 = queryNorm
              1.3195883 = fieldWeight in 4085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2226825 = idf(docFreq=1761, maxDocs=44218)
                0.3125 = fieldNorm(doc=4085)
          0.049161956 = weight(abstract_txt:indexing in 4085) [ClassicSimilarity], result of:
            0.049161956 = score(doc=4085,freq=1.0), product of:
              0.03616855 = queryWeight, product of:
                1.2897953 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.006447068 = queryNorm
              1.359246 = fieldWeight in 4085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.3125 = fieldNorm(doc=4085)
          4.644674 = weight(title_txt:scorpion in 4085) [ClassicSimilarity], result of:
            4.644674 = score(doc=4085,freq=1.0), product of:
              0.9378322 = queryWeight, product of:
                14.685975 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.006447068 = queryNorm
              4.952564 = fieldWeight in 4085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.5 = fieldNorm(doc=4085)
        0.12 = coord(3/25)
    
  5. Shafer, K.E.: Automatic Subject Assignment via the Scorpion System (2001) 0.14
    0.13934019 = sum of:
      0.13934019 = product of:
        3.483505 = sum of:
          3.483505 = weight(title_txt:scorpion in 1043) [ClassicSimilarity], result of:
            3.483505 = score(doc=1043,freq=1.0), product of:
              0.9378322 = queryWeight, product of:
                14.685975 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.006447068 = queryNorm
              3.7144227 = fieldWeight in 1043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.375 = fieldNorm(doc=1043)
        0.04 = coord(1/25)