Document (#26105)

Author
Subramanian, S.
Shafer, K.E.
Title
Clustering
Source
http://www.oclc.org/research/publications/arr/1997/
Year
1998
Abstract
This article presents our exploration of computer science clustering algorithms as they relate to the Scorpion system. Scorpion is a research project at OCLC that explores the indexing and cataloging of electronic resources. For a more complete description of the Scorpion, please visit the Scorpion Web site at <http://purl.oclc.org/scorpion>
Theme
Automatisches Klassifizieren
Internet
Object
Scorpion
DDC

Similar documents (author)

  1. Shafer, K.: Scorpion Project explores using Dewey to organize the Web (1996) 5.48
    5.4828243 = sum of:
      5.4828243 = weight(author_txt:shafer in 6819) [ClassicSimilarity], result of:
        5.4828243 = fieldWeight in 6819, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.772519 = idf(docFreq=17, maxDocs=42740)
          0.625 = fieldNorm(doc=6819)
    
  2. Shafer, K.: Scorpion helps catalog the Web (1997) 5.48
    5.4828243 = sum of:
      5.4828243 = weight(author_txt:shafer in 3533) [ClassicSimilarity], result of:
        5.4828243 = fieldWeight in 3533, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.772519 = idf(docFreq=17, maxDocs=42740)
          0.625 = fieldNorm(doc=3533)
    
  3. Shafer, K.E.: Manipulating Tagged text (2001) 5.48
    5.4828243 = sum of:
      5.4828243 = weight(author_txt:shafer in 5012) [ClassicSimilarity], result of:
        5.4828243 = fieldWeight in 5012, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.772519 = idf(docFreq=17, maxDocs=42740)
          0.625 = fieldNorm(doc=5012)
    
  4. Shafer, K.E.: Mantis Project : A Toolkit for Cataloging (2001) 5.48
    5.4828243 = sum of:
      5.4828243 = weight(author_txt:shafer in 2029) [ClassicSimilarity], result of:
        5.4828243 = fieldWeight in 2029, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.772519 = idf(docFreq=17, maxDocs=42740)
          0.625 = fieldNorm(doc=2029)
    
  5. Shafer, K.E.: Translating Mathematical Markup for Electronic Journals (2001) 5.48
    5.4828243 = sum of:
      5.4828243 = weight(author_txt:shafer in 2031) [ClassicSimilarity], result of:
        5.4828243 = fieldWeight in 2031, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.772519 = idf(docFreq=17, maxDocs=42740)
          0.625 = fieldNorm(doc=2031)
    

Similar documents (content)

  1. Shafer, K.: Scorpion helps catalog the Web (1997) 1.00
    0.9985525 = sum of:
      0.9985525 = product of:
        4.1606355 = sum of:
          0.02245755 = weight(abstract_txt:resources in 3533) [ClassicSimilarity], result of:
            0.02245755 = score(doc=3533,freq=2.0), product of:
              0.03435356 = queryWeight, product of:
                1.2565644 = boost
                4.226273 = idf(docFreq=1696, maxDocs=42740)
                0.006468886 = queryNorm
              0.65371823 = fieldWeight in 3533, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.226273 = idf(docFreq=1696, maxDocs=42740)
                0.109375 = fieldNorm(doc=3533)
          0.016136544 = weight(abstract_txt:electronic in 3533) [ClassicSimilarity], result of:
            0.016136544 = score(doc=3533,freq=1.0), product of:
              0.03472273 = queryWeight, product of:
                1.2632979 = boost
                4.2489204 = idf(docFreq=1658, maxDocs=42740)
                0.006468886 = queryNorm
              0.46472567 = fieldWeight in 3533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2489204 = idf(docFreq=1658, maxDocs=42740)
                0.109375 = fieldNorm(doc=3533)
          0.017201172 = weight(abstract_txt:indexing in 3533) [ClassicSimilarity], result of:
            0.017201172 = score(doc=3533,freq=1.0), product of:
              0.03623366 = queryWeight, product of:
                1.2904909 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.006468886 = queryNorm
              0.4747291 = fieldWeight in 3533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.109375 = fieldNorm(doc=3533)
          0.017661469 = weight(abstract_txt:project in 3533) [ClassicSimilarity], result of:
            0.017661469 = score(doc=3533,freq=1.0), product of:
              0.03687721 = queryWeight, product of:
                1.3019007 = boost
                4.3787556 = idf(docFreq=1456, maxDocs=42740)
                0.006468886 = queryNorm
              0.4789264 = fieldWeight in 3533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3787556 = idf(docFreq=1456, maxDocs=42740)
                0.109375 = fieldNorm(doc=3533)
          0.04043743 = weight(abstract_txt:oclc in 3533) [ClassicSimilarity], result of:
            0.04043743 = score(doc=3533,freq=1.0), product of:
              0.06406132 = queryWeight, product of:
                1.715919 = boost
                5.7712464 = idf(docFreq=361, maxDocs=42740)
                0.006468886 = queryNorm
              0.63123006 = fieldWeight in 3533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7712464 = idf(docFreq=361, maxDocs=42740)
                0.109375 = fieldNorm(doc=3533)
          4.0467415 = weight(title_txt:scorpion in 3533) [ClassicSimilarity], result of:
            4.0467415 = score(doc=3533,freq=1.0), product of:
              0.93704516 = queryWeight, product of:
                14.674527 = boost
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.006468886 = queryNorm
              4.3186197 = fieldWeight in 3533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.4375 = fieldNorm(doc=3533)
        0.24 = coord(6/25)
    
  2. Shafer, K.E.: Evaluating Scorpion results (1998) 0.94
    0.9448 = sum of:
      0.9448 = product of:
        4.724 = sum of:
          0.0143113965 = weight(abstract_txt:science in 2570) [ClassicSimilarity], result of:
            0.0143113965 = score(doc=2570,freq=1.0), product of:
              0.02932245 = queryWeight, product of:
                1.1609111 = boost
                3.904557 = idf(docFreq=2340, maxDocs=42740)
                0.006468886 = queryNorm
              0.48806962 = fieldWeight in 2570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.904557 = idf(docFreq=2340, maxDocs=42740)
                0.125 = fieldNorm(doc=2570)
          0.018441765 = weight(abstract_txt:electronic in 2570) [ClassicSimilarity], result of:
            0.018441765 = score(doc=2570,freq=1.0), product of:
              0.03472273 = queryWeight, product of:
                1.2632979 = boost
                4.2489204 = idf(docFreq=1658, maxDocs=42740)
                0.006468886 = queryNorm
              0.53111506 = fieldWeight in 2570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2489204 = idf(docFreq=1658, maxDocs=42740)
                0.125 = fieldNorm(doc=2570)
          0.020184537 = weight(abstract_txt:project in 2570) [ClassicSimilarity], result of:
            0.020184537 = score(doc=2570,freq=1.0), product of:
              0.03687721 = queryWeight, product of:
                1.3019007 = boost
                4.3787556 = idf(docFreq=1456, maxDocs=42740)
                0.006468886 = queryNorm
              0.54734445 = fieldWeight in 2570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3787556 = idf(docFreq=1456, maxDocs=42740)
                0.125 = fieldNorm(doc=2570)
          0.046214208 = weight(abstract_txt:oclc in 2570) [ClassicSimilarity], result of:
            0.046214208 = score(doc=2570,freq=1.0), product of:
              0.06406132 = queryWeight, product of:
                1.715919 = boost
                5.7712464 = idf(docFreq=361, maxDocs=42740)
                0.006468886 = queryNorm
              0.7214058 = fieldWeight in 2570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7712464 = idf(docFreq=361, maxDocs=42740)
                0.125 = fieldNorm(doc=2570)
          4.624848 = weight(title_txt:scorpion in 2570) [ClassicSimilarity], result of:
            4.624848 = score(doc=2570,freq=1.0), product of:
              0.93704516 = queryWeight, product of:
                14.674527 = boost
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.006468886 = queryNorm
              4.9355655 = fieldWeight in 2570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.5 = fieldNorm(doc=2570)
        0.2 = coord(5/25)
    
  3. Shafer, K.: Scorpion Project explores using Dewey to organize the Web (1996) 0.72
    0.715113 = sum of:
      0.715113 = product of:
        2.9796376 = sum of:
          0.010733548 = weight(abstract_txt:science in 6819) [ClassicSimilarity], result of:
            0.010733548 = score(doc=6819,freq=1.0), product of:
              0.02932245 = queryWeight, product of:
                1.1609111 = boost
                3.904557 = idf(docFreq=2340, maxDocs=42740)
                0.006468886 = queryNorm
              0.3660522 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.904557 = idf(docFreq=2340, maxDocs=42740)
                0.09375 = fieldNorm(doc=6819)
          0.013831324 = weight(abstract_txt:electronic in 6819) [ClassicSimilarity], result of:
            0.013831324 = score(doc=6819,freq=1.0), product of:
              0.03472273 = queryWeight, product of:
                1.2632979 = boost
                4.2489204 = idf(docFreq=1658, maxDocs=42740)
                0.006468886 = queryNorm
              0.3983363 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2489204 = idf(docFreq=1658, maxDocs=42740)
                0.09375 = fieldNorm(doc=6819)
          0.014743863 = weight(abstract_txt:indexing in 6819) [ClassicSimilarity], result of:
            0.014743863 = score(doc=6819,freq=1.0), product of:
              0.03623366 = queryWeight, product of:
                1.2904909 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.006468886 = queryNorm
              0.40691066 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.09375 = fieldNorm(doc=6819)
          0.015138403 = weight(abstract_txt:project in 6819) [ClassicSimilarity], result of:
            0.015138403 = score(doc=6819,freq=1.0), product of:
              0.03687721 = queryWeight, product of:
                1.3019007 = boost
                4.3787556 = idf(docFreq=1456, maxDocs=42740)
                0.006468886 = queryNorm
              0.41050833 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3787556 = idf(docFreq=1456, maxDocs=42740)
                0.09375 = fieldNorm(doc=6819)
          0.034660656 = weight(abstract_txt:oclc in 6819) [ClassicSimilarity], result of:
            0.034660656 = score(doc=6819,freq=1.0), product of:
              0.06406132 = queryWeight, product of:
                1.715919 = boost
                5.7712464 = idf(docFreq=361, maxDocs=42740)
                0.006468886 = queryNorm
              0.54105437 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7712464 = idf(docFreq=361, maxDocs=42740)
                0.09375 = fieldNorm(doc=6819)
          2.8905299 = weight(title_txt:scorpion in 6819) [ClassicSimilarity], result of:
            2.8905299 = score(doc=6819,freq=1.0), product of:
              0.93704516 = queryWeight, product of:
                14.674527 = boost
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.006468886 = queryNorm
              3.0847285 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.3125 = fieldNorm(doc=6819)
        0.24 = coord(6/25)
    
  4. Shafer, K.E.: Evaluating Scorpion Results (2001) 0.57
    0.5663238 = sum of:
      0.5663238 = product of:
        4.719365 = sum of:
          0.045371104 = weight(abstract_txt:resources in 86) [ClassicSimilarity], result of:
            0.045371104 = score(doc=86,freq=1.0), product of:
              0.03435356 = queryWeight, product of:
                1.2565644 = boost
                4.226273 = idf(docFreq=1696, maxDocs=42740)
                0.006468886 = queryNorm
              1.3207103 = fieldWeight in 86, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.226273 = idf(docFreq=1696, maxDocs=42740)
                0.3125 = fieldNorm(doc=86)
          0.049146205 = weight(abstract_txt:indexing in 86) [ClassicSimilarity], result of:
            0.049146205 = score(doc=86,freq=1.0), product of:
              0.03623366 = queryWeight, product of:
                1.2904909 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.006468886 = queryNorm
              1.3563688 = fieldWeight in 86, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.3125 = fieldNorm(doc=86)
          4.624848 = weight(title_txt:scorpion in 86) [ClassicSimilarity], result of:
            4.624848 = score(doc=86,freq=1.0), product of:
              0.93704516 = queryWeight, product of:
                14.674527 = boost
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.006468886 = queryNorm
              4.9355655 = fieldWeight in 86, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.5 = fieldNorm(doc=86)
        0.12 = coord(3/25)
    
  5. Shafer, K.E.: Automatic Subject Assignment via the Scorpion System (2001) 0.14
    0.13874543 = sum of:
      0.13874543 = product of:
        3.4686358 = sum of:
          3.4686358 = weight(title_txt:scorpion in 2044) [ClassicSimilarity], result of:
            3.4686358 = score(doc=2044,freq=1.0), product of:
              0.93704516 = queryWeight, product of:
                14.674527 = boost
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.006468886 = queryNorm
              3.701674 = fieldWeight in 2044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.375 = fieldNorm(doc=2044)
        0.04 = coord(1/25)