Document (#2332)

Author
Salton, G.
Title
Fast document classification in automatic information retrieval
Source
Kooperation in der Klassifikation I. Proc. der Sekt.1-3 der 2. Fachtagung der Gesellschaft für Klassifikation, Frankfurt-Hoechst, 6.-7.4.1978. Bearb.: W. Dahlberg
Imprint
Frankfurt : Gesellschaft für Klassifikation
Year
1978
Pages
S.129-146
Series
Studien zur Klassifikation; Bd.2
Abstract
A classified or clustered file is one where related or similar records are grouped into classes or clusters of items in such a way that all itmes within a cluster are jointly retrievable. Clustered files are easily adapted to to broad and narrow search strategies, and simple file updating methods are available. An inexpensive file clustering method applicable to large files is given together with appropriate file search methods
Theme
Automatisches Indexieren

Similar documents (author)

  1. Salton, G.: Another look at automatic text-retrieval systems (1986) 4.85
    4.8517637 = sum of:
      4.8517637 = weight(author_txt:salton in 1356) [ClassicSimilarity], result of:
        4.8517637 = fieldWeight in 1356, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.762822 = idf(docFreq=49, maxDocs=43254)
          0.625 = fieldNorm(doc=1356)
    
  2. Salton, G.: ¬A new comparison between conventional indexing (MEDLARS) and automatic text processing (SMART) (1972) 4.85
    4.8517637 = sum of:
      4.8517637 = weight(author_txt:salton in 2325) [ClassicSimilarity], result of:
        4.8517637 = fieldWeight in 2325, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.762822 = idf(docFreq=49, maxDocs=43254)
          0.625 = fieldNorm(doc=2325)
    
  3. Salton, G.: Future prospects for text-based information retrieval (1990) 4.85
    4.8517637 = sum of:
      4.8517637 = weight(author_txt:salton in 2327) [ClassicSimilarity], result of:
        4.8517637 = fieldWeight in 2327, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.762822 = idf(docFreq=49, maxDocs=43254)
          0.625 = fieldNorm(doc=2327)
    
  4. Salton, G.: Expert systems and information retrieval (1987) 4.85
    4.8517637 = sum of:
      4.8517637 = weight(author_txt:salton in 2837) [ClassicSimilarity], result of:
        4.8517637 = fieldWeight in 2837, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.762822 = idf(docFreq=49, maxDocs=43254)
          0.625 = fieldNorm(doc=2837)
    
  5. Salton, G.: Historical note: the past thirty years in information retrieval (1987) 4.85
    4.8517637 = sum of:
      4.8517637 = weight(author_txt:salton in 3910) [ClassicSimilarity], result of:
        4.8517637 = fieldWeight in 3910, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.762822 = idf(docFreq=49, maxDocs=43254)
          0.625 = fieldNorm(doc=3910)
    

Similar documents (content)

  1. O'Neill, E.T.; Bennett, R.; Kammerer, K.: Using authorities to improve subject searches (2012) 0.17
    0.16673642 = sum of:
      0.16673642 = product of:
        0.6947351 = sum of:
          0.04172519 = weight(abstract_txt:appropriate in 1775) [ClassicSimilarity], result of:
            0.04172519 = score(doc=1775,freq=1.0), product of:
              0.100349635 = queryWeight, product of:
                1.0121912 = boost
                5.3222156 = idf(docFreq=573, maxDocs=43254)
                0.018627767 = queryNorm
              0.4157981 = fieldWeight in 1775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3222156 = idf(docFreq=573, maxDocs=43254)
                0.078125 = fieldNorm(doc=1775)
          0.04197282 = weight(abstract_txt:simple in 1775) [ClassicSimilarity], result of:
            0.04197282 = score(doc=1775,freq=1.0), product of:
              0.10074628 = queryWeight, product of:
                1.0141896 = boost
                5.3327236 = idf(docFreq=567, maxDocs=43254)
                0.018627767 = queryNorm
              0.41661903 = fieldWeight in 1775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3327236 = idf(docFreq=567, maxDocs=43254)
                0.078125 = fieldNorm(doc=1775)
          0.0541565 = weight(abstract_txt:fast in 1775) [ClassicSimilarity], result of:
            0.0541565 = score(doc=1775,freq=1.0), product of:
              0.11940358 = queryWeight, product of:
                1.1041125 = boost
                5.805548 = idf(docFreq=353, maxDocs=43254)
                0.018627767 = queryNorm
              0.45355844 = fieldWeight in 1775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.805548 = idf(docFreq=353, maxDocs=43254)
                0.078125 = fieldNorm(doc=1775)
          0.026982097 = weight(abstract_txt:search in 1775) [ClassicSimilarity], result of:
            0.026982097 = score(doc=1775,freq=1.0), product of:
              0.0945462 = queryWeight, product of:
                1.389446 = boost
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.018627767 = queryNorm
              0.2853853 = fieldWeight in 1775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.078125 = fieldNorm(doc=1775)
          0.1031212 = weight(abstract_txt:files in 1775) [ClassicSimilarity], result of:
            0.1031212 = score(doc=1775,freq=1.0), product of:
              0.23111364 = queryWeight, product of:
                2.1723633 = boost
                5.7112656 = idf(docFreq=388, maxDocs=43254)
                0.018627767 = queryNorm
              0.44619262 = fieldWeight in 1775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7112656 = idf(docFreq=388, maxDocs=43254)
                0.078125 = fieldNorm(doc=1775)
          0.4267773 = weight(abstract_txt:file in 1775) [ClassicSimilarity], result of:
            0.4267773 = score(doc=1775,freq=5.0), product of:
              0.4389494 = queryWeight, product of:
                4.2339125 = boost
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.018627767 = queryNorm
              0.9722699 = fieldWeight in 1775, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.078125 = fieldNorm(doc=1775)
        0.24 = coord(6/25)
    
  2. Lee, D.L.; Ren, L.: Document ranking on weight-partitioned signature files (1996) 0.16
    0.15939161 = sum of:
      0.15939161 = product of:
        0.796958 = sum of:
          0.048282735 = weight(abstract_txt:together in 4418) [ClassicSimilarity], result of:
            0.048282735 = score(doc=4418,freq=1.0), product of:
              0.0979469 = queryWeight, product of:
                5.258113 = idf(docFreq=611, maxDocs=43254)
                0.018627767 = queryNorm
              0.49294809 = fieldWeight in 4418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.258113 = idf(docFreq=611, maxDocs=43254)
                0.09375 = fieldNorm(doc=4418)
          0.032378517 = weight(abstract_txt:search in 4418) [ClassicSimilarity], result of:
            0.032378517 = score(doc=4418,freq=1.0), product of:
              0.0945462 = queryWeight, product of:
                1.389446 = boost
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.018627767 = queryNorm
              0.3424624 = fieldWeight in 4418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.09375 = fieldNorm(doc=4418)
          0.13448589 = weight(abstract_txt:grouped in 4418) [ClassicSimilarity], result of:
            0.13448589 = score(doc=4418,freq=1.0), product of:
              0.19390123 = queryWeight, product of:
                1.4070027 = boost
                7.398179 = idf(docFreq=71, maxDocs=43254)
                0.018627767 = queryNorm
              0.6935793 = fieldWeight in 4418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.398179 = idf(docFreq=71, maxDocs=43254)
                0.09375 = fieldNorm(doc=4418)
          0.12374544 = weight(abstract_txt:files in 4418) [ClassicSimilarity], result of:
            0.12374544 = score(doc=4418,freq=1.0), product of:
              0.23111364 = queryWeight, product of:
                2.1723633 = boost
                5.7112656 = idf(docFreq=388, maxDocs=43254)
                0.018627767 = queryNorm
              0.53543115 = fieldWeight in 4418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7112656 = idf(docFreq=388, maxDocs=43254)
                0.09375 = fieldNorm(doc=4418)
          0.45806545 = weight(abstract_txt:file in 4418) [ClassicSimilarity], result of:
            0.45806545 = score(doc=4418,freq=4.0), product of:
              0.4389494 = queryWeight, product of:
                4.2339125 = boost
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.018627767 = queryNorm
              1.0435495 = fieldWeight in 4418, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.09375 = fieldNorm(doc=4418)
        0.2 = coord(5/25)
    
  3. O'Neill, E.T.; Bennett, R.; Kammerer, K.: Using authorities to improve subject searches (2014) 0.13
    0.1335506 = sum of:
      0.1335506 = product of:
        0.667753 = sum of:
          0.04172519 = weight(abstract_txt:appropriate in 3435) [ClassicSimilarity], result of:
            0.04172519 = score(doc=3435,freq=1.0), product of:
              0.100349635 = queryWeight, product of:
                1.0121912 = boost
                5.3222156 = idf(docFreq=573, maxDocs=43254)
                0.018627767 = queryNorm
              0.4157981 = fieldWeight in 3435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3222156 = idf(docFreq=573, maxDocs=43254)
                0.078125 = fieldNorm(doc=3435)
          0.04197282 = weight(abstract_txt:simple in 3435) [ClassicSimilarity], result of:
            0.04197282 = score(doc=3435,freq=1.0), product of:
              0.10074628 = queryWeight, product of:
                1.0141896 = boost
                5.3327236 = idf(docFreq=567, maxDocs=43254)
                0.018627767 = queryNorm
              0.41661903 = fieldWeight in 3435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3327236 = idf(docFreq=567, maxDocs=43254)
                0.078125 = fieldNorm(doc=3435)
          0.0541565 = weight(abstract_txt:fast in 3435) [ClassicSimilarity], result of:
            0.0541565 = score(doc=3435,freq=1.0), product of:
              0.11940358 = queryWeight, product of:
                1.1041125 = boost
                5.805548 = idf(docFreq=353, maxDocs=43254)
                0.018627767 = queryNorm
              0.45355844 = fieldWeight in 3435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.805548 = idf(docFreq=353, maxDocs=43254)
                0.078125 = fieldNorm(doc=3435)
          0.1031212 = weight(abstract_txt:files in 3435) [ClassicSimilarity], result of:
            0.1031212 = score(doc=3435,freq=1.0), product of:
              0.23111364 = queryWeight, product of:
                2.1723633 = boost
                5.7112656 = idf(docFreq=388, maxDocs=43254)
                0.018627767 = queryNorm
              0.44619262 = fieldWeight in 3435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7112656 = idf(docFreq=388, maxDocs=43254)
                0.078125 = fieldNorm(doc=3435)
          0.4267773 = weight(abstract_txt:file in 3435) [ClassicSimilarity], result of:
            0.4267773 = score(doc=3435,freq=5.0), product of:
              0.4389494 = queryWeight, product of:
                4.2339125 = boost
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.018627767 = queryNorm
              0.9722699 = fieldWeight in 3435, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.078125 = fieldNorm(doc=3435)
        0.2 = coord(5/25)
    
  4. Zamir, O.; Etzioni, O.: Grouper : a dynamic clustering interface to Web search results (1999) 0.13
    0.12596332 = sum of:
      0.12596332 = product of:
        0.52484715 = sum of:
          0.04197282 = weight(abstract_txt:simple in 1208) [ClassicSimilarity], result of:
            0.04197282 = score(doc=1208,freq=1.0), product of:
              0.10074628 = queryWeight, product of:
                1.0141896 = boost
                5.3327236 = idf(docFreq=567, maxDocs=43254)
                0.018627767 = queryNorm
              0.41661903 = fieldWeight in 1208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3327236 = idf(docFreq=567, maxDocs=43254)
                0.078125 = fieldNorm(doc=1208)
          0.0541565 = weight(abstract_txt:fast in 1208) [ClassicSimilarity], result of:
            0.0541565 = score(doc=1208,freq=1.0), product of:
              0.11940358 = queryWeight, product of:
                1.1041125 = boost
                5.805548 = idf(docFreq=353, maxDocs=43254)
                0.018627767 = queryNorm
              0.45355844 = fieldWeight in 1208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.805548 = idf(docFreq=353, maxDocs=43254)
                0.078125 = fieldNorm(doc=1208)
          0.13400547 = weight(abstract_txt:clustering in 1208) [ClassicSimilarity], result of:
            0.13400547 = score(doc=1208,freq=4.0), product of:
              0.13760851 = queryWeight, product of:
                1.1852974 = boost
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.018627767 = queryNorm
              0.97381675 = fieldWeight in 1208, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.232427 = idf(docFreq=230, maxDocs=43254)
                0.078125 = fieldNorm(doc=1208)
          0.13296421 = weight(abstract_txt:clusters in 1208) [ClassicSimilarity], result of:
            0.13296421 = score(doc=1208,freq=3.0), product of:
              0.15067217 = queryWeight, product of:
                1.2402841 = boost
                6.5215535 = idf(docFreq=172, maxDocs=43254)
                0.018627767 = queryNorm
              0.8824736 = fieldWeight in 1208, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5215535 = idf(docFreq=172, maxDocs=43254)
                0.078125 = fieldNorm(doc=1208)
          0.1347661 = weight(abstract_txt:cluster in 1208) [ClassicSimilarity], result of:
            0.1347661 = score(doc=1208,freq=3.0), product of:
              0.15203035 = queryWeight, product of:
                1.2458616 = boost
                6.550881 = idf(docFreq=167, maxDocs=43254)
                0.018627767 = queryNorm
              0.88644207 = fieldWeight in 1208, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.550881 = idf(docFreq=167, maxDocs=43254)
                0.078125 = fieldNorm(doc=1208)
          0.026982097 = weight(abstract_txt:search in 1208) [ClassicSimilarity], result of:
            0.026982097 = score(doc=1208,freq=1.0), product of:
              0.0945462 = queryWeight, product of:
                1.389446 = boost
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.018627767 = queryNorm
              0.2853853 = fieldWeight in 1208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.078125 = fieldNorm(doc=1208)
        0.24 = coord(6/25)
    
  5. Rasmussen, E.: Clustering algorithms (1992) 0.12
    0.11912315 = sum of:
      0.11912315 = product of:
        0.59561574 = sum of:
          0.07727277 = weight(abstract_txt:items in 5514) [ClassicSimilarity], result of:
            0.07727277 = score(doc=5514,freq=4.0), product of:
              0.11062534 = queryWeight, product of:
                1.0627521 = boost
                5.5880704 = idf(docFreq=439, maxDocs=43254)
                0.018627767 = queryNorm
              0.6985088 = fieldWeight in 5514, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5880704 = idf(docFreq=439, maxDocs=43254)
                0.0625 = fieldNorm(doc=5514)
          0.08685185 = weight(abstract_txt:clusters in 5514) [ClassicSimilarity], result of:
            0.08685185 = score(doc=5514,freq=2.0), product of:
              0.15067217 = queryWeight, product of:
                1.2402841 = boost
                6.5215535 = idf(docFreq=172, maxDocs=43254)
                0.018627767 = queryNorm
              0.5764293 = fieldWeight in 5514, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5215535 = idf(docFreq=172, maxDocs=43254)
                0.0625 = fieldNorm(doc=5514)
          0.13918583 = weight(abstract_txt:cluster in 5514) [ClassicSimilarity], result of:
            0.13918583 = score(doc=5514,freq=5.0), product of:
              0.15203035 = queryWeight, product of:
                1.2458616 = boost
                6.550881 = idf(docFreq=167, maxDocs=43254)
                0.018627767 = queryNorm
              0.91551346 = fieldWeight in 5514, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.550881 = idf(docFreq=167, maxDocs=43254)
                0.0625 = fieldNorm(doc=5514)
          0.078400895 = weight(abstract_txt:methods in 5514) [ClassicSimilarity], result of:
            0.078400895 = score(doc=5514,freq=6.0), product of:
              0.12294113 = queryWeight, product of:
                1.5844125 = boost
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.018627767 = queryNorm
              0.63771087 = fieldWeight in 5514, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.0625 = fieldNorm(doc=5514)
          0.21390437 = weight(abstract_txt:clustered in 5514) [ClassicSimilarity], result of:
            0.21390437 = score(doc=5514,freq=1.0), product of:
              0.43619436 = queryWeight, product of:
                2.9844182 = boost
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.018627767 = queryNorm
              0.49038774 = fieldWeight in 5514, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.0625 = fieldNorm(doc=5514)
        0.2 = coord(5/25)