Document (#7155)

Author
Ottaviani, J.S.
Title
¬The fractal nature of relevance : a hypothesis
Source
Journal of the American Society for Information Science. 45(1994) no.4, S.263-272
Year
1994
Abstract
This article proposes a new model, based on fractal geometry, for clusters of relevant documents. It reflects the relatively simple iterative search process used by interactive onlinesearchers. The untested model has the additional sttractive features of high-lighting the logarithmis growth of clusters, which produces complexities in relevance judgements and document clusters not realized by typical models. It indicates that clusters formed using dynamic search strategies appear topoligical distinct, indecomposable, and result from chaotic processes. The model also provides an intuitive definition and representation of cluster dimension which differentiates, where typical models do not, between them. The fractal model, then, gives an indication of what I believe are the limits on clustering relevant documents

Similar documents (content)

  1. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.17
    0.16731645 = sum of:
      0.16731645 = product of:
        1.0457278 = sum of:
          0.02987 = weight(abstract_txt:documents in 1719) [ClassicSimilarity], result of:
            0.02987 = score(doc=1719,freq=2.0), product of:
              0.08199846 = queryWeight, product of:
                1.2494437 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.015924087 = queryNorm
              0.36427513 = fieldWeight in 1719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.042670492 = weight(abstract_txt:models in 1719) [ClassicSimilarity], result of:
            0.042670492 = score(doc=1719,freq=2.0), product of:
              0.10400814 = queryWeight, product of:
                1.4071729 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015924087 = queryNorm
              0.4102611 = fieldWeight in 1719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.05405676 = weight(abstract_txt:model in 1719) [ClassicSimilarity], result of:
            0.05405676 = score(doc=1719,freq=2.0), product of:
              0.15342362 = queryWeight, product of:
                2.4169905 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.015924087 = queryNorm
              0.35233662 = fieldWeight in 1719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.9191306 = weight(abstract_txt:fractal in 1719) [ClassicSimilarity], result of:
            0.9191306 = score(doc=1719,freq=6.0), product of:
              0.6390827 = queryWeight, product of:
                4.272066 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.015924087 = queryNorm
              1.438203 = fieldWeight in 1719, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
        0.16 = coord(4/25)
    
  2. Abdo, A.H.; Cointet, J.-P.; Bourret, P.; Cambrosio, A,: Domain-topic models with chained dimensions : charting an emergent domain of a major oncology conference (2022) 0.14
    0.1350806 = sum of:
      0.1350806 = product of:
        0.5628359 = sum of:
          0.043314196 = weight(abstract_txt:dimension in 619) [ClassicSimilarity], result of:
            0.043314196 = score(doc=619,freq=1.0), product of:
              0.10505153 = queryWeight, product of:
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.015924087 = queryNorm
              0.4123138 = fieldWeight in 619, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.0625 = fieldNorm(doc=619)
          0.036583126 = weight(abstract_txt:documents in 619) [ClassicSimilarity], result of:
            0.036583126 = score(doc=619,freq=3.0), product of:
              0.08199846 = queryWeight, product of:
                1.2494437 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.015924087 = queryNorm
              0.44614407 = fieldWeight in 619, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=619)
          0.030055316 = weight(abstract_txt:relevant in 619) [ClassicSimilarity], result of:
            0.030055316 = score(doc=619,freq=1.0), product of:
              0.10373845 = queryWeight, product of:
                1.4053473 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.015924087 = queryNorm
              0.28972206 = fieldWeight in 619, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0625 = fieldNorm(doc=619)
          0.042670492 = weight(abstract_txt:models in 619) [ClassicSimilarity], result of:
            0.042670492 = score(doc=619,freq=2.0), product of:
              0.10400814 = queryWeight, product of:
                1.4071729 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015924087 = queryNorm
              0.4102611 = fieldWeight in 619, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=619)
          0.07644781 = weight(abstract_txt:model in 619) [ClassicSimilarity], result of:
            0.07644781 = score(doc=619,freq=4.0), product of:
              0.15342362 = queryWeight, product of:
                2.4169905 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.015924087 = queryNorm
              0.49827924 = fieldWeight in 619, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=619)
          0.33376494 = weight(abstract_txt:clusters in 619) [ClassicSimilarity], result of:
            0.33376494 = score(doc=619,freq=4.0), product of:
              0.4098353 = queryWeight, product of:
                3.950331 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.015924087 = queryNorm
              0.814388 = fieldWeight in 619, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=619)
        0.24 = coord(6/25)
    
  3. Desai, M.; Spink, A.: ¬A algorithm to cluster documents based on relevance (2005) 0.08
    0.08468062 = sum of:
      0.08468062 = product of:
        0.42340308 = sum of:
          0.029539049 = weight(abstract_txt:search in 1035) [ClassicSimilarity], result of:
            0.029539049 = score(doc=1035,freq=4.0), product of:
              0.0646006 = queryWeight, product of:
                1.1090014 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.015924087 = queryNorm
              0.45725656 = fieldWeight in 1035, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=1035)
          0.05588165 = weight(abstract_txt:documents in 1035) [ClassicSimilarity], result of:
            0.05588165 = score(doc=1035,freq=7.0), product of:
              0.08199846 = queryWeight, product of:
                1.2494437 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.015924087 = queryNorm
              0.6814963 = fieldWeight in 1035, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1035)
          0.09016595 = weight(abstract_txt:relevant in 1035) [ClassicSimilarity], result of:
            0.09016595 = score(doc=1035,freq=9.0), product of:
              0.10373845 = queryWeight, product of:
                1.4053473 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.015924087 = queryNorm
              0.86916614 = fieldWeight in 1035, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0625 = fieldNorm(doc=1035)
          0.08093396 = weight(abstract_txt:relevance in 1035) [ClassicSimilarity], result of:
            0.08093396 = score(doc=1035,freq=5.0), product of:
              0.11742379 = queryWeight, product of:
                1.4951744 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.015924087 = queryNorm
              0.6892467 = fieldWeight in 1035, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0625 = fieldNorm(doc=1035)
          0.16688247 = weight(abstract_txt:clusters in 1035) [ClassicSimilarity], result of:
            0.16688247 = score(doc=1035,freq=1.0), product of:
              0.4098353 = queryWeight, product of:
                3.950331 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.015924087 = queryNorm
              0.407194 = fieldWeight in 1035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=1035)
        0.2 = coord(5/25)
    
  4. Losee, R.M.; Church Jr., L.: Are two document clusters better than one? : the cluster performance question for information retrieval (2005) 0.08
    0.08163058 = sum of:
      0.08163058 = product of:
        0.51019114 = sum of:
          0.07020588 = weight(abstract_txt:hypothesis in 3270) [ClassicSimilarity], result of:
            0.07020588 = score(doc=3270,freq=1.0), product of:
              0.110620864 = queryWeight, product of:
                1.0261654 = boost
                6.769634 = idf(docFreq=137, maxDocs=44218)
                0.015924087 = queryNorm
              0.63465315 = fieldWeight in 3270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.769634 = idf(docFreq=137, maxDocs=44218)
                0.09375 = fieldNorm(doc=3270)
          0.031681918 = weight(abstract_txt:documents in 3270) [ClassicSimilarity], result of:
            0.031681918 = score(doc=3270,freq=1.0), product of:
              0.08199846 = queryWeight, product of:
                1.2494437 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.015924087 = queryNorm
              0.38637212 = fieldWeight in 3270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.09375 = fieldNorm(doc=3270)
          0.05429215 = weight(abstract_txt:relevance in 3270) [ClassicSimilarity], result of:
            0.05429215 = score(doc=3270,freq=1.0), product of:
              0.11742379 = queryWeight, product of:
                1.4951744 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.015924087 = queryNorm
              0.46236074 = fieldWeight in 3270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.09375 = fieldNorm(doc=3270)
          0.35401118 = weight(abstract_txt:clusters in 3270) [ClassicSimilarity], result of:
            0.35401118 = score(doc=3270,freq=2.0), product of:
              0.4098353 = queryWeight, product of:
                3.950331 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.015924087 = queryNorm
              0.86378884 = fieldWeight in 3270, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.09375 = fieldNorm(doc=3270)
        0.16 = coord(4/25)
    
  5. Klobas, J.E.: Beyond information quality : fitness for purpose and electronic information resource use (1995) 0.08
    0.07997875 = sum of:
      0.07997875 = product of:
        0.3332448 = sum of:
          0.058317903 = weight(abstract_txt:formed in 1945) [ClassicSimilarity], result of:
            0.058317903 = score(doc=1945,freq=1.0), product of:
              0.110385016 = queryWeight, product of:
                1.0250708 = boost
                6.7624135 = idf(docFreq=138, maxDocs=44218)
                0.015924087 = queryNorm
              0.5283136 = fieldWeight in 1945, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7624135 = idf(docFreq=138, maxDocs=44218)
                0.078125 = fieldNorm(doc=1945)
          0.061322387 = weight(abstract_txt:believe in 1945) [ClassicSimilarity], result of:
            0.061322387 = score(doc=1945,freq=1.0), product of:
              0.11414448 = queryWeight, product of:
                1.0423805 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.015924087 = queryNorm
              0.5372348 = fieldWeight in 1945, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.078125 = fieldNorm(doc=1945)
          0.082865424 = weight(abstract_txt:indication in 1945) [ClassicSimilarity], result of:
            0.082865424 = score(doc=1945,freq=1.0), product of:
              0.13951614 = queryWeight, product of:
                1.1524206 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.015924087 = queryNorm
              0.59394866 = fieldWeight in 1945, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.078125 = fieldNorm(doc=1945)
          0.037715744 = weight(abstract_txt:models in 1945) [ClassicSimilarity], result of:
            0.037715744 = score(doc=1945,freq=1.0), product of:
              0.10400814 = queryWeight, product of:
                1.4071729 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015924087 = queryNorm
              0.362623 = fieldWeight in 1945, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.078125 = fieldNorm(doc=1945)
          0.04524346 = weight(abstract_txt:relevance in 1945) [ClassicSimilarity], result of:
            0.04524346 = score(doc=1945,freq=1.0), product of:
              0.11742379 = queryWeight, product of:
                1.4951744 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.015924087 = queryNorm
              0.38530064 = fieldWeight in 1945, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=1945)
          0.047779877 = weight(abstract_txt:model in 1945) [ClassicSimilarity], result of:
            0.047779877 = score(doc=1945,freq=1.0), product of:
              0.15342362 = queryWeight, product of:
                2.4169905 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.015924087 = queryNorm
              0.31142452 = fieldWeight in 1945, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.078125 = fieldNorm(doc=1945)
        0.24 = coord(6/25)