Document (#13107)

Author
O'Kane, K.C.
Title
Generating hierarchical document indices from common denominators in large document collections
Source
Information processing and management. 32(1996) no.2, S.105-115
Year
1996
Abstract
Describes an effective, simple and efficient algorithm for computer generation of hierarchical indices from Document Term matrices by means of calculating common denominator vectors from the document vector set. This procedure produces an intuitive, user friendly hierarchical index of a document collection not unlike that which would be expected had a manual indexer set about to create an index or outline of a collection. The resulting index, when presented with a graphical user interface, provides the user with a natural easily comprehended view of the document collection, permits general browsing and informal search activities with an access method that requires no keyboard entry or prior knowledge of the vocabulary
Theme
Automatisches Indexieren
Register

Similar documents (content)

  1. Hartman, J.H.; Proebsting, T.A.; Sundaram, R.: Index-based hyperlinks (1997) 0.22
    0.218187 = sum of:
      0.218187 = product of:
        1.090935 = sum of:
          0.049968533 = weight(abstract_txt:user in 4724) [ClassicSimilarity], result of:
            0.049968533 = score(doc=4724,freq=1.0), product of:
              0.12417237 = queryWeight, product of:
                1.6692661 = boost
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.020218356 = queryNorm
              0.40241265 = fieldWeight in 4724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.109375 = fieldNorm(doc=4724)
          0.10701132 = weight(abstract_txt:index in 4724) [ClassicSimilarity], result of:
            0.10701132 = score(doc=4724,freq=1.0), product of:
              0.20630687 = queryWeight, product of:
                2.1516416 = boost
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.020218356 = queryNorm
              0.5186997 = fieldWeight in 4724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.109375 = fieldNorm(doc=4724)
          0.58618015 = weight(abstract_txt:indices in 4724) [ClassicSimilarity], result of:
            0.58618015 = score(doc=4724,freq=5.0), product of:
              0.32751223 = queryWeight, product of:
                2.2135086 = boost
                7.318136 = idf(docFreq=77, maxDocs=43254)
                0.020218356 = queryNorm
              1.7897961 = fieldWeight in 4724, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.318136 = idf(docFreq=77, maxDocs=43254)
                0.109375 = fieldNorm(doc=4724)
          0.1900035 = weight(abstract_txt:hierarchical in 4724) [ClassicSimilarity], result of:
            0.1900035 = score(doc=4724,freq=1.0), product of:
              0.3025067 = queryWeight, product of:
                2.605437 = boost
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.020218356 = queryNorm
              0.6280969 = fieldWeight in 4724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.109375 = fieldNorm(doc=4724)
          0.15777147 = weight(abstract_txt:document in 4724) [ClassicSimilarity], result of:
            0.15777147 = score(doc=4724,freq=1.0), product of:
              0.33671016 = queryWeight, product of:
                3.887373 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.020218356 = queryNorm
              0.4685676 = fieldWeight in 4724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.109375 = fieldNorm(doc=4724)
        0.2 = coord(5/25)
    
  2. Kim, P.J.; Lee, J.Y.; Park, J.-H.: Developing a new collection-evaluation method : mapping and the user-side h-index (2009) 0.16
    0.16232347 = sum of:
      0.16232347 = product of:
        0.67634785 = sum of:
          0.055248994 = weight(abstract_txt:procedure in 172) [ClassicSimilarity], result of:
            0.055248994 = score(doc=172,freq=1.0), product of:
              0.13368882 = queryWeight, product of:
                6.61225 = idf(docFreq=157, maxDocs=43254)
                0.020218356 = queryNorm
              0.41326562 = fieldWeight in 172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.61225 = idf(docFreq=157, maxDocs=43254)
                0.0625 = fieldNorm(doc=172)
          0.009073149 = weight(abstract_txt:with in 172) [ClassicSimilarity], result of:
            0.009073149 = score(doc=172,freq=1.0), product of:
              0.05782176 = queryWeight, product of:
                1.1390918 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.020218356 = queryNorm
              0.15691583 = fieldWeight in 172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.0625 = fieldNorm(doc=172)
          0.06384745 = weight(abstract_txt:user in 172) [ClassicSimilarity], result of:
            0.06384745 = score(doc=172,freq=5.0), product of:
              0.12417237 = queryWeight, product of:
                1.6692661 = boost
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.020218356 = queryNorm
              0.51418406 = fieldWeight in 172, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.0625 = fieldNorm(doc=172)
          0.15288295 = weight(abstract_txt:collection in 172) [ClassicSimilarity], result of:
            0.15288295 = score(doc=172,freq=7.0), product of:
              0.19866711 = queryWeight, product of:
                2.111427 = boost
                4.653761 = idf(docFreq=1119, maxDocs=43254)
                0.020218356 = queryNorm
              0.76954335 = fieldWeight in 172, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.653761 = idf(docFreq=1119, maxDocs=43254)
                0.0625 = fieldNorm(doc=172)
          0.18344797 = weight(abstract_txt:index in 172) [ClassicSimilarity], result of:
            0.18344797 = score(doc=172,freq=9.0), product of:
              0.20630687 = queryWeight, product of:
                2.1516416 = boost
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.020218356 = queryNorm
              0.8891995 = fieldWeight in 172, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.0625 = fieldNorm(doc=172)
          0.21184734 = weight(abstract_txt:indices in 172) [ClassicSimilarity], result of:
            0.21184734 = score(doc=172,freq=2.0), product of:
              0.32751223 = queryWeight, product of:
                2.2135086 = boost
                7.318136 = idf(docFreq=77, maxDocs=43254)
                0.020218356 = queryNorm
              0.64683795 = fieldWeight in 172, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.318136 = idf(docFreq=77, maxDocs=43254)
                0.0625 = fieldNorm(doc=172)
        0.24 = coord(6/25)
    
  3. Kim, Y.W.; Kim, J.H.: ¬A model of knowledge based information retrieval with hierarchical concept graph (1990) 0.14
    0.13997312 = sum of:
      0.13997312 = product of:
        0.5832213 = sum of:
          0.12656547 = weight(abstract_txt:intuitive in 3909) [ClassicSimilarity], result of:
            0.12656547 = score(doc=3909,freq=2.0), product of:
              0.15890552 = queryWeight, product of:
                1.0902396 = boost
                7.2089367 = idf(docFreq=86, maxDocs=43254)
                0.020218356 = queryNorm
              0.7964825 = fieldWeight in 3909, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2089367 = idf(docFreq=86, maxDocs=43254)
                0.078125 = fieldNorm(doc=3909)
          0.019643946 = weight(abstract_txt:with in 3909) [ClassicSimilarity], result of:
            0.019643946 = score(doc=3909,freq=3.0), product of:
              0.05782176 = queryWeight, product of:
                1.1390918 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.020218356 = queryNorm
              0.33973274 = fieldWeight in 3909, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.078125 = fieldNorm(doc=3909)
          0.021788733 = weight(abstract_txt:from in 3909) [ClassicSimilarity], result of:
            0.021788733 = score(doc=3909,freq=2.0), product of:
              0.07092357 = queryWeight, product of:
                1.2615613 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.020218356 = queryNorm
              0.3072143 = fieldWeight in 3909, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.078125 = fieldNorm(doc=3909)
          0.03569181 = weight(abstract_txt:user in 3909) [ClassicSimilarity], result of:
            0.03569181 = score(doc=3909,freq=1.0), product of:
              0.12417237 = queryWeight, product of:
                1.6692661 = boost
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.020218356 = queryNorm
              0.28743762 = fieldWeight in 3909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.078125 = fieldNorm(doc=3909)
          0.108097754 = weight(abstract_txt:index in 3909) [ClassicSimilarity], result of:
            0.108097754 = score(doc=3909,freq=2.0), product of:
              0.20630687 = queryWeight, product of:
                2.1516416 = boost
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.020218356 = queryNorm
              0.52396584 = fieldWeight in 3909, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.078125 = fieldNorm(doc=3909)
          0.27143356 = weight(abstract_txt:hierarchical in 3909) [ClassicSimilarity], result of:
            0.27143356 = score(doc=3909,freq=4.0), product of:
              0.3025067 = queryWeight, product of:
                2.605437 = boost
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.020218356 = queryNorm
              0.8972812 = fieldWeight in 3909, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.078125 = fieldNorm(doc=3909)
        0.24 = coord(6/25)
    
  4. Kim, G.: Relationship between index term specificity and relevance judgment (2006) 0.13
    0.12518631 = sum of:
      0.12518631 = product of:
        0.62593156 = sum of:
          0.019247057 = weight(abstract_txt:with in 2987) [ClassicSimilarity], result of:
            0.019247057 = score(doc=2987,freq=2.0), product of:
              0.05782176 = queryWeight, product of:
                1.1390918 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.020218356 = queryNorm
              0.33286873 = fieldWeight in 2987, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.09375 = fieldNorm(doc=2987)
          0.026146479 = weight(abstract_txt:from in 2987) [ClassicSimilarity], result of:
            0.026146479 = score(doc=2987,freq=2.0), product of:
              0.07092357 = queryWeight, product of:
                1.2615613 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.020218356 = queryNorm
              0.36865714 = fieldWeight in 2987, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.09375 = fieldNorm(doc=2987)
          0.18344797 = weight(abstract_txt:index in 2987) [ClassicSimilarity], result of:
            0.18344797 = score(doc=2987,freq=4.0), product of:
              0.20630687 = queryWeight, product of:
                2.1516416 = boost
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.020218356 = queryNorm
              0.8891995 = fieldWeight in 2987, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.09375 = fieldNorm(doc=2987)
          0.16286016 = weight(abstract_txt:hierarchical in 2987) [ClassicSimilarity], result of:
            0.16286016 = score(doc=2987,freq=1.0), product of:
              0.3025067 = queryWeight, product of:
                2.605437 = boost
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.020218356 = queryNorm
              0.53836876 = fieldWeight in 2987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.09375 = fieldNorm(doc=2987)
          0.23422988 = weight(abstract_txt:document in 2987) [ClassicSimilarity], result of:
            0.23422988 = score(doc=2987,freq=3.0), product of:
              0.33671016 = queryWeight, product of:
                3.887373 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.020218356 = queryNorm
              0.6956425 = fieldWeight in 2987, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.09375 = fieldNorm(doc=2987)
        0.2 = coord(5/25)
    
  5. Furner, J.: On Recommending (2002) 0.12
    0.12173883 = sum of:
      0.12173883 = product of:
        0.43478152 = sum of:
          0.060697023 = weight(abstract_txt:generating in 244) [ClassicSimilarity], result of:
            0.060697023 = score(doc=244,freq=1.0), product of:
              0.14233896 = queryWeight, product of:
                1.0318447 = boost
                6.822815 = idf(docFreq=127, maxDocs=43254)
                0.020218356 = queryNorm
              0.42642593 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.822815 = idf(docFreq=127, maxDocs=43254)
                0.0625 = fieldNorm(doc=244)
          0.06587065 = weight(abstract_txt:indexer in 244) [ClassicSimilarity], result of:
            0.06587065 = score(doc=244,freq=1.0), product of:
              0.15031655 = queryWeight, product of:
                1.0603662 = boost
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.020218356 = queryNorm
              0.43821287 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.0625 = fieldNorm(doc=244)
          0.012325568 = weight(abstract_txt:from in 244) [ClassicSimilarity], result of:
            0.012325568 = score(doc=244,freq=1.0), product of:
              0.07092357 = queryWeight, product of:
                1.2615613 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.020218356 = queryNorm
              0.17378664 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.0625 = fieldNorm(doc=244)
          0.04945602 = weight(abstract_txt:user in 244) [ClassicSimilarity], result of:
            0.04945602 = score(doc=244,freq=3.0), product of:
              0.12417237 = queryWeight, product of:
                1.6692661 = boost
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.020218356 = queryNorm
              0.3982852 = fieldWeight in 244, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.0625 = fieldNorm(doc=244)
          0.057784326 = weight(abstract_txt:collection in 244) [ClassicSimilarity], result of:
            0.057784326 = score(doc=244,freq=1.0), product of:
              0.19866711 = queryWeight, product of:
                2.111427 = boost
                4.653761 = idf(docFreq=1119, maxDocs=43254)
                0.020218356 = queryNorm
              0.29086006 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.653761 = idf(docFreq=1119, maxDocs=43254)
                0.0625 = fieldNorm(doc=244)
          0.06114932 = weight(abstract_txt:index in 244) [ClassicSimilarity], result of:
            0.06114932 = score(doc=244,freq=1.0), product of:
              0.20630687 = queryWeight, product of:
                2.1516416 = boost
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.020218356 = queryNorm
              0.29639983 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.0625 = fieldNorm(doc=244)
          0.1274986 = weight(abstract_txt:document in 244) [ClassicSimilarity], result of:
            0.1274986 = score(doc=244,freq=2.0), product of:
              0.33671016 = queryWeight, product of:
                3.887373 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.020218356 = queryNorm
              0.37865978 = fieldWeight in 244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=244)
        0.28 = coord(7/25)