Document (#13107)

Author
O'Kane, K.C.
Title
Generating hierarchical document indices from common denominators in large document collections
Source
Information processing and management. 32(1996) no.2, S.105-115
Year
1996
Abstract
Describes an effective, simple and efficient algorithm for computer generation of hierarchical indices from Document Term matrices by means of calculating common denominator vectors from the document vector set. This procedure produces an intuitive, user friendly hierarchical index of a document collection not unlike that which would be expected had a manual indexer set about to create an index or outline of a collection. The resulting index, when presented with a graphical user interface, provides the user with a natural easily comprehended view of the document collection, permits general browsing and informal search activities with an access method that requires no keyboard entry or prior knowledge of the vocabulary
Theme
Automatisches Indexieren
Register

Similar documents (content)

  1. Hartman, J.H.; Proebsting, T.A.; Sundaram, R.: Index-based hyperlinks (1997) 0.22
    0.21813507 = sum of:
      0.21813507 = product of:
        1.0906754 = sum of:
          0.050151788 = weight(abstract_txt:user in 2723) [ClassicSimilarity], result of:
            0.050151788 = score(doc=2723,freq=1.0), product of:
              0.12448083 = queryWeight, product of:
                1.6781778 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.020137178 = queryNorm
              0.40288764 = fieldWeight in 2723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.109375 = fieldNorm(doc=2723)
          0.10746859 = weight(abstract_txt:index in 2723) [ClassicSimilarity], result of:
            0.10746859 = score(doc=2723,freq=1.0), product of:
              0.20690258 = queryWeight, product of:
                2.1635637 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.020137178 = queryNorm
              0.5194164 = fieldWeight in 2723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.109375 = fieldNorm(doc=2723)
          0.58546466 = weight(abstract_txt:indices in 2723) [ClassicSimilarity], result of:
            0.58546466 = score(doc=2723,freq=5.0), product of:
              0.32725897 = queryWeight, product of:
                2.2217076 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.020137178 = queryNorm
              1.788995 = fieldWeight in 2723, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.109375 = fieldNorm(doc=2723)
          0.18885177 = weight(abstract_txt:hierarchical in 2723) [ClassicSimilarity], result of:
            0.18885177 = score(doc=2723,freq=1.0), product of:
              0.30129522 = queryWeight, product of:
                2.6108558 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.020137178 = queryNorm
              0.62679976 = fieldWeight in 2723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.109375 = fieldNorm(doc=2723)
          0.15873861 = weight(abstract_txt:document in 2723) [ClassicSimilarity], result of:
            0.15873861 = score(doc=2723,freq=1.0), product of:
              0.3380985 = queryWeight, product of:
                3.9113202 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.020137178 = queryNorm
              0.46950403 = fieldWeight in 2723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.109375 = fieldNorm(doc=2723)
        0.2 = coord(5/25)
    
  2. Kim, P.J.; Lee, J.Y.; Park, J.-H.: Developing a new collection-evaluation method : mapping and the user-side h-index (2009) 0.16
    0.16219565 = sum of:
      0.16219565 = product of:
        0.6758152 = sum of:
          0.054572884 = weight(abstract_txt:procedure in 3171) [ClassicSimilarity], result of:
            0.054572884 = score(doc=3171,freq=1.0), product of:
              0.13260129 = queryWeight, product of:
                6.5848994 = idf(docFreq=165, maxDocs=44218)
                0.020137178 = queryNorm
              0.4115562 = fieldWeight in 3171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5848994 = idf(docFreq=165, maxDocs=44218)
                0.0625 = fieldNorm(doc=3171)
          0.008956365 = weight(abstract_txt:with in 3171) [ClassicSimilarity], result of:
            0.008956365 = score(doc=3171,freq=1.0), product of:
              0.05732685 = queryWeight, product of:
                1.1388481 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.020137178 = queryNorm
              0.15623334 = fieldWeight in 3171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=3171)
          0.0640816 = weight(abstract_txt:user in 3171) [ClassicSimilarity], result of:
            0.0640816 = score(doc=3171,freq=5.0), product of:
              0.12448083 = queryWeight, product of:
                1.6781778 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.020137178 = queryNorm
              0.51479095 = fieldWeight in 3171, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=3171)
          0.1523837 = weight(abstract_txt:collection in 3171) [ClassicSimilarity], result of:
            0.1523837 = score(doc=3171,freq=7.0), product of:
              0.19824241 = queryWeight, product of:
                2.1178005 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.020137178 = queryNorm
              0.76867354 = fieldWeight in 3171, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0625 = fieldNorm(doc=3171)
          0.18423188 = weight(abstract_txt:index in 3171) [ClassicSimilarity], result of:
            0.18423188 = score(doc=3171,freq=9.0), product of:
              0.20690258 = queryWeight, product of:
                2.1635637 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.020137178 = queryNorm
              0.8904281 = fieldWeight in 3171, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0625 = fieldNorm(doc=3171)
          0.21158879 = weight(abstract_txt:indices in 3171) [ClassicSimilarity], result of:
            0.21158879 = score(doc=3171,freq=2.0), product of:
              0.32725897 = queryWeight, product of:
                2.2217076 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.020137178 = queryNorm
              0.64654845 = fieldWeight in 3171, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.0625 = fieldNorm(doc=3171)
        0.24 = coord(6/25)
    
  3. Safder, I.; Ali, M.; Aljohani, N.R.; Nawaz, R.; Hassan, S.-U.: Neural machine translation for in-text citation classification (2023) 0.15
    0.14729631 = sum of:
      0.14729631 = product of:
        0.52605826 = sum of:
          0.06788195 = weight(abstract_txt:unlike in 1053) [ClassicSimilarity], result of:
            0.06788195 = score(doc=1053,freq=1.0), product of:
              0.15336727 = queryWeight, product of:
                1.0754555 = boost
                7.0817666 = idf(docFreq=100, maxDocs=44218)
                0.020137178 = queryNorm
              0.4426104 = fieldWeight in 1053, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0817666 = idf(docFreq=100, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.012666213 = weight(abstract_txt:with in 1053) [ClassicSimilarity], result of:
            0.012666213 = score(doc=1053,freq=2.0), product of:
              0.05732685 = queryWeight, product of:
                1.1388481 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.020137178 = queryNorm
              0.22094731 = fieldWeight in 1053, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.09087813 = weight(abstract_txt:vectors in 1053) [ClassicSimilarity], result of:
            0.09087813 = score(doc=1053,freq=1.0), product of:
              0.18629566 = queryWeight, product of:
                1.1852978 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.020137178 = queryNorm
              0.4878167 = fieldWeight in 1053, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.101047516 = weight(abstract_txt:calculating in 1053) [ClassicSimilarity], result of:
            0.101047516 = score(doc=1053,freq=1.0), product of:
              0.19994639 = queryWeight, product of:
                1.2279563 = boost
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.020137178 = queryNorm
              0.50537306 = fieldWeight in 1053, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.017120818 = weight(abstract_txt:from in 1053) [ClassicSimilarity], result of:
            0.017120818 = score(doc=1053,freq=2.0), product of:
              0.07008255 = queryWeight, product of:
                1.2591913 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.020137178 = queryNorm
              0.24429502 = fieldWeight in 1053, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.08684774 = weight(abstract_txt:index in 1053) [ClassicSimilarity], result of:
            0.08684774 = score(doc=1053,freq=2.0), product of:
              0.20690258 = queryWeight, product of:
                2.1635637 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.020137178 = queryNorm
              0.41975182 = fieldWeight in 1053, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.14961587 = weight(abstract_txt:indices in 1053) [ClassicSimilarity], result of:
            0.14961587 = score(doc=1053,freq=1.0), product of:
              0.32725897 = queryWeight, product of:
                2.2217076 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.020137178 = queryNorm
              0.4571788 = fieldWeight in 1053, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
        0.28 = coord(7/25)
    
  4. Kim, Y.W.; Kim, J.H.: ¬A model of knowledge based information retrieval with hierarchical concept graph (1990) 0.14
    0.13956179 = sum of:
      0.13956179 = product of:
        0.58150744 = sum of:
          0.12654476 = weight(abstract_txt:intuitive in 3909) [ClassicSimilarity], result of:
            0.12654476 = score(doc=3909,freq=2.0), product of:
              0.15889464 = queryWeight, product of:
                1.0946637 = boost
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.020137178 = queryNorm
              0.79640675 = fieldWeight in 3909, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
          0.019391099 = weight(abstract_txt:with in 3909) [ClassicSimilarity], result of:
            0.019391099 = score(doc=3909,freq=3.0), product of:
              0.05732685 = queryWeight, product of:
                1.1388481 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.020137178 = queryNorm
              0.3382551 = fieldWeight in 3909, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
          0.021401023 = weight(abstract_txt:from in 3909) [ClassicSimilarity], result of:
            0.021401023 = score(doc=3909,freq=2.0), product of:
              0.07008255 = queryWeight, product of:
                1.2591913 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.020137178 = queryNorm
              0.30536878 = fieldWeight in 3909, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
          0.035822704 = weight(abstract_txt:user in 3909) [ClassicSimilarity], result of:
            0.035822704 = score(doc=3909,freq=1.0), product of:
              0.12448083 = queryWeight, product of:
                1.6781778 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.020137178 = queryNorm
              0.2877769 = fieldWeight in 3909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
          0.10855967 = weight(abstract_txt:index in 3909) [ClassicSimilarity], result of:
            0.10855967 = score(doc=3909,freq=2.0), product of:
              0.20690258 = queryWeight, product of:
                2.1635637 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.020137178 = queryNorm
              0.5246898 = fieldWeight in 3909, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
          0.26978824 = weight(abstract_txt:hierarchical in 3909) [ClassicSimilarity], result of:
            0.26978824 = score(doc=3909,freq=4.0), product of:
              0.30129522 = queryWeight, product of:
                2.6108558 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.020137178 = queryNorm
              0.8954282 = fieldWeight in 3909, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
        0.24 = coord(6/25)
    
  5. Crestani, F.; Vegas, J.; Fuente, P. de la: ¬A graphical user interface for the retrieval of hierarchically structured documents (2004) 0.14
    0.13728571 = sum of:
      0.13728571 = product of:
        0.5720238 = sum of:
          0.09837815 = weight(abstract_txt:graphical in 2555) [ClassicSimilarity], result of:
            0.09837815 = score(doc=2555,freq=2.0), product of:
              0.13434213 = queryWeight, product of:
                1.0065428 = boost
                6.627983 = idf(docFreq=158, maxDocs=44218)
                0.020137178 = queryNorm
              0.7322956 = fieldWeight in 2555, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.627983 = idf(docFreq=158, maxDocs=44218)
                0.078125 = fieldNorm(doc=2555)
          0.08948066 = weight(abstract_txt:intuitive in 2555) [ClassicSimilarity], result of:
            0.08948066 = score(doc=2555,freq=1.0), product of:
              0.15889464 = queryWeight, product of:
                1.0946637 = boost
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.020137178 = queryNorm
              0.5631446 = fieldWeight in 2555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.078125 = fieldNorm(doc=2555)
          0.011195456 = weight(abstract_txt:with in 2555) [ClassicSimilarity], result of:
            0.011195456 = score(doc=2555,freq=1.0), product of:
              0.05732685 = queryWeight, product of:
                1.1388481 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.020137178 = queryNorm
              0.19529167 = fieldWeight in 2555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=2555)
          0.015132809 = weight(abstract_txt:from in 2555) [ClassicSimilarity], result of:
            0.015132809 = score(doc=2555,freq=1.0), product of:
              0.07008255 = queryWeight, product of:
                1.2591913 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.020137178 = queryNorm
              0.21592833 = fieldWeight in 2555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=2555)
          0.080102004 = weight(abstract_txt:user in 2555) [ClassicSimilarity], result of:
            0.080102004 = score(doc=2555,freq=5.0), product of:
              0.12448083 = queryWeight, product of:
                1.6781778 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.020137178 = queryNorm
              0.6434887 = fieldWeight in 2555, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.078125 = fieldNorm(doc=2555)
          0.27773473 = weight(abstract_txt:document in 2555) [ClassicSimilarity], result of:
            0.27773473 = score(doc=2555,freq=6.0), product of:
              0.3380985 = queryWeight, product of:
                3.9113202 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.020137178 = queryNorm
              0.82146096 = fieldWeight in 2555, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=2555)
        0.24 = coord(6/25)