Document (#16323)

Author
Smeaton, A.F.
Morrissey, P.J.
Title
Experiments on the automatic construction of hypertext from texts
Source
New review of hypermedia and multimedia. 1995, no.1, S.23-39
Year
1995
Abstract
Describes an approach to semi-automatically generate a hypertext from linear texts, based on initially creatign nodes and composite nodes composed of 'mini-hypertexts'. Node-node similarity values are computed using standard information retrieval techniques and these similarity measures are then used to selectively create node-node links based on the strength of similarity between them. The process is a novel one because the link creation process also uses values from a dynamically computed metric which measures the topological compactness of the overall hypertext being generated. Describes experiments on generating a hypertext from a collection of 846 software product descriptions comprising 8,5 MBytes of text which yield some guidelines on how the process should be automated. This text to hypertext conversion method is put into the context of an overall hypertext authoring tool currently under development
Theme
Hypertext

Similar documents (author)

  1. Smeaton, A.F.: Prospects for intelligent, language-based information retrieval (1991) 5.38
    5.378652 = sum of:
      5.378652 = weight(author_txt:smeaton in 3700) [ClassicSimilarity], result of:
        5.378652 = score(doc=3700,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.1162001 = queryNorm
          5.3786526 = fieldWeight in 3700, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.625 = fieldNorm(doc=3700)
    
  2. Smeaton, A.F.: Retrieving information from hypertext : issues and problems (1991) 5.38
    5.378652 = sum of:
      5.378652 = weight(author_txt:smeaton in 4278) [ClassicSimilarity], result of:
        5.378652 = score(doc=4278,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.1162001 = queryNorm
          5.3786526 = fieldWeight in 4278, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.625 = fieldNorm(doc=4278)
    
  3. Smeaton, A.F.: Progress in the application of natural language processing to information retrieval tasks (1992) 5.38
    5.378652 = sum of:
      5.378652 = weight(author_txt:smeaton in 7080) [ClassicSimilarity], result of:
        5.378652 = score(doc=7080,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.1162001 = queryNorm
          5.3786526 = fieldWeight in 7080, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.625 = fieldNorm(doc=7080)
    
  4. Smeaton, A.F.: Information retrieval and hypertext : competing technologies or complementary access methods (1992) 5.38
    5.378652 = sum of:
      5.378652 = weight(author_txt:smeaton in 7503) [ClassicSimilarity], result of:
        5.378652 = score(doc=7503,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.1162001 = queryNorm
          5.3786526 = fieldWeight in 7503, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.625 = fieldNorm(doc=7503)
    
  5. Smeaton, A.F.: Natural language processing used in information retrieval tasks : an overview of achievements to date (1995) 5.38
    5.378652 = sum of:
      5.378652 = weight(author_txt:smeaton in 1265) [ClassicSimilarity], result of:
        5.378652 = score(doc=1265,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.1162001 = queryNorm
          5.3786526 = fieldWeight in 1265, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.625 = fieldNorm(doc=1265)
    

Similar documents (content)

  1. Langford, D.: Broadbutton node linking : a generalised approach to hyperbase navigation (1990) 0.27
    0.27198383 = sum of:
      0.27198383 = product of:
        0.9713708 = sum of:
          0.069745295 = weight(abstract_txt:dynamically in 4919) [ClassicSimilarity], result of:
            0.069745295 = score(doc=4919,freq=1.0), product of:
              0.120977305 = queryWeight, product of:
                1.0139258 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.01616876 = queryNorm
              0.57651556 = fieldWeight in 4919, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.078125 = fieldNorm(doc=4919)
          0.02745162 = weight(abstract_txt:describes in 4919) [ClassicSimilarity], result of:
            0.02745162 = score(doc=4919,freq=2.0), product of:
              0.06497395 = queryWeight, product of:
                1.0508455 = boost
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.01616876 = queryNorm
              0.42250192 = fieldWeight in 4919, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.078125 = fieldNorm(doc=4919)
          0.052573323 = weight(abstract_txt:experiments in 4919) [ClassicSimilarity], result of:
            0.052573323 = score(doc=4919,freq=1.0), product of:
              0.126245 = queryWeight, product of:
                1.4647932 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.01616876 = queryNorm
              0.41643882 = fieldWeight in 4919, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.078125 = fieldNorm(doc=4919)
          0.014657864 = weight(abstract_txt:from in 4919) [ClassicSimilarity], result of:
            0.014657864 = score(doc=4919,freq=1.0), product of:
              0.06788301 = queryWeight, product of:
                1.5190244 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01616876 = queryNorm
              0.21592833 = fieldWeight in 4919, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=4919)
          0.1701223 = weight(abstract_txt:nodes in 4919) [ClassicSimilarity], result of:
            0.1701223 = score(doc=4919,freq=2.0), product of:
              0.2192139 = queryWeight, product of:
                1.9302043 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.01616876 = queryNorm
              0.7760561 = fieldWeight in 4919, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.078125 = fieldNorm(doc=4919)
          0.44640636 = weight(abstract_txt:node in 4919) [ClassicSimilarity], result of:
            0.44640636 = score(doc=4919,freq=2.0), product of:
              0.5254413 = queryWeight, product of:
                4.226164 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.01616876 = queryNorm
              0.84958375 = fieldWeight in 4919, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.078125 = fieldNorm(doc=4919)
          0.19041409 = weight(abstract_txt:hypertext in 4919) [ClassicSimilarity], result of:
            0.19041409 = score(doc=4919,freq=1.0), product of:
              0.42941487 = queryWeight, product of:
                4.6791654 = boost
                5.6758637 = idf(docFreq=411, maxDocs=44218)
                0.01616876 = queryNorm
              0.44342685 = fieldWeight in 4919, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6758637 = idf(docFreq=411, maxDocs=44218)
                0.078125 = fieldNorm(doc=4919)
        0.28 = coord(7/25)
    
  2. Pausch, R.; Detmer, J.: Node popularity as a hypertext browsing aid (1990) 0.17
    0.17474 = sum of:
      0.17474 = product of:
        1.092125 = sum of:
          0.023293473 = weight(abstract_txt:describes in 5913) [ClassicSimilarity], result of:
            0.023293473 = score(doc=5913,freq=1.0), product of:
              0.06497395 = queryWeight, product of:
                1.0508455 = boost
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.01616876 = queryNorm
              0.3585048 = fieldWeight in 5913, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.09375 = fieldNorm(doc=5913)
          0.18425399 = weight(abstract_txt:computed in 5913) [ClassicSimilarity], result of:
            0.18425399 = score(doc=5913,freq=1.0), product of:
              0.25794536 = queryWeight, product of:
                2.0937898 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.01616876 = queryNorm
              0.71431404 = fieldWeight in 5913, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.09375 = fieldNorm(doc=5913)
          0.65608066 = weight(abstract_txt:node in 5913) [ClassicSimilarity], result of:
            0.65608066 = score(doc=5913,freq=3.0), product of:
              0.5254413 = queryWeight, product of:
                4.226164 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.01616876 = queryNorm
              1.2486279 = fieldWeight in 5913, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.09375 = fieldNorm(doc=5913)
          0.22849691 = weight(abstract_txt:hypertext in 5913) [ClassicSimilarity], result of:
            0.22849691 = score(doc=5913,freq=1.0), product of:
              0.42941487 = queryWeight, product of:
                4.6791654 = boost
                5.6758637 = idf(docFreq=411, maxDocs=44218)
                0.01616876 = queryNorm
              0.53211224 = fieldWeight in 5913, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6758637 = idf(docFreq=411, maxDocs=44218)
                0.09375 = fieldNorm(doc=5913)
        0.16 = coord(4/25)
    
  3. Falquet, G.; Guyot, J.; Nerima, L.: Languages and tools to specify hypertext views on databases (1999) 0.16
    0.16192235 = sum of:
      0.16192235 = product of:
        1.0120147 = sum of:
          0.014657864 = weight(abstract_txt:from in 3968) [ClassicSimilarity], result of:
            0.014657864 = score(doc=3968,freq=1.0), product of:
              0.06788301 = queryWeight, product of:
                1.5190244 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01616876 = queryNorm
              0.21592833 = fieldWeight in 3968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=3968)
          0.1701223 = weight(abstract_txt:nodes in 3968) [ClassicSimilarity], result of:
            0.1701223 = score(doc=3968,freq=2.0), product of:
              0.2192139 = queryWeight, product of:
                1.9302043 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.01616876 = queryNorm
              0.7760561 = fieldWeight in 3968, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.078125 = fieldNorm(doc=3968)
          0.44640636 = weight(abstract_txt:node in 3968) [ClassicSimilarity], result of:
            0.44640636 = score(doc=3968,freq=2.0), product of:
              0.5254413 = queryWeight, product of:
                4.226164 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.01616876 = queryNorm
              0.84958375 = fieldWeight in 3968, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.078125 = fieldNorm(doc=3968)
          0.38082817 = weight(abstract_txt:hypertext in 3968) [ClassicSimilarity], result of:
            0.38082817 = score(doc=3968,freq=4.0), product of:
              0.42941487 = queryWeight, product of:
                4.6791654 = boost
                5.6758637 = idf(docFreq=411, maxDocs=44218)
                0.01616876 = queryNorm
              0.8868537 = fieldWeight in 3968, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6758637 = idf(docFreq=411, maxDocs=44218)
                0.078125 = fieldNorm(doc=3968)
        0.16 = coord(4/25)
    
  4. Savoy, J.: ¬An extended vector-processing scheme for searching information in hypertext systems (1996) 0.15
    0.15397453 = sum of:
      0.15397453 = product of:
        0.76987267 = sum of:
          0.011726292 = weight(abstract_txt:from in 4036) [ClassicSimilarity], result of:
            0.011726292 = score(doc=4036,freq=1.0), product of:
              0.06788301 = queryWeight, product of:
                1.5190244 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01616876 = queryNorm
              0.17274266 = fieldWeight in 4036, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=4036)
          0.0962357 = weight(abstract_txt:nodes in 4036) [ClassicSimilarity], result of:
            0.0962357 = score(doc=4036,freq=1.0), product of:
              0.2192139 = queryWeight, product of:
                1.9302043 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.01616876 = queryNorm
              0.43900365 = fieldWeight in 4036, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4036)
          0.122835994 = weight(abstract_txt:computed in 4036) [ClassicSimilarity], result of:
            0.122835994 = score(doc=4036,freq=1.0), product of:
              0.25794536 = queryWeight, product of:
                2.0937898 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.01616876 = queryNorm
              0.47620937 = fieldWeight in 4036, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0625 = fieldNorm(doc=4036)
          0.08208091 = weight(abstract_txt:similarity in 4036) [ClassicSimilarity], result of:
            0.08208091 = score(doc=4036,freq=1.0), product of:
              0.22568488 = queryWeight, product of:
                2.3986456 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.01616876 = queryNorm
              0.36369696 = fieldWeight in 4036, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=4036)
          0.45699382 = weight(abstract_txt:hypertext in 4036) [ClassicSimilarity], result of:
            0.45699382 = score(doc=4036,freq=9.0), product of:
              0.42941487 = queryWeight, product of:
                4.6791654 = boost
                5.6758637 = idf(docFreq=411, maxDocs=44218)
                0.01616876 = queryNorm
              1.0642245 = fieldWeight in 4036, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.6758637 = idf(docFreq=411, maxDocs=44218)
                0.0625 = fieldNorm(doc=4036)
        0.2 = coord(5/25)
    
  5. Jin, L.; Zhu, H.; Hall, P.: Adequate testing of hypertext applications (1997) 0.15
    0.1536717 = sum of:
      0.1536717 = product of:
        0.9604481 = sum of:
          0.04153793 = weight(abstract_txt:process in 408) [ClassicSimilarity], result of:
            0.04153793 = score(doc=408,freq=1.0), product of:
              0.109372996 = queryWeight, product of:
                1.6698209 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.01616876 = queryNorm
              0.37978232 = fieldWeight in 408, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.09375 = fieldNorm(doc=408)
          0.14435355 = weight(abstract_txt:nodes in 408) [ClassicSimilarity], result of:
            0.14435355 = score(doc=408,freq=1.0), product of:
              0.2192139 = queryWeight, product of:
                1.9302043 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.01616876 = queryNorm
              0.65850544 = fieldWeight in 408, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.09375 = fieldNorm(doc=408)
          0.37878838 = weight(abstract_txt:node in 408) [ClassicSimilarity], result of:
            0.37878838 = score(doc=408,freq=1.0), product of:
              0.5254413 = queryWeight, product of:
                4.226164 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.01616876 = queryNorm
              0.7208957 = fieldWeight in 408, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.09375 = fieldNorm(doc=408)
          0.39576823 = weight(abstract_txt:hypertext in 408) [ClassicSimilarity], result of:
            0.39576823 = score(doc=408,freq=3.0), product of:
              0.42941487 = queryWeight, product of:
                4.6791654 = boost
                5.6758637 = idf(docFreq=411, maxDocs=44218)
                0.01616876 = queryNorm
              0.9216454 = fieldWeight in 408, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6758637 = idf(docFreq=411, maxDocs=44218)
                0.09375 = fieldNorm(doc=408)
        0.16 = coord(4/25)