Document (#18152)

Author
Thistlewaite, P.
Title
Automatic construction and management of large open webs
Source
Information processing and management. 33(1997) no.2, S.161-173
Year
1997
Abstract
Reviews the problems associated with manually created or maintained hyperdocument links, and the consequent need for automated methods. A number of techniques have been applied to the problem, including pattern-matching, information retrieval, and natural language processing. Describes a system for the automatic detection and management of structural and referential links. Addresses the issues of link-set soundness and completeness, open link management, and the particular problem engendered by large volatile hyperbases
Footnote
Contribution to a special issue on methods and tools for the automatic construction of hypertext

Similar documents (content)

  1. Salton, G.: Automatic text structuring and summarization (1997) 0.08
    0.08339604 = sum of:
      0.08339604 = product of:
        0.5212253 = sum of:
          0.078483075 = weight(abstract_txt:pattern in 145) [ClassicSimilarity], result of:
            0.078483075 = score(doc=145,freq=1.0), product of:
              0.13403349 = queryWeight, product of:
                1.1872574 = boost
                6.2458487 = idf(docFreq=232, maxDocs=44218)
                0.018074945 = queryNorm
              0.5855483 = fieldWeight in 145, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2458487 = idf(docFreq=232, maxDocs=44218)
                0.09375 = fieldNorm(doc=145)
          0.18070418 = weight(abstract_txt:automatic in 145) [ClassicSimilarity], result of:
            0.18070418 = score(doc=145,freq=4.0), product of:
              0.18549466 = queryWeight, product of:
                1.9752357 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.018074945 = queryNorm
              0.97417456 = fieldWeight in 145, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=145)
          0.09261144 = weight(abstract_txt:links in 145) [ClassicSimilarity], result of:
            0.09261144 = score(doc=145,freq=1.0), product of:
              0.18857424 = queryWeight, product of:
                1.9915646 = boost
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.018074945 = queryNorm
              0.49111396 = fieldWeight in 145, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.09375 = fieldNorm(doc=145)
          0.16942659 = weight(abstract_txt:link in 145) [ClassicSimilarity], result of:
            0.16942659 = score(doc=145,freq=2.0), product of:
              0.22388087 = queryWeight, product of:
                2.1700099 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.018074945 = queryNorm
              0.75677115 = fieldWeight in 145, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.09375 = fieldNorm(doc=145)
        0.16 = coord(4/25)
    
  2. Sood, S.O.; Churchill, E.F.; Antin, J.: Automatic identification of personal insults on social news sites (2012) 0.07
    0.06747614 = sum of:
      0.06747614 = product of:
        0.33738068 = sum of:
          0.037778616 = weight(abstract_txt:automated in 4976) [ClassicSimilarity], result of:
            0.037778616 = score(doc=4976,freq=1.0), product of:
              0.107875 = queryWeight, product of:
                1.0651202 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.018074945 = queryNorm
              0.35020733 = fieldWeight in 4976, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.0625 = fieldNorm(doc=4976)
          0.11613766 = weight(abstract_txt:detection in 4976) [ClassicSimilarity], result of:
            0.11613766 = score(doc=4976,freq=3.0), product of:
              0.15813637 = queryWeight, product of:
                1.2895973 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.018074945 = queryNorm
              0.73441464 = fieldWeight in 4976, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0625 = fieldNorm(doc=4976)
          0.0381158 = weight(abstract_txt:problem in 4976) [ClassicSimilarity], result of:
            0.0381158 = score(doc=4976,freq=1.0), product of:
              0.13672149 = queryWeight, product of:
                1.6957883 = boost
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.018074945 = queryNorm
              0.27878425 = fieldWeight in 4976, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.0625 = fieldNorm(doc=4976)
          0.060234725 = weight(abstract_txt:automatic in 4976) [ClassicSimilarity], result of:
            0.060234725 = score(doc=4976,freq=1.0), product of:
              0.18549466 = queryWeight, product of:
                1.9752357 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.018074945 = queryNorm
              0.32472485 = fieldWeight in 4976, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=4976)
          0.085113876 = weight(abstract_txt:management in 4976) [ClassicSimilarity], result of:
            0.085113876 = score(doc=4976,freq=3.0), product of:
              0.18539174 = queryWeight, product of:
                2.4184885 = boost
                4.2410107 = idf(docFreq=1729, maxDocs=44218)
                0.018074945 = queryNorm
              0.45910287 = fieldWeight in 4976, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2410107 = idf(docFreq=1729, maxDocs=44218)
                0.0625 = fieldNorm(doc=4976)
        0.2 = coord(5/25)
    
  3. May, A.D.: Automatic classification of e-mail messages by message type (1997) 0.06
    0.06231731 = sum of:
      0.06231731 = product of:
        0.3894832 = sum of:
          0.05666792 = weight(abstract_txt:automated in 6493) [ClassicSimilarity], result of:
            0.05666792 = score(doc=6493,freq=1.0), product of:
              0.107875 = queryWeight, product of:
                1.0651202 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.018074945 = queryNorm
              0.525311 = fieldWeight in 6493, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.09375 = fieldNorm(doc=6493)
          0.07125549 = weight(abstract_txt:matching in 6493) [ClassicSimilarity], result of:
            0.07125549 = score(doc=6493,freq=1.0), product of:
              0.12567286 = queryWeight, product of:
                1.1496323 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.018074945 = queryNorm
              0.56699187 = fieldWeight in 6493, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.09375 = fieldNorm(doc=6493)
          0.13378266 = weight(abstract_txt:manually in 6493) [ClassicSimilarity], result of:
            0.13378266 = score(doc=6493,freq=2.0), product of:
              0.15180491 = queryWeight, product of:
                1.2635171 = boost
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.018074945 = queryNorm
              0.8812802 = fieldWeight in 6493, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.09375 = fieldNorm(doc=6493)
          0.12777714 = weight(abstract_txt:automatic in 6493) [ClassicSimilarity], result of:
            0.12777714 = score(doc=6493,freq=2.0), product of:
              0.18549466 = queryWeight, product of:
                1.9752357 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.018074945 = queryNorm
              0.6888454 = fieldWeight in 6493, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=6493)
        0.16 = coord(4/25)
    
  4. Maurer, H.: Object-oriented modelling of hyperstructure : overcoming the static link deficiency (1994) 0.06
    0.061396107 = sum of:
      0.061396107 = product of:
        0.38372567 = sum of:
          0.04743809 = weight(abstract_txt:large in 764) [ClassicSimilarity], result of:
            0.04743809 = score(doc=764,freq=1.0), product of:
              0.13632585 = queryWeight, product of:
                1.6933328 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.018074945 = queryNorm
              0.34797573 = fieldWeight in 764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.078125 = fieldNorm(doc=764)
          0.13367309 = weight(abstract_txt:links in 764) [ClassicSimilarity], result of:
            0.13367309 = score(doc=764,freq=3.0), product of:
              0.18857424 = queryWeight, product of:
                1.9915646 = boost
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.018074945 = queryNorm
              0.7088619 = fieldWeight in 764, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.078125 = fieldNorm(doc=764)
          0.14118883 = weight(abstract_txt:link in 764) [ClassicSimilarity], result of:
            0.14118883 = score(doc=764,freq=2.0), product of:
              0.22388087 = queryWeight, product of:
                2.1700099 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.018074945 = queryNorm
              0.63064265 = fieldWeight in 764, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.078125 = fieldNorm(doc=764)
          0.061425652 = weight(abstract_txt:management in 764) [ClassicSimilarity], result of:
            0.061425652 = score(doc=764,freq=1.0), product of:
              0.18539174 = queryWeight, product of:
                2.4184885 = boost
                4.2410107 = idf(docFreq=1729, maxDocs=44218)
                0.018074945 = queryNorm
              0.33132896 = fieldWeight in 764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2410107 = idf(docFreq=1729, maxDocs=44218)
                0.078125 = fieldNorm(doc=764)
        0.16 = coord(4/25)
    
  5. ¬The Fourth Text Retrieval Conference (TREC-4) (1996) 0.06
    0.05984454 = sum of:
      0.05984454 = product of:
        0.37402838 = sum of:
          0.0831314 = weight(abstract_txt:matching in 7521) [ClassicSimilarity], result of:
            0.0831314 = score(doc=7521,freq=1.0), product of:
              0.12567286 = queryWeight, product of:
                1.1496323 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.018074945 = queryNorm
              0.6614905 = fieldWeight in 7521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.109375 = fieldNorm(doc=7521)
          0.09156359 = weight(abstract_txt:pattern in 7521) [ClassicSimilarity], result of:
            0.09156359 = score(doc=7521,freq=1.0), product of:
              0.13403349 = queryWeight, product of:
                1.1872574 = boost
                6.2458487 = idf(docFreq=232, maxDocs=44218)
                0.018074945 = queryNorm
              0.6831397 = fieldWeight in 7521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2458487 = idf(docFreq=232, maxDocs=44218)
                0.109375 = fieldNorm(doc=7521)
          0.093922615 = weight(abstract_txt:large in 7521) [ClassicSimilarity], result of:
            0.093922615 = score(doc=7521,freq=2.0), product of:
              0.13632585 = queryWeight, product of:
                1.6933328 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.018074945 = queryNorm
              0.68895674 = fieldWeight in 7521, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.109375 = fieldNorm(doc=7521)
          0.10541077 = weight(abstract_txt:automatic in 7521) [ClassicSimilarity], result of:
            0.10541077 = score(doc=7521,freq=1.0), product of:
              0.18549466 = queryWeight, product of:
                1.9752357 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.018074945 = queryNorm
              0.5682685 = fieldWeight in 7521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.109375 = fieldNorm(doc=7521)
        0.16 = coord(4/25)