Document (#18153)

Author
Thistlewaite, P.
Title
Automatic construction and management of large open webs
Source
Information processing and management. 33(1997) no.2, S.161-173
Year
1997
Abstract
Reviews the problems associated with manually created or maintained hyperdocument links, and the consequent need for automated methods. A number of techniques have been applied to the problem, including pattern-matching, information retrieval, and natural language processing. Describes a system for the automatic detection and management of structural and referential links. Addresses the issues of link-set soundness and completeness, open link management, and the particular problem engendered by large volatile hyperbases
Footnote
Contribution to a special issue on methods and tools for the automatic construction of hypertext

Similar documents (content)

  1. Salton, G.: Automatic text structuring and summarization (1997) 0.08
    0.083310165 = sum of:
      0.083310165 = product of:
        0.52068853 = sum of:
          0.07847477 = weight(abstract_txt:pattern in 2146) [ClassicSimilarity], result of:
            0.07847477 = score(doc=2146,freq=1.0), product of:
              0.13402678 = queryWeight, product of:
                1.1848185 = boost
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.018112257 = queryNorm
              0.58551556 = fieldWeight in 2146, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.09375 = fieldNorm(doc=2146)
          0.18079235 = weight(abstract_txt:automatic in 2146) [ClassicSimilarity], result of:
            0.18079235 = score(doc=2146,freq=4.0), product of:
              0.18555881 = queryWeight, product of:
                1.9715683 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.018112257 = queryNorm
              0.9743129 = fieldWeight in 2146, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.09375 = fieldNorm(doc=2146)
          0.09237075 = weight(abstract_txt:links in 2146) [ClassicSimilarity], result of:
            0.09237075 = score(doc=2146,freq=1.0), product of:
              0.18825124 = queryWeight, product of:
                1.9858204 = boost
                5.2338986 = idf(docFreq=626, maxDocs=43254)
                0.018112257 = queryNorm
              0.490678 = fieldWeight in 2146, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2338986 = idf(docFreq=626, maxDocs=43254)
                0.09375 = fieldNorm(doc=2146)
          0.16905066 = weight(abstract_txt:link in 2146) [ClassicSimilarity], result of:
            0.16905066 = score(doc=2146,freq=2.0), product of:
              0.22355421 = queryWeight, product of:
                2.164026 = boost
                5.7035832 = idf(docFreq=391, maxDocs=43254)
                0.018112257 = queryNorm
              0.7561954 = fieldWeight in 2146, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7035832 = idf(docFreq=391, maxDocs=43254)
                0.09375 = fieldNorm(doc=2146)
        0.16 = coord(4/25)
    
  2. Sood, S.O.; Churchill, E.F.; Antin, J.: Automatic identification of personal insults on social news sites (2012) 0.07
    0.06786188 = sum of:
      0.06786188 = product of:
        0.3393094 = sum of:
          0.03812767 = weight(abstract_txt:automated in 1441) [ClassicSimilarity], result of:
            0.03812767 = score(doc=1441,freq=1.0), product of:
              0.108540684 = queryWeight, product of:
                1.0662335 = boost
                5.6204057 = idf(docFreq=425, maxDocs=43254)
                0.018112257 = queryNorm
              0.35127535 = fieldWeight in 1441, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6204057 = idf(docFreq=425, maxDocs=43254)
                0.0625 = fieldNorm(doc=1441)
          0.11773395 = weight(abstract_txt:detection in 1441) [ClassicSimilarity], result of:
            0.11773395 = score(doc=1441,freq=3.0), product of:
              0.15958539 = queryWeight, product of:
                1.2928632 = boost
                6.8150325 = idf(docFreq=128, maxDocs=43254)
                0.018112257 = queryNorm
              0.7377489 = fieldWeight in 1441, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8150325 = idf(docFreq=128, maxDocs=43254)
                0.0625 = fieldNorm(doc=1441)
          0.03801878 = weight(abstract_txt:problem in 1441) [ClassicSimilarity], result of:
            0.03801878 = score(doc=1441,freq=1.0), product of:
              0.13649221 = queryWeight, product of:
                1.6909275 = boost
                4.4566684 = idf(docFreq=1363, maxDocs=43254)
                0.018112257 = queryNorm
              0.27854177 = fieldWeight in 1441, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4566684 = idf(docFreq=1363, maxDocs=43254)
                0.0625 = fieldNorm(doc=1441)
          0.060264114 = weight(abstract_txt:automatic in 1441) [ClassicSimilarity], result of:
            0.060264114 = score(doc=1441,freq=1.0), product of:
              0.18555881 = queryWeight, product of:
                1.9715683 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.018112257 = queryNorm
              0.32477096 = fieldWeight in 1441, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.0625 = fieldNorm(doc=1441)
          0.08516486 = weight(abstract_txt:management in 1441) [ClassicSimilarity], result of:
            0.08516486 = score(doc=1441,freq=3.0), product of:
              0.18546958 = queryWeight, product of:
                2.4140875 = boost
                4.24177 = idf(docFreq=1690, maxDocs=43254)
                0.018112257 = queryNorm
              0.45918503 = fieldWeight in 1441, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.24177 = idf(docFreq=1690, maxDocs=43254)
                0.0625 = fieldNorm(doc=1441)
        0.2 = coord(5/25)
    
  3. May, A.D.: Automatic classification of e-mail messages by message type (1997) 0.06
    0.062637046 = sum of:
      0.062637046 = product of:
        0.39148155 = sum of:
          0.057191502 = weight(abstract_txt:automated in 562) [ClassicSimilarity], result of:
            0.057191502 = score(doc=562,freq=1.0), product of:
              0.108540684 = queryWeight, product of:
                1.0662335 = boost
                5.6204057 = idf(docFreq=425, maxDocs=43254)
                0.018112257 = queryNorm
              0.52691305 = fieldWeight in 562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6204057 = idf(docFreq=425, maxDocs=43254)
                0.09375 = fieldNorm(doc=562)
          0.07161965 = weight(abstract_txt:matching in 562) [ClassicSimilarity], result of:
            0.07161965 = score(doc=562,freq=1.0), product of:
              0.12610328 = queryWeight, product of:
                1.1492625 = boost
                6.058074 = idf(docFreq=274, maxDocs=43254)
                0.018112257 = queryNorm
              0.5679444 = fieldWeight in 562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.058074 = idf(docFreq=274, maxDocs=43254)
                0.09375 = fieldNorm(doc=562)
          0.13483092 = weight(abstract_txt:manually in 562) [ClassicSimilarity], result of:
            0.13483092 = score(doc=562,freq=2.0), product of:
              0.15260002 = queryWeight, product of:
                1.2642511 = boost
                6.66421 = idf(docFreq=149, maxDocs=43254)
                0.018112257 = queryNorm
              0.88355774 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.66421 = idf(docFreq=149, maxDocs=43254)
                0.09375 = fieldNorm(doc=562)
          0.12783948 = weight(abstract_txt:automatic in 562) [ClassicSimilarity], result of:
            0.12783948 = score(doc=562,freq=2.0), product of:
              0.18555881 = queryWeight, product of:
                1.9715683 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.018112257 = queryNorm
              0.6889432 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.09375 = fieldNorm(doc=562)
        0.16 = coord(4/25)
    
  4. Maurer, H.: Object-oriented modelling of hyperstructure : overcoming the static link deficiency (1994) 0.06
    0.06136287 = sum of:
      0.06136287 = product of:
        0.38351795 = sum of:
          0.047854282 = weight(abstract_txt:large in 1833) [ClassicSimilarity], result of:
            0.047854282 = score(doc=1833,freq=1.0), product of:
              0.13712488 = queryWeight, product of:
                1.6948419 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.018112257 = queryNorm
              0.34898323 = fieldWeight in 1833, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.078125 = fieldNorm(doc=1833)
          0.13332568 = weight(abstract_txt:links in 1833) [ClassicSimilarity], result of:
            0.13332568 = score(doc=1833,freq=3.0), product of:
              0.18825124 = queryWeight, product of:
                1.9858204 = boost
                5.2338986 = idf(docFreq=626, maxDocs=43254)
                0.018112257 = queryNorm
              0.70823264 = fieldWeight in 1833, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2338986 = idf(docFreq=626, maxDocs=43254)
                0.078125 = fieldNorm(doc=1833)
          0.14087556 = weight(abstract_txt:link in 1833) [ClassicSimilarity], result of:
            0.14087556 = score(doc=1833,freq=2.0), product of:
              0.22355421 = queryWeight, product of:
                2.164026 = boost
                5.7035832 = idf(docFreq=391, maxDocs=43254)
                0.018112257 = queryNorm
              0.6301629 = fieldWeight in 1833, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7035832 = idf(docFreq=391, maxDocs=43254)
                0.078125 = fieldNorm(doc=1833)
          0.061462443 = weight(abstract_txt:management in 1833) [ClassicSimilarity], result of:
            0.061462443 = score(doc=1833,freq=1.0), product of:
              0.18546958 = queryWeight, product of:
                2.4140875 = boost
                4.24177 = idf(docFreq=1690, maxDocs=43254)
                0.018112257 = queryNorm
              0.33138826 = fieldWeight in 1833, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.24177 = idf(docFreq=1690, maxDocs=43254)
                0.078125 = fieldNorm(doc=1833)
        0.16 = coord(4/25)
    
  5. ¬The Fourth Text Retrieval Conference (TREC-4) (1996) 0.06
    0.06005104 = sum of:
      0.06005104 = product of:
        0.375319 = sum of:
          0.083556265 = weight(abstract_txt:matching in 591) [ClassicSimilarity], result of:
            0.083556265 = score(doc=591,freq=1.0), product of:
              0.12610328 = queryWeight, product of:
                1.1492625 = boost
                6.058074 = idf(docFreq=274, maxDocs=43254)
                0.018112257 = queryNorm
              0.6626018 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.058074 = idf(docFreq=274, maxDocs=43254)
                0.109375 = fieldNorm(doc=591)
          0.09155389 = weight(abstract_txt:pattern in 591) [ClassicSimilarity], result of:
            0.09155389 = score(doc=591,freq=1.0), product of:
              0.13402678 = queryWeight, product of:
                1.1848185 = boost
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.018112257 = queryNorm
              0.6831015 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.109375 = fieldNorm(doc=591)
          0.094746634 = weight(abstract_txt:large in 591) [ClassicSimilarity], result of:
            0.094746634 = score(doc=591,freq=2.0), product of:
              0.13712488 = queryWeight, product of:
                1.6948419 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.018112257 = queryNorm
              0.69095147 = fieldWeight in 591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.109375 = fieldNorm(doc=591)
          0.1054622 = weight(abstract_txt:automatic in 591) [ClassicSimilarity], result of:
            0.1054622 = score(doc=591,freq=1.0), product of:
              0.18555881 = queryWeight, product of:
                1.9715683 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.018112257 = queryNorm
              0.5683492 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.109375 = fieldNorm(doc=591)
        0.16 = coord(4/25)