Document (#25204)

Author
Koehler, W.
Title
Web page change and persistence : a four-year longitudinal study
Source
Journal of the American Society for Information Science and technology. 53(2002) no.2, S.162-171
Year
2002
Abstract
Changes in the topography of the Web can be expressed in at least four ways: (1) more sites on more servers in more places, (2) more pages and objects added to existing sites and pages, (3) changes in traffic, and (4) modifications to existing text, graphic, and other Web objects. This article does not address the first three factors (more sites, more pages, more traffic) in the growth of the Web. It focuses instead on changes to an existing set of Web documents. The article documents changes to an aging set of Web pages, first identified and "collected" in December 1996 and followed weekly thereafter. Results are reported through February 2001. The article addresses two related phenomena: (1) the life cycle of Web objects, and (2) changes to Web objects. These data reaffirm that the half-life of a Web page is approximately 2 years. There is variation among Web pages by top-level domain and by page type (navigation, content). Web page content appears to stabilize over time; aging pages change less often than once they did
Theme
Internet
Informetrie
Object
WWW

Similar documents (author)

  1. Koehler, W.C.: Internet search note : specialized retrieval and Web search engines (1997) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:koehler in 769) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 769, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=769)
    
  2. Koehler, W.: ¬An analysis of Web page and Web site constancy and performance (1999) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:koehler in 2945) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 2945, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=2945)
    
  3. Koehler, W.; Mincey, D.: FirstSearch and NetFirst - Web and dial-up access : plus ça change, plus c'est la même chose? (1996) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:koehler in 6532) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 6532, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=6532)
    
  4. Oguz, F.; Koehler, W.: URL decay at year 20 : a research note (2016) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:koehler in 2651) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 2651, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=2651)
    
  5. McDonnell, J.P.; Koehler Jr., W.C.; Carroll, B.C.: Cataloging challenges in an area studies virtual library catalog (ASVLC) : results of a case study (1999) 3.66
    3.6566167 = sum of:
      3.6566167 = weight(author_txt:koehler in 6101) [ClassicSimilarity], result of:
        3.6566167 = fieldWeight in 6101, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.375 = fieldNorm(doc=6101)
    

Similar documents (content)

  1. Spink, A.; Wolfram, D.; Jansen, B.J.; Saracevic, T.: Searching the Web : the public and their queries (2001) 0.22
    0.22115509 = sum of:
      0.22115509 = product of:
        0.6143197 = sum of:
          0.056943495 = weight(abstract_txt:december in 6980) [ClassicSimilarity], result of:
            0.056943495 = score(doc=6980,freq=1.0), product of:
              0.14972754 = queryWeight, product of:
                1.1324738 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.016295673 = queryNorm
              0.3803141 = fieldWeight in 6980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.015572969 = weight(abstract_txt:content in 6980) [ClassicSimilarity], result of:
            0.015572969 = score(doc=6980,freq=1.0), product of:
              0.079480976 = queryWeight, product of:
                1.1668739 = boost
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.016295673 = queryNorm
              0.19593328 = fieldWeight in 6980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.054882023 = weight(abstract_txt:change in 6980) [ClassicSimilarity], result of:
            0.054882023 = score(doc=6980,freq=3.0), product of:
              0.12762289 = queryWeight, product of:
                1.4786202 = boost
                5.29663 = idf(docFreq=601, maxDocs=44218)
                0.016295673 = queryNorm
              0.43003276 = fieldWeight in 6980, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.29663 = idf(docFreq=601, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.017607775 = weight(abstract_txt:article in 6980) [ClassicSimilarity], result of:
            0.017607775 = score(doc=6980,freq=1.0), product of:
              0.09874512 = queryWeight, product of:
                1.5929266 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.016295673 = queryNorm
              0.17831539 = fieldWeight in 6980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.07122041 = weight(abstract_txt:sites in 6980) [ClassicSimilarity], result of:
            0.07122041 = score(doc=6980,freq=2.0), product of:
              0.198963 = queryWeight, product of:
                2.2611227 = boost
                5.399778 = idf(docFreq=542, maxDocs=44218)
                0.016295673 = queryNorm
              0.35795808 = fieldWeight in 6980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.399778 = idf(docFreq=542, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.041561402 = weight(abstract_txt:more in 6980) [ClassicSimilarity], result of:
            0.041561402 = score(doc=6980,freq=2.0), product of:
              0.18428433 = queryWeight, product of:
                3.324073 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.016295673 = queryNorm
              0.22552869 = fieldWeight in 6980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.13004597 = weight(abstract_txt:page in 6980) [ClassicSimilarity], result of:
            0.13004597 = score(doc=6980,freq=2.0), product of:
              0.32714993 = queryWeight, product of:
                3.3479638 = boost
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.016295673 = queryNorm
              0.39751184 = fieldWeight in 6980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.06712981 = weight(abstract_txt:changes in 6980) [ClassicSimilarity], result of:
            0.06712981 = score(doc=6980,freq=1.0), product of:
              0.28571907 = queryWeight, product of:
                3.498098 = boost
                5.0122757 = idf(docFreq=799, maxDocs=44218)
                0.016295673 = queryNorm
              0.23495042 = fieldWeight in 6980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0122757 = idf(docFreq=799, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.15935582 = weight(abstract_txt:pages in 6980) [ClassicSimilarity], result of:
            0.15935582 = score(doc=6980,freq=2.0), product of:
              0.42883605 = queryWeight, product of:
                4.6946 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.016295673 = queryNorm
              0.3716008 = fieldWeight in 6980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
        0.36 = coord(9/25)
    
  2. Craven, T.: Changes in metatag descriptions over time (2001) 0.18
    0.184741 = sum of:
      0.184741 = product of:
        0.92370504 = sum of:
          0.059659485 = weight(abstract_txt:four in 6601) [ClassicSimilarity], result of:
            0.059659485 = score(doc=6601,freq=1.0), product of:
              0.12258817 = queryWeight, product of:
                1.4491609 = boost
                5.191103 = idf(docFreq=668, maxDocs=44218)
                0.016295673 = queryNorm
              0.4866659 = fieldWeight in 6601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.191103 = idf(docFreq=668, maxDocs=44218)
                0.09375 = fieldNorm(doc=6601)
          0.0633723 = weight(abstract_txt:change in 6601) [ClassicSimilarity], result of:
            0.0633723 = score(doc=6601,freq=1.0), product of:
              0.12762289 = queryWeight, product of:
                1.4786202 = boost
                5.29663 = idf(docFreq=601, maxDocs=44218)
                0.016295673 = queryNorm
              0.49655905 = fieldWeight in 6601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.29663 = idf(docFreq=601, maxDocs=44218)
                0.09375 = fieldNorm(doc=6601)
          0.058776703 = weight(abstract_txt:more in 6601) [ClassicSimilarity], result of:
            0.058776703 = score(doc=6601,freq=1.0), product of:
              0.18428433 = queryWeight, product of:
                3.324073 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.016295673 = queryNorm
              0.31894574 = fieldWeight in 6601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.09375 = fieldNorm(doc=6601)
          0.18987179 = weight(abstract_txt:changes in 6601) [ClassicSimilarity], result of:
            0.18987179 = score(doc=6601,freq=2.0), product of:
              0.28571907 = queryWeight, product of:
                3.498098 = boost
                5.0122757 = idf(docFreq=799, maxDocs=44218)
                0.016295673 = queryNorm
              0.6645401 = fieldWeight in 6601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0122757 = idf(docFreq=799, maxDocs=44218)
                0.09375 = fieldNorm(doc=6601)
          0.5520248 = weight(abstract_txt:pages in 6601) [ClassicSimilarity], result of:
            0.5520248 = score(doc=6601,freq=6.0), product of:
              0.42883605 = queryWeight, product of:
                4.6946 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.016295673 = queryNorm
              1.287263 = fieldWeight in 6601, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.09375 = fieldNorm(doc=6601)
        0.2 = coord(5/25)
    
  3. Barsky, E.; Bar-Ilan, J.: ¬The impact of task phrasing on the choice of search keywords and on the search process and success (2012) 0.18
    0.17596662 = sum of:
      0.17596662 = product of:
        0.7331943 = sum of:
          0.052275516 = weight(abstract_txt:modifications in 455) [ClassicSimilarity], result of:
            0.052275516 = score(doc=455,freq=1.0), product of:
              0.11674689 = queryWeight, product of:
                7.1642876 = idf(docFreq=92, maxDocs=44218)
                0.016295673 = queryNorm
              0.44776797 = fieldWeight in 455, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1642876 = idf(docFreq=92, maxDocs=44218)
                0.0625 = fieldNorm(doc=455)
          0.08543187 = weight(abstract_txt:persistence in 455) [ClassicSimilarity], result of:
            0.08543187 = score(doc=455,freq=1.0), product of:
              0.16197936 = queryWeight, product of:
                1.1778966 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.016295673 = queryNorm
              0.5274244 = fieldWeight in 455, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.0625 = fieldNorm(doc=455)
          0.03977299 = weight(abstract_txt:four in 455) [ClassicSimilarity], result of:
            0.03977299 = score(doc=455,freq=1.0), product of:
              0.12258817 = queryWeight, product of:
                1.4491609 = boost
                5.191103 = idf(docFreq=668, maxDocs=44218)
                0.016295673 = queryNorm
              0.32444394 = fieldWeight in 455, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.191103 = idf(docFreq=668, maxDocs=44218)
                0.0625 = fieldNorm(doc=455)
          0.042911194 = weight(abstract_txt:existing in 455) [ClassicSimilarity], result of:
            0.042911194 = score(doc=455,freq=1.0), product of:
              0.14761616 = queryWeight, product of:
                1.9476231 = boost
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.016295673 = queryNorm
              0.29069442 = fieldWeight in 455, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.0625 = fieldNorm(doc=455)
          0.30032828 = weight(abstract_txt:page in 455) [ClassicSimilarity], result of:
            0.30032828 = score(doc=455,freq=6.0), product of:
              0.32714993 = queryWeight, product of:
                3.3479638 = boost
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.016295673 = queryNorm
              0.9180142 = fieldWeight in 455, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.0625 = fieldNorm(doc=455)
          0.21247442 = weight(abstract_txt:pages in 455) [ClassicSimilarity], result of:
            0.21247442 = score(doc=455,freq=2.0), product of:
              0.42883605 = queryWeight, product of:
                4.6946 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.016295673 = queryNorm
              0.49546772 = fieldWeight in 455, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.0625 = fieldNorm(doc=455)
        0.24 = coord(6/25)
    
  4. Lawrence, S.; Giles, C.L.: Accessibility and distribution of information on the Web (1999) 0.17
    0.1697041 = sum of:
      0.1697041 = product of:
        0.7071004 = sum of:
          0.07441688 = weight(abstract_txt:february in 4952) [ClassicSimilarity], result of:
            0.07441688 = score(doc=4952,freq=1.0), product of:
              0.14773864 = queryWeight, product of:
                1.1249272 = boost
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.016295673 = queryNorm
              0.50370634 = fieldWeight in 4952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.0625 = fieldNorm(doc=4952)
          0.075924665 = weight(abstract_txt:december in 4952) [ClassicSimilarity], result of:
            0.075924665 = score(doc=4952,freq=1.0), product of:
              0.14972754 = queryWeight, product of:
                1.1324738 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.016295673 = queryNorm
              0.5070855 = fieldWeight in 4952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.0625 = fieldNorm(doc=4952)
          0.035964232 = weight(abstract_txt:content in 4952) [ClassicSimilarity], result of:
            0.035964232 = score(doc=4952,freq=3.0), product of:
              0.079480976 = queryWeight, product of:
                1.1668739 = boost
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.016295673 = queryNorm
              0.45248854 = fieldWeight in 4952, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.0625 = fieldNorm(doc=4952)
          0.21233827 = weight(abstract_txt:sites in 4952) [ClassicSimilarity], result of:
            0.21233827 = score(doc=4952,freq=10.0), product of:
              0.198963 = queryWeight, product of:
                2.2611227 = boost
                5.399778 = idf(docFreq=542, maxDocs=44218)
                0.016295673 = queryNorm
              1.0672249 = fieldWeight in 4952, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.399778 = idf(docFreq=542, maxDocs=44218)
                0.0625 = fieldNorm(doc=4952)
          0.095981956 = weight(abstract_txt:more in 4952) [ClassicSimilarity], result of:
            0.095981956 = score(doc=4952,freq=6.0), product of:
              0.18428433 = queryWeight, product of:
                3.324073 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.016295673 = queryNorm
              0.52083623 = fieldWeight in 4952, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.0625 = fieldNorm(doc=4952)
          0.21247442 = weight(abstract_txt:pages in 4952) [ClassicSimilarity], result of:
            0.21247442 = score(doc=4952,freq=2.0), product of:
              0.42883605 = queryWeight, product of:
                4.6946 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.016295673 = queryNorm
              0.49546772 = fieldWeight in 4952, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.0625 = fieldNorm(doc=4952)
        0.24 = coord(6/25)
    
  5. Bhavnani, S.K.; Peck, F.A.: Scatter matters : regularities and implications for the scatter of healthcare information on the Web (2010) 0.15
    0.14777622 = sum of:
      0.14777622 = product of:
        0.6157343 = sum of:
          0.020763958 = weight(abstract_txt:content in 3433) [ClassicSimilarity], result of:
            0.020763958 = score(doc=3433,freq=1.0), product of:
              0.079480976 = queryWeight, product of:
                1.1668739 = boost
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.016295673 = queryNorm
              0.2612444 = fieldWeight in 3433, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.0625 = fieldNorm(doc=3433)
          0.023477033 = weight(abstract_txt:article in 3433) [ClassicSimilarity], result of:
            0.023477033 = score(doc=3433,freq=1.0), product of:
              0.09874512 = queryWeight, product of:
                1.5929266 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.016295673 = queryNorm
              0.23775385 = fieldWeight in 3433, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0625 = fieldNorm(doc=3433)
          0.042911194 = weight(abstract_txt:existing in 3433) [ClassicSimilarity], result of:
            0.042911194 = score(doc=3433,freq=1.0), product of:
              0.14761616 = queryWeight, product of:
                1.9476231 = boost
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.016295673 = queryNorm
              0.29069442 = fieldWeight in 3433, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.0625 = fieldNorm(doc=3433)
          0.09496055 = weight(abstract_txt:sites in 3433) [ClassicSimilarity], result of:
            0.09496055 = score(doc=3433,freq=2.0), product of:
              0.198963 = queryWeight, product of:
                2.2611227 = boost
                5.399778 = idf(docFreq=542, maxDocs=44218)
                0.016295673 = queryNorm
              0.47727743 = fieldWeight in 3433, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.399778 = idf(docFreq=542, maxDocs=44218)
                0.0625 = fieldNorm(doc=3433)
          0.17339462 = weight(abstract_txt:page in 3433) [ClassicSimilarity], result of:
            0.17339462 = score(doc=3433,freq=2.0), product of:
              0.32714993 = queryWeight, product of:
                3.3479638 = boost
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.016295673 = queryNorm
              0.53001577 = fieldWeight in 3433, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.0625 = fieldNorm(doc=3433)
          0.26022694 = weight(abstract_txt:pages in 3433) [ClassicSimilarity], result of:
            0.26022694 = score(doc=3433,freq=3.0), product of:
              0.42883605 = queryWeight, product of:
                4.6946 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.016295673 = queryNorm
              0.60682154 = fieldWeight in 3433, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.0625 = fieldNorm(doc=3433)
        0.24 = coord(6/25)