Document (#33362)

Author
Luyt, B.
Aaron, T.C.H.
Thian, L.H.
Hong, C.K.
Title
Improving Wikipedia's accuracy : is edit age a solution?
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.2, S.318-330
Year
2008
Abstract
Wikipedia is fast becoming a key information source for many despite criticism that it is unreliable and inaccurate. A number of recommendations have been made to sort the chaff from the wheat in Wikipedia, among which is the idea of color-coding article segment edits according to age (Cross, 2006). Using data collected as part of a wider study published in Nature, this article examines the distribution of errors throughout the life of a select group of Wikipedia articles. The survival time of each error edit in terms of the edit counts and days was calculated and the hypothesis that surviving material added by older edits is more trustworthy was tested. Surprisingly, we find that roughly 20% of errors can be attributed to surviving text added by the first edit, which confirmed the existence of a first-mover effect (Viegas, Wattenberg, & Kushal, 2004) whereby material added by early edits are less likely to be removed. We suggest that the sizable number of errors added by early edits is simply a result of more material being added near the beginning of the life of the article. Overall, the results do not provide support for the idea of trusting surviving segments attributed to older edits because such edits tend to add more material and hence contain more errors which do not seem to be offset by greater opportunities for error correction by later edits.
Theme
Informationsmittel
Object
Wikipedia

Similar documents (author)

  1. Luyt, B.: Defining the digital divide : the role of e-readiness indicators (2006) 2.15
    2.1474473 = sum of:
      2.1474473 = product of:
        4.2948947 = sum of:
          4.2948947 = weight(author_txt:luyt in 1778) [ClassicSimilarity], result of:
            4.2948947 = score(doc=1778,freq=1.0), product of:
              0.7401874 = queryWeight, product of:
                1.0491965 = boost
                9.283908 = idf(docFreq=10, maxDocs=43556)
                0.07598958 = queryNorm
              5.8024426 = fieldWeight in 1778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.283908 = idf(docFreq=10, maxDocs=43556)
                0.625 = fieldNorm(doc=1778)
        0.5 = coord(1/2)
    
  2. Luyt, B.: Centres of calculation and unruly colonists : the colonial library in Singapore and its users, 1874-1900 (2008) 2.15
    2.1474473 = sum of:
      2.1474473 = product of:
        4.2948947 = sum of:
          4.2948947 = weight(author_txt:luyt in 3893) [ClassicSimilarity], result of:
            4.2948947 = score(doc=3893,freq=1.0), product of:
              0.7401874 = queryWeight, product of:
                1.0491965 = boost
                9.283908 = idf(docFreq=10, maxDocs=43556)
                0.07598958 = queryNorm
              5.8024426 = fieldWeight in 3893, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.283908 = idf(docFreq=10, maxDocs=43556)
                0.625 = fieldNorm(doc=3893)
        0.5 = coord(1/2)
    
  3. Luyt, B.: ¬The nature of historical representation on Wikipedia : dominant or alterative historiography? (2011) 2.15
    2.1474473 = sum of:
      2.1474473 = product of:
        4.2948947 = sum of:
          4.2948947 = weight(author_txt:luyt in 1457) [ClassicSimilarity], result of:
            4.2948947 = score(doc=1457,freq=1.0), product of:
              0.7401874 = queryWeight, product of:
                1.0491965 = boost
                9.283908 = idf(docFreq=10, maxDocs=43556)
                0.07598958 = queryNorm
              5.8024426 = fieldWeight in 1457, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.283908 = idf(docFreq=10, maxDocs=43556)
                0.625 = fieldNorm(doc=1457)
        0.5 = coord(1/2)
    
  4. Luyt, B.: ¬The inclusivity of Wikipedia and the drawing of expert boundaries : an examination of talk pages and reference lists (2012) 2.15
    2.1474473 = sum of:
      2.1474473 = product of:
        4.2948947 = sum of:
          4.2948947 = weight(author_txt:luyt in 2389) [ClassicSimilarity], result of:
            4.2948947 = score(doc=2389,freq=1.0), product of:
              0.7401874 = queryWeight, product of:
                1.0491965 = boost
                9.283908 = idf(docFreq=10, maxDocs=43556)
                0.07598958 = queryNorm
              5.8024426 = fieldWeight in 2389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.283908 = idf(docFreq=10, maxDocs=43556)
                0.625 = fieldNorm(doc=2389)
        0.5 = coord(1/2)
    
  5. Luyt, B.: History on Wikipedia : In need of a NWICO (New World Information and Communication Order)? the case of Cambodia (2013) 2.15
    2.1474473 = sum of:
      2.1474473 = product of:
        4.2948947 = sum of:
          4.2948947 = weight(author_txt:luyt in 2949) [ClassicSimilarity], result of:
            4.2948947 = score(doc=2949,freq=1.0), product of:
              0.7401874 = queryWeight, product of:
                1.0491965 = boost
                9.283908 = idf(docFreq=10, maxDocs=43556)
                0.07598958 = queryNorm
              5.8024426 = fieldWeight in 2949, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.283908 = idf(docFreq=10, maxDocs=43556)
                0.625 = fieldNorm(doc=2949)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Fallis, D.: Toward an epistemology of Wikipedia (2008) 0.14
    0.14110275 = sum of:
      0.14110275 = product of:
        0.5879281 = sum of:
          0.014691517 = weight(abstract_txt:number in 4008) [ClassicSimilarity], result of:
            0.014691517 = score(doc=4008,freq=1.0), product of:
              0.06512836 = queryWeight, product of:
                4.124852 = idf(docFreq=1913, maxDocs=43556)
                0.01578926 = queryNorm
              0.22557786 = fieldWeight in 4008, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.124852 = idf(docFreq=1913, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4008)
          0.017887095 = weight(abstract_txt:that in 4008) [ClassicSimilarity], result of:
            0.017887095 = score(doc=4008,freq=10.0), product of:
              0.043427244 = queryWeight, product of:
                1.1548114 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.01578926 = queryNorm
              0.41188648 = fieldWeight in 4008, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4008)
          0.024755038 = weight(abstract_txt:article in 4008) [ClassicSimilarity], result of:
            0.024755038 = score(doc=4008,freq=2.0), product of:
              0.083789304 = queryWeight, product of:
                1.3891683 = boost
                3.8200758 = idf(docFreq=2595, maxDocs=43556)
                0.01578926 = queryNorm
              0.2954439 = fieldWeight in 4008, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8200758 = idf(docFreq=2595, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4008)
          0.016604729 = weight(abstract_txt:more in 4008) [ClassicSimilarity], result of:
            0.016604729 = score(doc=4008,freq=1.0), product of:
              0.089034215 = queryWeight, product of:
                1.6535159 = boost
                3.4102545 = idf(docFreq=3910, maxDocs=43556)
                0.01578926 = queryNorm
              0.18649828 = fieldWeight in 4008, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4102545 = idf(docFreq=3910, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4008)
          0.27908698 = weight(abstract_txt:wikipedia in 4008) [ClassicSimilarity], result of:
            0.27908698 = score(doc=4008,freq=13.0), product of:
              0.22573632 = queryWeight, product of:
                2.2801387 = boost
                6.270157 = idf(docFreq=223, maxDocs=43556)
                0.01578926 = queryNorm
              1.2363406 = fieldWeight in 4008, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                6.270157 = idf(docFreq=223, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4008)
          0.23490278 = weight(abstract_txt:edit in 4008) [ClassicSimilarity], result of:
            0.23490278 = score(doc=4008,freq=1.0), product of:
              0.52078825 = queryWeight, product of:
                3.999084 = boost
                8.247815 = idf(docFreq=30, maxDocs=43556)
                0.01578926 = queryNorm
              0.4510524 = fieldWeight in 4008, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.247815 = idf(docFreq=30, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4008)
        0.24 = coord(6/25)
    
  2. Tsikerdekis, M.: Personal communication networks and their positive effects on online collaboration and outcome quality on Wikipedia : an empirical exploration (2016) 0.12
    0.120948754 = sum of:
      0.120948754 = product of:
        0.50395316 = sum of:
          0.00899035 = weight(abstract_txt:which in 4844) [ClassicSimilarity], result of:
            0.00899035 = score(doc=4844,freq=1.0), product of:
              0.049160082 = queryWeight, product of:
                1.0640618 = boost
                2.9260652 = idf(docFreq=6346, maxDocs=43556)
                0.01578926 = queryNorm
              0.18287908 = fieldWeight in 4844, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9260652 = idf(docFreq=6346, maxDocs=43556)
                0.0625 = fieldNorm(doc=4844)
          0.102060445 = weight(abstract_txt:wikipedia's in 4844) [ClassicSimilarity], result of:
            0.102060445 = score(doc=4844,freq=1.0), product of:
              0.17217077 = queryWeight, product of:
                1.149687 = boost
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.01578926 = queryNorm
              0.59278613 = fieldWeight in 4844, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.0625 = fieldNorm(doc=4844)
          0.009142117 = weight(abstract_txt:that in 4844) [ClassicSimilarity], result of:
            0.009142117 = score(doc=4844,freq=2.0), product of:
              0.043427244 = queryWeight, product of:
                1.1548114 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.01578926 = queryNorm
              0.2105157 = fieldWeight in 4844, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.0625 = fieldNorm(doc=4844)
          0.026837293 = weight(abstract_txt:more in 4844) [ClassicSimilarity], result of:
            0.026837293 = score(doc=4844,freq=2.0), product of:
              0.089034215 = queryWeight, product of:
                1.6535159 = boost
                3.4102545 = idf(docFreq=3910, maxDocs=43556)
                0.01578926 = queryNorm
              0.30142674 = fieldWeight in 4844, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4102545 = idf(docFreq=3910, maxDocs=43556)
                0.0625 = fieldNorm(doc=4844)
          0.088462636 = weight(abstract_txt:wikipedia in 4844) [ClassicSimilarity], result of:
            0.088462636 = score(doc=4844,freq=1.0), product of:
              0.22573632 = queryWeight, product of:
                2.2801387 = boost
                6.270157 = idf(docFreq=223, maxDocs=43556)
                0.01578926 = queryNorm
              0.3918848 = fieldWeight in 4844, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.270157 = idf(docFreq=223, maxDocs=43556)
                0.0625 = fieldNorm(doc=4844)
          0.26846033 = weight(abstract_txt:edit in 4844) [ClassicSimilarity], result of:
            0.26846033 = score(doc=4844,freq=1.0), product of:
              0.52078825 = queryWeight, product of:
                3.999084 = boost
                8.247815 = idf(docFreq=30, maxDocs=43556)
                0.01578926 = queryNorm
              0.51548845 = fieldWeight in 4844, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.247815 = idf(docFreq=30, maxDocs=43556)
                0.0625 = fieldNorm(doc=4844)
        0.24 = coord(6/25)
    
  3. Pope, J.T.; Holley, R.P.: Google Book Search and metadata (2011) 0.12
    0.11554044 = sum of:
      0.11554044 = product of:
        0.4814185 = sum of:
          0.097990364 = weight(abstract_txt:inaccurate in 3885) [ClassicSimilarity], result of:
            0.097990364 = score(doc=3885,freq=1.0), product of:
              0.14440094 = queryWeight, product of:
                1.0528947 = boost
                8.68607 = idf(docFreq=19, maxDocs=43556)
                0.01578926 = queryNorm
              0.67859924 = fieldWeight in 3885, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.68607 = idf(docFreq=19, maxDocs=43556)
                0.078125 = fieldNorm(doc=3885)
          0.011237938 = weight(abstract_txt:which in 3885) [ClassicSimilarity], result of:
            0.011237938 = score(doc=3885,freq=1.0), product of:
              0.049160082 = queryWeight, product of:
                1.0640618 = boost
                2.9260652 = idf(docFreq=6346, maxDocs=43556)
                0.01578926 = queryNorm
              0.22859885 = fieldWeight in 3885, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9260652 = idf(docFreq=6346, maxDocs=43556)
                0.078125 = fieldNorm(doc=3885)
          0.013995949 = weight(abstract_txt:that in 3885) [ClassicSimilarity], result of:
            0.013995949 = score(doc=3885,freq=3.0), product of:
              0.043427244 = queryWeight, product of:
                1.1548114 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.01578926 = queryNorm
              0.322285 = fieldWeight in 3885, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.078125 = fieldNorm(doc=3885)
          0.025006365 = weight(abstract_txt:article in 3885) [ClassicSimilarity], result of:
            0.025006365 = score(doc=3885,freq=1.0), product of:
              0.083789304 = queryWeight, product of:
                1.3891683 = boost
                3.8200758 = idf(docFreq=2595, maxDocs=43556)
                0.01578926 = queryNorm
              0.2984434 = fieldWeight in 3885, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8200758 = idf(docFreq=2595, maxDocs=43556)
                0.078125 = fieldNorm(doc=3885)
          0.09593187 = weight(abstract_txt:error in 3885) [ClassicSimilarity], result of:
            0.09593187 = score(doc=3885,freq=1.0), product of:
              0.17937686 = queryWeight, product of:
                1.6595798 = boost
                6.845521 = idf(docFreq=125, maxDocs=43556)
                0.01578926 = queryNorm
              0.5348063 = fieldWeight in 3885, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.845521 = idf(docFreq=125, maxDocs=43556)
                0.078125 = fieldNorm(doc=3885)
          0.237256 = weight(abstract_txt:errors in 3885) [ClassicSimilarity], result of:
            0.237256 = score(doc=3885,freq=2.0), product of:
              0.3280469 = queryWeight, product of:
                3.173934 = boost
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.01578926 = queryNorm
              0.7232381 = fieldWeight in 3885, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.078125 = fieldNorm(doc=3885)
        0.24 = coord(6/25)
    
  4. Subelj, L.; Fiala, D.: Publication boost in web of science journals and its effect on citation distributions (2017) 0.11
    0.1103043 = sum of:
      0.1103043 = product of:
        0.39394394 = sum of:
          0.029681345 = weight(abstract_txt:number in 535) [ClassicSimilarity], result of:
            0.029681345 = score(doc=535,freq=2.0), product of:
              0.06512836 = queryWeight, product of:
                4.124852 = idf(docFreq=1913, maxDocs=43556)
                0.01578926 = queryNorm
              0.45573607 = fieldWeight in 535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.124852 = idf(docFreq=1913, maxDocs=43556)
                0.078125 = fieldNorm(doc=535)
          0.021852288 = weight(abstract_txt:first in 535) [ClassicSimilarity], result of:
            0.021852288 = score(doc=535,freq=1.0), product of:
              0.06690456 = queryWeight, product of:
                1.0135444 = boost
                4.180721 = idf(docFreq=1809, maxDocs=43556)
                0.01578926 = queryNorm
              0.32661882 = fieldWeight in 535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.180721 = idf(docFreq=1809, maxDocs=43556)
                0.078125 = fieldNorm(doc=535)
          0.011427646 = weight(abstract_txt:that in 535) [ClassicSimilarity], result of:
            0.011427646 = score(doc=535,freq=2.0), product of:
              0.043427244 = queryWeight, product of:
                1.1548114 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.01578926 = queryNorm
              0.2631446 = fieldWeight in 535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.078125 = fieldNorm(doc=535)
          0.03536434 = weight(abstract_txt:article in 535) [ClassicSimilarity], result of:
            0.03536434 = score(doc=535,freq=2.0), product of:
              0.083789304 = queryWeight, product of:
                1.3891683 = boost
                3.8200758 = idf(docFreq=2595, maxDocs=43556)
                0.01578926 = queryNorm
              0.42206272 = fieldWeight in 535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8200758 = idf(docFreq=2595, maxDocs=43556)
                0.078125 = fieldNorm(doc=535)
          0.05304186 = weight(abstract_txt:more in 535) [ClassicSimilarity], result of:
            0.05304186 = score(doc=535,freq=5.0), product of:
              0.089034215 = queryWeight, product of:
                1.6535159 = boost
                3.4102545 = idf(docFreq=3910, maxDocs=43556)
                0.01578926 = queryNorm
              0.59574693 = fieldWeight in 535, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4102545 = idf(docFreq=3910, maxDocs=43556)
                0.078125 = fieldNorm(doc=535)
          0.106432036 = weight(abstract_txt:older in 535) [ClassicSimilarity], result of:
            0.106432036 = score(doc=535,freq=1.0), product of:
              0.19223805 = queryWeight, product of:
                1.7180452 = boost
                7.086683 = idf(docFreq=98, maxDocs=43556)
                0.01578926 = queryNorm
              0.5536471 = fieldWeight in 535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.086683 = idf(docFreq=98, maxDocs=43556)
                0.078125 = fieldNorm(doc=535)
          0.13614443 = weight(abstract_txt:attributed in 535) [ClassicSimilarity], result of:
            0.13614443 = score(doc=535,freq=1.0), product of:
              0.22652929 = queryWeight, product of:
                1.8649926 = boost
                7.6928186 = idf(docFreq=53, maxDocs=43556)
                0.01578926 = queryNorm
              0.60100144 = fieldWeight in 535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6928186 = idf(docFreq=53, maxDocs=43556)
                0.078125 = fieldNorm(doc=535)
        0.28 = coord(7/25)
    
  5. Tenopir, C.: Common end user errors (1997) 0.11
    0.10603218 = sum of:
      0.10603218 = product of:
        0.66270113 = sum of:
          0.013485525 = weight(abstract_txt:which in 1408) [ClassicSimilarity], result of:
            0.013485525 = score(doc=1408,freq=1.0), product of:
              0.049160082 = queryWeight, product of:
                1.0640618 = boost
                2.9260652 = idf(docFreq=6346, maxDocs=43556)
                0.01578926 = queryNorm
              0.2743186 = fieldWeight in 1408, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9260652 = idf(docFreq=6346, maxDocs=43556)
                0.09375 = fieldNorm(doc=1408)
          0.01679514 = weight(abstract_txt:that in 1408) [ClassicSimilarity], result of:
            0.01679514 = score(doc=1408,freq=3.0), product of:
              0.043427244 = queryWeight, product of:
                1.1548114 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.01578926 = queryNorm
              0.386742 = fieldWeight in 1408, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.09375 = fieldNorm(doc=1408)
          0.02846525 = weight(abstract_txt:more in 1408) [ClassicSimilarity], result of:
            0.02846525 = score(doc=1408,freq=1.0), product of:
              0.089034215 = queryWeight, product of:
                1.6535159 = boost
                3.4102545 = idf(docFreq=3910, maxDocs=43556)
                0.01578926 = queryNorm
              0.31971136 = fieldWeight in 1408, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4102545 = idf(docFreq=3910, maxDocs=43556)
                0.09375 = fieldNorm(doc=1408)
          0.6039552 = weight(abstract_txt:errors in 1408) [ClassicSimilarity], result of:
            0.6039552 = score(doc=1408,freq=9.0), product of:
              0.3280469 = queryWeight, product of:
                3.173934 = boost
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.01578926 = queryNorm
              1.8410636 = fieldWeight in 1408, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.5460043 = idf(docFreq=169, maxDocs=43556)
                0.09375 = fieldNorm(doc=1408)
        0.16 = coord(4/25)