Document (#38186)

Author
Arapakis, I.
Lalmas, M.
Ceylan, H.
Donmez, P.
Title
Automatically embedding newsworthy links to articles : from implementation to evaluation
Source
Journal of the Association for Information Science and Technology. 65(2014) no.1, S.129-145
Year
2014
Abstract
News portals are a popular destination for web users. News providers are therefore interested in attaining higher visitor rates and promoting greater engagement with their content. One aspect of engagement deals with keeping users on site longer by allowing them to have enhanced click-through experiences. News portals have invested in ways to embed links within news stories but so far these links have been curated by news editors. Given the manual effort involved, the use of such links is limited to a small scale. In this article, we evaluate a system-based approach that detects newsworthy events in a news article and locates other articles related to these events. Our system does not rely on resources like Wikipedia to identify events, and it was designed to be domain independent. A rigorous evaluation, using Amazon's Mechanical Turk, was performed to assess the system-embedded links against the manually-curated ones. Our findings reveal that our system's performance is comparable with that of professional editors, and that users find the automatically generated highlights interesting and the associated articles worthy of reading. Our evaluation also provides quantitative and qualitative insights into the curation of links, from the perspective of users and professional editors.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.22959/abstract.

Similar documents (author)

  1. Lalmas, M.: Logical models in information retrieval : introduction and overview (1998) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:lalmas in 2668) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 2668, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=2668)
    
  2. Lalmas, M.: XML information retrieval (2009) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:lalmas in 3880) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 3880, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=3880)
    
  3. Lalmas, M.: XML retrieval (2009) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:lalmas in 4998) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 4998, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=4998)
    
  4. Lalmas, M.; Ruthven, I.: ¬A model for structured document retrieval : empirical investigations (1997) 4.24
    4.2390056 = sum of:
      4.2390056 = weight(author_txt:lalmas in 727) [ClassicSimilarity], result of:
        4.2390056 = fieldWeight in 727, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.5 = fieldNorm(doc=727)
    
  5. Lalmas, M.; Ruthven, I.: Representing and retrieving structured documents using the Dempster-Shafer theory of evidence : modelling and evaluation (1998) 4.24
    4.2390056 = sum of:
      4.2390056 = weight(author_txt:lalmas in 1076) [ClassicSimilarity], result of:
        4.2390056 = fieldWeight in 1076, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.5 = fieldNorm(doc=1076)
    

Similar documents (content)

  1. Lehmann, J.; Castillo, C.; Lalmas, M.; Baeza-Yates, R.: Story-focused reading in online news and its potential for user engagement (2017) 0.21
    0.21081015 = sum of:
      0.21081015 = product of:
        0.87837565 = sum of:
          0.016498592 = weight(abstract_txt:that in 3529) [ClassicSimilarity], result of:
            0.016498592 = score(doc=3529,freq=6.0), product of:
              0.04548195 = queryWeight, product of:
                1.1798228 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016269347 = queryNorm
              0.36275032 = fieldWeight in 3529, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3529)
          0.12456748 = weight(abstract_txt:engagement in 3529) [ClassicSimilarity], result of:
            0.12456748 = score(doc=3529,freq=2.0), product of:
              0.200374 = queryWeight, product of:
                1.7510678 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.016269347 = queryNorm
              0.62167484 = fieldWeight in 3529, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=3529)
          0.05150192 = weight(abstract_txt:users in 3529) [ClassicSimilarity], result of:
            0.05150192 = score(doc=3529,freq=5.0), product of:
              0.103232674 = queryWeight, product of:
                1.7774847 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.016269347 = queryNorm
              0.49889165 = fieldWeight in 3529, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0625 = fieldNorm(doc=3529)
          0.07192966 = weight(abstract_txt:articles in 3529) [ClassicSimilarity], result of:
            0.07192966 = score(doc=3529,freq=3.0), product of:
              0.13894522 = queryWeight, product of:
                1.7858695 = boost
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.016269347 = queryNorm
              0.5176836 = fieldWeight in 3529, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.0625 = fieldNorm(doc=3529)
          0.18910353 = weight(abstract_txt:links in 3529) [ClassicSimilarity], result of:
            0.18910353 = score(doc=3529,freq=3.0), product of:
              0.33346328 = queryWeight, product of:
                3.9126132 = boost
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.016269347 = queryNorm
              0.5670895 = fieldWeight in 3529, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.0625 = fieldNorm(doc=3529)
          0.42477447 = weight(abstract_txt:news in 3529) [ClassicSimilarity], result of:
            0.42477447 = score(doc=3529,freq=7.0), product of:
              0.43121606 = queryWeight, product of:
                4.4492865 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.016269347 = queryNorm
              0.9850618 = fieldWeight in 3529, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0625 = fieldNorm(doc=3529)
        0.24 = coord(6/25)
    
  2. O'Brien, H.L.; Lebow, M.: Mixed-methods approach to measuring user experience in online news interactions (2013) 0.19
    0.18771064 = sum of:
      0.18771064 = product of:
        0.67039514 = sum of:
          0.013471044 = weight(abstract_txt:that in 1001) [ClassicSimilarity], result of:
            0.013471044 = score(doc=1001,freq=4.0), product of:
              0.04548195 = queryWeight, product of:
                1.1798228 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016269347 = queryNorm
              0.2961844 = fieldWeight in 1001, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1001)
          0.014563238 = weight(abstract_txt:system in 1001) [ClassicSimilarity], result of:
            0.014563238 = score(doc=1001,freq=1.0), product of:
              0.06909564 = queryWeight, product of:
                1.2593696 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.016269347 = queryNorm
              0.21076928 = fieldWeight in 1001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=1001)
          0.034282602 = weight(abstract_txt:evaluation in 1001) [ClassicSimilarity], result of:
            0.034282602 = score(doc=1001,freq=1.0), product of:
              0.12227224 = queryWeight, product of:
                1.675297 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.016269347 = queryNorm
              0.2803793 = fieldWeight in 1001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.0625 = fieldNorm(doc=1001)
          0.15256338 = weight(abstract_txt:engagement in 1001) [ClassicSimilarity], result of:
            0.15256338 = score(doc=1001,freq=3.0), product of:
              0.200374 = queryWeight, product of:
                1.7510678 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.016269347 = queryNorm
              0.7613931 = fieldWeight in 1001, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=1001)
          0.023032358 = weight(abstract_txt:users in 1001) [ClassicSimilarity], result of:
            0.023032358 = score(doc=1001,freq=1.0), product of:
              0.103232674 = queryWeight, product of:
                1.7774847 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.016269347 = queryNorm
              0.22311112 = fieldWeight in 1001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0625 = fieldNorm(doc=1001)
          0.15440239 = weight(abstract_txt:links in 1001) [ClassicSimilarity], result of:
            0.15440239 = score(doc=1001,freq=2.0), product of:
              0.33346328 = queryWeight, product of:
                3.9126132 = boost
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.016269347 = queryNorm
              0.46302667 = fieldWeight in 1001, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.0625 = fieldNorm(doc=1001)
          0.27808017 = weight(abstract_txt:news in 1001) [ClassicSimilarity], result of:
            0.27808017 = score(doc=1001,freq=3.0), product of:
              0.43121606 = queryWeight, product of:
                4.4492865 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.016269347 = queryNorm
              0.64487433 = fieldWeight in 1001, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0625 = fieldNorm(doc=1001)
        0.28 = coord(7/25)
    
  3. Arapakis, I.; Lalmas, M.; Cambazoglu, B.B.; MarcosM.-C.; Jose, J.M.: User engagement in online news : under the scope of sentiment, interest, affect, and gaze (2014) 0.19
    0.18647106 = sum of:
      0.18647106 = product of:
        0.77696276 = sum of:
          0.015061084 = weight(abstract_txt:that in 1497) [ClassicSimilarity], result of:
            0.015061084 = score(doc=1497,freq=5.0), product of:
              0.04548195 = queryWeight, product of:
                1.1798228 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016269347 = queryNorm
              0.3311442 = fieldWeight in 1497, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1497)
          0.15256338 = weight(abstract_txt:engagement in 1497) [ClassicSimilarity], result of:
            0.15256338 = score(doc=1497,freq=3.0), product of:
              0.200374 = queryWeight, product of:
                1.7510678 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.016269347 = queryNorm
              0.7613931 = fieldWeight in 1497, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=1497)
          0.09067428 = weight(abstract_txt:portals in 1497) [ClassicSimilarity], result of:
            0.09067428 = score(doc=1497,freq=1.0), product of:
              0.20428555 = queryWeight, product of:
                1.7680767 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.016269347 = queryNorm
              0.44386047 = fieldWeight in 1497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.0625 = fieldNorm(doc=1497)
          0.023032358 = weight(abstract_txt:users in 1497) [ClassicSimilarity], result of:
            0.023032358 = score(doc=1497,freq=1.0), product of:
              0.103232674 = queryWeight, product of:
                1.7774847 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.016269347 = queryNorm
              0.22311112 = fieldWeight in 1497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0625 = fieldNorm(doc=1497)
          0.041528612 = weight(abstract_txt:articles in 1497) [ClassicSimilarity], result of:
            0.041528612 = score(doc=1497,freq=1.0), product of:
              0.13894522 = queryWeight, product of:
                1.7858695 = boost
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.016269347 = queryNorm
              0.29888478 = fieldWeight in 1497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.0625 = fieldNorm(doc=1497)
          0.45410305 = weight(abstract_txt:news in 1497) [ClassicSimilarity], result of:
            0.45410305 = score(doc=1497,freq=8.0), product of:
              0.43121606 = queryWeight, product of:
                4.4492865 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.016269347 = queryNorm
              1.0530754 = fieldWeight in 1497, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0625 = fieldNorm(doc=1497)
        0.24 = coord(6/25)
    
  4. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.18
    0.17530425 = sum of:
      0.17530425 = product of:
        0.7304344 = sum of:
          0.009525466 = weight(abstract_txt:that in 657) [ClassicSimilarity], result of:
            0.009525466 = score(doc=657,freq=2.0), product of:
              0.04548195 = queryWeight, product of:
                1.1798228 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016269347 = queryNorm
              0.20943399 = fieldWeight in 657, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.014563238 = weight(abstract_txt:system in 657) [ClassicSimilarity], result of:
            0.014563238 = score(doc=657,freq=1.0), product of:
              0.06909564 = queryWeight, product of:
                1.2593696 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.016269347 = queryNorm
              0.21076928 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.048482925 = weight(abstract_txt:evaluation in 657) [ClassicSimilarity], result of:
            0.048482925 = score(doc=657,freq=2.0), product of:
              0.12227224 = queryWeight, product of:
                1.675297 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.016269347 = queryNorm
              0.3965162 = fieldWeight in 657, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.109874375 = weight(abstract_txt:articles in 657) [ClassicSimilarity], result of:
            0.109874375 = score(doc=657,freq=7.0), product of:
              0.13894522 = queryWeight, product of:
                1.7858695 = boost
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.016269347 = queryNorm
              0.79077476 = fieldWeight in 657, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.09388539 = weight(abstract_txt:events in 657) [ClassicSimilarity], result of:
            0.09388539 = score(doc=657,freq=1.0), product of:
              0.23933746 = queryWeight, product of:
                2.3438685 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.016269347 = queryNorm
              0.39227203 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.45410305 = weight(abstract_txt:news in 657) [ClassicSimilarity], result of:
            0.45410305 = score(doc=657,freq=8.0), product of:
              0.43121606 = queryWeight, product of:
                4.4492865 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.016269347 = queryNorm
              1.0530754 = fieldWeight in 657, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
        0.24 = coord(6/25)
    
  5. Watters, C.R.; Shepherd, M.A.; Burkowski, F.J.: Electronic news delivery project (1998) 0.17
    0.16805808 = sum of:
      0.16805808 = product of:
        0.8402904 = sum of:
          0.006735522 = weight(abstract_txt:that in 444) [ClassicSimilarity], result of:
            0.006735522 = score(doc=444,freq=1.0), product of:
              0.04548195 = queryWeight, product of:
                1.1798228 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016269347 = queryNorm
              0.1480922 = fieldWeight in 444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
          0.012496784 = weight(abstract_txt:have in 444) [ClassicSimilarity], result of:
            0.012496784 = score(doc=444,freq=1.0), product of:
              0.062394194 = queryWeight, product of:
                1.1967405 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.016269347 = queryNorm
              0.20028761 = fieldWeight in 444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
          0.132774 = weight(abstract_txt:events in 444) [ClassicSimilarity], result of:
            0.132774 = score(doc=444,freq=2.0), product of:
              0.23933746 = queryWeight, product of:
                2.3438685 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.016269347 = queryNorm
              0.5547564 = fieldWeight in 444, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
          0.13212377 = weight(abstract_txt:editors in 444) [ClassicSimilarity], result of:
            0.13212377 = score(doc=444,freq=1.0), product of:
              0.300561 = queryWeight, product of:
                2.6266017 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.016269347 = queryNorm
              0.4395905 = fieldWeight in 444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
          0.55616033 = weight(abstract_txt:news in 444) [ClassicSimilarity], result of:
            0.55616033 = score(doc=444,freq=12.0), product of:
              0.43121606 = queryWeight, product of:
                4.4492865 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.016269347 = queryNorm
              1.2897487 = fieldWeight in 444, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
        0.2 = coord(5/25)