Document (#42094)

Author
Zielinski, K.
Nielek, R.
Wierzbicki, A.
Jatowt, A.
Title
Computing controversy : formal model and algorithms for detecting controversy on Wikipedia and in search queries
Source
Information processing and management. 54(2018) no.1, S.14-36
Year
2018
Abstract
Controversy is a complex concept that has been attracting attention of scholars from diverse fields. In the era of Internet and social media, detecting controversy and controversial concepts by the means of automatic methods is especially important. Web searchers could be alerted when the contents they consume are controversial or when they attempt to acquire information on disputed topics. Presenting users with the indications and explanations of the controversy should offer them chance to see the "wider picture" rather than letting them obtain one-sided views. In this work we first introduce a formal model of controversy as the basis of computational approaches to detecting controversial concepts. Then we propose a classification based method for automatic detection of controversial articles and categories in Wikipedia. Next, we demonstrate how to use the obtained results for the estimation of the controversy level of search queries. The proposed method can be incorporated into search engines as a component responsible for detection of queries related to controversial topics. The method is independent of the search engine's retrieval and search results recommendation algorithms, and is therefore unaffected by a possible filter bubble. Our approach can be also applied in Wikipedia or other knowledge bases for supporting the detection of controversy and content maintenance. Finally, we believe that our results could be useful for social science researchers for understanding the complex nature of controversy and in fostering their studies.
Content
Vgl.: https://doi.org/10.1016/j.ipm.2017.08.005.
Theme
Informationsmittel
Internet
Field
Kommunikationswissenschaften
Object
Wikipedia

Similar documents (author)

  1. Jatowt, A.; Yeung, C.M.A.; Tanaka, K.: Generic method for detecting focus time of documents (2015) 3.71
    3.7144227 = sum of:
      3.7144227 = weight(author_txt:jatowt in 2668) [ClassicSimilarity], result of:
        3.7144227 = fieldWeight in 2668, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=2668)
    
  2. Joho, H.; Jatowt, A.; Blanco, R.: Temporal information searching behaviour and strategies (2015) 3.71
    3.7144227 = sum of:
      3.7144227 = weight(author_txt:jatowt in 2674) [ClassicSimilarity], result of:
        3.7144227 = fieldWeight in 2674, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=2674)
    
  3. Chin, J.Y.; Bhowmick, S.S.; Jatowt, A.: On-demand recent personal tweets summarization on mobile devices (2019) 3.71
    3.7144227 = sum of:
      3.7144227 = weight(author_txt:jatowt in 5246) [ClassicSimilarity], result of:
        3.7144227 = fieldWeight in 5246, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=5246)
    
  4. Lee, J.; Jatowt, A.; Kim, K.-S..: Discovering underlying sensations of human emotions based on social media (2021) 3.71
    3.7144227 = sum of:
      3.7144227 = weight(author_txt:jatowt in 163) [ClassicSimilarity], result of:
        3.7144227 = fieldWeight in 163, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=163)
    

Similar documents (content)

  1. Hjørland, B.: Evaluation of an information source illustrated by a case study : effect of screening for breast cancer (2011) 0.12
    0.12300832 = sum of:
      0.12300832 = product of:
        1.0250694 = sum of:
          0.031983394 = weight(abstract_txt:method in 4657) [ClassicSimilarity], result of:
            0.031983394 = score(doc=4657,freq=1.0), product of:
              0.07579649 = queryWeight, product of:
                1.5850586 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.010624283 = queryNorm
              0.42196405 = fieldWeight in 4657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.09375 = fieldNorm(doc=4657)
          0.122128956 = weight(abstract_txt:wikipedia in 4657) [ClassicSimilarity], result of:
            0.122128956 = score(doc=4657,freq=2.0), product of:
              0.1469722 = queryWeight, product of:
                2.207183 = boost
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.010624283 = queryNorm
              0.8309664 = fieldWeight in 4657, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.09375 = fieldNorm(doc=4657)
          0.870957 = weight(abstract_txt:controversy in 4657) [ClassicSimilarity], result of:
            0.870957 = score(doc=4657,freq=2.0), product of:
              0.7853459 = queryWeight, product of:
                8.837143 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.010624283 = queryNorm
              1.1090107 = fieldWeight in 4657, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.09375 = fieldNorm(doc=4657)
        0.12 = coord(3/25)
    
  2. Plotnick, R.: Computers, systems theory, and the making of a wired hospital : a history of Technicon Medical Information System, 1964-1987 (2010) 0.09
    0.093007475 = sum of:
      0.093007475 = product of:
        0.58129674 = sum of:
          0.029952353 = weight(abstract_txt:could in 3473) [ClassicSimilarity], result of:
            0.029952353 = score(doc=3473,freq=2.0), product of:
              0.056806818 = queryWeight, product of:
                1.1204058 = boost
                4.772275 = idf(docFreq=1016, maxDocs=44218)
                0.010624283 = queryNorm
              0.52726686 = fieldWeight in 3473, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.772275 = idf(docFreq=1016, maxDocs=44218)
                0.078125 = fieldNorm(doc=3473)
          0.025783394 = weight(abstract_txt:complex in 3473) [ClassicSimilarity], result of:
            0.025783394 = score(doc=3473,freq=1.0), product of:
              0.064766414 = queryWeight, product of:
                1.1963274 = boost
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.010624283 = queryNorm
              0.3980982 = fieldWeight in 3473, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.078125 = fieldNorm(doc=3473)
          0.012344544 = weight(abstract_txt:results in 3473) [ClassicSimilarity], result of:
            0.012344544 = score(doc=3473,freq=1.0), product of:
              0.045373637 = queryWeight, product of:
                1.226373 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.010624283 = queryNorm
              0.27206424 = fieldWeight in 3473, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.078125 = fieldNorm(doc=3473)
          0.51321644 = weight(abstract_txt:controversy in 3473) [ClassicSimilarity], result of:
            0.51321644 = score(doc=3473,freq=1.0), product of:
              0.7853459 = queryWeight, product of:
                8.837143 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.010624283 = queryNorm
              0.6534909 = fieldWeight in 3473, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.078125 = fieldNorm(doc=3473)
        0.16 = coord(4/25)
    
  3. Peters, B.: ¬The search engine democracy : metaphors and Muhammad (2007) 0.09
    0.08907469 = sum of:
      0.08907469 = product of:
        0.7422891 = sum of:
          0.021743314 = weight(abstract_txt:algorithms in 384) [ClassicSimilarity], result of:
            0.021743314 = score(doc=384,freq=1.0), product of:
              0.08126549 = queryWeight, product of:
                1.3400722 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.010624283 = queryNorm
              0.26755902 = fieldWeight in 384, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.046875 = fieldNorm(doc=384)
          0.031993784 = weight(abstract_txt:search in 384) [ClassicSimilarity], result of:
            0.031993784 = score(doc=384,freq=5.0), product of:
              0.0834429 = queryWeight, product of:
                2.1470385 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.010624283 = queryNorm
              0.3834213 = fieldWeight in 384, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.046875 = fieldNorm(doc=384)
          0.688552 = weight(abstract_txt:controversy in 384) [ClassicSimilarity], result of:
            0.688552 = score(doc=384,freq=5.0), product of:
              0.7853459 = queryWeight, product of:
                8.837143 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.010624283 = queryNorm
              0.87675 = fieldWeight in 384, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.046875 = fieldNorm(doc=384)
        0.12 = coord(3/25)
    
  4. Fonseca, F.: Whether or when : the question on the use of theories in data science (2021) 0.07
    0.07235089 = sum of:
      0.07235089 = product of:
        0.6029241 = sum of:
          0.009875635 = weight(abstract_txt:results in 409) [ClassicSimilarity], result of:
            0.009875635 = score(doc=409,freq=1.0), product of:
              0.045373637 = queryWeight, product of:
                1.226373 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.010624283 = queryNorm
              0.21765138 = fieldWeight in 409, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=409)
          0.18247534 = weight(abstract_txt:controversial in 409) [ClassicSimilarity], result of:
            0.18247534 = score(doc=409,freq=1.0), product of:
              0.3759926 = queryWeight, product of:
                4.557585 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.010624283 = queryNorm
              0.48531634 = fieldWeight in 409, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=409)
          0.4105731 = weight(abstract_txt:controversy in 409) [ClassicSimilarity], result of:
            0.4105731 = score(doc=409,freq=1.0), product of:
              0.7853459 = queryWeight, product of:
                8.837143 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.010624283 = queryNorm
              0.5227927 = fieldWeight in 409, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.0625 = fieldNorm(doc=409)
        0.12 = coord(3/25)
    
  5. Buccio, E. Di; Melucci, M.; Moro, F.: Detecting verbose queries and improving information retrieval (2014) 0.07
    0.069035664 = sum of:
      0.069035664 = product of:
        0.24655595 = sum of:
          0.014716332 = weight(abstract_txt:concepts in 2695) [ClassicSimilarity], result of:
            0.014716332 = score(doc=2695,freq=1.0), product of:
              0.05171258 = queryWeight, product of:
                1.068989 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.010624283 = queryNorm
              0.28457934 = fieldWeight in 2695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.0625 = fieldNorm(doc=2695)
          0.02051198 = weight(abstract_txt:topics in 2695) [ClassicSimilarity], result of:
            0.02051198 = score(doc=2695,freq=1.0), product of:
              0.06452602 = queryWeight, product of:
                1.1941051 = boost
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.010624283 = queryNorm
              0.31788695 = fieldWeight in 2695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.0625 = fieldNorm(doc=2695)
          0.013966257 = weight(abstract_txt:results in 2695) [ClassicSimilarity], result of:
            0.013966257 = score(doc=2695,freq=2.0), product of:
              0.045373637 = queryWeight, product of:
                1.226373 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.010624283 = queryNorm
              0.30780554 = fieldWeight in 2695, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=2695)
          0.028991086 = weight(abstract_txt:algorithms in 2695) [ClassicSimilarity], result of:
            0.028991086 = score(doc=2695,freq=1.0), product of:
              0.08126549 = queryWeight, product of:
                1.3400722 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.010624283 = queryNorm
              0.35674536 = fieldWeight in 2695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=2695)
          0.07627608 = weight(abstract_txt:queries in 2695) [ClassicSimilarity], result of:
            0.07627608 = score(doc=2695,freq=6.0), product of:
              0.0975668 = queryWeight, product of:
                1.7983398 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.010624283 = queryNorm
              0.78178316 = fieldWeight in 2695, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0625 = fieldNorm(doc=2695)
          0.019077405 = weight(abstract_txt:search in 2695) [ClassicSimilarity], result of:
            0.019077405 = score(doc=2695,freq=1.0), product of:
              0.0834429 = queryWeight, product of:
                2.1470385 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.010624283 = queryNorm
              0.22862828 = fieldWeight in 2695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=2695)
          0.073016815 = weight(abstract_txt:detection in 2695) [ClassicSimilarity], result of:
            0.073016815 = score(doc=2695,freq=1.0), product of:
              0.17220357 = queryWeight, product of:
                2.3891413 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.010624283 = queryNorm
              0.4240145 = fieldWeight in 2695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0625 = fieldNorm(doc=2695)
        0.28 = coord(7/25)