Document (#38817)

Author
Rubin, V.L.
Lukoianova, T.
Title
Truth and deception at the rhetorical structure level
Source
Journal of the Association for Information Science and Technology. 66(2015) no.5, S.905-917
Year
2015
Abstract
This paper furthers the development of methods to distinguish truth from deception in textual data. We use rhetorical structure theory (RST) as the analytic framework to identify systematic differences between deceptive and truthful stories in terms of their coherence and structure. A sample of 36 elicited personal stories, self-ranked as truthful or deceptive, is manually analyzed by assigning RST discourse relations among each story's constituent parts. A vector space model (VSM) assesses each story's position in multidimensional RST space with respect to its distance from truthful and deceptive centers as measures of the story's level of deception and truthfulness. Ten human judges evaluate independently whether each story is deceptive and assign their confidence levels (360 evaluations total), producing measures of the expected human ability to recognize deception. As a robustness check, a test sample of 18 truthful stories (with 180 additional evaluations) is used to determine the reliability of our RST-VSM method in determining deception. The contribution is in demonstration of the discourse structure analysis as a significant method for automated deception detection and an effective complement to lexicosemantic analysis. The potential is in developing novel discourse-based tools to alert information users to potential deception in computer-mediated texts.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23216/abstract.

Similar documents (author)

  1. Rubin, V.L.: Epistemic modality : from uncertainty to certainty in the context of information seeking as interactions with texts (2010) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:rubin in 4241) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 4241, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=4241)
    
  2. Rubin, R.: Foundations of library and information science (2010) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:rubin in 4781) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 4781, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=4781)
    
  3. Rubin, V.L.: Disinformation and misinformation triangle (2019) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:rubin in 5462) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 5462, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=5462)
    
  4. Kwasnik, B.H.; Rubin, V.L.: Stretching conceptual structures in classifications across languages and cultures (2003) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:rubin in 5517) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 5517, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=5517)
    
  5. Rubin, R.; Froehlich, T.J.: Ethical aspects of library and information science (2009) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:rubin in 3778) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 3778, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=3778)
    

Similar documents (content)

  1. Frohnsdorff, G.: Facts? of publication : cataloging problems posed by deceptive information (1999) 0.14
    0.14317104 = sum of:
      0.14317104 = product of:
        0.894819 = sum of:
          0.01629808 = weight(abstract_txt:potential in 108) [ClassicSimilarity], result of:
            0.01629808 = score(doc=108,freq=1.0), product of:
              0.045028314 = queryWeight, product of:
                1.2266706 = boost
                4.632983 = idf(docFreq=1168, maxDocs=44218)
                0.007923134 = queryNorm
              0.36195183 = fieldWeight in 108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.632983 = idf(docFreq=1168, maxDocs=44218)
                0.078125 = fieldNorm(doc=108)
          0.017176682 = weight(abstract_txt:each in 108) [ClassicSimilarity], result of:
            0.017176682 = score(doc=108,freq=1.0), product of:
              0.053380746 = queryWeight, product of:
                1.6357732 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.007923134 = queryNorm
              0.32177672 = fieldWeight in 108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.078125 = fieldNorm(doc=108)
          0.3038986 = weight(abstract_txt:deceptive in 108) [ClassicSimilarity], result of:
            0.3038986 = score(doc=108,freq=1.0), product of:
              0.39892435 = queryWeight, product of:
                5.1635146 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.007923134 = queryNorm
              0.7617951 = fieldWeight in 108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.078125 = fieldNorm(doc=108)
          0.55744565 = weight(abstract_txt:deception in 108) [ClassicSimilarity], result of:
            0.55744565 = score(doc=108,freq=1.0), product of:
              0.7203647 = queryWeight, product of:
                9.179 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.007923134 = queryNorm
              0.7738381 = fieldWeight in 108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.078125 = fieldNorm(doc=108)
        0.16 = coord(4/25)
    
  2. Rubin, V.L.: Disinformation and misinformation triangle (2019) 0.13
    0.12903659 = sum of:
      0.12903659 = product of:
        0.64518297 = sum of:
          0.011931355 = weight(abstract_txt:level in 5462) [ClassicSimilarity], result of:
            0.011931355 = score(doc=5462,freq=1.0), product of:
              0.042441875 = queryWeight, product of:
                1.1909195 = boost
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.007923134 = queryNorm
              0.28112224 = fieldWeight in 5462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.0625 = fieldNorm(doc=5462)
          0.01354315 = weight(abstract_txt:human in 5462) [ClassicSimilarity], result of:
            0.01354315 = score(doc=5462,freq=1.0), product of:
              0.0461829 = queryWeight, product of:
                1.2422979 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.007923134 = queryNorm
              0.29325032 = fieldWeight in 5462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.0625 = fieldNorm(doc=5462)
          0.021054152 = weight(abstract_txt:measures in 5462) [ClassicSimilarity], result of:
            0.021054152 = score(doc=5462,freq=1.0), product of:
              0.061976433 = queryWeight, product of:
                1.4391247 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.007923134 = queryNorm
              0.33971223 = fieldWeight in 5462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=5462)
          0.34382203 = weight(abstract_txt:deceptive in 5462) [ClassicSimilarity], result of:
            0.34382203 = score(doc=5462,freq=2.0), product of:
              0.39892435 = queryWeight, product of:
                5.1635146 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.007923134 = queryNorm
              0.8618728 = fieldWeight in 5462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=5462)
          0.2548323 = weight(abstract_txt:truthful in 5462) [ClassicSimilarity], result of:
            0.2548323 = score(doc=5462,freq=1.0), product of:
              0.41163698 = queryWeight, product of:
                5.245143 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.007923134 = queryNorm
              0.6190705 = fieldWeight in 5462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=5462)
        0.2 = coord(5/25)
    
  3. Ho, S.M.; Hancock, J.T.; Booth, C.: Ethical dilemma : deception dynamics in computer-mediated group communication (2017) 0.04
    0.043413274 = sum of:
      0.043413274 = product of:
        0.54266596 = sum of:
          0.01629808 = weight(abstract_txt:potential in 3821) [ClassicSimilarity], result of:
            0.01629808 = score(doc=3821,freq=1.0), product of:
              0.045028314 = queryWeight, product of:
                1.2266706 = boost
                4.632983 = idf(docFreq=1168, maxDocs=44218)
                0.007923134 = queryNorm
              0.36195183 = fieldWeight in 3821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.632983 = idf(docFreq=1168, maxDocs=44218)
                0.078125 = fieldNorm(doc=3821)
          0.5263679 = weight(abstract_txt:deceptive in 3821) [ClassicSimilarity], result of:
            0.5263679 = score(doc=3821,freq=3.0), product of:
              0.39892435 = queryWeight, product of:
                5.1635146 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.007923134 = queryNorm
              1.3194679 = fieldWeight in 3821, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.078125 = fieldNorm(doc=3821)
        0.08 = coord(2/25)
    
  4. Wang, X.; Song, N.; Zhou, H.; Cheng, H.: ¬The representation of argumentation in scientific papers : a comparative analysis of two research areas (2022) 0.04
    0.03919736 = sum of:
      0.03919736 = product of:
        0.19598679 = sum of:
          0.011955198 = weight(abstract_txt:method in 567) [ClassicSimilarity], result of:
            0.011955198 = score(doc=567,freq=1.0), product of:
              0.0424984 = queryWeight, product of:
                1.1917123 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.007923134 = queryNorm
              0.28130937 = fieldWeight in 567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=567)
          0.013741345 = weight(abstract_txt:each in 567) [ClassicSimilarity], result of:
            0.013741345 = score(doc=567,freq=1.0), product of:
              0.053380746 = queryWeight, product of:
                1.6357732 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.007923134 = queryNorm
              0.25742137 = fieldWeight in 567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0625 = fieldNorm(doc=567)
          0.10006424 = weight(abstract_txt:rhetorical in 567) [ClassicSimilarity], result of:
            0.10006424 = score(doc=567,freq=2.0), product of:
              0.13905203 = queryWeight, product of:
                2.155628 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.007923134 = queryNorm
              0.71961725 = fieldWeight in 567, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.0625 = fieldNorm(doc=567)
          0.021703795 = weight(abstract_txt:structure in 567) [ClassicSimilarity], result of:
            0.021703795 = score(doc=567,freq=1.0), product of:
              0.07968352 = queryWeight, product of:
                2.3077269 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.007923134 = queryNorm
              0.27237496 = fieldWeight in 567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0625 = fieldNorm(doc=567)
          0.04852222 = weight(abstract_txt:discourse in 567) [ClassicSimilarity], result of:
            0.04852222 = score(doc=567,freq=1.0), product of:
              0.123782404 = queryWeight, product of:
                2.4909225 = boost
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.007923134 = queryNorm
              0.3919961 = fieldWeight in 567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.0625 = fieldNorm(doc=567)
        0.2 = coord(5/25)
    
  5. Fox, M.J.: Medical discourse's epistemic influence on gender classification in three editions of the Dewey Decimal Classification (2014) 0.04
    0.037819855 = sum of:
      0.037819855 = product of:
        0.23637411 = sum of:
          0.025698625 = weight(abstract_txt:space in 1427) [ClassicSimilarity], result of:
            0.025698625 = score(doc=1427,freq=1.0), product of:
              0.061000675 = queryWeight, product of:
                1.427751 = boost
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.007923134 = queryNorm
              0.42128426 = fieldWeight in 1427, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.078125 = fieldNorm(doc=1427)
          0.017176682 = weight(abstract_txt:each in 1427) [ClassicSimilarity], result of:
            0.017176682 = score(doc=1427,freq=1.0), product of:
              0.053380746 = queryWeight, product of:
                1.6357732 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.007923134 = queryNorm
              0.32177672 = fieldWeight in 1427, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.078125 = fieldNorm(doc=1427)
          0.08844513 = weight(abstract_txt:rhetorical in 1427) [ClassicSimilarity], result of:
            0.08844513 = score(doc=1427,freq=1.0), product of:
              0.13905203 = queryWeight, product of:
                2.155628 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.007923134 = queryNorm
              0.6360578 = fieldWeight in 1427, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.078125 = fieldNorm(doc=1427)
          0.10505367 = weight(abstract_txt:discourse in 1427) [ClassicSimilarity], result of:
            0.10505367 = score(doc=1427,freq=3.0), product of:
              0.123782404 = queryWeight, product of:
                2.4909225 = boost
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.007923134 = queryNorm
              0.84869635 = fieldWeight in 1427, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.078125 = fieldNorm(doc=1427)
        0.16 = coord(4/25)