Document (#38818)

Author
Rubin, V.L.
Lukoianova, T.
Title
Truth and deception at the rhetorical structure level
Source
Journal of the Association for Information Science and Technology. 66(2015) no.5, S.905-917
Year
2015
Abstract
This paper furthers the development of methods to distinguish truth from deception in textual data. We use rhetorical structure theory (RST) as the analytic framework to identify systematic differences between deceptive and truthful stories in terms of their coherence and structure. A sample of 36 elicited personal stories, self-ranked as truthful or deceptive, is manually analyzed by assigning RST discourse relations among each story's constituent parts. A vector space model (VSM) assesses each story's position in multidimensional RST space with respect to its distance from truthful and deceptive centers as measures of the story's level of deception and truthfulness. Ten human judges evaluate independently whether each story is deceptive and assign their confidence levels (360 evaluations total), producing measures of the expected human ability to recognize deception. As a robustness check, a test sample of 18 truthful stories (with 180 additional evaluations) is used to determine the reliability of our RST-VSM method in determining deception. The contribution is in demonstration of the discourse structure analysis as a significant method for automated deception detection and an effective complement to lexicosemantic analysis. The potential is in developing novel discourse-based tools to alert information users to potential deception in computer-mediated texts.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23216/abstract.

Similar documents (author)

  1. Rubin, V.L.: Epistemic modality : from uncertainty to certainty in the context of information seeking as interactions with texts (2010) 6.07
    6.0731125 = sum of:
      6.0731125 = weight(author_txt:rubin in 1242) [ClassicSimilarity], result of:
        6.0731125 = fieldWeight in 1242, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.625 = fieldNorm(doc=1242)
    
  2. Rubin, R.: Foundations of library and information science (2010) 6.07
    6.0731125 = sum of:
      6.0731125 = weight(author_txt:rubin in 1782) [ClassicSimilarity], result of:
        6.0731125 = fieldWeight in 1782, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.625 = fieldNorm(doc=1782)
    
  3. Rubin, V.L.: Disinformation and misinformation triangle (2019) 6.07
    6.0731125 = sum of:
      6.0731125 = weight(author_txt:rubin in 1463) [ClassicSimilarity], result of:
        6.0731125 = fieldWeight in 1463, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.625 = fieldNorm(doc=1463)
    
  4. Kwasnik, B.H.; Rubin, V.L.: Stretching conceptual structures in classifications across languages and cultures (2003) 4.86
    4.85849 = sum of:
      4.85849 = weight(author_txt:rubin in 518) [ClassicSimilarity], result of:
        4.85849 = fieldWeight in 518, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.5 = fieldNorm(doc=518)
    
  5. Rubin, R.; Froehlich, T.J.: Ethical aspects of library and information science (2009) 4.86
    4.85849 = sum of:
      4.85849 = weight(author_txt:rubin in 779) [ClassicSimilarity], result of:
        4.85849 = fieldWeight in 779, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.5 = fieldNorm(doc=779)
    

Similar documents (content)

  1. Rubin, V.L.: Disinformation and misinformation triangle (2019) 0.11
    0.109279424 = sum of:
      0.109279424 = product of:
        0.6829964 = sum of:
          0.021519564 = weight(abstract_txt:level in 1463) [ClassicSimilarity], result of:
            0.021519564 = score(doc=1463,freq=1.0), product of:
              0.07597389 = queryWeight, product of:
                1.2177236 = boost
                4.5319915 = idf(docFreq=1249, maxDocs=42740)
                0.013766595 = queryNorm
              0.28324947 = fieldWeight in 1463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5319915 = idf(docFreq=1249, maxDocs=42740)
                0.0625 = fieldNorm(doc=1463)
          0.024517637 = weight(abstract_txt:human in 1463) [ClassicSimilarity], result of:
            0.024517637 = score(doc=1463,freq=1.0), product of:
              0.08287581 = queryWeight, product of:
                1.271834 = boost
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.013766595 = queryNorm
              0.29583585 = fieldWeight in 1463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.0625 = fieldNorm(doc=1463)
          0.03702302 = weight(abstract_txt:measures in 1463) [ClassicSimilarity], result of:
            0.03702302 = score(doc=1463,freq=1.0), product of:
              0.10908288 = queryWeight, product of:
                1.4591329 = boost
                5.4304423 = idf(docFreq=508, maxDocs=42740)
                0.013766595 = queryNorm
              0.33940265 = fieldWeight in 1463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4304423 = idf(docFreq=508, maxDocs=42740)
                0.0625 = fieldNorm(doc=1463)
          0.5999362 = weight(abstract_txt:deceptive in 1463) [ClassicSimilarity], result of:
            0.5999362 = score(doc=1463,freq=2.0), product of:
              0.6985199 = queryWeight, product of:
                5.2218084 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.013766595 = queryNorm
              0.85886776 = fieldWeight in 1463, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.0625 = fieldNorm(doc=1463)
        0.16 = coord(4/25)
    
  2. Ho, S.M.; Hancock, J.T.; Booth, C.: Ethical dilemma : deception dynamics in computer-mediated group communication (2017) 0.08
    0.07582852 = sum of:
      0.07582852 = product of:
        0.94785655 = sum of:
          0.02939545 = weight(abstract_txt:potential in 5822) [ClassicSimilarity], result of:
            0.02939545 = score(doc=5822,freq=1.0), product of:
              0.080603786 = queryWeight, product of:
                1.2542794 = boost
                4.6680408 = idf(docFreq=1090, maxDocs=42740)
                0.013766595 = queryNorm
              0.3646907 = fieldWeight in 5822, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6680408 = idf(docFreq=1090, maxDocs=42740)
                0.078125 = fieldNorm(doc=5822)
          0.9184611 = weight(abstract_txt:deceptive in 5822) [ClassicSimilarity], result of:
            0.9184611 = score(doc=5822,freq=3.0), product of:
              0.6985199 = queryWeight, product of:
                5.2218084 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.013766595 = queryNorm
              1.3148675 = fieldWeight in 5822, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.078125 = fieldNorm(doc=5822)
        0.08 = coord(2/25)
    
  3. Frohnsdorff, G.: Facts? of publication : cataloging problems posed by deceptive information (1999) 0.07
    0.07083252 = sum of:
      0.07083252 = product of:
        0.590271 = sum of:
          0.02939545 = weight(abstract_txt:potential in 1234) [ClassicSimilarity], result of:
            0.02939545 = score(doc=1234,freq=1.0), product of:
              0.080603786 = queryWeight, product of:
                1.2542794 = boost
                4.6680408 = idf(docFreq=1090, maxDocs=42740)
                0.013766595 = queryNorm
              0.3646907 = fieldWeight in 1234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6680408 = idf(docFreq=1090, maxDocs=42740)
                0.078125 = fieldNorm(doc=1234)
          0.030601798 = weight(abstract_txt:each in 1234) [ClassicSimilarity], result of:
            0.030601798 = score(doc=1234,freq=1.0), product of:
              0.09477572 = queryWeight, product of:
                1.6657535 = boost
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.013766595 = queryNorm
              0.32288647 = fieldWeight in 1234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.078125 = fieldNorm(doc=1234)
          0.53027374 = weight(abstract_txt:deceptive in 1234) [ClassicSimilarity], result of:
            0.53027374 = score(doc=1234,freq=1.0), product of:
              0.6985199 = queryWeight, product of:
                5.2218084 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.013766595 = queryNorm
              0.75913906 = fieldWeight in 1234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.078125 = fieldNorm(doc=1234)
        0.12 = coord(3/25)
    
  4. Fox, M.J.: Medical discourse's epistemic influence on gender classification in three editions of the Dewey Decimal Classification (2014) 0.07
    0.06936362 = sum of:
      0.06936362 = product of:
        0.4335226 = sum of:
          0.04539596 = weight(abstract_txt:space in 3428) [ClassicSimilarity], result of:
            0.04539596 = score(doc=3428,freq=1.0), product of:
              0.10769119 = queryWeight, product of:
                1.4497951 = boost
                5.39569 = idf(docFreq=526, maxDocs=42740)
                0.013766595 = queryNorm
              0.4215383 = fieldWeight in 3428, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.39569 = idf(docFreq=526, maxDocs=42740)
                0.078125 = fieldNorm(doc=3428)
          0.030601798 = weight(abstract_txt:each in 3428) [ClassicSimilarity], result of:
            0.030601798 = score(doc=3428,freq=1.0), product of:
              0.09477572 = queryWeight, product of:
                1.6657535 = boost
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.013766595 = queryNorm
              0.32288647 = fieldWeight in 3428, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.078125 = fieldNorm(doc=3428)
          0.16707768 = weight(abstract_txt:rhetorical in 3428) [ClassicSimilarity], result of:
            0.16707768 = score(doc=3428,freq=1.0), product of:
              0.25671288 = queryWeight, product of:
                2.2384138 = boost
                8.330686 = idf(docFreq=27, maxDocs=42740)
                0.013766595 = queryNorm
              0.6508348 = fieldWeight in 3428, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.330686 = idf(docFreq=27, maxDocs=42740)
                0.078125 = fieldNorm(doc=3428)
          0.19044718 = weight(abstract_txt:discourse in 3428) [ClassicSimilarity], result of:
            0.19044718 = score(doc=3428,freq=3.0), product of:
              0.22233528 = queryWeight, product of:
                2.5513284 = boost
                6.3301716 = idf(docFreq=206, maxDocs=42740)
                0.013766595 = queryNorm
              0.8565765 = fieldWeight in 3428, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.3301716 = idf(docFreq=206, maxDocs=42740)
                0.078125 = fieldNorm(doc=3428)
        0.16 = coord(4/25)
    
  5. Can, F.; Kocberber, S.; Baglioglu, O.; Kardas, S.; Ocalan, H.C.; Uyar, E.: New event detection and topic tracking in Turkish (2010) 0.07
    0.066106826 = sum of:
      0.066106826 = product of:
        0.4131677 = sum of:
          0.021372521 = weight(abstract_txt:method in 443) [ClassicSimilarity], result of:
            0.021372521 = score(doc=443,freq=1.0), product of:
              0.07562741 = queryWeight, product of:
                1.2149438 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.013766595 = queryNorm
              0.28260285 = fieldWeight in 443, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.0625 = fieldNorm(doc=443)
          0.06412575 = weight(abstract_txt:measures in 443) [ClassicSimilarity], result of:
            0.06412575 = score(doc=443,freq=3.0), product of:
              0.10908288 = queryWeight, product of:
                1.4591329 = boost
                5.4304423 = idf(docFreq=508, maxDocs=42740)
                0.013766595 = queryNorm
              0.5878626 = fieldWeight in 443, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4304423 = idf(docFreq=508, maxDocs=42740)
                0.0625 = fieldNorm(doc=443)
          0.039936766 = weight(abstract_txt:sample in 443) [ClassicSimilarity], result of:
            0.039936766 = score(doc=443,freq=1.0), product of:
              0.11473361 = queryWeight, product of:
                1.4964489 = boost
                5.5693207 = idf(docFreq=442, maxDocs=42740)
                0.013766595 = queryNorm
              0.34808254 = fieldWeight in 443, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5693207 = idf(docFreq=442, maxDocs=42740)
                0.0625 = fieldNorm(doc=443)
          0.28773266 = weight(abstract_txt:stories in 443) [ClassicSimilarity], result of:
            0.28773266 = score(doc=443,freq=4.0), product of:
              0.30863506 = queryWeight, product of:
                3.0059712 = boost
                7.458198 = idf(docFreq=66, maxDocs=42740)
                0.013766595 = queryNorm
              0.93227476 = fieldWeight in 443, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.458198 = idf(docFreq=66, maxDocs=42740)
                0.0625 = fieldNorm(doc=443)
        0.16 = coord(4/25)