Document (#35710)

Author
Bounhas, I.
Elayeb, B.
Evrard, F.
Slimani, Y.
Title
Toward a computer study of the reliability of Arabic stories
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.8, S.1686-1705
Year
2010
Abstract
The Arabic storytelling methodology provides solutions to the problem of information reliability. The reliability of a story depends on the credibility of its narrators. To insure reliability verification, the narrators' names are explicitly cited at the head of the story, which constitute its chain of narrators. Stories were reported from a generation to another to insure the reliable transmission of historical knowledge. We present a set of tools based on the Arabic storytelling methodology. We start by presenting this methodology as a set of principles for information-reliability assessment. Then, we detail an architecture designed to support the study of the reliability of Arabic stories. Indeed, we developed grammars for parsing Arabic full names and chains of narrators of Arabic stories. After that, an intelligent identity recognizer links names found in chains of narrators to the biographies of the corresponding persons. We model this step as a possibilistic information retrieval task. Finally, chains are analyzed through metadata available in biographies to help the user identify sources of unreliability. We propose to identify the class of reliability of a story with a possibilistic classifier. The achieved results in named entity and identity recognition were satisfactory and confirm to the targets set for the precision, recall, and F-measure metrics. The developed tools also are reusable components that can be used to study the reliability of other types of Arabic texts.

Similar documents (content)

  1. Kanan, T.; Fox, E.A.: Automated arabic text classification with P-Stemmer, machine learning, and a tailored news article taxonomy (2016) 0.17
    0.1653422 = sum of:
      0.1653422 = product of:
        0.82671094 = sum of:
          0.003514676 = weight(abstract_txt:information in 3151) [ClassicSimilarity], result of:
            0.003514676 = score(doc=3151,freq=1.0), product of:
              0.023228442 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.009594778 = queryNorm
              0.15130915 = fieldWeight in 3151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=3151)
          0.01725041 = weight(abstract_txt:developed in 3151) [ClassicSimilarity], result of:
            0.01725041 = score(doc=3151,freq=2.0), product of:
              0.04651458 = queryWeight, product of:
                1.1554173 = boost
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.009594778 = queryNorm
              0.37086028 = fieldWeight in 3151, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.0625 = fieldNorm(doc=3151)
          0.00994162 = weight(abstract_txt:study in 3151) [ClassicSimilarity], result of:
            0.00994162 = score(doc=3151,freq=1.0), product of:
              0.0464588 = queryWeight, product of:
                1.4142427 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.009594778 = queryNorm
              0.21398787 = fieldWeight in 3151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.0625 = fieldNorm(doc=3151)
          0.13271852 = weight(abstract_txt:stories in 3151) [ClassicSimilarity], result of:
            0.13271852 = score(doc=3151,freq=1.0), product of:
              0.28776005 = queryWeight, product of:
                4.064195 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.009594778 = queryNorm
              0.46121246 = fieldWeight in 3151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.0625 = fieldNorm(doc=3151)
          0.66328573 = weight(abstract_txt:arabic in 3151) [ClassicSimilarity], result of:
            0.66328573 = score(doc=3151,freq=7.0), product of:
              0.5298951 = queryWeight, product of:
                7.2958064 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.009594778 = queryNorm
              1.2517302 = fieldWeight in 3151, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.0625 = fieldNorm(doc=3151)
        0.2 = coord(5/25)
    
  2. Rafferty, P.; Albinfalah, F.: ¬A tale of two images : the quest to create a story-based image indexing system (2014) 0.16
    0.16435055 = sum of:
      0.16435055 = product of:
        0.684794 = sum of:
          0.014139039 = weight(abstract_txt:were in 1777) [ClassicSimilarity], result of:
            0.014139039 = score(doc=1777,freq=3.0), product of:
              0.03558817 = queryWeight, product of:
                1.0106416 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.009594778 = queryNorm
              0.39729604 = fieldWeight in 1777, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
          0.01988324 = weight(abstract_txt:study in 1777) [ClassicSimilarity], result of:
            0.01988324 = score(doc=1777,freq=4.0), product of:
              0.0464588 = queryWeight, product of:
                1.4142427 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.009594778 = queryNorm
              0.42797574 = fieldWeight in 1777, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
          0.023741812 = weight(abstract_txt:methodology in 1777) [ClassicSimilarity], result of:
            0.023741812 = score(doc=1777,freq=1.0), product of:
              0.083005294 = queryWeight, product of:
                1.8903527 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.009594778 = queryNorm
              0.28602767 = fieldWeight in 1777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
          0.24543752 = weight(abstract_txt:storytelling in 1777) [ClassicSimilarity], result of:
            0.24543752 = score(doc=1777,freq=4.0), product of:
              0.2167738 = queryWeight, product of:
                2.4942944 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.009594778 = queryNorm
              1.1322287 = fieldWeight in 1777, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
          0.19390005 = weight(abstract_txt:story in 1777) [ClassicSimilarity], result of:
            0.19390005 = score(doc=1777,freq=4.0), product of:
              0.21206151 = queryWeight, product of:
                3.021488 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.009594778 = queryNorm
              0.9143576 = fieldWeight in 1777, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
          0.18769233 = weight(abstract_txt:stories in 1777) [ClassicSimilarity], result of:
            0.18769233 = score(doc=1777,freq=2.0), product of:
              0.28776005 = queryWeight, product of:
                4.064195 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.009594778 = queryNorm
              0.6522529 = fieldWeight in 1777, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
        0.24 = coord(6/25)
    
  3. McDowell, K.: Storytelling wisdom : story, information, and DIKW (2021) 0.15
    0.14635597 = sum of:
      0.14635597 = product of:
        0.9147248 = sum of:
          0.011623696 = weight(abstract_txt:information in 350) [ClassicSimilarity], result of:
            0.011623696 = score(doc=350,freq=7.0), product of:
              0.023228442 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.009594778 = queryNorm
              0.50040793 = fieldWeight in 350, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=350)
          0.40585414 = weight(abstract_txt:storytelling in 350) [ClassicSimilarity], result of:
            0.40585414 = score(doc=350,freq=7.0), product of:
              0.2167738 = queryWeight, product of:
                2.4942944 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.009594778 = queryNorm
              1.8722472 = fieldWeight in 350, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.078125 = fieldNorm(doc=350)
          0.20990296 = weight(abstract_txt:story in 350) [ClassicSimilarity], result of:
            0.20990296 = score(doc=350,freq=3.0), product of:
              0.21206151 = queryWeight, product of:
                3.021488 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.009594778 = queryNorm
              0.9898211 = fieldWeight in 350, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.078125 = fieldNorm(doc=350)
          0.287344 = weight(abstract_txt:stories in 350) [ClassicSimilarity], result of:
            0.287344 = score(doc=350,freq=3.0), product of:
              0.28776005 = queryWeight, product of:
                4.064195 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.009594778 = queryNorm
              0.9985542 = fieldWeight in 350, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.078125 = fieldNorm(doc=350)
        0.16 = coord(4/25)
    
  4. Rubin, V.L.; Lukoianova, T.: Truth and deception at the rhetorical structure level (2015) 0.14
    0.1385659 = sum of:
      0.1385659 = product of:
        0.57735795 = sum of:
          0.003514676 = weight(abstract_txt:information in 1816) [ClassicSimilarity], result of:
            0.003514676 = score(doc=1816,freq=1.0), product of:
              0.023228442 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.009594778 = queryNorm
              0.15130915 = fieldWeight in 1816, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=1816)
          0.01456395 = weight(abstract_txt:tools in 1816) [ClassicSimilarity], result of:
            0.01456395 = score(doc=1816,freq=1.0), product of:
              0.05235027 = queryWeight, product of:
                1.2257553 = boost
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.009594778 = queryNorm
              0.278202 = fieldWeight in 1816, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.0625 = fieldNorm(doc=1816)
          0.019893026 = weight(abstract_txt:identify in 1816) [ClassicSimilarity], result of:
            0.019893026 = score(doc=1816,freq=1.0), product of:
              0.064446606 = queryWeight, product of:
                1.3600171 = boost
                4.9387927 = idf(docFreq=860, maxDocs=44218)
                0.009594778 = queryNorm
              0.30867454 = fieldWeight in 1816, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9387927 = idf(docFreq=860, maxDocs=44218)
                0.0625 = fieldNorm(doc=1816)
          0.096950024 = weight(abstract_txt:story in 1816) [ClassicSimilarity], result of:
            0.096950024 = score(doc=1816,freq=1.0), product of:
              0.21206151 = queryWeight, product of:
                3.021488 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.009594778 = queryNorm
              0.4571788 = fieldWeight in 1816, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.0625 = fieldNorm(doc=1816)
          0.2298752 = weight(abstract_txt:stories in 1816) [ClassicSimilarity], result of:
            0.2298752 = score(doc=1816,freq=3.0), product of:
              0.28776005 = queryWeight, product of:
                4.064195 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.009594778 = queryNorm
              0.7988434 = fieldWeight in 1816, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.0625 = fieldNorm(doc=1816)
          0.21256106 = weight(abstract_txt:reliability in 1816) [ClassicSimilarity], result of:
            0.21256106 = score(doc=1816,freq=1.0), product of:
              0.49629733 = queryWeight, product of:
                7.5482326 = boost
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.009594778 = queryNorm
              0.42829376 = fieldWeight in 1816, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.0625 = fieldNorm(doc=1816)
        0.24 = coord(6/25)
    
  5. Bilal, D.; Bachir, I.: Children's interaction with cross-cultural and multilingual digital libraries : II. Information seeking, success, and affective experience (2007) 0.13
    0.1341675 = sum of:
      0.1341675 = product of:
        0.6708375 = sum of:
          0.0062131276 = weight(abstract_txt:information in 895) [ClassicSimilarity], result of:
            0.0062131276 = score(doc=895,freq=2.0), product of:
              0.023228442 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.009594778 = queryNorm
              0.2674793 = fieldWeight in 895, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=895)
          0.010203973 = weight(abstract_txt:were in 895) [ClassicSimilarity], result of:
            0.010203973 = score(doc=895,freq=1.0), product of:
              0.03558817 = queryWeight, product of:
                1.0106416 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.009594778 = queryNorm
              0.28672373 = fieldWeight in 895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.078125 = fieldNorm(doc=895)
          0.0152473515 = weight(abstract_txt:developed in 895) [ClassicSimilarity], result of:
            0.0152473515 = score(doc=895,freq=1.0), product of:
              0.04651458 = queryWeight, product of:
                1.1554173 = boost
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.009594778 = queryNorm
              0.32779726 = fieldWeight in 895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.078125 = fieldNorm(doc=895)
          0.012427025 = weight(abstract_txt:study in 895) [ClassicSimilarity], result of:
            0.012427025 = score(doc=895,freq=1.0), product of:
              0.0464588 = queryWeight, product of:
                1.4142427 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.009594778 = queryNorm
              0.26748484 = fieldWeight in 895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.078125 = fieldNorm(doc=895)
          0.62674606 = weight(abstract_txt:arabic in 895) [ClassicSimilarity], result of:
            0.62674606 = score(doc=895,freq=4.0), product of:
              0.5298951 = queryWeight, product of:
                7.2958064 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.009594778 = queryNorm
              1.1827738 = fieldWeight in 895, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.078125 = fieldNorm(doc=895)
        0.2 = coord(5/25)