Document (#35711)

Author
Bounhas, I.
Elayeb, B.
Evrard, F.
Slimani, Y.
Title
Toward a computer study of the reliability of Arabic stories
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.8, S.1686-1705
Year
2010
Abstract
The Arabic storytelling methodology provides solutions to the problem of information reliability. The reliability of a story depends on the credibility of its narrators. To insure reliability verification, the narrators' names are explicitly cited at the head of the story, which constitute its chain of narrators. Stories were reported from a generation to another to insure the reliable transmission of historical knowledge. We present a set of tools based on the Arabic storytelling methodology. We start by presenting this methodology as a set of principles for information-reliability assessment. Then, we detail an architecture designed to support the study of the reliability of Arabic stories. Indeed, we developed grammars for parsing Arabic full names and chains of narrators of Arabic stories. After that, an intelligent identity recognizer links names found in chains of narrators to the biographies of the corresponding persons. We model this step as a possibilistic information retrieval task. Finally, chains are analyzed through metadata available in biographies to help the user identify sources of unreliability. We propose to identify the class of reliability of a story with a possibilistic classifier. The achieved results in named entity and identity recognition were satisfactory and confirm to the targets set for the precision, recall, and F-measure metrics. The developed tools also are reusable components that can be used to study the reliability of other types of Arabic texts.

Similar documents (content)

  1. Rafferty, P.; Albinfalah, F.: ¬A tale of two images : the quest to create a story-based image indexing system (2014) 0.17
    0.17393823 = sum of:
      0.17393823 = product of:
        0.72474265 = sum of:
          0.014239283 = weight(abstract_txt:were in 3778) [ClassicSimilarity], result of:
            0.014239283 = score(doc=3778,freq=3.0), product of:
              0.035532936 = queryWeight, product of:
                1.0147593 = boost
                3.7018294 = idf(docFreq=2837, maxDocs=42306)
                0.00945914 = queryNorm
              0.40073478 = fieldWeight in 3778, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7018294 = idf(docFreq=2837, maxDocs=42306)
                0.0625 = fieldNorm(doc=3778)
          0.02058914 = weight(abstract_txt:study in 3778) [ClassicSimilarity], result of:
            0.02058914 = score(doc=3778,freq=4.0), product of:
              0.047255095 = queryWeight, product of:
                1.4332352 = boost
                3.485616 = idf(docFreq=3522, maxDocs=42306)
                0.00945914 = queryNorm
              0.435702 = fieldWeight in 3778, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.485616 = idf(docFreq=3522, maxDocs=42306)
                0.0625 = fieldNorm(doc=3778)
          0.024507415 = weight(abstract_txt:methodology in 3778) [ClassicSimilarity], result of:
            0.024507415 = score(doc=3778,freq=1.0), product of:
              0.08425096 = queryWeight, product of:
                1.9137294 = boost
                4.6541743 = idf(docFreq=1094, maxDocs=42306)
                0.00945914 = queryNorm
              0.2908859 = fieldWeight in 3778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6541743 = idf(docFreq=1094, maxDocs=42306)
                0.0625 = fieldNorm(doc=3778)
          0.27400357 = weight(abstract_txt:storytelling in 3778) [ClassicSimilarity], result of:
            0.27400357 = score(doc=3778,freq=4.0), product of:
              0.23182675 = queryWeight, product of:
                2.591966 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.00945914 = queryNorm
              1.1819324 = fieldWeight in 3778, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.0625 = fieldNorm(doc=3778)
          0.20087092 = weight(abstract_txt:story in 3778) [ClassicSimilarity], result of:
            0.20087092 = score(doc=3778,freq=4.0), product of:
              0.21575849 = queryWeight, product of:
                3.062507 = boost
                7.4479914 = idf(docFreq=66, maxDocs=42306)
                0.00945914 = queryNorm
              0.9309989 = fieldWeight in 3778, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.4479914 = idf(docFreq=66, maxDocs=42306)
                0.0625 = fieldNorm(doc=3778)
          0.19053236 = weight(abstract_txt:stories in 3778) [ClassicSimilarity], result of:
            0.19053236 = score(doc=3778,freq=2.0), product of:
              0.28884083 = queryWeight, product of:
                4.091587 = boost
                7.4630294 = idf(docFreq=65, maxDocs=42306)
                0.00945914 = queryNorm
              0.65964484 = fieldWeight in 3778, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4630294 = idf(docFreq=65, maxDocs=42306)
                0.0625 = fieldNorm(doc=3778)
        0.24 = coord(6/25)
    
  2. Kanan, T.; Fox, E.A.: Automated arabic text classification with P-Stemmer, machine learning, and a tailored news article taxonomy (2016) 0.16
    0.16358556 = sum of:
      0.16358556 = product of:
        0.8179278 = sum of:
          0.003513413 = weight(abstract_txt:information in 70) [ClassicSimilarity], result of:
            0.003513413 = score(doc=70,freq=1.0), product of:
              0.023077885 = queryWeight, product of:
                1.0015926 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.00945914 = queryNorm
              0.15224156 = fieldWeight in 70, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.0625 = fieldNorm(doc=70)
          0.01706132 = weight(abstract_txt:developed in 70) [ClassicSimilarity], result of:
            0.01706132 = score(doc=70,freq=2.0), product of:
              0.04588575 = queryWeight, product of:
                1.1531516 = boost
                4.2066827 = idf(docFreq=1712, maxDocs=42306)
                0.00945914 = queryNorm
              0.37182173 = fieldWeight in 70, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2066827 = idf(docFreq=1712, maxDocs=42306)
                0.0625 = fieldNorm(doc=70)
          0.01029457 = weight(abstract_txt:study in 70) [ClassicSimilarity], result of:
            0.01029457 = score(doc=70,freq=1.0), product of:
              0.047255095 = queryWeight, product of:
                1.4332352 = boost
                3.485616 = idf(docFreq=3522, maxDocs=42306)
                0.00945914 = queryNorm
              0.217851 = fieldWeight in 70, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.485616 = idf(docFreq=3522, maxDocs=42306)
                0.0625 = fieldNorm(doc=70)
          0.13472672 = weight(abstract_txt:stories in 70) [ClassicSimilarity], result of:
            0.13472672 = score(doc=70,freq=1.0), product of:
              0.28884083 = queryWeight, product of:
                4.091587 = boost
                7.4630294 = idf(docFreq=65, maxDocs=42306)
                0.00945914 = queryNorm
              0.46643934 = fieldWeight in 70, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4630294 = idf(docFreq=65, maxDocs=42306)
                0.0625 = fieldNorm(doc=70)
          0.65233177 = weight(abstract_txt:arabic in 70) [ClassicSimilarity], result of:
            0.65233177 = score(doc=70,freq=7.0), product of:
              0.520773 = queryWeight, product of:
                7.2678466 = boost
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.00945914 = queryNorm
              1.2526221 = fieldWeight in 70, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.0625 = fieldNorm(doc=70)
        0.2 = coord(5/25)
    
  3. Rubin, V.L.; Lukoianova, T.: Truth and deception at the rhetorical structure level (2015) 0.14
    0.14093335 = sum of:
      0.14093335 = product of:
        0.5872223 = sum of:
          0.003513413 = weight(abstract_txt:information in 3817) [ClassicSimilarity], result of:
            0.003513413 = score(doc=3817,freq=1.0), product of:
              0.023077885 = queryWeight, product of:
                1.0015926 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.00945914 = queryNorm
              0.15224156 = fieldWeight in 3817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.0625 = fieldNorm(doc=3817)
          0.014499822 = weight(abstract_txt:tools in 3817) [ClassicSimilarity], result of:
            0.014499822 = score(doc=3817,freq=1.0), product of:
              0.051870592 = queryWeight, product of:
                1.2260498 = boost
                4.4726143 = idf(docFreq=1312, maxDocs=42306)
                0.00945914 = queryNorm
              0.2795384 = fieldWeight in 3817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4726143 = idf(docFreq=1312, maxDocs=42306)
                0.0625 = fieldNorm(doc=3817)
          0.020115439 = weight(abstract_txt:identify in 3817) [ClassicSimilarity], result of:
            0.020115439 = score(doc=3817,freq=1.0), product of:
              0.0645207 = queryWeight, product of:
                1.3674046 = boost
                4.988275 = idf(docFreq=783, maxDocs=42306)
                0.00945914 = queryNorm
              0.3117672 = fieldWeight in 3817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.988275 = idf(docFreq=783, maxDocs=42306)
                0.0625 = fieldNorm(doc=3817)
          0.10043546 = weight(abstract_txt:story in 3817) [ClassicSimilarity], result of:
            0.10043546 = score(doc=3817,freq=1.0), product of:
              0.21575849 = queryWeight, product of:
                3.062507 = boost
                7.4479914 = idf(docFreq=66, maxDocs=42306)
                0.00945914 = queryNorm
              0.46549946 = fieldWeight in 3817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4479914 = idf(docFreq=66, maxDocs=42306)
                0.0625 = fieldNorm(doc=3817)
          0.23335353 = weight(abstract_txt:stories in 3817) [ClassicSimilarity], result of:
            0.23335353 = score(doc=3817,freq=3.0), product of:
              0.28884083 = queryWeight, product of:
                4.091587 = boost
                7.4630294 = idf(docFreq=65, maxDocs=42306)
                0.00945914 = queryNorm
              0.8078966 = fieldWeight in 3817, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.4630294 = idf(docFreq=65, maxDocs=42306)
                0.0625 = fieldNorm(doc=3817)
          0.21530463 = weight(abstract_txt:reliability in 3817) [ClassicSimilarity], result of:
            0.21530463 = score(doc=3817,freq=1.0), product of:
              0.49743345 = queryWeight, product of:
                7.5935526 = boost
                6.9252963 = idf(docFreq=112, maxDocs=42306)
                0.00945914 = queryNorm
              0.43283102 = fieldWeight in 3817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9252963 = idf(docFreq=112, maxDocs=42306)
                0.0625 = fieldNorm(doc=3817)
        0.24 = coord(6/25)
    
  4. Bilal, D.; Bachir, I.: Children's interaction with cross-cultural and multilingual digital libraries : II. Information seeking, success, and affective experience (2007) 0.13
    0.13216625 = sum of:
      0.13216625 = product of:
        0.6608312 = sum of:
          0.0062108953 = weight(abstract_txt:information in 2896) [ClassicSimilarity], result of:
            0.0062108953 = score(doc=2896,freq=2.0), product of:
              0.023077885 = queryWeight, product of:
                1.0015926 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.00945914 = queryNorm
              0.26912758 = fieldWeight in 2896, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.078125 = fieldNorm(doc=2896)
          0.0102763185 = weight(abstract_txt:were in 2896) [ClassicSimilarity], result of:
            0.0102763185 = score(doc=2896,freq=1.0), product of:
              0.035532936 = queryWeight, product of:
                1.0147593 = boost
                3.7018294 = idf(docFreq=2837, maxDocs=42306)
                0.00945914 = queryNorm
              0.28920543 = fieldWeight in 2896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7018294 = idf(docFreq=2837, maxDocs=42306)
                0.078125 = fieldNorm(doc=2896)
          0.015080217 = weight(abstract_txt:developed in 2896) [ClassicSimilarity], result of:
            0.015080217 = score(doc=2896,freq=1.0), product of:
              0.04588575 = queryWeight, product of:
                1.1531516 = boost
                4.2066827 = idf(docFreq=1712, maxDocs=42306)
                0.00945914 = queryNorm
              0.32864708 = fieldWeight in 2896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2066827 = idf(docFreq=1712, maxDocs=42306)
                0.078125 = fieldNorm(doc=2896)
          0.012868212 = weight(abstract_txt:study in 2896) [ClassicSimilarity], result of:
            0.012868212 = score(doc=2896,freq=1.0), product of:
              0.047255095 = queryWeight, product of:
                1.4332352 = boost
                3.485616 = idf(docFreq=3522, maxDocs=42306)
                0.00945914 = queryNorm
              0.27231374 = fieldWeight in 2896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.485616 = idf(docFreq=3522, maxDocs=42306)
                0.078125 = fieldNorm(doc=2896)
          0.6163956 = weight(abstract_txt:arabic in 2896) [ClassicSimilarity], result of:
            0.6163956 = score(doc=2896,freq=4.0), product of:
              0.520773 = queryWeight, product of:
                7.2678466 = boost
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.00945914 = queryNorm
              1.1836166 = fieldWeight in 2896, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.078125 = fieldNorm(doc=2896)
        0.2 = coord(5/25)
    
  5. Fattah, M. Abdel; Ren, F.: English-Arabic proper-noun transliteration-pairs creation (2008) 0.12
    0.12156578 = sum of:
      0.12156578 = product of:
        0.7597861 = sum of:
          0.003513413 = weight(abstract_txt:information in 4000) [ClassicSimilarity], result of:
            0.003513413 = score(doc=4000,freq=1.0), product of:
              0.023077885 = queryWeight, product of:
                1.0015926 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.00945914 = queryNorm
              0.15224156 = fieldWeight in 4000, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.0625 = fieldNorm(doc=4000)
          0.01029457 = weight(abstract_txt:study in 4000) [ClassicSimilarity], result of:
            0.01029457 = score(doc=4000,freq=1.0), product of:
              0.047255095 = queryWeight, product of:
                1.4332352 = boost
                3.485616 = idf(docFreq=3522, maxDocs=42306)
                0.00945914 = queryNorm
              0.217851 = fieldWeight in 4000, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.485616 = idf(docFreq=3522, maxDocs=42306)
                0.0625 = fieldNorm(doc=4000)
          0.048606154 = weight(abstract_txt:names in 4000) [ClassicSimilarity], result of:
            0.048606154 = score(doc=4000,freq=1.0), product of:
              0.13299562 = queryWeight, product of:
                2.4044282 = boost
                5.8475494 = idf(docFreq=331, maxDocs=42306)
                0.00945914 = queryNorm
              0.36547184 = fieldWeight in 4000, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8475494 = idf(docFreq=331, maxDocs=42306)
                0.0625 = fieldNorm(doc=4000)
          0.697372 = weight(abstract_txt:arabic in 4000) [ClassicSimilarity], result of:
            0.697372 = score(doc=4000,freq=8.0), product of:
              0.520773 = queryWeight, product of:
                7.2678466 = boost
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.00945914 = queryNorm
              1.3391094 = fieldWeight in 4000, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.0625 = fieldNorm(doc=4000)
        0.16 = coord(4/25)