Document (#36739)

Author
Anizi, M.
Dichy, J.
Title
Improving information retrieval in Arabic through a multi-agent approach and a rich lexical resource
Source
Knowledge organization. 38(2011) no.5, S.405-413
Year
2011
Abstract
This paper addresses the optimization of information retrieval in Arabic. The results derived from the expanding development of sites in Arabic are often spectacular. Nevertheless, several observations indicate that the responses remain disappointing, particularly upon comparing users' requests and quality of responses. One of the problems encountered by users is the loss of time when navigating between different URLs to find adequate responses. This, in many cases, is due to the absence of forms morphologically related to the research keyword. Such problems can be approached through a morphological analyzer drawing on the DIINAR.1 morpho-lexical resource. A second problem concerns the formulation of the query, which may prove ambiguous, as in everyday language. We then focus on contextual disambiguation based on a rich lexical resource that includes collocations and set expressions. The overall scheme of such a resource will only be hinted at here. Our approach leads to the elaboration of a multi-agent system, motivated by a need to solve problems encountered when using conventional methods of analysis, and to improve the results of queries thanks to a better collaboration between different levels of analysis. We suggest resorting to four agents: morphological, morpho-lexical, contextualization, and an interface agent. These agents 'negotiate' and 'cooperate' throughout the analysis process, starting from the submission of the initial query, and going on until an adequate query is obtained.
Content
Beitrag innerhalb einer Special Section: Knowledge Organization, Competitive Intelligence, and Information Systems - Papers from 4th International Conference on "Information Systems & Economic Intelligence," February 17-19th, 2011. Marrakech - Morocco.
Footnote
Vgl.: http://www.ergon-verlag.de/isko_ko/downloads/ko_38_2011_5d.pdf.
Theme
Computerlinguistik

Similar documents (content)

  1. Fautsch, C.; Savoy, J.: Algorithmic stemmers or morphological analysis? : an evaluation (2009) 0.11
    0.11297226 = sum of:
      0.11297226 = product of:
        0.5648613 = sum of:
          0.021613317 = weight(abstract_txt:when in 2950) [ClassicSimilarity], result of:
            0.021613317 = score(doc=2950,freq=1.0), product of:
              0.06668958 = queryWeight, product of:
                1.004086 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.016010823 = queryNorm
              0.32408836 = fieldWeight in 2950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.078125 = fieldNorm(doc=2950)
          0.11833964 = weight(abstract_txt:analyzer in 2950) [ClassicSimilarity], result of:
            0.11833964 = score(doc=2950,freq=1.0), product of:
              0.16443232 = queryWeight, product of:
                1.1148604 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.016010823 = queryNorm
              0.71968603 = fieldWeight in 2950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.078125 = fieldNorm(doc=2950)
          0.121724725 = weight(abstract_txt:morphologically in 2950) [ClassicSimilarity], result of:
            0.121724725 = score(doc=2950,freq=1.0), product of:
              0.16755326 = queryWeight, product of:
                1.1253908 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.016010823 = queryNorm
              0.72648376 = fieldWeight in 2950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.078125 = fieldNorm(doc=2950)
          0.031322096 = weight(abstract_txt:analysis in 2950) [ClassicSimilarity], result of:
            0.031322096 = score(doc=2950,freq=2.0), product of:
              0.07759457 = queryWeight, product of:
                1.3264877 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.016010823 = queryNorm
              0.40366352 = fieldWeight in 2950, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=2950)
          0.27186155 = weight(abstract_txt:morphological in 2950) [ClassicSimilarity], result of:
            0.27186155 = score(doc=2950,freq=3.0), product of:
              0.25009316 = queryWeight, product of:
                1.9444323 = boost
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.016010823 = queryNorm
              1.0870411 = fieldWeight in 2950, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.078125 = fieldNorm(doc=2950)
        0.2 = coord(5/25)
    
  2. Dumais, S.T.: Latent semantic analysis (2003) 0.10
    0.104468964 = sum of:
      0.104468964 = product of:
        0.37310344 = sum of:
          0.008645327 = weight(abstract_txt:when in 2462) [ClassicSimilarity], result of:
            0.008645327 = score(doc=2462,freq=1.0), product of:
              0.06668958 = queryWeight, product of:
                1.004086 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.016010823 = queryNorm
              0.12963535 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.0688579 = weight(abstract_txt:morphologically in 2462) [ClassicSimilarity], result of:
            0.0688579 = score(doc=2462,freq=2.0), product of:
              0.16755326 = queryWeight, product of:
                1.1253908 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.016010823 = queryNorm
              0.41096127 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.017718455 = weight(abstract_txt:analysis in 2462) [ClassicSimilarity], result of:
            0.017718455 = score(doc=2462,freq=4.0), product of:
              0.07759457 = queryWeight, product of:
                1.3264877 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.016010823 = queryNorm
              0.22834657 = fieldWeight in 2462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.028835453 = weight(abstract_txt:problems in 2462) [ClassicSimilarity], result of:
            0.028835453 = score(doc=2462,freq=4.0), product of:
              0.1073574 = queryWeight, product of:
                1.5602835 = boost
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.016010823 = queryNorm
              0.26859307 = fieldWeight in 2462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.043636553 = weight(abstract_txt:query in 2462) [ClassicSimilarity], result of:
            0.043636553 = score(doc=2462,freq=5.0), product of:
              0.13136442 = queryWeight, product of:
                1.7259429 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.016010823 = queryNorm
              0.3321794 = fieldWeight in 2462, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.08878961 = weight(abstract_txt:morphological in 2462) [ClassicSimilarity], result of:
            0.08878961 = score(doc=2462,freq=2.0), product of:
              0.25009316 = queryWeight, product of:
                1.9444323 = boost
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.016010823 = queryNorm
              0.35502616 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.11662013 = weight(abstract_txt:lexical in 2462) [ClassicSimilarity], result of:
            0.11662013 = score(doc=2462,freq=3.0), product of:
              0.33013302 = queryWeight, product of:
                3.1593766 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.016010823 = queryNorm
              0.35325193 = fieldWeight in 2462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
        0.28 = coord(7/25)
    
  3. Bicchieri, C.: ¬The potential for the evolution of co-operation among web agents (1998) 0.09
    0.093915075 = sum of:
      0.093915075 = product of:
        0.46957538 = sum of:
          0.021613317 = weight(abstract_txt:when in 2297) [ClassicSimilarity], result of:
            0.021613317 = score(doc=2297,freq=1.0), product of:
              0.06668958 = queryWeight, product of:
                1.004086 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.016010823 = queryNorm
              0.32408836 = fieldWeight in 2297, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.078125 = fieldNorm(doc=2297)
          0.038361575 = weight(abstract_txt:analysis in 2297) [ClassicSimilarity], result of:
            0.038361575 = score(doc=2297,freq=3.0), product of:
              0.07759457 = queryWeight, product of:
                1.3264877 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.016010823 = queryNorm
              0.4943848 = fieldWeight in 2297, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=2297)
          0.12466954 = weight(abstract_txt:agents in 2297) [ClassicSimilarity], result of:
            0.12466954 = score(doc=2297,freq=2.0), product of:
              0.17024483 = queryWeight, product of:
                1.6042752 = boost
                6.627983 = idf(docFreq=158, maxDocs=44218)
                0.016010823 = queryNorm
              0.7322956 = fieldWeight in 2297, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.627983 = idf(docFreq=158, maxDocs=44218)
                0.078125 = fieldNorm(doc=2297)
          0.124309584 = weight(abstract_txt:responses in 2297) [ClassicSimilarity], result of:
            0.124309584 = score(doc=2297,freq=1.0), product of:
              0.2450627 = queryWeight, product of:
                2.3573613 = boost
                6.4928803 = idf(docFreq=181, maxDocs=44218)
                0.016010823 = queryNorm
              0.50725627 = fieldWeight in 2297, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4928803 = idf(docFreq=181, maxDocs=44218)
                0.078125 = fieldNorm(doc=2297)
          0.16062137 = weight(abstract_txt:agent in 2297) [ClassicSimilarity], result of:
            0.16062137 = score(doc=2297,freq=1.0), product of:
              0.29072097 = queryWeight, product of:
                2.5675902 = boost
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.016010823 = queryNorm
              0.5524933 = fieldWeight in 2297, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.078125 = fieldNorm(doc=2297)
        0.2 = coord(5/25)
    
  4. AI-Sughaiyer, I.A.; AI-Kharashi, I.A.: Arabic morphological analysis techniques : a comprehensive survey (2004) 0.09
    0.09279001 = sum of:
      0.09279001 = product of:
        0.7732501 = sum of:
          0.037586518 = weight(abstract_txt:analysis in 2206) [ClassicSimilarity], result of:
            0.037586518 = score(doc=2206,freq=2.0), product of:
              0.07759457 = queryWeight, product of:
                1.3264877 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.016010823 = queryNorm
              0.48439622 = fieldWeight in 2206, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.09375 = fieldNorm(doc=2206)
          0.3262339 = weight(abstract_txt:morphological in 2206) [ClassicSimilarity], result of:
            0.3262339 = score(doc=2206,freq=3.0), product of:
              0.25009316 = queryWeight, product of:
                1.9444323 = boost
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.016010823 = queryNorm
              1.3044494 = fieldWeight in 2206, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.09375 = fieldNorm(doc=2206)
          0.4094297 = weight(abstract_txt:arabic in 2206) [ClassicSimilarity], result of:
            0.4094297 = score(doc=2206,freq=3.0), product of:
              0.3330932 = queryWeight, product of:
                2.74834 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.016010823 = queryNorm
              1.2291746 = fieldWeight in 2206, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.09375 = fieldNorm(doc=2206)
        0.12 = coord(3/25)
    
  5. Galitsky, B.: Can many agents answer questions better than one? (2005) 0.09
    0.09178152 = sum of:
      0.09178152 = product of:
        0.4589076 = sum of:
          0.021613317 = weight(abstract_txt:when in 3094) [ClassicSimilarity], result of:
            0.021613317 = score(doc=3094,freq=1.0), product of:
              0.06668958 = queryWeight, product of:
                1.004086 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.016010823 = queryNorm
              0.32408836 = fieldWeight in 3094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.078125 = fieldNorm(doc=3094)
          0.022148067 = weight(abstract_txt:analysis in 3094) [ClassicSimilarity], result of:
            0.022148067 = score(doc=3094,freq=1.0), product of:
              0.07759457 = queryWeight, product of:
                1.3264877 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.016010823 = queryNorm
              0.2854332 = fieldWeight in 3094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=3094)
          0.08815467 = weight(abstract_txt:agents in 3094) [ClassicSimilarity], result of:
            0.08815467 = score(doc=3094,freq=1.0), product of:
              0.17024483 = queryWeight, product of:
                1.6042752 = boost
                6.627983 = idf(docFreq=158, maxDocs=44218)
                0.016010823 = queryNorm
              0.5178112 = fieldWeight in 3094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.627983 = idf(docFreq=158, maxDocs=44218)
                0.078125 = fieldNorm(doc=3094)
          0.04878715 = weight(abstract_txt:query in 3094) [ClassicSimilarity], result of:
            0.04878715 = score(doc=3094,freq=1.0), product of:
              0.13136442 = queryWeight, product of:
                1.7259429 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.016010823 = queryNorm
              0.37138787 = fieldWeight in 3094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=3094)
          0.2782044 = weight(abstract_txt:agent in 3094) [ClassicSimilarity], result of:
            0.2782044 = score(doc=3094,freq=3.0), product of:
              0.29072097 = queryWeight, product of:
                2.5675902 = boost
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.016010823 = queryNorm
              0.9569465 = fieldWeight in 3094, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.078125 = fieldNorm(doc=3094)
        0.2 = coord(5/25)