Document (#36361)

Author
Rushdi-Saleh, M.
Martín-Valdivia, M.T.
Ureña-López, L.A.
Perea-Ortega, J.M.
Title
OCA: Opinion corpus for Arabic
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.10, S.2045-2054
Year
2011
Abstract
Sentiment analysis is a challenging new task related to text mining and natural language processing. Although there are, at present, several studies related to this theme, most of these focus mainly on English texts. The resources available for opinion mining (OM) in other languages are still limited. In this article, we present a new Arabic corpus for the OM task that has been made available to the scientific community for research purposes. The corpus contains 500 movie reviews collected from different web pages and blogs in Arabic, 250 of them considered as positive reviews, and the other 250 as negative opinions. Furthermore, different experiments have been carried out on this corpus, using machine learning algorithms such as support vector machines and Nave Bayes. The results obtained are very promising and we are encouraged to continue this line of research.

Similar documents (author)

  1. Perea-Ortega, J.M.; Martín-Valdivia, M.T.; Ureña-López, L.A.; Martínez-Cámara, E.: Improving polarity classification of bilingual parallel corpora combining machine learning and semantic orientation approaches (2013) 4.95
    4.9549966 = sum of:
      4.9549966 = sum of:
        0.6418292 = weight(author_txt:lópez in 1045) [ClassicSimilarity], result of:
          0.6418292 = score(doc=1045,freq=1.0), product of:
            0.33307588 = queryWeight, product of:
              7.7079034 = idf(docFreq=53, maxDocs=44218)
              0.043212254 = queryNorm
            1.9269758 = fieldWeight in 1045, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.7079034 = idf(docFreq=53, maxDocs=44218)
              0.25 = fieldNorm(doc=1045)
        0.9760049 = weight(author_txt:martín in 1045) [ClassicSimilarity], result of:
          0.9760049 = score(doc=1045,freq=1.0), product of:
            0.44045162 = queryWeight, product of:
              1.1499462 = boost
              8.863674 = idf(docFreq=16, maxDocs=44218)
              0.043212254 = queryNorm
            2.2159185 = fieldWeight in 1045, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.863674 = idf(docFreq=16, maxDocs=44218)
              0.25 = fieldNorm(doc=1045)
        0.9961687 = weight(author_txt:ortega in 1045) [ClassicSimilarity], result of:
          0.9961687 = score(doc=1045,freq=1.0), product of:
            0.44649726 = queryWeight, product of:
              1.1578114 = boost
              8.924298 = idf(docFreq=15, maxDocs=44218)
              0.043212254 = queryNorm
            2.2310746 = fieldWeight in 1045, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.924298 = idf(docFreq=15, maxDocs=44218)
              0.25 = fieldNorm(doc=1045)
        1.0415573 = weight(author_txt:valdivia in 1045) [ClassicSimilarity], result of:
          1.0415573 = score(doc=1045,freq=1.0), product of:
            0.45995885 = queryWeight, product of:
              1.1751354 = boost
              9.05783 = idf(docFreq=13, maxDocs=44218)
              0.043212254 = queryNorm
            2.2644575 = fieldWeight in 1045, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.05783 = idf(docFreq=13, maxDocs=44218)
              0.25 = fieldNorm(doc=1045)
        1.2994367 = weight(author_txt:ureña in 1045) [ClassicSimilarity], result of:
          1.2994367 = score(doc=1045,freq=1.0), product of:
            0.5330488 = queryWeight, product of:
              1.2650622 = boost
              9.7509775 = idf(docFreq=6, maxDocs=44218)
              0.043212254 = queryNorm
            2.4377444 = fieldWeight in 1045, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.7509775 = idf(docFreq=6, maxDocs=44218)
              0.25 = fieldNorm(doc=1045)
    
  2. Martín-Valdivia, M.T.; Díaz-Galiano, M.C.; Montejo-Raez, A.; Ureña-López, L.A.: Using information gain to improve multi-modal information retrieval systems (2008) 3.17
    3.1670625 = sum of:
      3.1670625 = product of:
        3.958828 = sum of:
          0.6418292 = weight(author_txt:lópez in 2086) [ClassicSimilarity], result of:
            0.6418292 = score(doc=2086,freq=1.0), product of:
              0.33307588 = queryWeight, product of:
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.043212254 = queryNorm
              1.9269758 = fieldWeight in 2086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.25 = fieldNorm(doc=2086)
          0.9760049 = weight(author_txt:martín in 2086) [ClassicSimilarity], result of:
            0.9760049 = score(doc=2086,freq=1.0), product of:
              0.44045162 = queryWeight, product of:
                1.1499462 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.043212254 = queryNorm
              2.2159185 = fieldWeight in 2086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.25 = fieldNorm(doc=2086)
          1.0415573 = weight(author_txt:valdivia in 2086) [ClassicSimilarity], result of:
            1.0415573 = score(doc=2086,freq=1.0), product of:
              0.45995885 = queryWeight, product of:
                1.1751354 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.043212254 = queryNorm
              2.2644575 = fieldWeight in 2086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.25 = fieldNorm(doc=2086)
          1.2994367 = weight(author_txt:ureña in 2086) [ClassicSimilarity], result of:
            1.2994367 = score(doc=2086,freq=1.0), product of:
              0.5330488 = queryWeight, product of:
                1.2650622 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.043212254 = queryNorm
              2.4377444 = fieldWeight in 2086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.25 = fieldNorm(doc=2086)
        0.8 = coord(4/5)
    
  3. Montejo-Ráez, A.; Martínez-Cámara, E.; Martín-Valdivia, M.T.; Ureña-López, L.A.: ¬A knowledge-based approach for polarity classification in Twitter (2014) 3.17
    3.1670625 = sum of:
      3.1670625 = product of:
        3.958828 = sum of:
          0.6418292 = weight(author_txt:lópez in 1204) [ClassicSimilarity], result of:
            0.6418292 = score(doc=1204,freq=1.0), product of:
              0.33307588 = queryWeight, product of:
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.043212254 = queryNorm
              1.9269758 = fieldWeight in 1204, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.25 = fieldNorm(doc=1204)
          0.9760049 = weight(author_txt:martín in 1204) [ClassicSimilarity], result of:
            0.9760049 = score(doc=1204,freq=1.0), product of:
              0.44045162 = queryWeight, product of:
                1.1499462 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.043212254 = queryNorm
              2.2159185 = fieldWeight in 1204, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.25 = fieldNorm(doc=1204)
          1.0415573 = weight(author_txt:valdivia in 1204) [ClassicSimilarity], result of:
            1.0415573 = score(doc=1204,freq=1.0), product of:
              0.45995885 = queryWeight, product of:
                1.1751354 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.043212254 = queryNorm
              2.2644575 = fieldWeight in 1204, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.25 = fieldNorm(doc=1204)
          1.2994367 = weight(author_txt:ureña in 1204) [ClassicSimilarity], result of:
            1.2994367 = score(doc=1204,freq=1.0), product of:
              0.5330488 = queryWeight, product of:
                1.2650622 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.043212254 = queryNorm
              2.4377444 = fieldWeight in 1204, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.25 = fieldNorm(doc=1204)
        0.8 = coord(4/5)
    
  4. Delgado-Quirós, L.; Aguillo, I.F.; Martín-Martín, A.; López-Cózar, E.D.; Orduña-Malea, E.; Ortega, J.L.: Why are these publications missing? : uncovering the reasons behind the exclusion of documents in free-access scholarly databases (2024) 1.81
    1.8109664 = sum of:
      1.8109664 = product of:
        3.0182772 = sum of:
          0.6418292 = weight(author_txt:lópez in 1201) [ClassicSimilarity], result of:
            0.6418292 = score(doc=1201,freq=1.0), product of:
              0.33307588 = queryWeight, product of:
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.043212254 = queryNorm
              1.9269758 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.25 = fieldNorm(doc=1201)
          1.3802793 = weight(author_txt:martín in 1201) [ClassicSimilarity], result of:
            1.3802793 = score(doc=1201,freq=2.0), product of:
              0.44045162 = queryWeight, product of:
                1.1499462 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.043212254 = queryNorm
              3.133782 = fieldWeight in 1201, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.25 = fieldNorm(doc=1201)
          0.9961687 = weight(author_txt:ortega in 1201) [ClassicSimilarity], result of:
            0.9961687 = score(doc=1201,freq=1.0), product of:
              0.44649726 = queryWeight, product of:
                1.1578114 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.043212254 = queryNorm
              2.2310746 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.25 = fieldNorm(doc=1201)
        0.6 = coord(3/5)
    
  5. García Cumbreras, M.A.; Perea-Ortega, J.M.; García Vega, M.; Ureña López, L.A.: Information retrieval with geographical references : relevant documents filtering vs. query expansion (2009) 1.76
    1.7624608 = sum of:
      1.7624608 = product of:
        2.9374347 = sum of:
          0.6418292 = weight(author_txt:lópez in 4222) [ClassicSimilarity], result of:
            0.6418292 = score(doc=4222,freq=1.0), product of:
              0.33307588 = queryWeight, product of:
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.043212254 = queryNorm
              1.9269758 = fieldWeight in 4222, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.25 = fieldNorm(doc=4222)
          0.9961687 = weight(author_txt:ortega in 4222) [ClassicSimilarity], result of:
            0.9961687 = score(doc=4222,freq=1.0), product of:
              0.44649726 = queryWeight, product of:
                1.1578114 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.043212254 = queryNorm
              2.2310746 = fieldWeight in 4222, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.25 = fieldNorm(doc=4222)
          1.2994367 = weight(author_txt:ureña in 4222) [ClassicSimilarity], result of:
            1.2994367 = score(doc=4222,freq=1.0), product of:
              0.5330488 = queryWeight, product of:
                1.2650622 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.043212254 = queryNorm
              2.4377444 = fieldWeight in 4222, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.25 = fieldNorm(doc=4222)
        0.6 = coord(3/5)
    

Similar documents (content)

  1. Kanaan, G.; Al-Shalabi, R.; Ghwanmeh, S.; Al-Ma'adeed, H.: ¬A comparison of text-classification techniques applied to Arabic text (2009) 0.49
    0.4899841 = sum of:
      0.4899841 = product of:
        1.5312003 = sum of:
          0.02542561 = weight(abstract_txt:research in 3096) [ClassicSimilarity], result of:
            0.02542561 = score(doc=3096,freq=2.0), product of:
              0.06048944 = queryWeight, product of:
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.019079808 = queryNorm
              0.4203314 = fieldWeight in 3096, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.09375 = fieldNorm(doc=3096)
          0.08587928 = weight(abstract_txt:challenging in 3096) [ClassicSimilarity], result of:
            0.08587928 = score(doc=3096,freq=1.0), product of:
              0.13617297 = queryWeight, product of:
                1.0609397 = boost
                6.727074 = idf(docFreq=143, maxDocs=44218)
                0.019079808 = queryNorm
              0.6306632 = fieldWeight in 3096, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.727074 = idf(docFreq=143, maxDocs=44218)
                0.09375 = fieldNorm(doc=3096)
          0.046265196 = weight(abstract_txt:been in 3096) [ClassicSimilarity], result of:
            0.046265196 = score(doc=3096,freq=3.0), product of:
              0.07875978 = queryWeight, product of:
                1.1410705 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.019079808 = queryNorm
              0.5874216 = fieldWeight in 3096, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.09375 = fieldNorm(doc=3096)
          0.027786897 = weight(abstract_txt:different in 3096) [ClassicSimilarity], result of:
            0.027786897 = score(doc=3096,freq=1.0), product of:
              0.080860294 = queryWeight, product of:
                1.1561865 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.019079808 = queryNorm
              0.3436408 = fieldWeight in 3096, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.09375 = fieldNorm(doc=3096)
          0.2466409 = weight(abstract_txt:bayes in 3096) [ClassicSimilarity], result of:
            0.2466409 = score(doc=3096,freq=2.0), product of:
              0.21837288 = queryWeight, product of:
                1.3435214 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.019079808 = queryNorm
              1.1294484 = fieldWeight in 3096, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.09375 = fieldNorm(doc=3096)
          0.022421574 = weight(abstract_txt:this in 3096) [ClassicSimilarity], result of:
            0.022421574 = score(doc=3096,freq=2.0), product of:
              0.07008408 = queryWeight, product of:
                1.5222462 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.019079808 = queryNorm
              0.31992394 = fieldWeight in 3096, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.09375 = fieldNorm(doc=3096)
          0.8208448 = weight(abstract_txt:arabic in 3096) [ClassicSimilarity], result of:
            0.8208448 = score(doc=3096,freq=5.0), product of:
              0.51727694 = queryWeight, product of:
                3.5815203 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.019079808 = queryNorm
              1.5868576 = fieldWeight in 3096, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.09375 = fieldNorm(doc=3096)
          0.25593603 = weight(abstract_txt:corpus in 3096) [ClassicSimilarity], result of:
            0.25593603 = score(doc=3096,freq=1.0), product of:
              0.447651 = queryWeight, product of:
                3.847202 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.019079808 = queryNorm
              0.57173115 = fieldWeight in 3096, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.09375 = fieldNorm(doc=3096)
        0.32 = coord(8/25)
    
  2. Pang, B.; Lee, L.: Opinion mining and sentiment analysis (2008) 0.42
    0.4230476 = sum of:
      0.4230476 = product of:
        0.8813492 = sum of:
          0.010487529 = weight(abstract_txt:research in 1171) [ClassicSimilarity], result of:
            0.010487529 = score(doc=1171,freq=1.0), product of:
              0.06048944 = queryWeight, product of:
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.019079808 = queryNorm
              0.17337786 = fieldWeight in 1171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.0787855 = weight(abstract_txt:opinions in 1171) [ClassicSimilarity], result of:
            0.0787855 = score(doc=1171,freq=2.0), product of:
              0.14616442 = queryWeight, product of:
                1.099173 = boost
                6.9694996 = idf(docFreq=112, maxDocs=44218)
                0.019079808 = queryNorm
              0.5390197 = fieldWeight in 1171, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9694996 = idf(docFreq=112, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.0143604465 = weight(abstract_txt:other in 1171) [ClassicSimilarity], result of:
            0.0143604465 = score(doc=1171,freq=1.0), product of:
              0.07458922 = queryWeight, product of:
                1.1104481 = boost
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.019079808 = queryNorm
              0.1925271 = fieldWeight in 1171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.015581548 = weight(abstract_txt:been in 1171) [ClassicSimilarity], result of:
            0.015581548 = score(doc=1171,freq=1.0), product of:
              0.07875978 = queryWeight, product of:
                1.1410705 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.019079808 = queryNorm
              0.19783635 = fieldWeight in 1171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.06408114 = weight(abstract_txt:blogs in 1171) [ClassicSimilarity], result of:
            0.06408114 = score(doc=1171,freq=1.0), product of:
              0.16046277 = queryWeight, product of:
                1.1516814 = boost
                7.3024383 = idf(docFreq=80, maxDocs=44218)
                0.019079808 = queryNorm
              0.3993521 = fieldWeight in 1171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3024383 = idf(docFreq=80, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.15859875 = weight(abstract_txt:sentiment in 1171) [ClassicSimilarity], result of:
            0.15859875 = score(doc=1171,freq=5.0), product of:
              0.1716975 = queryWeight, product of:
                1.1913166 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.019079808 = queryNorm
              0.92371035 = fieldWeight in 1171, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.026188765 = weight(abstract_txt:available in 1171) [ClassicSimilarity], result of:
            0.026188765 = score(doc=1171,freq=1.0), product of:
              0.11133733 = queryWeight, product of:
                1.3566899 = boost
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.019079808 = queryNorm
              0.23521999 = fieldWeight in 1171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.027071211 = weight(abstract_txt:present in 1171) [ClassicSimilarity], result of:
            0.027071211 = score(doc=1171,freq=1.0), product of:
              0.11382454 = queryWeight, product of:
                1.3717601 = boost
                4.348943 = idf(docFreq=1552, maxDocs=44218)
                0.019079808 = queryNorm
              0.23783283 = fieldWeight in 1171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.348943 = idf(docFreq=1552, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.009248428 = weight(abstract_txt:this in 1171) [ClassicSimilarity], result of:
            0.009248428 = score(doc=1171,freq=1.0), product of:
              0.07008408 = queryWeight, product of:
                1.5222462 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.019079808 = queryNorm
              0.1319619 = fieldWeight in 1171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.039986763 = weight(abstract_txt:reviews in 1171) [ClassicSimilarity], result of:
            0.039986763 = score(doc=1171,freq=1.0), product of:
              0.14763011 = queryWeight, product of:
                1.5622398 = boost
                4.952828 = idf(docFreq=848, maxDocs=44218)
                0.019079808 = queryNorm
              0.27085778 = fieldWeight in 1171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.952828 = idf(docFreq=848, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.13425116 = weight(abstract_txt:mining in 1171) [ClassicSimilarity], result of:
            0.13425116 = score(doc=1171,freq=3.0), product of:
              0.2295104 = queryWeight, product of:
                1.9478765 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.019079808 = queryNorm
              0.58494586 = fieldWeight in 1171, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.30270797 = weight(abstract_txt:opinion in 1171) [ClassicSimilarity], result of:
            0.30270797 = score(doc=1171,freq=8.0), product of:
              0.28458807 = queryWeight, product of:
                2.169045 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.019079808 = queryNorm
              1.0636706 = fieldWeight in 1171, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
        0.48 = coord(12/25)
    
  3. Perea-Ortega, J.M.; Martín-Valdivia, M.T.; Ureña-López, L.A.; Martínez-Cámara, E.: Improving polarity classification of bilingual parallel corpora combining machine learning and semantic orientation approaches (2013) 0.37
    0.37210974 = sum of:
      0.37210974 = product of:
        0.93027437 = sum of:
          0.0636683 = weight(abstract_txt:opinions in 1045) [ClassicSimilarity], result of:
            0.0636683 = score(doc=1045,freq=1.0), product of:
              0.14616442 = queryWeight, product of:
                1.099173 = boost
                6.9694996 = idf(docFreq=112, maxDocs=44218)
                0.019079808 = queryNorm
              0.43559372 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9694996 = idf(docFreq=112, maxDocs=44218)
                0.0625 = fieldNorm(doc=1045)
          0.017807484 = weight(abstract_txt:been in 1045) [ClassicSimilarity], result of:
            0.017807484 = score(doc=1045,freq=1.0), product of:
              0.07875978 = queryWeight, product of:
                1.1410705 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.019079808 = queryNorm
              0.22609869 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=1045)
          0.08106002 = weight(abstract_txt:sentiment in 1045) [ClassicSimilarity], result of:
            0.08106002 = score(doc=1045,freq=1.0), product of:
              0.1716975 = queryWeight, product of:
                1.1913166 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.019079808 = queryNorm
              0.47210953 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=1045)
          0.027960738 = weight(abstract_txt:related in 1045) [ClassicSimilarity], result of:
            0.027960738 = score(doc=1045,freq=1.0), product of:
              0.10639843 = queryWeight, product of:
                1.3262575 = boost
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.019079808 = queryNorm
              0.26279277 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.0625 = fieldNorm(doc=1045)
          0.018307138 = weight(abstract_txt:this in 1045) [ClassicSimilarity], result of:
            0.018307138 = score(doc=1045,freq=3.0), product of:
              0.07008408 = queryWeight, product of:
                1.5222462 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.019079808 = queryNorm
              0.2612168 = fieldWeight in 1045, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=1045)
          0.04455924 = weight(abstract_txt:task in 1045) [ClassicSimilarity], result of:
            0.04455924 = score(doc=1045,freq=1.0), product of:
              0.1451648 = queryWeight, product of:
                1.5491408 = boost
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.019079808 = queryNorm
              0.30695623 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.0625 = fieldNorm(doc=1045)
          0.088582784 = weight(abstract_txt:mining in 1045) [ClassicSimilarity], result of:
            0.088582784 = score(doc=1045,freq=1.0), product of:
              0.2295104 = queryWeight, product of:
                1.9478765 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.019079808 = queryNorm
              0.38596416 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0625 = fieldNorm(doc=1045)
          0.17297599 = weight(abstract_txt:opinion in 1045) [ClassicSimilarity], result of:
            0.17297599 = score(doc=1045,freq=2.0), product of:
              0.28458807 = queryWeight, product of:
                2.169045 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.019079808 = queryNorm
              0.6078118 = fieldWeight in 1045, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0625 = fieldNorm(doc=1045)
          0.24472865 = weight(abstract_txt:arabic in 1045) [ClassicSimilarity], result of:
            0.24472865 = score(doc=1045,freq=1.0), product of:
              0.51727694 = queryWeight, product of:
                3.5815203 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.019079808 = queryNorm
              0.47310954 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.0625 = fieldNorm(doc=1045)
          0.170624 = weight(abstract_txt:corpus in 1045) [ClassicSimilarity], result of:
            0.170624 = score(doc=1045,freq=1.0), product of:
              0.447651 = queryWeight, product of:
                3.847202 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.019079808 = queryNorm
              0.3811541 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.0625 = fieldNorm(doc=1045)
        0.4 = coord(10/25)
    
  4. Belbachir, F.; Boughanem, M.: Using language models to improve opinion detection (2018) 0.26
    0.25956857 = sum of:
      0.25956857 = product of:
        0.7210238 = sum of:
          0.014831605 = weight(abstract_txt:research in 5044) [ClassicSimilarity], result of:
            0.014831605 = score(doc=5044,freq=2.0), product of:
              0.06048944 = queryWeight, product of:
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.019079808 = queryNorm
              0.2451933 = fieldWeight in 5044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5044)
          0.06408114 = weight(abstract_txt:blogs in 5044) [ClassicSimilarity], result of:
            0.06408114 = score(doc=5044,freq=1.0), product of:
              0.16046277 = queryWeight, product of:
                1.1516814 = boost
                7.3024383 = idf(docFreq=80, maxDocs=44218)
                0.019079808 = queryNorm
              0.3993521 = fieldWeight in 5044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3024383 = idf(docFreq=80, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5044)
          0.032418046 = weight(abstract_txt:different in 5044) [ClassicSimilarity], result of:
            0.032418046 = score(doc=5044,freq=4.0), product of:
              0.080860294 = queryWeight, product of:
                1.1561865 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.019079808 = queryNorm
              0.40091425 = fieldWeight in 5044, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5044)
          0.095103756 = weight(abstract_txt:movie in 5044) [ClassicSimilarity], result of:
            0.095103756 = score(doc=5044,freq=1.0), product of:
              0.20877855 = queryWeight, product of:
                1.3136756 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.019079808 = queryNorm
              0.45552456 = fieldWeight in 5044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5044)
          0.026188765 = weight(abstract_txt:available in 5044) [ClassicSimilarity], result of:
            0.026188765 = score(doc=5044,freq=1.0), product of:
              0.11133733 = queryWeight, product of:
                1.3566899 = boost
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.019079808 = queryNorm
              0.23521999 = fieldWeight in 5044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5044)
          0.027071211 = weight(abstract_txt:present in 5044) [ClassicSimilarity], result of:
            0.027071211 = score(doc=5044,freq=1.0), product of:
              0.11382454 = queryWeight, product of:
                1.3717601 = boost
                4.348943 = idf(docFreq=1552, maxDocs=44218)
                0.019079808 = queryNorm
              0.23783283 = fieldWeight in 5044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.348943 = idf(docFreq=1552, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5044)
          0.013079253 = weight(abstract_txt:this in 5044) [ClassicSimilarity], result of:
            0.013079253 = score(doc=5044,freq=2.0), product of:
              0.07008408 = queryWeight, product of:
                1.5222462 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.019079808 = queryNorm
              0.1866223 = fieldWeight in 5044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5044)
          0.07750994 = weight(abstract_txt:mining in 5044) [ClassicSimilarity], result of:
            0.07750994 = score(doc=5044,freq=1.0), product of:
              0.2295104 = queryWeight, product of:
                1.9478765 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.019079808 = queryNorm
              0.33771864 = fieldWeight in 5044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5044)
          0.37074006 = weight(abstract_txt:opinion in 5044) [ClassicSimilarity], result of:
            0.37074006 = score(doc=5044,freq=12.0), product of:
              0.28458807 = queryWeight, product of:
                2.169045 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.019079808 = queryNorm
              1.3027252 = fieldWeight in 5044, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5044)
        0.36 = coord(9/25)
    
  5. Abdelali, A.: Localization in modern standard Arabic (2004) 0.25
    0.2452417 = sum of:
      0.2452417 = product of:
        1.0218405 = sum of:
          0.03147948 = weight(abstract_txt:been in 2066) [ClassicSimilarity], result of:
            0.03147948 = score(doc=2066,freq=2.0), product of:
              0.07875978 = queryWeight, product of:
                1.1410705 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.019079808 = queryNorm
              0.3996898 = fieldWeight in 2066, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.078125 = fieldNorm(doc=2066)
          0.032747168 = weight(abstract_txt:different in 2066) [ClassicSimilarity], result of:
            0.032747168 = score(doc=2066,freq=2.0), product of:
              0.080860294 = queryWeight, product of:
                1.1561865 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.019079808 = queryNorm
              0.40498453 = fieldWeight in 2066, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.078125 = fieldNorm(doc=2066)
          0.037412524 = weight(abstract_txt:available in 2066) [ClassicSimilarity], result of:
            0.037412524 = score(doc=2066,freq=1.0), product of:
              0.11133733 = queryWeight, product of:
                1.3566899 = boost
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.019079808 = queryNorm
              0.33602858 = fieldWeight in 2066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.078125 = fieldNorm(doc=2066)
          0.022883922 = weight(abstract_txt:this in 2066) [ClassicSimilarity], result of:
            0.022883922 = score(doc=2066,freq=3.0), product of:
              0.07008408 = queryWeight, product of:
                1.5222462 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.019079808 = queryNorm
              0.32652098 = fieldWeight in 2066, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=2066)
          0.6840374 = weight(abstract_txt:arabic in 2066) [ClassicSimilarity], result of:
            0.6840374 = score(doc=2066,freq=5.0), product of:
              0.51727694 = queryWeight, product of:
                3.5815203 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.019079808 = queryNorm
              1.3223814 = fieldWeight in 2066, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.078125 = fieldNorm(doc=2066)
          0.21328 = weight(abstract_txt:corpus in 2066) [ClassicSimilarity], result of:
            0.21328 = score(doc=2066,freq=1.0), product of:
              0.447651 = queryWeight, product of:
                3.847202 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.019079808 = queryNorm
              0.4764426 = fieldWeight in 2066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.078125 = fieldNorm(doc=2066)
        0.24 = coord(6/25)