Document (#36362)

Author
Rushdi-Saleh, M.
Martín-Valdivia, M.T.
Ureña-López, L.A.
Perea-Ortega, J.M.
Title
OCA: Opinion corpus for Arabic
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.10, S.2045-2054
Year
2011
Abstract
Sentiment analysis is a challenging new task related to text mining and natural language processing. Although there are, at present, several studies related to this theme, most of these focus mainly on English texts. The resources available for opinion mining (OM) in other languages are still limited. In this article, we present a new Arabic corpus for the OM task that has been made available to the scientific community for research purposes. The corpus contains 500 movie reviews collected from different web pages and blogs in Arabic, 250 of them considered as positive reviews, and the other 250 as negative opinions. Furthermore, different experiments have been carried out on this corpus, using machine learning algorithms such as support vector machines and Nave Bayes. The results obtained are very promising and we are encouraged to continue this line of research.

Similar documents (author)

  1. Perea-Ortega, J.M.; Martín-Valdivia, M.T.; Ureña-López, L.A.; Martínez-Cámara, E.: Improving polarity classification of bilingual parallel corpora combining machine learning and semantic orientation approaches (2013) 4.97
    4.973597 = sum of:
      4.973597 = sum of:
        0.64856684 = weight(author_txt:lópez in 3046) [ClassicSimilarity], result of:
          0.64856684 = score(doc=3046,freq=1.0), product of:
            0.33470672 = queryWeight, product of:
              7.7508674 = idf(docFreq=49, maxDocs=42740)
              0.043183133 = queryNorm
            1.9377168 = fieldWeight in 3046, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.7508674 = idf(docFreq=49, maxDocs=42740)
              0.25 = fieldNorm(doc=3046)
        1.0001783 = weight(author_txt:martín in 3046) [ClassicSimilarity], result of:
          1.0001783 = score(doc=3046,freq=1.0), product of:
            0.44676542 = queryWeight, product of:
              1.155334 = boost
              8.954841 = idf(docFreq=14, maxDocs=42740)
              0.043183133 = queryNorm
            2.2387102 = fieldWeight in 3046, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.954841 = idf(docFreq=14, maxDocs=42740)
              0.25 = fieldNorm(doc=3046)
        1.0234746 = weight(author_txt:valdivia in 3046) [ClassicSimilarity], result of:
          1.0234746 = score(doc=3046,freq=1.0), product of:
            0.45367616 = queryWeight, product of:
              1.1642352 = boost
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.043183133 = queryNorm
            2.2559583 = fieldWeight in 3046, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.25 = fieldNorm(doc=3046)
        1.0234746 = weight(author_txt:ortega in 3046) [ClassicSimilarity], result of:
          1.0234746 = score(doc=3046,freq=1.0), product of:
            0.45367616 = queryWeight, product of:
              1.1642352 = boost
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.043183133 = queryNorm
            2.2559583 = fieldWeight in 3046, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.25 = fieldNorm(doc=3046)
        1.2779027 = weight(author_txt:ureña in 3046) [ClassicSimilarity], result of:
          1.2779027 = score(doc=3046,freq=1.0), product of:
            0.5260493 = queryWeight, product of:
              1.2536635 = boost
              9.71698 = idf(docFreq=6, maxDocs=42740)
              0.043183133 = queryNorm
            2.429245 = fieldWeight in 3046, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.71698 = idf(docFreq=6, maxDocs=42740)
              0.25 = fieldNorm(doc=3046)
    
  2. Martín-Valdivia, M.T.; Díaz-Galiano, M.C.; Montejo-Raez, A.; Ureña-López, L.A.: Using information gain to improve multi-modal information retrieval systems (2008) 3.16
    3.1600978 = sum of:
      3.1600978 = product of:
        3.9501224 = sum of:
          0.64856684 = weight(author_txt:lópez in 4087) [ClassicSimilarity], result of:
            0.64856684 = score(doc=4087,freq=1.0), product of:
              0.33470672 = queryWeight, product of:
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.043183133 = queryNorm
              1.9377168 = fieldWeight in 4087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.25 = fieldNorm(doc=4087)
          1.0001783 = weight(author_txt:martín in 4087) [ClassicSimilarity], result of:
            1.0001783 = score(doc=4087,freq=1.0), product of:
              0.44676542 = queryWeight, product of:
                1.155334 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.043183133 = queryNorm
              2.2387102 = fieldWeight in 4087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.25 = fieldNorm(doc=4087)
          1.0234746 = weight(author_txt:valdivia in 4087) [ClassicSimilarity], result of:
            1.0234746 = score(doc=4087,freq=1.0), product of:
              0.45367616 = queryWeight, product of:
                1.1642352 = boost
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.043183133 = queryNorm
              2.2559583 = fieldWeight in 4087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.25 = fieldNorm(doc=4087)
          1.2779027 = weight(author_txt:ureña in 4087) [ClassicSimilarity], result of:
            1.2779027 = score(doc=4087,freq=1.0), product of:
              0.5260493 = queryWeight, product of:
                1.2536635 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.043183133 = queryNorm
              2.429245 = fieldWeight in 4087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.25 = fieldNorm(doc=4087)
        0.8 = coord(4/5)
    
  3. Montejo-Ráez, A.; Martínez-Cámara, E.; Martín-Valdivia, M.T.; Ureña-López, L.A.: ¬A knowledge-based approach for polarity classification in Twitter (2014) 3.16
    3.1600978 = sum of:
      3.1600978 = product of:
        3.9501224 = sum of:
          0.64856684 = weight(author_txt:lópez in 3205) [ClassicSimilarity], result of:
            0.64856684 = score(doc=3205,freq=1.0), product of:
              0.33470672 = queryWeight, product of:
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.043183133 = queryNorm
              1.9377168 = fieldWeight in 3205, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.25 = fieldNorm(doc=3205)
          1.0001783 = weight(author_txt:martín in 3205) [ClassicSimilarity], result of:
            1.0001783 = score(doc=3205,freq=1.0), product of:
              0.44676542 = queryWeight, product of:
                1.155334 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.043183133 = queryNorm
              2.2387102 = fieldWeight in 3205, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.25 = fieldNorm(doc=3205)
          1.0234746 = weight(author_txt:valdivia in 3205) [ClassicSimilarity], result of:
            1.0234746 = score(doc=3205,freq=1.0), product of:
              0.45367616 = queryWeight, product of:
                1.1642352 = boost
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.043183133 = queryNorm
              2.2559583 = fieldWeight in 3205, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.25 = fieldNorm(doc=3205)
          1.2779027 = weight(author_txt:ureña in 3205) [ClassicSimilarity], result of:
            1.2779027 = score(doc=3205,freq=1.0), product of:
              0.5260493 = queryWeight, product of:
                1.2536635 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.043183133 = queryNorm
              2.429245 = fieldWeight in 3205, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.25 = fieldNorm(doc=3205)
        0.8 = coord(4/5)
    
  4. García Cumbreras, M.A.; Perea-Ortega, J.M.; García Vega, M.; Ureña López, L.A.: Information retrieval with geographical references : relevant documents filtering vs. query expansion (2009) 1.77
    1.7699665 = sum of:
      1.7699665 = product of:
        2.949944 = sum of:
          0.64856684 = weight(author_txt:lópez in 1223) [ClassicSimilarity], result of:
            0.64856684 = score(doc=1223,freq=1.0), product of:
              0.33470672 = queryWeight, product of:
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.043183133 = queryNorm
              1.9377168 = fieldWeight in 1223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.25 = fieldNorm(doc=1223)
          1.0234746 = weight(author_txt:ortega in 1223) [ClassicSimilarity], result of:
            1.0234746 = score(doc=1223,freq=1.0), product of:
              0.45367616 = queryWeight, product of:
                1.1642352 = boost
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.043183133 = queryNorm
              2.2559583 = fieldWeight in 1223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.25 = fieldNorm(doc=1223)
          1.2779027 = weight(author_txt:ureña in 1223) [ClassicSimilarity], result of:
            1.2779027 = score(doc=1223,freq=1.0), product of:
              0.5260493 = queryWeight, product of:
                1.2536635 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.043183133 = queryNorm
              2.429245 = fieldWeight in 1223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.25 = fieldNorm(doc=1223)
        0.6 = coord(3/5)
    
  5. Martínez, F.; Martín, M.T.; Rivas, V.M.; Díaz, M.C.; Ureña, L.A.: Using neural networks for multiword recognition in IR (2003) 1.14
    1.1390406 = sum of:
      1.1390406 = product of:
        2.8476014 = sum of:
          1.2502229 = weight(author_txt:martín in 3778) [ClassicSimilarity], result of:
            1.2502229 = score(doc=3778,freq=1.0), product of:
              0.44676542 = queryWeight, product of:
                1.155334 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.043183133 = queryNorm
              2.7983878 = fieldWeight in 3778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.3125 = fieldNorm(doc=3778)
          1.5973784 = weight(author_txt:ureña in 3778) [ClassicSimilarity], result of:
            1.5973784 = score(doc=3778,freq=1.0), product of:
              0.5260493 = queryWeight, product of:
                1.2536635 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.043183133 = queryNorm
              3.0365562 = fieldWeight in 3778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.3125 = fieldNorm(doc=3778)
        0.4 = coord(2/5)
    

Similar documents (content)

  1. Kanaan, G.; Al-Shalabi, R.; Ghwanmeh, S.; Al-Ma'adeed, H.: ¬A comparison of text-classification techniques applied to Arabic text (2009) 0.49
    0.49004972 = sum of:
      0.49004972 = product of:
        1.5314054 = sum of:
          0.0262108 = weight(abstract_txt:research in 97) [ClassicSimilarity], result of:
            0.0262108 = score(doc=97,freq=2.0), product of:
              0.061510798 = queryWeight, product of:
                3.2139761 = idf(docFreq=4669, maxDocs=42740)
                0.019138535 = queryNorm
              0.42611706 = fieldWeight in 97, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2139761 = idf(docFreq=4669, maxDocs=42740)
                0.09375 = fieldNorm(doc=97)
          0.08788682 = weight(abstract_txt:challenging in 97) [ClassicSimilarity], result of:
            0.08788682 = score(doc=97,freq=1.0), product of:
              0.1377993 = queryWeight, product of:
                1.0583586 = boost
                6.803078 = idf(docFreq=128, maxDocs=42740)
                0.019138535 = queryNorm
              0.6377886 = fieldWeight in 97, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.803078 = idf(docFreq=128, maxDocs=42740)
                0.09375 = fieldNorm(doc=97)
          0.04650801 = weight(abstract_txt:been in 97) [ClassicSimilarity], result of:
            0.04650801 = score(doc=97,freq=3.0), product of:
              0.078756414 = queryWeight, product of:
                1.1315331 = boost
                3.6367204 = idf(docFreq=3059, maxDocs=42740)
                0.019138535 = queryNorm
              0.5905298 = fieldWeight in 97, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6367204 = idf(docFreq=3059, maxDocs=42740)
                0.09375 = fieldNorm(doc=97)
          0.028153306 = weight(abstract_txt:different in 97) [ClassicSimilarity], result of:
            0.028153306 = score(doc=97,freq=1.0), product of:
              0.081281945 = queryWeight, product of:
                1.1495328 = boost
                3.694571 = idf(docFreq=2887, maxDocs=42740)
                0.019138535 = queryNorm
              0.34636605 = fieldWeight in 97, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.694571 = idf(docFreq=2887, maxDocs=42740)
                0.09375 = fieldNorm(doc=97)
          0.24477758 = weight(abstract_txt:bayes in 97) [ClassicSimilarity], result of:
            0.24477758 = score(doc=97,freq=2.0), product of:
              0.2165055 = queryWeight, product of:
                1.3266116 = boost
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.019138535 = queryNorm
              1.1305836 = fieldWeight in 97, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.09375 = fieldNorm(doc=97)
          0.023022402 = weight(abstract_txt:this in 97) [ClassicSimilarity], result of:
            0.023022402 = score(doc=97,freq=2.0), product of:
              0.071079046 = queryWeight, product of:
                1.5202328 = boost
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.019138535 = queryNorm
              0.32389858 = fieldWeight in 97, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.09375 = fieldNorm(doc=97)
          0.81722367 = weight(abstract_txt:arabic in 97) [ClassicSimilarity], result of:
            0.81722367 = score(doc=97,freq=5.0), product of:
              0.5139358 = queryWeight, product of:
                3.540172 = boost
                7.585353 = idf(docFreq=58, maxDocs=42740)
                0.019138535 = queryNorm
              1.590128 = fieldWeight in 97, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.585353 = idf(docFreq=58, maxDocs=42740)
                0.09375 = fieldNorm(doc=97)
          0.25762278 = weight(abstract_txt:corpus in 97) [ClassicSimilarity], result of:
            0.25762278 = score(doc=97,freq=1.0), product of:
              0.44803023 = queryWeight, product of:
                3.8167436 = boost
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.019138535 = queryNorm
              0.575012 = fieldWeight in 97, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.09375 = fieldNorm(doc=97)
        0.32 = coord(8/25)
    
  2. Pang, B.; Lee, L.: Opinion mining and sentiment analysis (2008) 0.43
    0.43094784 = sum of:
      0.43094784 = product of:
        0.897808 = sum of:
          0.010811403 = weight(abstract_txt:research in 3172) [ClassicSimilarity], result of:
            0.010811403 = score(doc=3172,freq=1.0), product of:
              0.061510798 = queryWeight, product of:
                3.2139761 = idf(docFreq=4669, maxDocs=42740)
                0.019138535 = queryNorm
              0.17576432 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2139761 = idf(docFreq=4669, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.08095289 = weight(abstract_txt:opinions in 3172) [ClassicSimilarity], result of:
            0.08095289 = score(doc=3172,freq=2.0), product of:
              0.14830811 = queryWeight, product of:
                1.0979733 = boost
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.019138535 = queryNorm
              0.54584265 = fieldWeight in 3172, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.0143841375 = weight(abstract_txt:other in 3172) [ClassicSimilarity], result of:
            0.0143841375 = score(doc=3172,freq=1.0), product of:
              0.07440792 = queryWeight, product of:
                1.0998511 = boost
                3.5348954 = idf(docFreq=3387, maxDocs=42740)
                0.019138535 = queryNorm
              0.1933146 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5348954 = idf(docFreq=3387, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.015663324 = weight(abstract_txt:been in 3172) [ClassicSimilarity], result of:
            0.015663324 = score(doc=3172,freq=1.0), product of:
              0.078756414 = queryWeight, product of:
                1.1315331 = boost
                3.6367204 = idf(docFreq=3059, maxDocs=42740)
                0.019138535 = queryNorm
              0.19888315 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6367204 = idf(docFreq=3059, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.06350318 = weight(abstract_txt:blogs in 3172) [ClassicSimilarity], result of:
            0.06350318 = score(doc=3172,freq=1.0), product of:
              0.15893406 = queryWeight, product of:
                1.1366266 = boost
                7.306182 = idf(docFreq=77, maxDocs=42740)
                0.019138535 = queryNorm
              0.39955682 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.306182 = idf(docFreq=77, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.16453512 = weight(abstract_txt:sentiment in 3172) [ClassicSimilarity], result of:
            0.16453512 = score(doc=3172,freq=5.0), product of:
              0.17533517 = queryWeight, product of:
                1.1938337 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.019138535 = queryNorm
              0.93840337 = fieldWeight in 3172, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.025976565 = weight(abstract_txt:available in 3172) [ClassicSimilarity], result of:
            0.025976565 = score(doc=3172,freq=1.0), product of:
              0.1103446 = queryWeight, product of:
                1.3393679 = boost
                4.3046966 = idf(docFreq=1568, maxDocs=42740)
                0.019138535 = queryNorm
              0.23541309 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3046966 = idf(docFreq=1568, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.027314806 = weight(abstract_txt:present in 3172) [ClassicSimilarity], result of:
            0.027314806 = score(doc=3172,freq=1.0), product of:
              0.11410256 = queryWeight, product of:
                1.361984 = boost
                4.377384 = idf(docFreq=1458, maxDocs=42740)
                0.019138535 = queryNorm
              0.2393882 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.377384 = idf(docFreq=1458, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.009496256 = weight(abstract_txt:this in 3172) [ClassicSimilarity], result of:
            0.009496256 = score(doc=3172,freq=1.0), product of:
              0.071079046 = queryWeight, product of:
                1.5202328 = boost
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.019138535 = queryNorm
              0.13360134 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.039379995 = weight(abstract_txt:reviews in 3172) [ClassicSimilarity], result of:
            0.039379995 = score(doc=3172,freq=1.0), product of:
              0.14561756 = queryWeight, product of:
                1.5386194 = boost
                4.945086 = idf(docFreq=826, maxDocs=42740)
                0.019138535 = queryNorm
              0.27043438 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.945086 = idf(docFreq=826, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.134921 = weight(abstract_txt:mining in 3172) [ClassicSimilarity], result of:
            0.134921 = score(doc=3172,freq=3.0), product of:
              0.2294612 = queryWeight, product of:
                1.9314299 = boost
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.019138535 = queryNorm
              0.58799046 = fieldWeight in 3172, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.31086934 = weight(abstract_txt:opinion in 3172) [ClassicSimilarity], result of:
            0.31086934 = score(doc=3172,freq=8.0), product of:
              0.2886591 = queryWeight, product of:
                2.1662917 = boost
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.019138535 = queryNorm
              1.0769428 = fieldWeight in 3172, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
        0.48 = coord(12/25)
    
  3. Perea-Ortega, J.M.; Martín-Valdivia, M.T.; Ureña-López, L.A.; Martínez-Cámara, E.: Improving polarity classification of bilingual parallel corpora combining machine learning and semantic orientation approaches (2013) 0.38
    0.37655756 = sum of:
      0.37655756 = product of:
        0.94139385 = sum of:
          0.06541982 = weight(abstract_txt:opinions in 3046) [ClassicSimilarity], result of:
            0.06541982 = score(doc=3046,freq=1.0), product of:
              0.14830811 = queryWeight, product of:
                1.0979733 = boost
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.019138535 = queryNorm
              0.4411075 = fieldWeight in 3046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.0625 = fieldNorm(doc=3046)
          0.017900942 = weight(abstract_txt:been in 3046) [ClassicSimilarity], result of:
            0.017900942 = score(doc=3046,freq=1.0), product of:
              0.078756414 = queryWeight, product of:
                1.1315331 = boost
                3.6367204 = idf(docFreq=3059, maxDocs=42740)
                0.019138535 = queryNorm
              0.22729503 = fieldWeight in 3046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6367204 = idf(docFreq=3059, maxDocs=42740)
                0.0625 = fieldNorm(doc=3046)
          0.08409411 = weight(abstract_txt:sentiment in 3046) [ClassicSimilarity], result of:
            0.08409411 = score(doc=3046,freq=1.0), product of:
              0.17533517 = queryWeight, product of:
                1.1938337 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.019138535 = queryNorm
              0.47961915 = fieldWeight in 3046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=3046)
          0.028343396 = weight(abstract_txt:related in 3046) [ClassicSimilarity], result of:
            0.028343396 = score(doc=3046,freq=1.0), product of:
              0.10698838 = queryWeight, product of:
                1.3188416 = boost
                4.238725 = idf(docFreq=1675, maxDocs=42740)
                0.019138535 = queryNorm
              0.26492032 = fieldWeight in 3046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.238725 = idf(docFreq=1675, maxDocs=42740)
                0.0625 = fieldNorm(doc=3046)
          0.018797712 = weight(abstract_txt:this in 3046) [ClassicSimilarity], result of:
            0.018797712 = score(doc=3046,freq=3.0), product of:
              0.071079046 = queryWeight, product of:
                1.5202328 = boost
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.019138535 = queryNorm
              0.26446208 = fieldWeight in 3046, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.0625 = fieldNorm(doc=3046)
          0.044775963 = weight(abstract_txt:task in 3046) [ClassicSimilarity], result of:
            0.044775963 = score(doc=3046,freq=1.0), product of:
              0.14512157 = queryWeight, product of:
                1.5359968 = boost
                4.936657 = idf(docFreq=833, maxDocs=42740)
                0.019138535 = queryNorm
              0.30854106 = fieldWeight in 3046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.936657 = idf(docFreq=833, maxDocs=42740)
                0.0625 = fieldNorm(doc=3046)
          0.08902477 = weight(abstract_txt:mining in 3046) [ClassicSimilarity], result of:
            0.08902477 = score(doc=3046,freq=1.0), product of:
              0.2294612 = queryWeight, product of:
                1.9314299 = boost
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.019138535 = queryNorm
              0.3879731 = fieldWeight in 3046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.0625 = fieldNorm(doc=3046)
          0.17763962 = weight(abstract_txt:opinion in 3046) [ClassicSimilarity], result of:
            0.17763962 = score(doc=3046,freq=2.0), product of:
              0.2886591 = queryWeight, product of:
                2.1662917 = boost
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.019138535 = queryNorm
              0.6153959 = fieldWeight in 3046, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.0625 = fieldNorm(doc=3046)
          0.24364902 = weight(abstract_txt:arabic in 3046) [ClassicSimilarity], result of:
            0.24364902 = score(doc=3046,freq=1.0), product of:
              0.5139358 = queryWeight, product of:
                3.540172 = boost
                7.585353 = idf(docFreq=58, maxDocs=42740)
                0.019138535 = queryNorm
              0.47408456 = fieldWeight in 3046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.585353 = idf(docFreq=58, maxDocs=42740)
                0.0625 = fieldNorm(doc=3046)
          0.1717485 = weight(abstract_txt:corpus in 3046) [ClassicSimilarity], result of:
            0.1717485 = score(doc=3046,freq=1.0), product of:
              0.44803023 = queryWeight, product of:
                3.8167436 = boost
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.019138535 = queryNorm
              0.38334134 = fieldWeight in 3046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.0625 = fieldNorm(doc=3046)
        0.4 = coord(10/25)
    
  4. Belbachir, F.; Boughanem, M.: Using language models to improve opinion detection (2018) 0.26
    0.26365262 = sum of:
      0.26365262 = product of:
        0.73236835 = sum of:
          0.015289634 = weight(abstract_txt:research in 1045) [ClassicSimilarity], result of:
            0.015289634 = score(doc=1045,freq=2.0), product of:
              0.061510798 = queryWeight, product of:
                3.2139761 = idf(docFreq=4669, maxDocs=42740)
                0.019138535 = queryNorm
              0.24856828 = fieldWeight in 1045, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2139761 = idf(docFreq=4669, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1045)
          0.06350318 = weight(abstract_txt:blogs in 1045) [ClassicSimilarity], result of:
            0.06350318 = score(doc=1045,freq=1.0), product of:
              0.15893406 = queryWeight, product of:
                1.1366266 = boost
                7.306182 = idf(docFreq=77, maxDocs=42740)
                0.019138535 = queryNorm
              0.39955682 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.306182 = idf(docFreq=77, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1045)
          0.032845523 = weight(abstract_txt:different in 1045) [ClassicSimilarity], result of:
            0.032845523 = score(doc=1045,freq=4.0), product of:
              0.081281945 = queryWeight, product of:
                1.1495328 = boost
                3.694571 = idf(docFreq=2887, maxDocs=42740)
                0.019138535 = queryNorm
              0.4040937 = fieldWeight in 1045, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.694571 = idf(docFreq=2887, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1045)
          0.0953766 = weight(abstract_txt:movie in 1045) [ClassicSimilarity], result of:
            0.0953766 = score(doc=1045,freq=1.0), product of:
              0.20844007 = queryWeight, product of:
                1.3016671 = boost
                8.367054 = idf(docFreq=26, maxDocs=42740)
                0.019138535 = queryNorm
              0.45757326 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.367054 = idf(docFreq=26, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1045)
          0.025976565 = weight(abstract_txt:available in 1045) [ClassicSimilarity], result of:
            0.025976565 = score(doc=1045,freq=1.0), product of:
              0.1103446 = queryWeight, product of:
                1.3393679 = boost
                4.3046966 = idf(docFreq=1568, maxDocs=42740)
                0.019138535 = queryNorm
              0.23541309 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3046966 = idf(docFreq=1568, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1045)
          0.027314806 = weight(abstract_txt:present in 1045) [ClassicSimilarity], result of:
            0.027314806 = score(doc=1045,freq=1.0), product of:
              0.11410256 = queryWeight, product of:
                1.361984 = boost
                4.377384 = idf(docFreq=1458, maxDocs=42740)
                0.019138535 = queryNorm
              0.2393882 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.377384 = idf(docFreq=1458, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1045)
          0.013429735 = weight(abstract_txt:this in 1045) [ClassicSimilarity], result of:
            0.013429735 = score(doc=1045,freq=2.0), product of:
              0.071079046 = queryWeight, product of:
                1.5202328 = boost
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.019138535 = queryNorm
              0.18894084 = fieldWeight in 1045, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1045)
          0.07789668 = weight(abstract_txt:mining in 1045) [ClassicSimilarity], result of:
            0.07789668 = score(doc=1045,freq=1.0), product of:
              0.2294612 = queryWeight, product of:
                1.9314299 = boost
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.019138535 = queryNorm
              0.33947647 = fieldWeight in 1045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1045)
          0.38073564 = weight(abstract_txt:opinion in 1045) [ClassicSimilarity], result of:
            0.38073564 = score(doc=1045,freq=12.0), product of:
              0.2886591 = queryWeight, product of:
                2.1662917 = boost
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.019138535 = queryNorm
              1.3189802 = fieldWeight in 1045, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1045)
        0.36 = coord(9/25)
    
  5. Sleem-Amer, M.; Bigorgne, I.; Brizard, S.; Santos, L.D.P.D.; Bouhairi, Y. El; Goujon, B.; Lorin, S.; Martineau, C.; Rigouste, L.; Varga, L.: Intelligent semantic search engines for opinion and sentiment mining (2012) 0.25
    0.24711825 = sum of:
      0.24711825 = product of:
        0.6864396 = sum of:
          0.017473867 = weight(abstract_txt:research in 2101) [ClassicSimilarity], result of:
            0.017473867 = score(doc=2101,freq=2.0), product of:
              0.061510798 = queryWeight, product of:
                3.2139761 = idf(docFreq=4669, maxDocs=42740)
                0.019138535 = queryNorm
              0.28407803 = fieldWeight in 2101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2139761 = idf(docFreq=4669, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.09251759 = weight(abstract_txt:opinions in 2101) [ClassicSimilarity], result of:
            0.09251759 = score(doc=2101,freq=2.0), product of:
              0.14830811 = queryWeight, product of:
                1.0979733 = boost
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.019138535 = queryNorm
              0.6238202 = fieldWeight in 2101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.07257507 = weight(abstract_txt:blogs in 2101) [ClassicSimilarity], result of:
            0.07257507 = score(doc=2101,freq=1.0), product of:
              0.15893406 = queryWeight, product of:
                1.1366266 = boost
                7.306182 = idf(docFreq=77, maxDocs=42740)
                0.019138535 = queryNorm
              0.45663637 = fieldWeight in 2101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.306182 = idf(docFreq=77, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.11892702 = weight(abstract_txt:sentiment in 2101) [ClassicSimilarity], result of:
            0.11892702 = score(doc=2101,freq=2.0), product of:
              0.17533517 = queryWeight, product of:
                1.1938337 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.019138535 = queryNorm
              0.6782839 = fieldWeight in 2101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.028343396 = weight(abstract_txt:related in 2101) [ClassicSimilarity], result of:
            0.028343396 = score(doc=2101,freq=1.0), product of:
              0.10698838 = queryWeight, product of:
                1.3188416 = boost
                4.238725 = idf(docFreq=1675, maxDocs=42740)
                0.019138535 = queryNorm
              0.26492032 = fieldWeight in 2101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.238725 = idf(docFreq=1675, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.031216921 = weight(abstract_txt:present in 2101) [ClassicSimilarity], result of:
            0.031216921 = score(doc=2101,freq=1.0), product of:
              0.11410256 = queryWeight, product of:
                1.361984 = boost
                4.377384 = idf(docFreq=1458, maxDocs=42740)
                0.019138535 = queryNorm
              0.2735865 = fieldWeight in 2101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.377384 = idf(docFreq=1458, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.018797712 = weight(abstract_txt:this in 2101) [ClassicSimilarity], result of:
            0.018797712 = score(doc=2101,freq=3.0), product of:
              0.071079046 = queryWeight, product of:
                1.5202328 = boost
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.019138535 = queryNorm
              0.26446208 = fieldWeight in 2101, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.08902477 = weight(abstract_txt:mining in 2101) [ClassicSimilarity], result of:
            0.08902477 = score(doc=2101,freq=1.0), product of:
              0.2294612 = queryWeight, product of:
                1.9314299 = boost
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.019138535 = queryNorm
              0.3879731 = fieldWeight in 2101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.21756323 = weight(abstract_txt:opinion in 2101) [ClassicSimilarity], result of:
            0.21756323 = score(doc=2101,freq=3.0), product of:
              0.2886591 = queryWeight, product of:
                2.1662917 = boost
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.019138535 = queryNorm
              0.753703 = fieldWeight in 2101, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
        0.36 = coord(9/25)