Document (#42877)

Author
Wei, W.
Liu, Y.-P.
Wei, L-R.
Title
Feature-level sentiment analysis based on rules and fine-grained domain ontology
Source
Knowledge organization. 47(2020) no.2, S.105-121
Year
2020
Abstract
Mining product reviews and sentiment analysis are of great significance, whether for academic research purposes or optimizing business strategies. We propose a feature-level sentiment analysis framework based on rules parsing and fine-grained domain ontology for Chinese reviews. Fine-grained ontology is used to describe synonymous expressions of product features, which are reflected in word changes in online reviews. First, a semiautomatic construction method is developed by using Word2Vec for fine-grained ontology. Then, featurelevel sentiment analysis that combines rules parsing and the fine-grained domain ontology is conducted to extract explicit and implicit features from product reviews. Finally, the domain sentiment dictionary and context sentiment dictionary are established to identify sentiment polarities for the extracted feature-sentiment combinations. An experiment is conducted on the basis of product reviews crawled from Chinese e-commerce websites. The results demonstrate the effectiveness of our approach.
Content
DOI:10.5771/0943-7444-2020-2-105.
Theme
Wissensrepräsentation
Field
Wirtschaftswissenschaften

Similar documents (content)

  1. Miao, Q.; Li, Q.; Zeng, D.: Fine-grained opinion mining by integrating multiple review sources (2010) 0.48
    0.47655204 = sum of:
      0.47655204 = product of:
        1.4892251 = sum of:
          0.023880973 = weight(abstract_txt:features in 4104) [ClassicSimilarity], result of:
            0.023880973 = score(doc=4104,freq=1.0), product of:
              0.056118503 = queryWeight, product of:
                1.3366933 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.009249103 = queryNorm
              0.42554545 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
          0.111294426 = weight(abstract_txt:feature in 4104) [ClassicSimilarity], result of:
            0.111294426 = score(doc=4104,freq=2.0), product of:
              0.14225714 = queryWeight, product of:
                2.606522 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.009249103 = queryNorm
              0.78234684 = fieldWeight in 4104, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
          0.07670005 = weight(abstract_txt:domain in 4104) [ClassicSimilarity], result of:
            0.07670005 = score(doc=4104,freq=2.0), product of:
              0.12216172 = queryWeight, product of:
                2.789082 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.009249103 = queryNorm
              0.6278566 = fieldWeight in 4104, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
          0.15494703 = weight(abstract_txt:product in 4104) [ClassicSimilarity], result of:
            0.15494703 = score(doc=4104,freq=2.0), product of:
              0.19522116 = queryWeight, product of:
                3.525796 = boost
                5.98646 = idf(docFreq=301, maxDocs=44218)
                0.009249103 = queryNorm
              0.7937 = fieldWeight in 4104, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.98646 = idf(docFreq=301, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
          0.13433474 = weight(abstract_txt:reviews in 4104) [ClassicSimilarity], result of:
            0.13433474 = score(doc=4104,freq=3.0), product of:
              0.16703334 = queryWeight, product of:
                3.6462827 = boost
                4.952828 = idf(docFreq=848, maxDocs=44218)
                0.009249103 = queryNorm
              0.80423903 = fieldWeight in 4104, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.952828 = idf(docFreq=848, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
          0.2368834 = weight(abstract_txt:fine in 4104) [ClassicSimilarity], result of:
            0.2368834 = score(doc=4104,freq=1.0), product of:
              0.35162073 = queryWeight, product of:
                5.290369 = boost
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.009249103 = queryNorm
              0.6736901 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
          0.310958 = weight(abstract_txt:grained in 4104) [ClassicSimilarity], result of:
            0.310958 = score(doc=4104,freq=1.0), product of:
              0.42155337 = queryWeight, product of:
                5.7926197 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.009249103 = queryNorm
              0.737648 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
          0.4402265 = weight(abstract_txt:sentiment in 4104) [ClassicSimilarity], result of:
            0.4402265 = score(doc=4104,freq=1.0), product of:
              0.6216446 = queryWeight, product of:
                8.897742 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.009249103 = queryNorm
              0.7081643 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
        0.32 = coord(8/25)
    
  2. Chen, Z.; Huang, Y.; Tian, J.; Liu, X.; Fu, K.; Huang, T.: Joint model for subsentence-level sentiment analysis with Markov logic (2015) 0.42
    0.42119956 = sum of:
      0.42119956 = product of:
        1.5042841 = sum of:
          0.021907724 = weight(abstract_txt:level in 2210) [ClassicSimilarity], result of:
            0.021907724 = score(doc=2210,freq=2.0), product of:
              0.0551045 = queryWeight, product of:
                1.324562 = boost
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.009249103 = queryNorm
              0.39756688 = fieldWeight in 2210, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.0625 = fieldNorm(doc=2210)
          0.042631675 = weight(abstract_txt:chinese in 2210) [ClassicSimilarity], result of:
            0.042631675 = score(doc=2210,freq=1.0), product of:
              0.10821492 = queryWeight, product of:
                1.8561895 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.009249103 = queryNorm
              0.39395374 = fieldWeight in 2210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.0625 = fieldNorm(doc=2210)
          0.040671088 = weight(abstract_txt:analysis in 2210) [ClassicSimilarity], result of:
            0.040671088 = score(doc=2210,freq=6.0), product of:
              0.0727136 = queryWeight, product of:
                2.1517982 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.009249103 = queryNorm
              0.5593326 = fieldWeight in 2210, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=2210)
          0.052464694 = weight(abstract_txt:feature in 2210) [ClassicSimilarity], result of:
            0.052464694 = score(doc=2210,freq=1.0), product of:
              0.14225714 = queryWeight, product of:
                2.606522 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.009249103 = queryNorm
              0.36880183 = fieldWeight in 2210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.0625 = fieldNorm(doc=2210)
          0.22333582 = weight(abstract_txt:fine in 2210) [ClassicSimilarity], result of:
            0.22333582 = score(doc=2210,freq=2.0), product of:
              0.35162073 = queryWeight, product of:
                5.290369 = boost
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.009249103 = queryNorm
              0.6351611 = fieldWeight in 2210, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.0625 = fieldNorm(doc=2210)
          0.29317403 = weight(abstract_txt:grained in 2210) [ClassicSimilarity], result of:
            0.29317403 = score(doc=2210,freq=2.0), product of:
              0.42155337 = queryWeight, product of:
                5.7926197 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.009249103 = queryNorm
              0.6954612 = fieldWeight in 2210, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.0625 = fieldNorm(doc=2210)
          0.8300991 = weight(abstract_txt:sentiment in 2210) [ClassicSimilarity], result of:
            0.8300991 = score(doc=2210,freq=8.0), product of:
              0.6216446 = queryWeight, product of:
                8.897742 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.009249103 = queryNorm
              1.3353274 = fieldWeight in 2210, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=2210)
        0.28 = coord(7/25)
    
  3. Ma, X.; Xue, P.; Matta, N.; Chen, Q.: Fine-grained ontology reconstruction for crisis knowledge based on integrated analysis of temporal-spatial factors (2021) 0.30
    0.29890078 = sum of:
      0.29890078 = product of:
        0.934065 = sum of:
          0.015491101 = weight(abstract_txt:level in 232) [ClassicSimilarity], result of:
            0.015491101 = score(doc=232,freq=1.0), product of:
              0.0551045 = queryWeight, product of:
                1.324562 = boost
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.009249103 = queryNorm
              0.28112224 = fieldWeight in 232, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.0625 = fieldNorm(doc=232)
          0.022515198 = weight(abstract_txt:features in 232) [ClassicSimilarity], result of:
            0.022515198 = score(doc=232,freq=2.0), product of:
              0.056118503 = queryWeight, product of:
                1.3366933 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.009249103 = queryNorm
              0.4012081 = fieldWeight in 232, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=232)
          0.020138498 = weight(abstract_txt:conducted in 232) [ClassicSimilarity], result of:
            0.020138498 = score(doc=232,freq=1.0), product of:
              0.06563724 = queryWeight, product of:
                1.4456191 = boost
                4.9090424 = idf(docFreq=886, maxDocs=44218)
                0.009249103 = queryNorm
              0.30681515 = fieldWeight in 232, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9090424 = idf(docFreq=886, maxDocs=44218)
                0.0625 = fieldNorm(doc=232)
          0.0166039 = weight(abstract_txt:analysis in 232) [ClassicSimilarity], result of:
            0.0166039 = score(doc=232,freq=1.0), product of:
              0.0727136 = queryWeight, product of:
                2.1517982 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.009249103 = queryNorm
              0.22834657 = fieldWeight in 232, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=232)
          0.03615675 = weight(abstract_txt:domain in 232) [ClassicSimilarity], result of:
            0.03615675 = score(doc=232,freq=1.0), product of:
              0.12216172 = queryWeight, product of:
                2.789082 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.009249103 = queryNorm
              0.29597446 = fieldWeight in 232, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=232)
          0.19056682 = weight(abstract_txt:ontology in 232) [ClassicSimilarity], result of:
            0.19056682 = score(doc=232,freq=7.0), product of:
              0.20834245 = queryWeight, product of:
                4.07228 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.009249103 = queryNorm
              0.9146807 = fieldWeight in 232, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0625 = fieldNorm(doc=232)
          0.27352938 = weight(abstract_txt:fine in 232) [ClassicSimilarity], result of:
            0.27352938 = score(doc=232,freq=3.0), product of:
              0.35162073 = queryWeight, product of:
                5.290369 = boost
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.009249103 = queryNorm
              0.7779103 = fieldWeight in 232, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.0625 = fieldNorm(doc=232)
          0.35906336 = weight(abstract_txt:grained in 232) [ClassicSimilarity], result of:
            0.35906336 = score(doc=232,freq=3.0), product of:
              0.42155337 = queryWeight, product of:
                5.7926197 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.009249103 = queryNorm
              0.85176253 = fieldWeight in 232, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.0625 = fieldNorm(doc=232)
        0.32 = coord(8/25)
    
  4. Sinha, A.; Kedas, S.; Kumar, R.; Malo, P.: SEntFiN 1.0 : Entity-aware sentiment analysis for financial news (2022) 0.27
    0.27040893 = sum of:
      0.27040893 = product of:
        1.1267039 = sum of:
          0.0166039 = weight(abstract_txt:analysis in 652) [ClassicSimilarity], result of:
            0.0166039 = score(doc=652,freq=1.0), product of:
              0.0727136 = queryWeight, product of:
                2.1517982 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.009249103 = queryNorm
              0.22834657 = fieldWeight in 652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=652)
          0.052464694 = weight(abstract_txt:feature in 652) [ClassicSimilarity], result of:
            0.052464694 = score(doc=652,freq=1.0), product of:
              0.14225714 = queryWeight, product of:
                2.606522 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.009249103 = queryNorm
              0.36880183 = fieldWeight in 652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.0625 = fieldNorm(doc=652)
          0.03615675 = weight(abstract_txt:domain in 652) [ClassicSimilarity], result of:
            0.03615675 = score(doc=652,freq=1.0), product of:
              0.12216172 = queryWeight, product of:
                2.789082 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.009249103 = queryNorm
              0.29597446 = fieldWeight in 652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=652)
          0.15792227 = weight(abstract_txt:fine in 652) [ClassicSimilarity], result of:
            0.15792227 = score(doc=652,freq=1.0), product of:
              0.35162073 = queryWeight, product of:
                5.290369 = boost
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.009249103 = queryNorm
              0.44912672 = fieldWeight in 652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.0625 = fieldNorm(doc=652)
          0.20730534 = weight(abstract_txt:grained in 652) [ClassicSimilarity], result of:
            0.20730534 = score(doc=652,freq=1.0), product of:
              0.42155337 = queryWeight, product of:
                5.7926197 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.009249103 = queryNorm
              0.49176535 = fieldWeight in 652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.0625 = fieldNorm(doc=652)
          0.6562509 = weight(abstract_txt:sentiment in 652) [ClassicSimilarity], result of:
            0.6562509 = score(doc=652,freq=5.0), product of:
              0.6216446 = queryWeight, product of:
                8.897742 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.009249103 = queryNorm
              1.055669 = fieldWeight in 652, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=652)
        0.24 = coord(6/25)
    
  5. Nguyen, T.T.; Tho Thanh Quan, T.T.; Tuoi Thi Phan, T.T.: Sentiment search : an emerging trend on social media monitoring systems (2014) 0.21
    0.2099576 = sum of:
      0.2099576 = product of:
        1.049788 = sum of:
          0.023481462 = weight(abstract_txt:analysis in 1625) [ClassicSimilarity], result of:
            0.023481462 = score(doc=1625,freq=2.0), product of:
              0.0727136 = queryWeight, product of:
                2.1517982 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.009249103 = queryNorm
              0.3229308 = fieldWeight in 1625, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=1625)
          0.05186644 = weight(abstract_txt:rules in 1625) [ClassicSimilarity], result of:
            0.05186644 = score(doc=1625,freq=2.0), product of:
              0.112049595 = queryWeight, product of:
                2.313287 = boost
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.009249103 = queryNorm
              0.46288824 = fieldWeight in 1625, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.0625 = fieldNorm(doc=1625)
          0.0723135 = weight(abstract_txt:domain in 1625) [ClassicSimilarity], result of:
            0.0723135 = score(doc=1625,freq=4.0), product of:
              0.12216172 = queryWeight, product of:
                2.789082 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.009249103 = queryNorm
              0.5919489 = fieldWeight in 1625, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=1625)
          0.07202749 = weight(abstract_txt:ontology in 1625) [ClassicSimilarity], result of:
            0.07202749 = score(doc=1625,freq=1.0), product of:
              0.20834245 = queryWeight, product of:
                4.07228 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.009249103 = queryNorm
              0.34571683 = fieldWeight in 1625, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0625 = fieldNorm(doc=1625)
          0.8300991 = weight(abstract_txt:sentiment in 1625) [ClassicSimilarity], result of:
            0.8300991 = score(doc=1625,freq=8.0), product of:
              0.6216446 = queryWeight, product of:
                8.897742 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.009249103 = queryNorm
              1.3353274 = fieldWeight in 1625, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=1625)
        0.2 = coord(5/25)