Document (#36106)

Author
Miao, Q.
Li, Q.
Zeng, D.
Title
Fine-grained opinion mining by integrating multiple review sources
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.11, S.2288-2299
Year
2010
Abstract
With the rapid development of Web 2.0, online reviews have become extremely valuable sources for mining customers' opinions. Fine-grained opinion mining has attracted more and more attention of both applied and theoretical research. In this article, the authors study how to automatically mine product features and opinions from multiple review sources. Specifically, they propose an integration strategy to solve the issue. Within the integration strategy, the authors mine domain knowledge from semistructured reviews and then exploit the domain knowledge to assist product feature extraction and sentiment orientation identification from unstructured reviews. Finally, feature-opinion tuples are generated. Experimental results on real-world datasets show that the proposed approach is effective.
Theme
Data Mining

Similar documents (author)

  1. Zeng, L.: ¬An introduction to thesauri and classification systems in the People's Republic of China (1986) 4.74
    4.740845 = sum of:
      4.740845 = weight(author_txt:zeng in 1731) [ClassicSimilarity], result of:
        4.740845 = score(doc=1731,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.585353 = idf(docFreq=58, maxDocs=42740)
            0.13183302 = queryNorm
          4.7408457 = fieldWeight in 1731, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.585353 = idf(docFreq=58, maxDocs=42740)
            0.625 = fieldNorm(doc=1731)
    
  2. Zeng, L.: Achieving compatibility of indexing languages in online access environment (1992) 4.74
    4.740845 = sum of:
      4.740845 = weight(author_txt:zeng in 1353) [ClassicSimilarity], result of:
        4.740845 = score(doc=1353,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.585353 = idf(docFreq=58, maxDocs=42740)
            0.13183302 = queryNorm
          4.7408457 = fieldWeight in 1353, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.585353 = idf(docFreq=58, maxDocs=42740)
            0.625 = fieldNorm(doc=1353)
    
  3. Zeng, L.: Automatic indexing for Chinese text : problems and progress (1992) 4.74
    4.740845 = sum of:
      4.740845 = weight(author_txt:zeng in 1358) [ClassicSimilarity], result of:
        4.740845 = score(doc=1358,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.585353 = idf(docFreq=58, maxDocs=42740)
            0.13183302 = queryNorm
          4.7408457 = fieldWeight in 1358, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.585353 = idf(docFreq=58, maxDocs=42740)
            0.625 = fieldNorm(doc=1358)
    
  4. Zeng, M.L.: Towards a unified medical langugae in a diverse cultural environment (1996) 4.74
    4.740845 = sum of:
      4.740845 = weight(author_txt:zeng in 5225) [ClassicSimilarity], result of:
        4.740845 = score(doc=5225,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.585353 = idf(docFreq=58, maxDocs=42740)
            0.13183302 = queryNorm
          4.7408457 = fieldWeight in 5225, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.585353 = idf(docFreq=58, maxDocs=42740)
            0.625 = fieldNorm(doc=5225)
    
  5. Zeng, M.L.: Developing control mechanisms for discipline-based virtual libraries : a study of the process (1995) 4.74
    4.740845 = sum of:
      4.740845 = weight(author_txt:zeng in 6906) [ClassicSimilarity], result of:
        4.740845 = score(doc=6906,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.585353 = idf(docFreq=58, maxDocs=42740)
            0.13183302 = queryNorm
          4.7408457 = fieldWeight in 6906, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.585353 = idf(docFreq=58, maxDocs=42740)
            0.625 = fieldNorm(doc=6906)
    

Similar documents (content)

  1. Varathan, K.D.; Giachanou, A.; Crestani, F.: Comparative opinion mining : a review (2017) 0.25
    0.25475892 = sum of:
      0.25475892 = product of:
        1.0614955 = sum of:
          0.07086674 = weight(abstract_txt:sentiment in 5541) [ClassicSimilarity], result of:
            0.07086674 = score(doc=5541,freq=1.0), product of:
              0.14775628 = queryWeight, product of:
                1.0689417 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.01801256 = queryNorm
              0.47961915 = fieldWeight in 5541, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=5541)
          0.017672207 = weight(abstract_txt:from in 5541) [ClassicSimilarity], result of:
            0.017672207 = score(doc=5541,freq=3.0), product of:
              0.058538944 = queryWeight, product of:
                1.1653707 = boost
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.01801256 = queryNorm
              0.30188805 = fieldWeight in 5541, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.0625 = fieldNorm(doc=5541)
          0.051857788 = weight(abstract_txt:review in 5541) [ClassicSimilarity], result of:
            0.051857788 = score(doc=5541,freq=2.0), product of:
              0.119985014 = queryWeight, product of:
                1.3622586 = boost
                4.88981 = idf(docFreq=873, maxDocs=42740)
                0.01801256 = queryNorm
              0.43220222 = fieldWeight in 5541, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.88981 = idf(docFreq=873, maxDocs=42740)
                0.0625 = fieldNorm(doc=5541)
          0.056889992 = weight(abstract_txt:reviews in 5541) [ClassicSimilarity], result of:
            0.056889992 = score(doc=5541,freq=1.0), product of:
              0.18406957 = queryWeight, product of:
                2.066487 = boost
                4.945086 = idf(docFreq=826, maxDocs=42740)
                0.01801256 = queryNorm
              0.30906788 = fieldWeight in 5541, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.945086 = idf(docFreq=826, maxDocs=42740)
                0.0625 = fieldNorm(doc=5541)
          0.3375984 = weight(abstract_txt:mining in 5541) [ClassicSimilarity], result of:
            0.3375984 = score(doc=5541,freq=9.0), product of:
              0.2900531 = queryWeight, product of:
                2.5940623 = boost
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.01801256 = queryNorm
              1.1639193 = fieldWeight in 5541, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.0625 = fieldNorm(doc=5541)
          0.5266104 = weight(abstract_txt:opinion in 5541) [ClassicSimilarity], result of:
            0.5266104 = score(doc=5541,freq=11.0), product of:
              0.36488286 = queryWeight, product of:
                2.9095004 = boost
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.01801256 = queryNorm
              1.4432313 = fieldWeight in 5541, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.0625 = fieldNorm(doc=5541)
        0.24 = coord(6/25)
    
  2. Huang, H.-H.; Wang, J.-J.; Chen, H.-H.: Implicit opinion analysis : extraction and polarity labelling (2017) 0.23
    0.2304243 = sum of:
      0.2304243 = product of:
        0.96010125 = sum of:
          0.021757634 = weight(abstract_txt:knowledge in 5821) [ClassicSimilarity], result of:
            0.021757634 = score(doc=5821,freq=1.0), product of:
              0.06465586 = queryWeight, product of:
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.01801256 = queryNorm
              0.33651447 = fieldWeight in 5821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.09375 = fieldNorm(doc=5821)
          0.10630011 = weight(abstract_txt:sentiment in 5821) [ClassicSimilarity], result of:
            0.10630011 = score(doc=5821,freq=1.0), product of:
              0.14775628 = queryWeight, product of:
                1.0689417 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.01801256 = queryNorm
              0.7194287 = fieldWeight in 5821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.09375 = fieldNorm(doc=5821)
          0.16538937 = weight(abstract_txt:opinions in 5821) [ClassicSimilarity], result of:
            0.16538937 = score(doc=5821,freq=1.0), product of:
              0.24996078 = queryWeight, product of:
                1.9662194 = boost
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.01801256 = queryNorm
              0.66166127 = fieldWeight in 5821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.09375 = fieldNorm(doc=5821)
          0.085334994 = weight(abstract_txt:reviews in 5821) [ClassicSimilarity], result of:
            0.085334994 = score(doc=5821,freq=1.0), product of:
              0.18406957 = queryWeight, product of:
                2.066487 = boost
                4.945086 = idf(docFreq=826, maxDocs=42740)
                0.01801256 = queryNorm
              0.46360183 = fieldWeight in 5821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.945086 = idf(docFreq=826, maxDocs=42740)
                0.09375 = fieldNorm(doc=5821)
          0.1687992 = weight(abstract_txt:mining in 5821) [ClassicSimilarity], result of:
            0.1687992 = score(doc=5821,freq=1.0), product of:
              0.2900531 = queryWeight, product of:
                2.5940623 = boost
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.01801256 = queryNorm
              0.58195966 = fieldWeight in 5821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.09375 = fieldNorm(doc=5821)
          0.41251993 = weight(abstract_txt:opinion in 5821) [ClassicSimilarity], result of:
            0.41251993 = score(doc=5821,freq=3.0), product of:
              0.36488286 = queryWeight, product of:
                2.9095004 = boost
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.01801256 = queryNorm
              1.1305544 = fieldWeight in 5821, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.09375 = fieldNorm(doc=5821)
        0.24 = coord(6/25)
    
  3. Ku, L.-W.; Chen, H.-H.: Mining opinions from the Web : beyond relevance retrieval (2007) 0.23
    0.2304003 = sum of:
      0.2304003 = product of:
        0.96000123 = sum of:
          0.06844164 = weight(abstract_txt:customers in 2606) [ClassicSimilarity], result of:
            0.06844164 = score(doc=2606,freq=1.0), product of:
              0.14436589 = queryWeight, product of:
                1.0566067 = boost
                7.585353 = idf(docFreq=58, maxDocs=42740)
                0.01801256 = queryNorm
              0.47408456 = fieldWeight in 2606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.585353 = idf(docFreq=58, maxDocs=42740)
                0.0625 = fieldNorm(doc=2606)
          0.1002207 = weight(abstract_txt:sentiment in 2606) [ClassicSimilarity], result of:
            0.1002207 = score(doc=2606,freq=2.0), product of:
              0.14775628 = queryWeight, product of:
                1.0689417 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.01801256 = queryNorm
              0.6782839 = fieldWeight in 2606, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=2606)
          0.020406108 = weight(abstract_txt:from in 2606) [ClassicSimilarity], result of:
            0.020406108 = score(doc=2606,freq=4.0), product of:
              0.058538944 = queryWeight, product of:
                1.1653707 = boost
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.01801256 = queryNorm
              0.3485903 = fieldWeight in 2606, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.0625 = fieldNorm(doc=2606)
          0.15593058 = weight(abstract_txt:opinions in 2606) [ClassicSimilarity], result of:
            0.15593058 = score(doc=2606,freq=2.0), product of:
              0.24996078 = queryWeight, product of:
                1.9662194 = boost
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.01801256 = queryNorm
              0.6238202 = fieldWeight in 2606, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.0625 = fieldNorm(doc=2606)
          0.19491252 = weight(abstract_txt:mining in 2606) [ClassicSimilarity], result of:
            0.19491252 = score(doc=2606,freq=3.0), product of:
              0.2900531 = queryWeight, product of:
                2.5940623 = boost
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.01801256 = queryNorm
              0.6719891 = fieldWeight in 2606, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.0625 = fieldNorm(doc=2606)
          0.42008975 = weight(abstract_txt:opinion in 2606) [ClassicSimilarity], result of:
            0.42008975 = score(doc=2606,freq=7.0), product of:
              0.36488286 = queryWeight, product of:
                2.9095004 = boost
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.01801256 = queryNorm
              1.1513003 = fieldWeight in 2606, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.0625 = fieldNorm(doc=2606)
        0.24 = coord(6/25)
    
  4. Sleem-Amer, M.; Bigorgne, I.; Brizard, S.; Santos, L.D.P.D.; Bouhairi, Y. El; Goujon, B.; Lorin, S.; Martineau, C.; Rigouste, L.; Varga, L.: Intelligent semantic search engines for opinion and sentiment mining (2012) 0.23
    0.22817872 = sum of:
      0.22817872 = product of:
        0.814924 = sum of:
          0.058180004 = weight(abstract_txt:unstructured in 2101) [ClassicSimilarity], result of:
            0.058180004 = score(doc=2101,freq=1.0), product of:
              0.12954883 = queryWeight, product of:
                1.0009164 = boost
                7.1855536 = idf(docFreq=87, maxDocs=42740)
                0.01801256 = queryNorm
              0.4490971 = fieldWeight in 2101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1855536 = idf(docFreq=87, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.1002207 = weight(abstract_txt:sentiment in 2101) [ClassicSimilarity], result of:
            0.1002207 = score(doc=2101,freq=2.0), product of:
              0.14775628 = queryWeight, product of:
                1.0689417 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.01801256 = queryNorm
              0.6782839 = fieldWeight in 2101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.045304075 = weight(abstract_txt:authors in 2101) [ClassicSimilarity], result of:
            0.045304075 = score(doc=2101,freq=2.0), product of:
              0.109650135 = queryWeight, product of:
                1.3022687 = boost
                4.6744776 = idf(docFreq=1083, maxDocs=42740)
                0.01801256 = queryNorm
              0.41316935 = fieldWeight in 2101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6744776 = idf(docFreq=1083, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.06774255 = weight(abstract_txt:product in 2101) [ClassicSimilarity], result of:
            0.06774255 = score(doc=2101,freq=1.0), product of:
              0.18064891 = queryWeight, product of:
                1.6715282 = boost
                5.99993 = idf(docFreq=287, maxDocs=42740)
                0.01801256 = queryNorm
              0.37499562 = fieldWeight in 2101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.99993 = idf(docFreq=287, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.15593058 = weight(abstract_txt:opinions in 2101) [ClassicSimilarity], result of:
            0.15593058 = score(doc=2101,freq=2.0), product of:
              0.24996078 = queryWeight, product of:
                1.9662194 = boost
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.01801256 = queryNorm
              0.6238202 = fieldWeight in 2101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.1125328 = weight(abstract_txt:mining in 2101) [ClassicSimilarity], result of:
            0.1125328 = score(doc=2101,freq=1.0), product of:
              0.2900531 = queryWeight, product of:
                2.5940623 = boost
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.01801256 = queryNorm
              0.3879731 = fieldWeight in 2101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
          0.2750133 = weight(abstract_txt:opinion in 2101) [ClassicSimilarity], result of:
            0.2750133 = score(doc=2101,freq=3.0), product of:
              0.36488286 = queryWeight, product of:
                2.9095004 = boost
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.01801256 = queryNorm
              0.753703 = fieldWeight in 2101, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.0625 = fieldNorm(doc=2101)
        0.28 = coord(7/25)
    
  5. Pang, B.; Lee, L.: Opinion mining and sentiment analysis (2008) 0.22
    0.22091153 = sum of:
      0.22091153 = product of:
        0.92046475 = sum of:
          0.13865499 = weight(abstract_txt:sentiment in 3172) [ClassicSimilarity], result of:
            0.13865499 = score(doc=3172,freq=5.0), product of:
              0.14775628 = queryWeight, product of:
                1.0689417 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.01801256 = queryNorm
              0.93840337 = fieldWeight in 3172, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.032085374 = weight(abstract_txt:review in 3172) [ClassicSimilarity], result of:
            0.032085374 = score(doc=3172,freq=1.0), product of:
              0.119985014 = queryWeight, product of:
                1.3622586 = boost
                4.88981 = idf(docFreq=873, maxDocs=42740)
                0.01801256 = queryNorm
              0.2674115 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.88981 = idf(docFreq=873, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.13643925 = weight(abstract_txt:opinions in 3172) [ClassicSimilarity], result of:
            0.13643925 = score(doc=3172,freq=2.0), product of:
              0.24996078 = queryWeight, product of:
                1.9662194 = boost
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.01801256 = queryNorm
              0.54584265 = fieldWeight in 3172, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.05772 = idf(docFreq=99, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.04977874 = weight(abstract_txt:reviews in 3172) [ClassicSimilarity], result of:
            0.04977874 = score(doc=3172,freq=1.0), product of:
              0.18406957 = queryWeight, product of:
                2.066487 = boost
                4.945086 = idf(docFreq=826, maxDocs=42740)
                0.01801256 = queryNorm
              0.27043438 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.945086 = idf(docFreq=826, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.17054845 = weight(abstract_txt:mining in 3172) [ClassicSimilarity], result of:
            0.17054845 = score(doc=3172,freq=3.0), product of:
              0.2900531 = queryWeight, product of:
                2.5940623 = boost
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.01801256 = queryNorm
              0.58799046 = fieldWeight in 3172, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
          0.39295796 = weight(abstract_txt:opinion in 3172) [ClassicSimilarity], result of:
            0.39295796 = score(doc=3172,freq=8.0), product of:
              0.36488286 = queryWeight, product of:
                2.9095004 = boost
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.01801256 = queryNorm
              1.0769428 = fieldWeight in 3172, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3172)
        0.24 = coord(6/25)