Document (#37494)

Author
Guo, L.
Wan, X.
Title
Exploiting syntactic and semantic relationships between terms for opinion retrieval
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.11, S.2269-2282
Year
2012
Abstract
Opinion retrieval is the task of finding documents that express an opinion about a given query. A key challenge in opinion retrieval is to capture the query-related opinion score of a document. Existing methods rely mainly on the proximity information between the opinion terms and the query terms to address the key challenge. In this study, we propose to incorporate the syntactic and semantic information of terms into a probabilistic model to capture the query-related opinion score more accurately. The syntactic tree structure of a sentence is used to evaluate the modifying probability between an opinion term and a noun within the sentence with a tree kernel method. Moreover, WordNet and the probabilistic topic model are used to evaluate the semantic relatedness between any noun and the given query. The experimental results over standard TREC baselines on the benchmark BLOG06 collection demonstrate the effectiveness of our proposed method, in comparison with the proximity-based method and other baselines.

Similar documents (content)

  1. Fang, L.; Tuan, L.A.; Hui, S.C.; Wu, L.: Syntactic based approach for grammar question retrieval (2018) 0.26
    0.26385674 = sum of:
      0.26385674 = product of:
        0.73293537 = sum of:
          0.017002663 = weight(abstract_txt:related in 87) [ClassicSimilarity], result of:
            0.017002663 = score(doc=87,freq=1.0), product of:
              0.06440122 = queryWeight, product of:
                1.0910947 = boost
                4.224184 = idf(docFreq=1720, maxDocs=43254)
                0.013972972 = queryNorm
              0.2640115 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.224184 = idf(docFreq=1720, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.11811612 = weight(abstract_txt:kernel in 87) [ClassicSimilarity], result of:
            0.11811612 = score(doc=87,freq=3.0), product of:
              0.1290343 = queryWeight, product of:
                1.0920764 = boost
                8.455969 = idf(docFreq=24, maxDocs=43254)
                0.013972972 = queryNorm
              0.9153855 = fieldWeight in 87, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.455969 = idf(docFreq=24, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.019991506 = weight(abstract_txt:retrieval in 87) [ClassicSimilarity], result of:
            0.019991506 = score(doc=87,freq=2.0), product of:
              0.06518288 = queryWeight, product of:
                1.3443979 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.013972972 = queryNorm
              0.3066987 = fieldWeight in 87, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.058347423 = weight(abstract_txt:capture in 87) [ClassicSimilarity], result of:
            0.058347423 = score(doc=87,freq=1.0), product of:
              0.14652011 = queryWeight, product of:
                1.6457508 = boost
                6.37154 = idf(docFreq=200, maxDocs=43254)
                0.013972972 = queryNorm
              0.39822125 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.37154 = idf(docFreq=200, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.12495211 = weight(abstract_txt:tree in 87) [ClassicSimilarity], result of:
            0.12495211 = score(doc=87,freq=3.0), product of:
              0.16878666 = queryWeight, product of:
                1.7663814 = boost
                6.838563 = idf(docFreq=125, maxDocs=43254)
                0.013972972 = queryNorm
              0.7402961 = fieldWeight in 87, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.838563 = idf(docFreq=125, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.07239361 = weight(abstract_txt:sentence in 87) [ClassicSimilarity], result of:
            0.07239361 = score(doc=87,freq=1.0), product of:
              0.16918023 = queryWeight, product of:
                1.7684397 = boost
                6.8465314 = idf(docFreq=124, maxDocs=43254)
                0.013972972 = queryNorm
              0.4279082 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8465314 = idf(docFreq=124, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.026864674 = weight(abstract_txt:between in 87) [ClassicSimilarity], result of:
            0.026864674 = score(doc=87,freq=2.0), product of:
              0.08736494 = queryWeight, product of:
                1.7972108 = boost
                3.4789596 = idf(docFreq=3625, maxDocs=43254)
                0.013972972 = queryNorm
              0.30749947 = fieldWeight in 87, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4789596 = idf(docFreq=3625, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.21041441 = weight(abstract_txt:syntactic in 87) [ClassicSimilarity], result of:
            0.21041441 = score(doc=87,freq=5.0), product of:
              0.23066066 = queryWeight, product of:
                2.528994 = boost
                6.5273504 = idf(docFreq=171, maxDocs=43254)
                0.013972972 = queryNorm
              0.91222495 = fieldWeight in 87, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.5273504 = idf(docFreq=171, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.0848529 = weight(abstract_txt:query in 87) [ClassicSimilarity], result of:
            0.0848529 = score(doc=87,freq=2.0), product of:
              0.20259587 = queryWeight, product of:
                3.0598543 = boost
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.013972972 = queryNorm
              0.41882837 = fieldWeight in 87, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
        0.36 = coord(9/25)
    
  2. Belbachir, F.; Boughanem, M.: Using language models to improve opinion detection (2018) 0.26
    0.26123467 = sum of:
      0.26123467 = product of:
        1.0884778 = sum of:
          0.030298015 = weight(abstract_txt:retrieval in 45) [ClassicSimilarity], result of:
            0.030298015 = score(doc=45,freq=6.0), product of:
              0.06518288 = queryWeight, product of:
                1.3443979 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.013972972 = queryNorm
              0.46481553 = fieldWeight in 45, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0546875 = fieldNorm(doc=45)
          0.01662167 = weight(abstract_txt:between in 45) [ClassicSimilarity], result of:
            0.01662167 = score(doc=45,freq=1.0), product of:
              0.08736494 = queryWeight, product of:
                1.7972108 = boost
                3.4789596 = idf(docFreq=3625, maxDocs=43254)
                0.013972972 = queryNorm
              0.1902556 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4789596 = idf(docFreq=3625, maxDocs=43254)
                0.0546875 = fieldNorm(doc=45)
          0.06915765 = weight(abstract_txt:score in 45) [ClassicSimilarity], result of:
            0.06915765 = score(doc=45,freq=1.0), product of:
              0.17937872 = queryWeight, product of:
                1.8209621 = boost
                7.0498724 = idf(docFreq=101, maxDocs=43254)
                0.013972972 = queryNorm
              0.3855399 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0498724 = idf(docFreq=101, maxDocs=43254)
                0.0546875 = fieldNorm(doc=45)
          0.026380608 = weight(abstract_txt:terms in 45) [ClassicSimilarity], result of:
            0.026380608 = score(doc=45,freq=1.0), product of:
              0.118871376 = queryWeight, product of:
                2.0963755 = boost
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.013972972 = queryNorm
              0.22192566 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.0546875 = fieldNorm(doc=45)
          0.05250005 = weight(abstract_txt:query in 45) [ClassicSimilarity], result of:
            0.05250005 = score(doc=45,freq=1.0), product of:
              0.20259587 = queryWeight, product of:
                3.0598543 = boost
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.013972972 = queryNorm
              0.25913683 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.0546875 = fieldNorm(doc=45)
          0.8935199 = weight(abstract_txt:opinion in 45) [ClassicSimilarity], result of:
            0.8935199 = score(doc=45,freq=12.0), product of:
              0.6848148 = queryWeight, product of:
                7.1159353 = boost
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.013972972 = queryNorm
              1.3047613 = fieldWeight in 45, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.0546875 = fieldNorm(doc=45)
        0.24 = coord(6/25)
    
  3. Fernández, R.T.; Losada, D.E.: Effective sentence retrieval based on query-independent evidence (2012) 0.23
    0.22896351 = sum of:
      0.22896351 = product of:
        0.95401466 = sum of:
          0.017002663 = weight(abstract_txt:related in 4193) [ClassicSimilarity], result of:
            0.017002663 = score(doc=4193,freq=1.0), product of:
              0.06440122 = queryWeight, product of:
                1.0910947 = boost
                4.224184 = idf(docFreq=1720, maxDocs=43254)
                0.013972972 = queryNorm
              0.2640115 = fieldWeight in 4193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.224184 = idf(docFreq=1720, maxDocs=43254)
                0.0625 = fieldNorm(doc=4193)
          0.03740068 = weight(abstract_txt:retrieval in 4193) [ClassicSimilarity], result of:
            0.03740068 = score(doc=4193,freq=7.0), product of:
              0.06518288 = queryWeight, product of:
                1.3443979 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.013972972 = queryNorm
              0.5737808 = fieldWeight in 4193, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=4193)
          0.031089673 = weight(abstract_txt:method in 4193) [ClassicSimilarity], result of:
            0.031089673 = score(doc=4193,freq=1.0), product of:
              0.11023614 = queryWeight, product of:
                1.7483286 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.013972972 = queryNorm
              0.28202796 = fieldWeight in 4193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.0625 = fieldNorm(doc=4193)
          0.14478722 = weight(abstract_txt:sentence in 4193) [ClassicSimilarity], result of:
            0.14478722 = score(doc=4193,freq=4.0), product of:
              0.16918023 = queryWeight, product of:
                1.7684397 = boost
                6.8465314 = idf(docFreq=124, maxDocs=43254)
                0.013972972 = queryNorm
              0.8558164 = fieldWeight in 4193, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8465314 = idf(docFreq=124, maxDocs=43254)
                0.0625 = fieldNorm(doc=4193)
          0.13416421 = weight(abstract_txt:query in 4193) [ClassicSimilarity], result of:
            0.13416421 = score(doc=4193,freq=5.0), product of:
              0.20259587 = queryWeight, product of:
                3.0598543 = boost
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.013972972 = queryNorm
              0.6622258 = fieldWeight in 4193, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.0625 = fieldNorm(doc=4193)
          0.5895702 = weight(abstract_txt:opinion in 4193) [ClassicSimilarity], result of:
            0.5895702 = score(doc=4193,freq=4.0), product of:
              0.6848148 = queryWeight, product of:
                7.1159353 = boost
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.013972972 = queryNorm
              0.8609192 = fieldWeight in 4193, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.0625 = fieldNorm(doc=4193)
        0.24 = coord(6/25)
    
  4. Zhang, M.; Zhou, G.D.; Aw, A.: Exploring syntactic structured features over parse trees for relation extraction using kernel methods (2008) 0.22
    0.22406854 = sum of:
      0.22406854 = product of:
        0.7002142 = sum of:
          0.014520104 = weight(abstract_txt:model in 4056) [ClassicSimilarity], result of:
            0.014520104 = score(doc=4056,freq=1.0), product of:
              0.05796902 = queryWeight, product of:
                1.0351741 = boost
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.013972972 = queryNorm
              0.2504804 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.0625 = fieldNorm(doc=4056)
          0.19288282 = weight(abstract_txt:kernel in 4056) [ClassicSimilarity], result of:
            0.19288282 = score(doc=4056,freq=8.0), product of:
              0.1290343 = queryWeight, product of:
                1.0920764 = boost
                8.455969 = idf(docFreq=24, maxDocs=43254)
                0.013972972 = queryNorm
              1.4948182 = fieldWeight in 4056, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.455969 = idf(docFreq=24, maxDocs=43254)
                0.0625 = fieldNorm(doc=4056)
          0.058347423 = weight(abstract_txt:capture in 4056) [ClassicSimilarity], result of:
            0.058347423 = score(doc=4056,freq=1.0), product of:
              0.14652011 = queryWeight, product of:
                1.6457508 = boost
                6.37154 = idf(docFreq=200, maxDocs=43254)
                0.013972972 = queryNorm
              0.39822125 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.37154 = idf(docFreq=200, maxDocs=43254)
                0.0625 = fieldNorm(doc=4056)
          0.030524224 = weight(abstract_txt:semantic in 4056) [ClassicSimilarity], result of:
            0.030524224 = score(doc=4056,freq=1.0), product of:
              0.10889543 = queryWeight, product of:
                1.7376643 = boost
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.013972972 = queryNorm
              0.28030768 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.0625 = fieldNorm(doc=4056)
          0.031089673 = weight(abstract_txt:method in 4056) [ClassicSimilarity], result of:
            0.031089673 = score(doc=4056,freq=1.0), product of:
              0.11023614 = queryWeight, product of:
                1.7483286 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.013972972 = queryNorm
              0.28202796 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.0625 = fieldNorm(doc=4056)
          0.1908675 = weight(abstract_txt:tree in 4056) [ClassicSimilarity], result of:
            0.1908675 = score(doc=4056,freq=7.0), product of:
              0.16878666 = queryWeight, product of:
                1.7663814 = boost
                6.838563 = idf(docFreq=125, maxDocs=43254)
                0.013972972 = queryNorm
              1.130821 = fieldWeight in 4056, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.838563 = idf(docFreq=125, maxDocs=43254)
                0.0625 = fieldNorm(doc=4056)
          0.018996194 = weight(abstract_txt:between in 4056) [ClassicSimilarity], result of:
            0.018996194 = score(doc=4056,freq=1.0), product of:
              0.08736494 = queryWeight, product of:
                1.7972108 = boost
                3.4789596 = idf(docFreq=3625, maxDocs=43254)
                0.013972972 = queryNorm
              0.21743497 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4789596 = idf(docFreq=3625, maxDocs=43254)
                0.0625 = fieldNorm(doc=4056)
          0.1629863 = weight(abstract_txt:syntactic in 4056) [ClassicSimilarity], result of:
            0.1629863 = score(doc=4056,freq=3.0), product of:
              0.23066066 = queryWeight, product of:
                2.528994 = boost
                6.5273504 = idf(docFreq=171, maxDocs=43254)
                0.013972972 = queryNorm
              0.7066064 = fieldWeight in 4056, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5273504 = idf(docFreq=171, maxDocs=43254)
                0.0625 = fieldNorm(doc=4056)
        0.32 = coord(8/25)
    
  5. Li, D.; Tang, J.; Ding, Y.; Shuai, X.; Chambers, T.; Sun, G.; Luo, Z.; Zhang, J.: Topic-level opinion influence model (TOIM) : an investigation using tencent microblogging (2015) 0.21
    0.20751767 = sum of:
      0.20751767 = product of:
        1.0375884 = sum of:
          0.025149558 = weight(abstract_txt:model in 3810) [ClassicSimilarity], result of:
            0.025149558 = score(doc=3810,freq=3.0), product of:
              0.05796902 = queryWeight, product of:
                1.0351741 = boost
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.013972972 = queryNorm
              0.4338448 = fieldWeight in 3810, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.0625 = fieldNorm(doc=3810)
          0.023425208 = weight(abstract_txt:given in 3810) [ClassicSimilarity], result of:
            0.023425208 = score(doc=3810,freq=1.0), product of:
              0.07973918 = queryWeight, product of:
                1.2140912 = boost
                4.700366 = idf(docFreq=1068, maxDocs=43254)
                0.013972972 = queryNorm
              0.29377288 = fieldWeight in 3810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.700366 = idf(docFreq=1068, maxDocs=43254)
                0.0625 = fieldNorm(doc=3810)
          0.03444783 = weight(abstract_txt:evaluate in 3810) [ClassicSimilarity], result of:
            0.03444783 = score(doc=3810,freq=1.0), product of:
              0.10311552 = queryWeight, product of:
                1.3806305 = boost
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.013972972 = queryNorm
              0.33407027 = fieldWeight in 3810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.0625 = fieldNorm(doc=3810)
          0.07021047 = weight(abstract_txt:probabilistic in 3810) [ClassicSimilarity], result of:
            0.07021047 = score(doc=3810,freq=1.0), product of:
              0.16576165 = queryWeight, product of:
                1.7504812 = boost
                6.777005 = idf(docFreq=133, maxDocs=43254)
                0.013972972 = queryNorm
              0.42356282 = fieldWeight in 3810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.777005 = idf(docFreq=133, maxDocs=43254)
                0.0625 = fieldNorm(doc=3810)
          0.8843553 = weight(abstract_txt:opinion in 3810) [ClassicSimilarity], result of:
            0.8843553 = score(doc=3810,freq=9.0), product of:
              0.6848148 = queryWeight, product of:
                7.1159353 = boost
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.013972972 = queryNorm
              1.2913787 = fieldWeight in 3810, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.0625 = fieldNorm(doc=3810)
        0.2 = coord(5/25)