Document (#38228)

Author
Malo, P.
Sinha, A.
Korhonen, P.
Wallenius, J.
Takala, P.
Title
Good debt or bad debt : detecting semantic orientations in economic texts
Source
Journal of the Association for Information Science and Technology. 65(2014) no.4, S.782-796
Year
2014
Abstract
The use of robo-readers to analyze news texts is an emerging technology trend in computational finance. Recent research has developed sophisticated financial polarity lexicons for investigating how financial sentiments relate to future company performance. However, based on experience from fields that commonly analyze sentiment, it is well known that the overall semantic orientation of a sentence may differ from that of individual words. This article investigates how semantic orientations can be better detected in financial and economic news by accommodating the overall phrase-structure information and domain-specific use of language. Our three main contributions are the following: (a) a human-annotated finance phrase bank that can be used for training and evaluating alternative models; (b) a technique to enhance financial lexicons with attributes that help to identify expected direction of events that affect sentiment; and (c) a linearized phrase-structure model for detecting contextual semantic orientations in economic texts. The relevance of the newly added lexicon features and the benefit of using the proposed learning algorithm are demonstrated in a comparative study against general sentiment models as well as the popular word frequency models used in recent financial studies. The proposed framework is parsimonious and avoids the explosion in feature space caused by the use of conventional n-gram features.
Theme
Computerlinguistik
Field
Wirtschaftswissenschaften

Similar documents (author)

  1. Malo, M.: 1. Tagung der deutschsprachigen MetaLib/SFXAnwender (SMUG-DACH) in Berlin (2005) 5.99
    5.989656 = sum of:
      5.989656 = weight(author_txt:malo in 4123) [ClassicSimilarity], result of:
        5.989656 = fieldWeight in 4123, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.625 = fieldNorm(doc=4123)
    
  2. Malo, M.: Vermittlung von Informationskompetenz an der UB Stuttgart (2006) 5.99
    5.989656 = sum of:
      5.989656 = weight(author_txt:malo in 30) [ClassicSimilarity], result of:
        5.989656 = fieldWeight in 30, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.625 = fieldNorm(doc=30)
    
  3. Malo, M.: Wiki als Werkzeug für das Wissensmanagement in Bibliotheken (2006) 5.99
    5.989656 = sum of:
      5.989656 = weight(author_txt:malo in 1745) [ClassicSimilarity], result of:
        5.989656 = fieldWeight in 1745, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.625 = fieldNorm(doc=1745)
    
  4. Imhof, A.; Malo, M.: Aufbau einer verteilten Fachredaktion für freie Internet-Ressourcen im KOBV-Portal (2005) 4.79
    4.7917247 = sum of:
      4.7917247 = weight(author_txt:malo in 4406) [ClassicSimilarity], result of:
        4.7917247 = fieldWeight in 4406, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.5 = fieldNorm(doc=4406)
    
  5. Hodoroaba, L.; Imhof, A.; Malo, M.: ¬Das Profil des KOBV-Portals (2005) 3.59
    3.5937934 = sum of:
      3.5937934 = weight(author_txt:malo in 4211) [ClassicSimilarity], result of:
        3.5937934 = fieldWeight in 4211, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.375 = fieldNorm(doc=4211)
    

Similar documents (content)

  1. Saif, H.; He, Y.; Fernandez, M.; Alani, H.: Contextual semantics for sentiment analysis of Twitter (2016) 0.20
    0.19741069 = sum of:
      0.19741069 = product of:
        0.8225446 = sum of:
          0.098950595 = weight(abstract_txt:polarity in 4668) [ClassicSimilarity], result of:
            0.098950595 = score(doc=4668,freq=2.0), product of:
              0.13060173 = queryWeight, product of:
                1.0715709 = boost
                8.571848 = idf(docFreq=21, maxDocs=42740)
                0.014218493 = queryNorm
              0.75765145 = fieldWeight in 4668, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.571848 = idf(docFreq=21, maxDocs=42740)
                0.0625 = fieldNorm(doc=4668)
          0.08163065 = weight(abstract_txt:sentiments in 4668) [ClassicSimilarity], result of:
            0.08163065 = score(doc=4668,freq=1.0), product of:
              0.14473787 = queryWeight, product of:
                1.1280738 = boost
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.014218493 = queryNorm
              0.5639896 = fieldWeight in 4668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.0625 = fieldNorm(doc=4668)
          0.022208568 = weight(abstract_txt:proposed in 4668) [ClassicSimilarity], result of:
            0.022208568 = score(doc=4668,freq=1.0), product of:
              0.07656619 = queryWeight, product of:
                1.1603258 = boost
                4.640914 = idf(docFreq=1120, maxDocs=42740)
                0.014218493 = queryNorm
              0.29005712 = fieldWeight in 4668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.640914 = idf(docFreq=1120, maxDocs=42740)
                0.0625 = fieldNorm(doc=4668)
          0.009152971 = weight(abstract_txt:that in 4668) [ClassicSimilarity], result of:
            0.009152971 = score(doc=4668,freq=1.0), product of:
              0.06115592 = queryWeight, product of:
                1.7961446 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.014218493 = queryNorm
              0.14966616 = fieldWeight in 4668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=4668)
          0.21212797 = weight(abstract_txt:lexicons in 4668) [ClassicSimilarity], result of:
            0.21212797 = score(doc=4668,freq=2.0), product of:
              0.27357638 = queryWeight, product of:
                2.1933136 = boost
                8.772519 = idf(docFreq=17, maxDocs=42740)
                0.014218493 = queryNorm
              0.7753885 = fieldWeight in 4668, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.772519 = idf(docFreq=17, maxDocs=42740)
                0.0625 = fieldNorm(doc=4668)
          0.39847377 = weight(abstract_txt:sentiment in 4668) [ClassicSimilarity], result of:
            0.39847377 = score(doc=4668,freq=7.0), product of:
              0.31401777 = queryWeight, product of:
                2.8779562 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.014218493 = queryNorm
              1.268953 = fieldWeight in 4668, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=4668)
        0.24 = coord(6/25)
    
  2. Xing, F.Z.; Pallucchini, F.; Cambria, E.: Cognitive-inspired domain adaptation of sentiment lexicons (2019) 0.18
    0.17731424 = sum of:
      0.17731424 = product of:
        0.88657117 = sum of:
          0.12118924 = weight(abstract_txt:polarity in 1105) [ClassicSimilarity], result of:
            0.12118924 = score(doc=1105,freq=3.0), product of:
              0.13060173 = queryWeight, product of:
                1.0715709 = boost
                8.571848 = idf(docFreq=21, maxDocs=42740)
                0.014218493 = queryNorm
              0.92792976 = fieldWeight in 1105, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.571848 = idf(docFreq=21, maxDocs=42740)
                0.0625 = fieldNorm(doc=1105)
          0.021095356 = weight(abstract_txt:features in 1105) [ClassicSimilarity], result of:
            0.021095356 = score(doc=1105,freq=1.0), product of:
              0.07398572 = queryWeight, product of:
                1.1406052 = boost
                4.5620384 = idf(docFreq=1212, maxDocs=42740)
                0.014218493 = queryNorm
              0.2851274 = fieldWeight in 1105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5620384 = idf(docFreq=1212, maxDocs=42740)
                0.0625 = fieldNorm(doc=1105)
          0.018305942 = weight(abstract_txt:that in 1105) [ClassicSimilarity], result of:
            0.018305942 = score(doc=1105,freq=4.0), product of:
              0.06115592 = queryWeight, product of:
                1.7961446 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.014218493 = queryNorm
              0.29933232 = fieldWeight in 1105, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=1105)
          0.29999426 = weight(abstract_txt:lexicons in 1105) [ClassicSimilarity], result of:
            0.29999426 = score(doc=1105,freq=4.0), product of:
              0.27357638 = queryWeight, product of:
                2.1933136 = boost
                8.772519 = idf(docFreq=17, maxDocs=42740)
                0.014218493 = queryNorm
              1.0965649 = fieldWeight in 1105, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.772519 = idf(docFreq=17, maxDocs=42740)
                0.0625 = fieldNorm(doc=1105)
          0.42598638 = weight(abstract_txt:sentiment in 1105) [ClassicSimilarity], result of:
            0.42598638 = score(doc=1105,freq=8.0), product of:
              0.31401777 = queryWeight, product of:
                2.8779562 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.014218493 = queryNorm
              1.3565677 = fieldWeight in 1105, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=1105)
        0.2 = coord(5/25)
    
  3. Chen, Z.; Huang, Y.; Tian, J.; Liu, X.; Fu, K.; Huang, T.: Joint model for subsentence-level sentiment analysis with Markov logic (2015) 0.17
    0.1703908 = sum of:
      0.1703908 = product of:
        0.70996165 = sum of:
          0.098950595 = weight(abstract_txt:polarity in 4211) [ClassicSimilarity], result of:
            0.098950595 = score(doc=4211,freq=2.0), product of:
              0.13060173 = queryWeight, product of:
                1.0715709 = boost
                8.571848 = idf(docFreq=21, maxDocs=42740)
                0.014218493 = queryNorm
              0.75765145 = fieldWeight in 4211, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.571848 = idf(docFreq=21, maxDocs=42740)
                0.0625 = fieldNorm(doc=4211)
          0.11544317 = weight(abstract_txt:sentiments in 4211) [ClassicSimilarity], result of:
            0.11544317 = score(doc=4211,freq=2.0), product of:
              0.14473787 = queryWeight, product of:
                1.1280738 = boost
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.014218493 = queryNorm
              0.7976017 = fieldWeight in 4211, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.0625 = fieldNorm(doc=4211)
          0.022208568 = weight(abstract_txt:proposed in 4211) [ClassicSimilarity], result of:
            0.022208568 = score(doc=4211,freq=1.0), product of:
              0.07656619 = queryWeight, product of:
                1.1603258 = boost
                4.640914 = idf(docFreq=1120, maxDocs=42740)
                0.014218493 = queryNorm
              0.29005712 = fieldWeight in 4211, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.640914 = idf(docFreq=1120, maxDocs=42740)
                0.0625 = fieldNorm(doc=4211)
          0.034428645 = weight(abstract_txt:models in 4211) [ClassicSimilarity], result of:
            0.034428645 = score(doc=4211,freq=1.0), product of:
              0.117399715 = queryWeight, product of:
                1.7597078 = boost
                4.6921606 = idf(docFreq=1064, maxDocs=42740)
                0.014218493 = queryNorm
              0.29326004 = fieldWeight in 4211, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6921606 = idf(docFreq=1064, maxDocs=42740)
                0.0625 = fieldNorm(doc=4211)
          0.012944256 = weight(abstract_txt:that in 4211) [ClassicSimilarity], result of:
            0.012944256 = score(doc=4211,freq=2.0), product of:
              0.06115592 = queryWeight, product of:
                1.7961446 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.014218493 = queryNorm
              0.21165991 = fieldWeight in 4211, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=4211)
          0.42598638 = weight(abstract_txt:sentiment in 4211) [ClassicSimilarity], result of:
            0.42598638 = score(doc=4211,freq=8.0), product of:
              0.31401777 = queryWeight, product of:
                2.8779562 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.014218493 = queryNorm
              1.3565677 = fieldWeight in 4211, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=4211)
        0.24 = coord(6/25)
    
  4. Na, J.-C.; Sui, H.; Khoo, C.; Chan, S.; Zhou, Y.: Effectiveness of simple linguistic processing in automatic sentiment classification of product reviews (2004) 0.15
    0.15004349 = sum of:
      0.15004349 = product of:
        0.6251812 = sum of:
          0.08163065 = weight(abstract_txt:sentiments in 3625) [ClassicSimilarity], result of:
            0.08163065 = score(doc=3625,freq=1.0), product of:
              0.14473787 = queryWeight, product of:
                1.1280738 = boost
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.014218493 = queryNorm
              0.5639896 = fieldWeight in 3625, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.0625 = fieldNorm(doc=3625)
          0.021095356 = weight(abstract_txt:features in 3625) [ClassicSimilarity], result of:
            0.021095356 = score(doc=3625,freq=1.0), product of:
              0.07398572 = queryWeight, product of:
                1.1406052 = boost
                4.5620384 = idf(docFreq=1212, maxDocs=42740)
                0.014218493 = queryNorm
              0.2851274 = fieldWeight in 3625, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5620384 = idf(docFreq=1212, maxDocs=42740)
                0.0625 = fieldNorm(doc=3625)
          0.012944256 = weight(abstract_txt:that in 3625) [ClassicSimilarity], result of:
            0.012944256 = score(doc=3625,freq=2.0), product of:
              0.06115592 = queryWeight, product of:
                1.7961446 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.014218493 = queryNorm
              0.21165991 = fieldWeight in 3625, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=3625)
          0.040443406 = weight(abstract_txt:semantic in 3625) [ClassicSimilarity], result of:
            0.040443406 = score(doc=3625,freq=1.0), product of:
              0.14385727 = queryWeight, product of:
                2.2492738 = boost
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.014218493 = queryNorm
              0.28113565 = fieldWeight in 3625, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.0625 = fieldNorm(doc=3625)
          0.16784963 = weight(abstract_txt:phrase in 3625) [ClassicSimilarity], result of:
            0.16784963 = score(doc=3625,freq=2.0), product of:
              0.26791108 = queryWeight, product of:
                2.6582901 = boost
                7.0881796 = idf(docFreq=96, maxDocs=42740)
                0.014218493 = queryNorm
              0.62651247 = fieldWeight in 3625, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0881796 = idf(docFreq=96, maxDocs=42740)
                0.0625 = fieldNorm(doc=3625)
          0.30121788 = weight(abstract_txt:sentiment in 3625) [ClassicSimilarity], result of:
            0.30121788 = score(doc=3625,freq=4.0), product of:
              0.31401777 = queryWeight, product of:
                2.8779562 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.014218493 = queryNorm
              0.9592383 = fieldWeight in 3625, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=3625)
        0.24 = coord(6/25)
    
  5. Thelwall, M.; Buckley, K.; Paltoglou, G.: Sentiment strength detection for the social web (2012) 0.13
    0.12884735 = sum of:
      0.12884735 = product of:
        0.6442367 = sum of:
          0.03695707 = weight(abstract_txt:overall in 1973) [ClassicSimilarity], result of:
            0.03695707 = score(doc=1973,freq=1.0), product of:
              0.107519776 = queryWeight, product of:
                1.3750092 = boost
                5.4995756 = idf(docFreq=474, maxDocs=42740)
                0.014218493 = queryNorm
              0.34372348 = fieldWeight in 1973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4995756 = idf(docFreq=474, maxDocs=42740)
                0.0625 = fieldNorm(doc=1973)
          0.049829498 = weight(abstract_txt:news in 1973) [ClassicSimilarity], result of:
            0.049829498 = score(doc=1973,freq=1.0), product of:
              0.13122432 = queryWeight, product of:
                1.5190378 = boost
                6.0756416 = idf(docFreq=266, maxDocs=42740)
                0.014218493 = queryNorm
              0.3797276 = fieldWeight in 1973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0756416 = idf(docFreq=266, maxDocs=42740)
                0.0625 = fieldNorm(doc=1973)
          0.020466667 = weight(abstract_txt:that in 1973) [ClassicSimilarity], result of:
            0.020466667 = score(doc=1973,freq=5.0), product of:
              0.06115592 = queryWeight, product of:
                1.7961446 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.014218493 = queryNorm
              0.33466372 = fieldWeight in 1973, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=1973)
          0.060716163 = weight(abstract_txt:texts in 1973) [ClassicSimilarity], result of:
            0.060716163 = score(doc=1973,freq=1.0), product of:
              0.17136545 = queryWeight, product of:
                2.1260266 = boost
                5.668929 = idf(docFreq=400, maxDocs=42740)
                0.014218493 = queryNorm
              0.35430807 = fieldWeight in 1973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.668929 = idf(docFreq=400, maxDocs=42740)
                0.0625 = fieldNorm(doc=1973)
          0.47626728 = weight(abstract_txt:sentiment in 1973) [ClassicSimilarity], result of:
            0.47626728 = score(doc=1973,freq=10.0), product of:
              0.31401777 = queryWeight, product of:
                2.8779562 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.014218493 = queryNorm
              1.516689 = fieldWeight in 1973, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=1973)
        0.2 = coord(5/25)