Document (#43394)

Author
Xiang, R.
Chersoni, E.
Lu, Q.
Huang, C.-R.
Li, W.
Long, Y.
Title
Lexical data augmentation for sentiment analysis
Source
Journal of the Association for Information Science and Technology. 72(2021) no.11, S.1432-1447
Year
2021
Abstract
Machine learning methods, especially deep learning models, have achieved impressive performance in various natural language processing tasks including sentiment analysis. However, deep learning models are more demanding for training data. Data augmentation techniques are widely used to generate new instances based on modifications to existing data or relying on external knowledge bases to address annotated data scarcity, which hinders the full potential of machine learning techniques. This paper presents our work using part-of-speech (POS) focused lexical substitution for data augmentation (PLSDA) to enhance the performance of machine learning algorithms in sentiment analysis. We exploit POS information to identify words to be replaced and investigate different augmentation strategies to find semantically related substitutions when generating new instances. The choice of POS tags as well as a variety of strategies such as semantic-based substitution methods and sampling methods are discussed in detail. Performance evaluation focuses on the comparison between PLSDA and two previous lexical substitution-based data augmentation methods, one of which is thesaurus-based, and the other is lexicon manipulation based. Our approach is tested on five English sentiment analysis benchmarks: SST-2, MR, IMDB, Twitter, and AirRecord. Hyperparameters such as the candidate similarity threshold and number of newly generated instances are optimized. Results show that six classifiers (SVM, LSTM, BiLSTM-AT, bidirectional encoder representations from transformers [BERT], XLNet, and RoBERTa) trained with PLSDA achieve accuracy improvement of more than 0.6% comparing to two previous lexical substitution methods averaged on five benchmarks. Introducing POS constraint and well-designed augmentation strategies can improve the reliability of lexical data augmentation methods. Consequently, PLSDA significantly improves the performance of sentiment analysis algorithms.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24493.
Theme
Computerlinguistik

Similar documents (author)

  1. Long, C.A.: Making information available to partially sighted and blind clients (1993) 2.46
    2.4571695 = sum of:
      2.4571695 = product of:
        4.914339 = sum of:
          4.914339 = weight(author_txt:long in 7030) [ClassicSimilarity], result of:
            4.914339 = score(doc=7030,freq=1.0), product of:
              0.8535561 = queryWeight, product of:
                1.2799612 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.07239061 = queryNorm
              5.7574883 = fieldWeight in 7030, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=7030)
        0.5 = coord(1/2)
    
  2. Long, C.E.: ¬The Internet's value to catalogers : results of a survey (1997) 2.46
    2.4571695 = sum of:
      2.4571695 = product of:
        4.914339 = sum of:
          4.914339 = weight(author_txt:long in 494) [ClassicSimilarity], result of:
            4.914339 = score(doc=494,freq=1.0), product of:
              0.8535561 = queryWeight, product of:
                1.2799612 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.07239061 = queryNorm
              5.7574883 = fieldWeight in 494, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=494)
        0.5 = coord(1/2)
    
  3. Long, C.E.: Improving subject searching in Web-based OPACs : evaluation of the problem and guidelines for design (2000) 2.46
    2.4571695 = sum of:
      2.4571695 = product of:
        4.914339 = sum of:
          4.914339 = weight(author_txt:long in 6110) [ClassicSimilarity], result of:
            4.914339 = score(doc=6110,freq=1.0), product of:
              0.8535561 = queryWeight, product of:
                1.2799612 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.07239061 = queryNorm
              5.7574883 = fieldWeight in 6110, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=6110)
        0.5 = coord(1/2)
    
  4. Long, J.: Google hacking (2005) 2.46
    2.4571695 = sum of:
      2.4571695 = product of:
        4.914339 = sum of:
          4.914339 = weight(author_txt:long in 4551) [ClassicSimilarity], result of:
            4.914339 = score(doc=4551,freq=1.0), product of:
              0.8535561 = queryWeight, product of:
                1.2799612 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.07239061 = queryNorm
              5.7574883 = fieldWeight in 4551, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=4551)
        0.5 = coord(1/2)
    
  5. Long, J.: Google hacking (2008) 2.46
    2.4571695 = sum of:
      2.4571695 = product of:
        4.914339 = sum of:
          4.914339 = weight(author_txt:long in 2925) [ClassicSimilarity], result of:
            4.914339 = score(doc=2925,freq=1.0), product of:
              0.8535561 = queryWeight, product of:
                1.2799612 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.07239061 = queryNorm
              5.7574883 = fieldWeight in 2925, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=2925)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Yu, N.: Exploring co-training strategies for opinion detection (2014) 0.31
    0.30986154 = sum of:
      0.30986154 = product of:
        0.77465385 = sum of:
          0.020455288 = weight(abstract_txt:previous in 1503) [ClassicSimilarity], result of:
            0.020455288 = score(doc=1503,freq=1.0), product of:
              0.062532194 = queryWeight, product of:
                1.1462753 = boost
                5.2338576 = idf(docFreq=640, maxDocs=44218)
                0.010423002 = queryNorm
              0.3271161 = fieldWeight in 1503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2338576 = idf(docFreq=640, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.026532294 = weight(abstract_txt:algorithms in 1503) [ClassicSimilarity], result of:
            0.026532294 = score(doc=1503,freq=1.0), product of:
              0.0743732 = queryWeight, product of:
                1.2501017 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.010423002 = queryNorm
              0.35674536 = fieldWeight in 1503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.04069789 = weight(abstract_txt:strategies in 1503) [ClassicSimilarity], result of:
            0.04069789 = score(doc=1503,freq=2.0), product of:
              0.089874186 = queryWeight, product of:
                1.6830624 = boost
                5.123207 = idf(docFreq=715, maxDocs=44218)
                0.010423002 = queryNorm
              0.4528318 = fieldWeight in 1503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.123207 = idf(docFreq=715, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.044512633 = weight(abstract_txt:machine in 1503) [ClassicSimilarity], result of:
            0.044512633 = score(doc=1503,freq=2.0), product of:
              0.09540604 = queryWeight, product of:
                1.7340862 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.010423002 = queryNorm
              0.4665599 = fieldWeight in 1503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.016342567 = weight(abstract_txt:based in 1503) [ClassicSimilarity], result of:
            0.016342567 = score(doc=1503,freq=2.0), product of:
              0.057998504 = queryWeight, product of:
                1.7454826 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.010423002 = queryNorm
              0.28177565 = fieldWeight in 1503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.042608764 = weight(abstract_txt:analysis in 1503) [ClassicSimilarity], result of:
            0.042608764 = score(doc=1503,freq=6.0), product of:
              0.076177865 = queryWeight, product of:
                2.0004215 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.010423002 = queryNorm
              0.5593326 = fieldWeight in 1503, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.06624617 = weight(abstract_txt:learning in 1503) [ClassicSimilarity], result of:
            0.06624617 = score(doc=1503,freq=3.0), product of:
              0.12880915 = queryWeight, product of:
                2.6012404 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.010423002 = queryNorm
              0.51429707 = fieldWeight in 1503, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.030520055 = weight(abstract_txt:methods in 1503) [ClassicSimilarity], result of:
            0.030520055 = score(doc=1503,freq=1.0), product of:
              0.11775985 = queryWeight, product of:
                2.72456 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.010423002 = queryNorm
              0.259172 = fieldWeight in 1503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.051914595 = weight(abstract_txt:data in 1503) [ClassicSimilarity], result of:
            0.051914595 = score(doc=1503,freq=6.0), product of:
              0.101639494 = queryWeight, product of:
                2.922795 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.010423002 = queryNorm
              0.5107719 = fieldWeight in 1503, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.43482357 = weight(abstract_txt:sentiment in 1503) [ClassicSimilarity], result of:
            0.43482357 = score(doc=1503,freq=8.0), product of:
              0.3256307 = queryWeight, product of:
                4.1358976 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.010423002 = queryNorm
              1.3353274 = fieldWeight in 1503, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
        0.4 = coord(10/25)
    
  2. Thelwall, M.; Buckley, K.: Topic-based sentiment analysis for the social web : the role of mood and issue-related words (2013) 0.21
    0.21036234 = sum of:
      0.21036234 = product of:
        0.7512941 = sum of:
          0.03316537 = weight(abstract_txt:algorithms in 1004) [ClassicSimilarity], result of:
            0.03316537 = score(doc=1004,freq=1.0), product of:
              0.0743732 = queryWeight, product of:
                1.2501017 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.010423002 = queryNorm
              0.4459317 = fieldWeight in 1004, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.078125 = fieldNorm(doc=1004)
          0.03766118 = weight(abstract_txt:analysis in 1004) [ClassicSimilarity], result of:
            0.03766118 = score(doc=1004,freq=3.0), product of:
              0.076177865 = queryWeight, product of:
                2.0004215 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.010423002 = queryNorm
              0.4943848 = fieldWeight in 1004, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=1004)
          0.03541124 = weight(abstract_txt:performance in 1004) [ClassicSimilarity], result of:
            0.03541124 = score(doc=1004,freq=1.0), product of:
              0.097888276 = queryWeight, product of:
                2.0282311 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.010423002 = queryNorm
              0.3617516 = fieldWeight in 1004, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.078125 = fieldNorm(doc=1004)
          0.053952347 = weight(abstract_txt:methods in 1004) [ClassicSimilarity], result of:
            0.053952347 = score(doc=1004,freq=2.0), product of:
              0.11775985 = queryWeight, product of:
                2.72456 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.010423002 = queryNorm
              0.4581557 = fieldWeight in 1004, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.078125 = fieldNorm(doc=1004)
          0.03746613 = weight(abstract_txt:data in 1004) [ClassicSimilarity], result of:
            0.03746613 = score(doc=1004,freq=2.0), product of:
              0.101639494 = queryWeight, product of:
                2.922795 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.010423002 = queryNorm
              0.36861783 = fieldWeight in 1004, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=1004)
          0.123940066 = weight(abstract_txt:lexical in 1004) [ClassicSimilarity], result of:
            0.123940066 = score(doc=1004,freq=1.0), product of:
              0.24307917 = queryWeight, product of:
                3.5733948 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.010423002 = queryNorm
              0.5098753 = fieldWeight in 1004, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.078125 = fieldNorm(doc=1004)
          0.42969775 = weight(abstract_txt:sentiment in 1004) [ClassicSimilarity], result of:
            0.42969775 = score(doc=1004,freq=5.0), product of:
              0.3256307 = queryWeight, product of:
                4.1358976 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.010423002 = queryNorm
              1.3195862 = fieldWeight in 1004, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.078125 = fieldNorm(doc=1004)
        0.28 = coord(7/25)
    
  3. Zhang, L.; Wang, S.; Liu, B.: Deep learning for sentiment analysis : a survey (2018) 0.19
    0.19390488 = sum of:
      0.19390488 = product of:
        0.807937 = sum of:
          0.14257891 = weight(abstract_txt:deep in 4092) [ClassicSimilarity], result of:
            0.14257891 = score(doc=4092,freq=4.0), product of:
              0.09898243 = queryWeight, product of:
                1.4421691 = boost
                6.5848994 = idf(docFreq=165, maxDocs=44218)
                0.010423002 = queryNorm
              1.4404467 = fieldWeight in 4092, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.5848994 = idf(docFreq=165, maxDocs=44218)
                0.109375 = fieldNorm(doc=4092)
          0.055081572 = weight(abstract_txt:machine in 4092) [ClassicSimilarity], result of:
            0.055081572 = score(doc=4092,freq=1.0), product of:
              0.09540604 = queryWeight, product of:
                1.7340862 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.010423002 = queryNorm
              0.5773384 = fieldWeight in 4092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.109375 = fieldNorm(doc=4092)
          0.043050315 = weight(abstract_txt:analysis in 4092) [ClassicSimilarity], result of:
            0.043050315 = score(doc=4092,freq=2.0), product of:
              0.076177865 = queryWeight, product of:
                2.0004215 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.010423002 = queryNorm
              0.5651289 = fieldWeight in 4092, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.109375 = fieldNorm(doc=4092)
          0.14966603 = weight(abstract_txt:learning in 4092) [ClassicSimilarity], result of:
            0.14966603 = score(doc=4092,freq=5.0), product of:
              0.12880915 = queryWeight, product of:
                2.6012404 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.010423002 = queryNorm
              1.1619208 = fieldWeight in 4092, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.109375 = fieldNorm(doc=4092)
          0.03708958 = weight(abstract_txt:data in 4092) [ClassicSimilarity], result of:
            0.03708958 = score(doc=4092,freq=1.0), product of:
              0.101639494 = queryWeight, product of:
                2.922795 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.010423002 = queryNorm
              0.36491305 = fieldWeight in 4092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.109375 = fieldNorm(doc=4092)
          0.38047063 = weight(abstract_txt:sentiment in 4092) [ClassicSimilarity], result of:
            0.38047063 = score(doc=4092,freq=2.0), product of:
              0.3256307 = queryWeight, product of:
                4.1358976 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.010423002 = queryNorm
              1.1684115 = fieldWeight in 4092, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.109375 = fieldNorm(doc=4092)
        0.24 = coord(6/25)
    
  4. Abdi, A.; Shamsuddin, S.M.; Aliguliyev, R.M.: QMOS: Query-based multi-documents opinion-oriented summarization (2018) 0.17
    0.17055012 = sum of:
      0.17055012 = product of:
        0.60910755 = sum of:
          0.025180535 = weight(abstract_txt:strategies in 5089) [ClassicSimilarity], result of:
            0.025180535 = score(doc=5089,freq=1.0), product of:
              0.089874186 = queryWeight, product of:
                1.6830624 = boost
                5.123207 = idf(docFreq=715, maxDocs=44218)
                0.010423002 = queryNorm
              0.2801754 = fieldWeight in 5089, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.123207 = idf(docFreq=715, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5089)
          0.017513542 = weight(abstract_txt:based in 5089) [ClassicSimilarity], result of:
            0.017513542 = score(doc=5089,freq=3.0), product of:
              0.057998504 = queryWeight, product of:
                1.7454826 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.010423002 = queryNorm
              0.3019654 = fieldWeight in 5089, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5089)
          0.026362825 = weight(abstract_txt:analysis in 5089) [ClassicSimilarity], result of:
            0.026362825 = score(doc=5089,freq=3.0), product of:
              0.076177865 = queryWeight, product of:
                2.0004215 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.010423002 = queryNorm
              0.34606937 = fieldWeight in 5089, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5089)
          0.03505534 = weight(abstract_txt:performance in 5089) [ClassicSimilarity], result of:
            0.03505534 = score(doc=5089,freq=2.0), product of:
              0.097888276 = queryWeight, product of:
                2.0282311 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.010423002 = queryNorm
              0.3581158 = fieldWeight in 5089, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5089)
          0.037766643 = weight(abstract_txt:methods in 5089) [ClassicSimilarity], result of:
            0.037766643 = score(doc=5089,freq=2.0), product of:
              0.11775985 = queryWeight, product of:
                2.72456 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.010423002 = queryNorm
              0.320709 = fieldWeight in 5089, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5089)
          0.08675804 = weight(abstract_txt:lexical in 5089) [ClassicSimilarity], result of:
            0.08675804 = score(doc=5089,freq=1.0), product of:
              0.24307917 = queryWeight, product of:
                3.5733948 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.010423002 = queryNorm
              0.35691267 = fieldWeight in 5089, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5089)
          0.38047063 = weight(abstract_txt:sentiment in 5089) [ClassicSimilarity], result of:
            0.38047063 = score(doc=5089,freq=8.0), product of:
              0.3256307 = queryWeight, product of:
                4.1358976 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.010423002 = queryNorm
              1.1684115 = fieldWeight in 5089, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5089)
        0.28 = coord(7/25)
    
  5. Thelwall, M.; Buckley, K.; Paltoglou, G.: Sentiment strength detection for the social web (2012) 0.16
    0.1568386 = sum of:
      0.1568386 = product of:
        0.6534942 = sum of:
          0.037522327 = weight(abstract_txt:algorithms in 4972) [ClassicSimilarity], result of:
            0.037522327 = score(doc=4972,freq=2.0), product of:
              0.0743732 = queryWeight, product of:
                1.2501017 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.010423002 = queryNorm
              0.5045141 = fieldWeight in 4972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=4972)
          0.031475183 = weight(abstract_txt:machine in 4972) [ClassicSimilarity], result of:
            0.031475183 = score(doc=4972,freq=1.0), product of:
              0.09540604 = queryWeight, product of:
                1.7340862 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.010423002 = queryNorm
              0.32990766 = fieldWeight in 4972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0625 = fieldNorm(doc=4972)
          0.030128943 = weight(abstract_txt:analysis in 4972) [ClassicSimilarity], result of:
            0.030128943 = score(doc=4972,freq=3.0), product of:
              0.076177865 = queryWeight, product of:
                2.0004215 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.010423002 = queryNorm
              0.39550784 = fieldWeight in 4972, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=4972)
          0.038247246 = weight(abstract_txt:learning in 4972) [ClassicSimilarity], result of:
            0.038247246 = score(doc=4972,freq=1.0), product of:
              0.12880915 = queryWeight, product of:
                2.6012404 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.010423002 = queryNorm
              0.29692957 = fieldWeight in 4972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=4972)
          0.029972905 = weight(abstract_txt:data in 4972) [ClassicSimilarity], result of:
            0.029972905 = score(doc=4972,freq=2.0), product of:
              0.101639494 = queryWeight, product of:
                2.922795 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.010423002 = queryNorm
              0.29489428 = fieldWeight in 4972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=4972)
          0.48614755 = weight(abstract_txt:sentiment in 4972) [ClassicSimilarity], result of:
            0.48614755 = score(doc=4972,freq=10.0), product of:
              0.3256307 = queryWeight, product of:
                4.1358976 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.010423002 = queryNorm
              1.4929414 = fieldWeight in 4972, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=4972)
        0.24 = coord(6/25)