Document (#39678)

Author
Kar, M.
Nunes, S.
Ribeiro, C.
Title
Summarization of changes in dynamic text collections using Latent Dirichlet Allocation model
Source
Information processing and management. 51(2015) no.6, S.809-833
Year
2015
Abstract
In the area of Information Retrieval, the task of automatic text summarization usually assumes a static underlying collection of documents, disregarding the temporal dimension of each document. However, in real world settings, collections and individual documents rarely stay unchanged over time. The World Wide Web is a prime example of a collection where information changes both frequently and significantly over time, with documents being added, modified or just deleted at different times. In this context, previous work addressing the summarization of web documents has simply discarded the dynamic nature of the web, considering only the latest published version of each individual document. This paper proposes and addresses a new challenge - the automatic summarization of changes in dynamic text collections. In standard text summarization, retrieval techniques present a summary to the user by capturing the major points expressed in the most recent version of an entire document in a condensed form. In this new task, the goal is to obtain a summary that describes the most significant changes made to a document during a given period. In other words, the idea is to have a summary of the revisions made to a document over a specific period of time. This paper proposes different approaches to generate summaries using extractive summarization techniques. First, individual terms are scored and then this information is used to rank and select sentences to produce the final summary. A system based on Latent Dirichlet Allocation model (LDA) is used to find the hidden topic structures of changes. The purpose of using the LDA model is to identify separate topics where the changed terms from each topic are likely to carry at least one significant change. The different approaches are then compared with the previous work in this area. A collection of articles from Wikipedia, including their revision history, is used to evaluate the proposed system. For each article, a temporal interval and a reference summary from the article's content are selected manually. The articles and intervals in which a significant event occurred are carefully selected. The summaries produced by each of the approaches are evaluated comparatively to the manual summaries using ROUGE metrics. It is observed that the approach using the LDA model outperforms all the other approaches. Statistical tests reveal that the differences in ROUGE scores for the LDA-based approach is statistically significant at 99% over baseline.
Content
Vgl.: 10.1016/j.ipm.2015.06.002.
Footnote
Beitrag in einem Themenschwerpunkt "Time and information retrieval"
Object
Latent semantic analysis

Similar documents (author)

  1. Nunes, S.; Ribeiro, C.; David, G.: Term weighting based on document revision history (2011) 4.75
    4.7518578 = sum of:
      4.7518578 = sum of:
        2.1934702 = weight(author_txt:ribeiro in 1411) [ClassicSimilarity], result of:
          2.1934702 = score(doc=1411,freq=1.0), product of:
            0.6699865 = queryWeight, product of:
              8.730406 = idf(docFreq=18, maxDocs=43254)
              0.07674173 = queryNorm
            3.2739022 = fieldWeight in 1411, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.730406 = idf(docFreq=18, maxDocs=43254)
              0.375 = fieldNorm(doc=1411)
        2.5583873 = weight(author_txt:nunes in 1411) [ClassicSimilarity], result of:
          2.5583873 = score(doc=1411,freq=1.0), product of:
            0.7423734 = queryWeight, product of:
              1.0526359 = boost
              9.189939 = idf(docFreq=11, maxDocs=43254)
              0.07674173 = queryNorm
            3.446227 = fieldWeight in 1411, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.189939 = idf(docFreq=11, maxDocs=43254)
              0.375 = fieldNorm(doc=1411)
    
  2. Nunes, L.F.A.: Portugal: ten years of information development (1995) 2.13
    2.1319892 = sum of:
      2.1319892 = product of:
        4.2639785 = sum of:
          4.2639785 = weight(author_txt:nunes in 4381) [ClassicSimilarity], result of:
            4.2639785 = score(doc=4381,freq=1.0), product of:
              0.7423734 = queryWeight, product of:
                1.0526359 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.07674173 = queryNorm
              5.7437115 = fieldWeight in 4381, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.625 = fieldNorm(doc=4381)
        0.5 = coord(1/2)
    
  3. Ribeiro, F.: Subject indexing and authority control in archives : the need for subject indexing in archives and for an indexing policy using controlled language (1996) 1.83
    1.827892 = sum of:
      1.827892 = product of:
        3.655784 = sum of:
          3.655784 = weight(author_txt:ribeiro in 646) [ClassicSimilarity], result of:
            3.655784 = score(doc=646,freq=1.0), product of:
              0.6699865 = queryWeight, product of:
                8.730406 = idf(docFreq=18, maxDocs=43254)
                0.07674173 = queryNorm
              5.456504 = fieldWeight in 646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.730406 = idf(docFreq=18, maxDocs=43254)
                0.625 = fieldNorm(doc=646)
        0.5 = coord(1/2)
    
  4. Ribeiro, F.: ¬The use of classification in archives as a means of organization, representation and retrieval of information (2014) 1.83
    1.827892 = sum of:
      1.827892 = product of:
        3.655784 = sum of:
          3.655784 = weight(author_txt:ribeiro in 2862) [ClassicSimilarity], result of:
            3.655784 = score(doc=2862,freq=1.0), product of:
              0.6699865 = queryWeight, product of:
                8.730406 = idf(docFreq=18, maxDocs=43254)
                0.07674173 = queryNorm
              5.456504 = fieldWeight in 2862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.730406 = idf(docFreq=18, maxDocs=43254)
                0.625 = fieldNorm(doc=2862)
        0.5 = coord(1/2)
    
  5. Amaral, L.A. Nunes -> Nunes Amaral, L.A.: 1.81
    1.8090528 = sum of:
      1.8090528 = product of:
        3.6181056 = sum of:
          3.6181056 = weight(author_txt:nunes in 4143) [ClassicSimilarity], result of:
            3.6181056 = score(doc=4143,freq=2.0), product of:
              0.7423734 = queryWeight, product of:
                1.0526359 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.07674173 = queryNorm
              4.8737006 = fieldWeight in 4143, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.375 = fieldNorm(doc=4143)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Sankarasubramaniam, Y.; Ramanathan, K.; Ghosh, S.: Text summarization using Wikipedia (2014) 0.54
    0.5353239 = sum of:
      0.5353239 = product of:
        1.2166452 = sum of:
          0.023134403 = weight(abstract_txt:time in 4158) [ClassicSimilarity], result of:
            0.023134403 = score(doc=4158,freq=1.0), product of:
              0.089000806 = queryWeight, product of:
                1.1062301 = boost
                4.158956 = idf(docFreq=1836, maxDocs=43254)
                0.019344795 = queryNorm
              0.25993475 = fieldWeight in 4158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.158956 = idf(docFreq=1836, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.01307525 = weight(abstract_txt:this in 4158) [ClassicSimilarity], result of:
            0.01307525 = score(doc=4158,freq=2.0), product of:
              0.060839895 = queryWeight, product of:
                1.2934741 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.019344795 = queryNorm
              0.21491244 = fieldWeight in 4158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.027601032 = weight(abstract_txt:model in 4158) [ClassicSimilarity], result of:
            0.027601032 = score(doc=4158,freq=1.0), product of:
              0.11019237 = queryWeight, product of:
                1.4213258 = boost
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.019344795 = queryNorm
              0.2504804 = fieldWeight in 4158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.049327083 = weight(abstract_txt:text in 4158) [ClassicSimilarity], result of:
            0.049327083 = score(doc=4158,freq=3.0), product of:
              0.112516925 = queryWeight, product of:
                1.4362392 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.019344795 = queryNorm
              0.438397 = fieldWeight in 4158, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.03889541 = weight(abstract_txt:using in 4158) [ClassicSimilarity], result of:
            0.03889541 = score(doc=4158,freq=3.0), product of:
              0.103449844 = queryWeight, product of:
                1.539706 = boost
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.019344795 = queryNorm
              0.37598327 = fieldWeight in 4158, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.16208929 = weight(abstract_txt:rouge in 4158) [ClassicSimilarity], result of:
            0.16208929 = score(doc=4158,freq=1.0), product of:
              0.28468257 = queryWeight, product of:
                1.615412 = boost
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.019344795 = queryNorm
              0.5693685 = fieldWeight in 4158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.04284759 = weight(abstract_txt:approaches in 4158) [ClassicSimilarity], result of:
            0.04284759 = score(doc=4158,freq=1.0), product of:
              0.14773574 = queryWeight, product of:
                1.6457379 = boost
                4.640457 = idf(docFreq=1134, maxDocs=43254)
                0.019344795 = queryNorm
              0.29002857 = fieldWeight in 4158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.640457 = idf(docFreq=1134, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.037627075 = weight(abstract_txt:each in 4158) [ClassicSimilarity], result of:
            0.037627075 = score(doc=4158,freq=1.0), product of:
              0.1459391 = queryWeight, product of:
                1.8287684 = boost
                4.125236 = idf(docFreq=1899, maxDocs=43254)
                0.019344795 = queryNorm
              0.25782725 = fieldWeight in 4158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.125236 = idf(docFreq=1899, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.04214214 = weight(abstract_txt:document in 4158) [ClassicSimilarity], result of:
            0.04214214 = score(doc=4158,freq=1.0), product of:
              0.1573919 = queryWeight, product of:
                1.899171 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.019344795 = queryNorm
              0.26775292 = fieldWeight in 4158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.20590153 = weight(abstract_txt:summary in 4158) [ClassicSimilarity], result of:
            0.20590153 = score(doc=4158,freq=2.0), product of:
              0.359695 = queryWeight, product of:
                2.8710456 = boost
                6.476348 = idf(docFreq=180, maxDocs=43254)
                0.019344795 = queryNorm
              0.5724337 = fieldWeight in 4158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.476348 = idf(docFreq=180, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.57400435 = weight(abstract_txt:summarization in 4158) [ClassicSimilarity], result of:
            0.57400435 = score(doc=4158,freq=6.0), product of:
              0.52495825 = queryWeight, product of:
                3.799495 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.019344795 = queryNorm
              1.0934286 = fieldWeight in 4158, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
        0.44 = coord(11/25)
    
  2. Dunlavy, D.M.; O'Leary, D.P.; Conroy, J.M.; Schlesinger, J.D.: QCS: A system for querying, clustering and summarizing documents (2007) 0.46
    0.45780873 = sum of:
      0.45780873 = product of:
        1.0404744 = sum of:
          0.06545782 = weight(abstract_txt:latent in 2948) [ClassicSimilarity], result of:
            0.06545782 = score(doc=2948,freq=1.0), product of:
              0.17001751 = queryWeight, product of:
                1.2483883 = boost
                7.040116 = idf(docFreq=102, maxDocs=43254)
                0.019344795 = queryNorm
              0.38500634 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.040116 = idf(docFreq=102, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2948)
          0.011440844 = weight(abstract_txt:this in 2948) [ClassicSimilarity], result of:
            0.011440844 = score(doc=2948,freq=2.0), product of:
              0.060839895 = queryWeight, product of:
                1.2934741 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.019344795 = queryNorm
              0.18804839 = fieldWeight in 2948, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2948)
          0.024150902 = weight(abstract_txt:model in 2948) [ClassicSimilarity], result of:
            0.024150902 = score(doc=2948,freq=1.0), product of:
              0.11019237 = queryWeight, product of:
                1.4213258 = boost
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.019344795 = queryNorm
              0.21917036 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2948)
          0.024919128 = weight(abstract_txt:text in 2948) [ClassicSimilarity], result of:
            0.024919128 = score(doc=2948,freq=1.0), product of:
              0.112516925 = queryWeight, product of:
                1.4362392 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.019344795 = queryNorm
              0.22147004 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2948)
          0.037008125 = weight(abstract_txt:documents in 2948) [ClassicSimilarity], result of:
            0.037008125 = score(doc=2948,freq=2.0), product of:
              0.1162476 = queryWeight, product of:
                1.4598556 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.019344795 = queryNorm
              0.31835604 = fieldWeight in 2948, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2948)
          0.019649243 = weight(abstract_txt:using in 2948) [ClassicSimilarity], result of:
            0.019649243 = score(doc=2948,freq=1.0), product of:
              0.103449844 = queryWeight, product of:
                1.539706 = boost
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.019344795 = queryNorm
              0.1899398 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2948)
          0.14182813 = weight(abstract_txt:rouge in 2948) [ClassicSimilarity], result of:
            0.14182813 = score(doc=2948,freq=1.0), product of:
              0.28468257 = queryWeight, product of:
                1.615412 = boost
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.019344795 = queryNorm
              0.49819744 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2948)
          0.07361962 = weight(abstract_txt:each in 2948) [ClassicSimilarity], result of:
            0.07361962 = score(doc=2948,freq=5.0), product of:
              0.1459391 = queryWeight, product of:
                1.8287684 = boost
                4.125236 = idf(docFreq=1899, maxDocs=43254)
                0.019344795 = queryNorm
              0.5044544 = fieldWeight in 2948, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.125236 = idf(docFreq=1899, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2948)
          0.052148238 = weight(abstract_txt:document in 2948) [ClassicSimilarity], result of:
            0.052148238 = score(doc=2948,freq=2.0), product of:
              0.1573919 = queryWeight, product of:
                1.899171 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.019344795 = queryNorm
              0.33132732 = fieldWeight in 2948, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2948)
          0.18016386 = weight(abstract_txt:summary in 2948) [ClassicSimilarity], result of:
            0.18016386 = score(doc=2948,freq=2.0), product of:
              0.359695 = queryWeight, product of:
                2.8710456 = boost
                6.476348 = idf(docFreq=180, maxDocs=43254)
                0.019344795 = queryNorm
              0.5008795 = fieldWeight in 2948, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.476348 = idf(docFreq=180, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2948)
          0.4100885 = weight(abstract_txt:summarization in 2948) [ClassicSimilarity], result of:
            0.4100885 = score(doc=2948,freq=4.0), product of:
              0.52495825 = queryWeight, product of:
                3.799495 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.019344795 = queryNorm
              0.78118306 = fieldWeight in 2948, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2948)
        0.44 = coord(11/25)
    
  3. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.45
    0.44513136 = sum of:
      0.44513136 = product of:
        1.236476 = sum of:
          0.009245599 = weight(abstract_txt:this in 3720) [ClassicSimilarity], result of:
            0.009245599 = score(doc=3720,freq=1.0), product of:
              0.060839895 = queryWeight, product of:
                1.2934741 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.019344795 = queryNorm
              0.15196605 = fieldWeight in 3720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.039033752 = weight(abstract_txt:model in 3720) [ClassicSimilarity], result of:
            0.039033752 = score(doc=3720,freq=2.0), product of:
              0.11019237 = queryWeight, product of:
                1.4213258 = boost
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.019344795 = queryNorm
              0.3542328 = fieldWeight in 3720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.028479004 = weight(abstract_txt:text in 3720) [ClassicSimilarity], result of:
            0.028479004 = score(doc=3720,freq=1.0), product of:
              0.112516925 = queryWeight, product of:
                1.4362392 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.019344795 = queryNorm
              0.25310862 = fieldWeight in 3720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.042294998 = weight(abstract_txt:documents in 3720) [ClassicSimilarity], result of:
            0.042294998 = score(doc=3720,freq=2.0), product of:
              0.1162476 = queryWeight, product of:
                1.4598556 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.019344795 = queryNorm
              0.36383545 = fieldWeight in 3720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.022456277 = weight(abstract_txt:using in 3720) [ClassicSimilarity], result of:
            0.022456277 = score(doc=3720,freq=1.0), product of:
              0.103449844 = queryWeight, product of:
                1.539706 = boost
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.019344795 = queryNorm
              0.21707405 = fieldWeight in 3720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.11129631 = weight(abstract_txt:summaries in 3720) [ClassicSimilarity], result of:
            0.11129631 = score(doc=3720,freq=1.0), product of:
              0.25363484 = queryWeight, product of:
                1.867467 = boost
                7.0208845 = idf(docFreq=104, maxDocs=43254)
                0.019344795 = queryNorm
              0.43880528 = fieldWeight in 3720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0208845 = idf(docFreq=104, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.11149762 = weight(abstract_txt:document in 3720) [ClassicSimilarity], result of:
            0.11149762 = score(doc=3720,freq=7.0), product of:
              0.1573919 = queryWeight, product of:
                1.899171 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.019344795 = queryNorm
              0.7084076 = fieldWeight in 3720, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.25217682 = weight(abstract_txt:summary in 3720) [ClassicSimilarity], result of:
            0.25217682 = score(doc=3720,freq=3.0), product of:
              0.359695 = queryWeight, product of:
                2.8710456 = boost
                6.476348 = idf(docFreq=180, maxDocs=43254)
                0.019344795 = queryNorm
              0.7010852 = fieldWeight in 3720, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.476348 = idf(docFreq=180, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.61999553 = weight(abstract_txt:summarization in 3720) [ClassicSimilarity], result of:
            0.61999553 = score(doc=3720,freq=7.0), product of:
              0.52495825 = queryWeight, product of:
                3.799495 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.019344795 = queryNorm
              1.1810378 = fieldWeight in 3720, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
        0.36 = coord(9/25)
    
  4. Yulianti, E.; Huspi, S.; Sanderson, M.: Tweet-biased summarization (2016) 0.42
    0.41594803 = sum of:
      0.41594803 = product of:
        1.1554111 = sum of:
          0.009245599 = weight(abstract_txt:this in 4391) [ClassicSimilarity], result of:
            0.009245599 = score(doc=4391,freq=1.0), product of:
              0.060839895 = queryWeight, product of:
                1.2934741 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.019344795 = queryNorm
              0.15196605 = fieldWeight in 4391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=4391)
          0.029907081 = weight(abstract_txt:documents in 4391) [ClassicSimilarity], result of:
            0.029907081 = score(doc=4391,freq=1.0), product of:
              0.1162476 = queryWeight, product of:
                1.4598556 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.019344795 = queryNorm
              0.25727051 = fieldWeight in 4391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.0625 = fieldNorm(doc=4391)
          0.033071373 = weight(abstract_txt:over in 4391) [ClassicSimilarity], result of:
            0.033071373 = score(doc=4391,freq=1.0), product of:
              0.12430907 = queryWeight, product of:
                1.5096257 = boost
                4.2566643 = idf(docFreq=1665, maxDocs=43254)
                0.019344795 = queryNorm
              0.26604152 = fieldWeight in 4391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2566643 = idf(docFreq=1665, maxDocs=43254)
                0.0625 = fieldNorm(doc=4391)
          0.022456277 = weight(abstract_txt:using in 4391) [ClassicSimilarity], result of:
            0.022456277 = score(doc=4391,freq=1.0), product of:
              0.103449844 = queryWeight, product of:
                1.539706 = boost
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.019344795 = queryNorm
              0.21707405 = fieldWeight in 4391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.0625 = fieldNorm(doc=4391)
          0.16208929 = weight(abstract_txt:rouge in 4391) [ClassicSimilarity], result of:
            0.16208929 = score(doc=4391,freq=1.0), product of:
              0.28468257 = queryWeight, product of:
                1.615412 = boost
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.019344795 = queryNorm
              0.5693685 = fieldWeight in 4391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.0625 = fieldNorm(doc=4391)
          0.15739673 = weight(abstract_txt:summaries in 4391) [ClassicSimilarity], result of:
            0.15739673 = score(doc=4391,freq=2.0), product of:
              0.25363484 = queryWeight, product of:
                1.867467 = boost
                7.0208845 = idf(docFreq=104, maxDocs=43254)
                0.019344795 = queryNorm
              0.62056434 = fieldWeight in 4391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0208845 = idf(docFreq=104, maxDocs=43254)
                0.0625 = fieldNorm(doc=4391)
          0.08428428 = weight(abstract_txt:document in 4391) [ClassicSimilarity], result of:
            0.08428428 = score(doc=4391,freq=4.0), product of:
              0.1573919 = queryWeight, product of:
                1.899171 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.019344795 = queryNorm
              0.53550583 = fieldWeight in 4391, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=4391)
          0.3255589 = weight(abstract_txt:summary in 4391) [ClassicSimilarity], result of:
            0.3255589 = score(doc=4391,freq=5.0), product of:
              0.359695 = queryWeight, product of:
                2.8710456 = boost
                6.476348 = idf(docFreq=180, maxDocs=43254)
                0.019344795 = queryNorm
              0.9050971 = fieldWeight in 4391, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.476348 = idf(docFreq=180, maxDocs=43254)
                0.0625 = fieldNorm(doc=4391)
          0.33140156 = weight(abstract_txt:summarization in 4391) [ClassicSimilarity], result of:
            0.33140156 = score(doc=4391,freq=2.0), product of:
              0.52495825 = queryWeight, product of:
                3.799495 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.019344795 = queryNorm
              0.6312913 = fieldWeight in 4391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.0625 = fieldNorm(doc=4391)
        0.36 = coord(9/25)
    
  5. Galgani, F.; Compton, P.; Hoffmann, A.: Summarization based on bi-directional citation analysis (2015) 0.41
    0.4117325 = sum of:
      0.4117325 = product of:
        1.1437013 = sum of:
          0.01307525 = weight(abstract_txt:this in 4150) [ClassicSimilarity], result of:
            0.01307525 = score(doc=4150,freq=2.0), product of:
              0.060839895 = queryWeight, product of:
                1.2934741 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.019344795 = queryNorm
              0.21491244 = fieldWeight in 4150, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=4150)
          0.049327083 = weight(abstract_txt:text in 4150) [ClassicSimilarity], result of:
            0.049327083 = score(doc=4150,freq=3.0), product of:
              0.112516925 = queryWeight, product of:
                1.4362392 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.019344795 = queryNorm
              0.438397 = fieldWeight in 4150, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=4150)
          0.051800583 = weight(abstract_txt:documents in 4150) [ClassicSimilarity], result of:
            0.051800583 = score(doc=4150,freq=3.0), product of:
              0.1162476 = queryWeight, product of:
                1.4598556 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.019344795 = queryNorm
              0.4456056 = fieldWeight in 4150, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.0625 = fieldNorm(doc=4150)
          0.022456277 = weight(abstract_txt:using in 4150) [ClassicSimilarity], result of:
            0.022456277 = score(doc=4150,freq=1.0), product of:
              0.103449844 = queryWeight, product of:
                1.539706 = boost
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.019344795 = queryNorm
              0.21707405 = fieldWeight in 4150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.0625 = fieldNorm(doc=4150)
          0.04284759 = weight(abstract_txt:approaches in 4150) [ClassicSimilarity], result of:
            0.04284759 = score(doc=4150,freq=1.0), product of:
              0.14773574 = queryWeight, product of:
                1.6457379 = boost
                4.640457 = idf(docFreq=1134, maxDocs=43254)
                0.019344795 = queryNorm
              0.29002857 = fieldWeight in 4150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.640457 = idf(docFreq=1134, maxDocs=43254)
                0.0625 = fieldNorm(doc=4150)
          0.11129631 = weight(abstract_txt:summaries in 4150) [ClassicSimilarity], result of:
            0.11129631 = score(doc=4150,freq=1.0), product of:
              0.25363484 = queryWeight, product of:
                1.867467 = boost
                7.0208845 = idf(docFreq=104, maxDocs=43254)
                0.019344795 = queryNorm
              0.43880528 = fieldWeight in 4150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0208845 = idf(docFreq=104, maxDocs=43254)
                0.0625 = fieldNorm(doc=4150)
          0.07299233 = weight(abstract_txt:document in 4150) [ClassicSimilarity], result of:
            0.07299233 = score(doc=4150,freq=3.0), product of:
              0.1573919 = queryWeight, product of:
                1.899171 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.019344795 = queryNorm
              0.46376166 = fieldWeight in 4150, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=4150)
          0.20590153 = weight(abstract_txt:summary in 4150) [ClassicSimilarity], result of:
            0.20590153 = score(doc=4150,freq=2.0), product of:
              0.359695 = queryWeight, product of:
                2.8710456 = boost
                6.476348 = idf(docFreq=180, maxDocs=43254)
                0.019344795 = queryNorm
              0.5724337 = fieldWeight in 4150, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.476348 = idf(docFreq=180, maxDocs=43254)
                0.0625 = fieldNorm(doc=4150)
          0.57400435 = weight(abstract_txt:summarization in 4150) [ClassicSimilarity], result of:
            0.57400435 = score(doc=4150,freq=6.0), product of:
              0.52495825 = queryWeight, product of:
                3.799495 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.019344795 = queryNorm
              1.0934286 = fieldWeight in 4150, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.0625 = fieldNorm(doc=4150)
        0.36 = coord(9/25)