Document (#42822)

Author
Xiong, C.
Title
Knowledge based text representations for information retrieval
Imprint
Pittsburgh, PA : Carnegie Mellon University, School of Computer Science, Language Technologies Institute
Year
2016
Pages
iii, 82 S
Abstract
The successes of information retrieval (IR) in recent decades were built upon bag-of-words representations. Effective as it is, bag-of-words is only a shallow text understanding; there is a limited amount of information for document ranking in the word space. This dissertation goes beyond words and builds knowledge based text representations, which embed the external and carefully curated information from knowledge bases, and provide richer and structured evidence for more advanced information retrieval systems. This thesis research first builds query representations with entities associated with the query. Entities' descriptions are used by query expansion techniques that enrich the query with explanation terms. Then we present a general framework that represents a query with entities that appear in the query, are retrieved by the query, or frequently show up in the top retrieved documents. A latent space model is developed to jointly learn the connections from query to entities and the ranking of documents, modeling the external evidence from knowledge bases and internal ranking features cooperatively. To further improve the quality of relevant entities, a defining factor of our query representations, we introduce learning to rank to entity search and retrieve better entities from knowledge bases. In the document representation part, this thesis research also moves one step forward with a bag-of-entities model, in which documents are represented by their automatic entity annotations, and the ranking is performed in the entity space.
This proposal includes plans to improve the quality of relevant entities with a co-learning framework that learns from both entity labels and document labels. We also plan to develop a hybrid ranking system that combines word based and entity based representations together with their uncertainties considered. At last, we plan to enrich the text representations with connections between entities. We propose several ways to infer entity graph representations for texts, and to rank documents using their structure representations. This dissertation overcomes the limitation of word based representations with external and carefully curated information from knowledge bases. We believe this thesis research is a solid start towards the new generation of intelligent, semantic, and structured information retrieval.
Content
Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Language and Information Technologies. Vgl.: https%3A%2F%2Fwww.cs.cmu.edu%2F~cx%2Fpapers%2Fknowledge_based_text_representation.pdf&usg=AOvVaw0SaTSvhWLTh__Uz_HtOtl3.
Theme
Wissensrepräsentation

Similar documents (content)

  1. Han, B.; Chen, L.; Tian, X.: Knowledge based collection selection for distributed information retrieval (2018) 0.43
    0.433 = sum of:
      0.433 = product of:
        0.9840909 = sum of:
          0.0072853477 = weight(abstract_txt:this in 4754) [ClassicSimilarity], result of:
            0.0072853477 = score(doc=4754,freq=1.0), product of:
              0.047940627 = queryWeight, product of:
                1.0258595 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.019219818 = queryNorm
              0.15196605 = fieldWeight in 4754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=4754)
          0.07172706 = weight(abstract_txt:enrich in 4754) [ClassicSimilarity], result of:
            0.07172706 = score(doc=4754,freq=1.0), product of:
              0.15269275 = queryWeight, product of:
                1.0570234 = boost
                7.515962 = idf(docFreq=63, maxDocs=43254)
                0.019219818 = queryNorm
              0.46974763 = fieldWeight in 4754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.515962 = idf(docFreq=63, maxDocs=43254)
                0.0625 = fieldNorm(doc=4754)
          0.024016116 = weight(abstract_txt:based in 4754) [ClassicSimilarity], result of:
            0.024016116 = score(doc=4754,freq=3.0), product of:
              0.06928478 = queryWeight, product of:
                1.125808 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.019219818 = queryNorm
              0.34662902 = fieldWeight in 4754, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0625 = fieldNorm(doc=4754)
          0.06736512 = weight(abstract_txt:words in 4754) [ClassicSimilarity], result of:
            0.06736512 = score(doc=4754,freq=3.0), product of:
              0.116227746 = queryWeight, product of:
                1.1294732 = boost
                5.354077 = idf(docFreq=555, maxDocs=43254)
                0.019219818 = queryNorm
              0.5795958 = fieldWeight in 4754, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.354077 = idf(docFreq=555, maxDocs=43254)
                0.0625 = fieldNorm(doc=4754)
          0.033327624 = weight(abstract_txt:documents in 4754) [ClassicSimilarity], result of:
            0.033327624 = score(doc=4754,freq=2.0), product of:
              0.09160081 = queryWeight, product of:
                1.1578174 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.019219818 = queryNorm
              0.36383545 = fieldWeight in 4754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.0625 = fieldNorm(doc=4754)
          0.015409009 = weight(abstract_txt:from in 4754) [ClassicSimilarity], result of:
            0.015409009 = score(doc=4754,freq=2.0), product of:
              0.0626965 = queryWeight, product of:
                1.1731611 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.019219818 = queryNorm
              0.24577142 = fieldWeight in 4754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.0625 = fieldNorm(doc=4754)
          0.032873902 = weight(abstract_txt:knowledge in 4754) [ClassicSimilarity], result of:
            0.032873902 = score(doc=4754,freq=2.0), product of:
              0.103902906 = queryWeight, product of:
                1.5102535 = boost
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.019219818 = queryNorm
              0.3163906 = fieldWeight in 4754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.0625 = fieldNorm(doc=4754)
          0.18078308 = weight(abstract_txt:entity in 4754) [ClassicSimilarity], result of:
            0.18078308 = score(doc=4754,freq=2.0), product of:
              0.3237169 = queryWeight, product of:
                2.6657457 = boost
                6.318259 = idf(docFreq=211, maxDocs=43254)
                0.019219818 = queryNorm
              0.5584604 = fieldWeight in 4754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.318259 = idf(docFreq=211, maxDocs=43254)
                0.0625 = fieldNorm(doc=4754)
          0.21400014 = weight(abstract_txt:query in 4754) [ClassicSimilarity], result of:
            0.21400014 = score(doc=4754,freq=7.0), product of:
              0.273114 = queryWeight, product of:
                2.9988422 = boost
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.019219818 = queryNorm
              0.7835561 = fieldWeight in 4754, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.0625 = fieldNorm(doc=4754)
          0.15280259 = weight(abstract_txt:entities in 4754) [ClassicSimilarity], result of:
            0.15280259 = score(doc=4754,freq=1.0), product of:
              0.41736984 = queryWeight, product of:
                3.7071674 = boost
                5.8577337 = idf(docFreq=335, maxDocs=43254)
                0.019219818 = queryNorm
              0.36610836 = fieldWeight in 4754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8577337 = idf(docFreq=335, maxDocs=43254)
                0.0625 = fieldNorm(doc=4754)
          0.18450095 = weight(abstract_txt:representations in 4754) [ClassicSimilarity], result of:
            0.18450095 = score(doc=4754,freq=1.0), product of:
              0.4901761 = queryWeight, product of:
                4.2348347 = boost
                6.022356 = idf(docFreq=284, maxDocs=43254)
                0.019219818 = queryNorm
              0.37639725 = fieldWeight in 4754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.022356 = idf(docFreq=284, maxDocs=43254)
                0.0625 = fieldNorm(doc=4754)
        0.44 = coord(11/25)
    
  2. Vechtomova, O.; Robertson, S.E.: ¬A domain-independent approach to finding related entities (2012) 0.33
    0.33429208 = sum of:
      0.33429208 = product of:
        1.1939003 = sum of:
          0.0091066845 = weight(abstract_txt:this in 4198) [ClassicSimilarity], result of:
            0.0091066845 = score(doc=4198,freq=1.0), product of:
              0.047940627 = queryWeight, product of:
                1.0258595 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.019219818 = queryNorm
              0.18995756 = fieldWeight in 4198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.078125 = fieldNorm(doc=4198)
          0.029457737 = weight(abstract_txt:documents in 4198) [ClassicSimilarity], result of:
            0.029457737 = score(doc=4198,freq=1.0), product of:
              0.09160081 = queryWeight, product of:
                1.1578174 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.019219818 = queryNorm
              0.32158816 = fieldWeight in 4198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.078125 = fieldNorm(doc=4198)
          0.023590129 = weight(abstract_txt:from in 4198) [ClassicSimilarity], result of:
            0.023590129 = score(doc=4198,freq=3.0), product of:
              0.0626965 = queryWeight, product of:
                1.1731611 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.019219818 = queryNorm
              0.3762591 = fieldWeight in 4198, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.078125 = fieldNorm(doc=4198)
          0.015038761 = weight(abstract_txt:with in 4198) [ClassicSimilarity], result of:
            0.015038761 = score(doc=4198,freq=1.0), product of:
              0.076671734 = queryWeight, product of:
                1.58891 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.019219818 = queryNorm
              0.19614479 = fieldWeight in 4198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.078125 = fieldNorm(doc=4198)
          0.42276773 = weight(abstract_txt:entity in 4198) [ClassicSimilarity], result of:
            0.42276773 = score(doc=4198,freq=7.0), product of:
              0.3237169 = queryWeight, product of:
                2.6657457 = boost
                6.318259 = idf(docFreq=211, maxDocs=43254)
                0.019219818 = queryNorm
              1.3059797 = fieldWeight in 4198, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.318259 = idf(docFreq=211, maxDocs=43254)
                0.078125 = fieldNorm(doc=4198)
          0.22607891 = weight(abstract_txt:query in 4198) [ClassicSimilarity], result of:
            0.22607891 = score(doc=4198,freq=5.0), product of:
              0.273114 = queryWeight, product of:
                2.9988422 = boost
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.019219818 = queryNorm
              0.8277822 = fieldWeight in 4198, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.078125 = fieldNorm(doc=4198)
          0.46786046 = weight(abstract_txt:entities in 4198) [ClassicSimilarity], result of:
            0.46786046 = score(doc=4198,freq=6.0), product of:
              0.41736984 = queryWeight, product of:
                3.7071674 = boost
                5.8577337 = idf(docFreq=335, maxDocs=43254)
                0.019219818 = queryNorm
              1.1209733 = fieldWeight in 4198, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8577337 = idf(docFreq=335, maxDocs=43254)
                0.078125 = fieldNorm(doc=4198)
        0.28 = coord(7/25)
    
  3. Soulier, L.; Jabeur, L.B.; Tamine, L.; Bahsoun, W.: On ranking relevant entities in heterogeneous networks using a language-based model (2013) 0.31
    0.3091143 = sum of:
      0.3091143 = product of:
        0.7727858 = sum of:
          0.012618592 = weight(abstract_txt:this in 2129) [ClassicSimilarity], result of:
            0.012618592 = score(doc=2129,freq=3.0), product of:
              0.047940627 = queryWeight, product of:
                1.0258595 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.019219818 = queryNorm
              0.26321292 = fieldWeight in 2129, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=2129)
          0.024016116 = weight(abstract_txt:based in 2129) [ClassicSimilarity], result of:
            0.024016116 = score(doc=2129,freq=3.0), product of:
              0.06928478 = queryWeight, product of:
                1.125808 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.019219818 = queryNorm
              0.34662902 = fieldWeight in 2129, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0625 = fieldNorm(doc=2129)
          0.023566188 = weight(abstract_txt:documents in 2129) [ClassicSimilarity], result of:
            0.023566188 = score(doc=2129,freq=1.0), product of:
              0.09160081 = queryWeight, product of:
                1.1578174 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.019219818 = queryNorm
              0.25727051 = fieldWeight in 2129, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.0625 = fieldNorm(doc=2129)
          0.010895815 = weight(abstract_txt:from in 2129) [ClassicSimilarity], result of:
            0.010895815 = score(doc=2129,freq=1.0), product of:
              0.0626965 = queryWeight, product of:
                1.1731611 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.019219818 = queryNorm
              0.17378664 = fieldWeight in 2129, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.0625 = fieldNorm(doc=2129)
          0.014639435 = weight(abstract_txt:information in 2129) [ClassicSimilarity], result of:
            0.014639435 = score(doc=2129,freq=3.0), product of:
              0.0557222 = queryWeight, product of:
                1.1946028 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.019219818 = queryNorm
              0.26272178 = fieldWeight in 2129, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.0625 = fieldNorm(doc=2129)
          0.0120310085 = weight(abstract_txt:with in 2129) [ClassicSimilarity], result of:
            0.0120310085 = score(doc=2129,freq=1.0), product of:
              0.076671734 = queryWeight, product of:
                1.58891 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.019219818 = queryNorm
              0.15691583 = fieldWeight in 2129, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.0625 = fieldNorm(doc=2129)
          0.07424244 = weight(abstract_txt:ranking in 2129) [ClassicSimilarity], result of:
            0.07424244 = score(doc=2129,freq=1.0), product of:
              0.21205309 = queryWeight, product of:
                1.9695531 = boost
                5.6018004 = idf(docFreq=433, maxDocs=43254)
                0.019219818 = queryNorm
              0.35011253 = fieldWeight in 2129, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6018004 = idf(docFreq=433, maxDocs=43254)
                0.0625 = fieldNorm(doc=2129)
          0.18078308 = weight(abstract_txt:entity in 2129) [ClassicSimilarity], result of:
            0.18078308 = score(doc=2129,freq=2.0), product of:
              0.3237169 = queryWeight, product of:
                2.6657457 = boost
                6.318259 = idf(docFreq=211, maxDocs=43254)
                0.019219818 = queryNorm
              0.5584604 = fieldWeight in 2129, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.318259 = idf(docFreq=211, maxDocs=43254)
                0.0625 = fieldNorm(doc=2129)
          0.11438789 = weight(abstract_txt:query in 2129) [ClassicSimilarity], result of:
            0.11438789 = score(doc=2129,freq=2.0), product of:
              0.273114 = queryWeight, product of:
                2.9988422 = boost
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.019219818 = queryNorm
              0.41882837 = fieldWeight in 2129, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.0625 = fieldNorm(doc=2129)
          0.30560517 = weight(abstract_txt:entities in 2129) [ClassicSimilarity], result of:
            0.30560517 = score(doc=2129,freq=4.0), product of:
              0.41736984 = queryWeight, product of:
                3.7071674 = boost
                5.8577337 = idf(docFreq=335, maxDocs=43254)
                0.019219818 = queryNorm
              0.7322167 = fieldWeight in 2129, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8577337 = idf(docFreq=335, maxDocs=43254)
                0.0625 = fieldNorm(doc=2129)
        0.4 = coord(10/25)
    
  4. Zhao, G.; Wu, J.; Wang, D.; Li, T.: Entity disambiguation to Wikipedia using collective ranking (2016) 0.30
    0.29624867 = sum of:
      0.29624867 = product of:
        0.822913 = sum of:
          0.012878796 = weight(abstract_txt:this in 4731) [ClassicSimilarity], result of:
            0.012878796 = score(doc=4731,freq=2.0), product of:
              0.047940627 = queryWeight, product of:
                1.0258595 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.019219818 = queryNorm
              0.26864055 = fieldWeight in 4731, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.078125 = fieldNorm(doc=4731)
          0.017332138 = weight(abstract_txt:based in 4731) [ClassicSimilarity], result of:
            0.017332138 = score(doc=4731,freq=1.0), product of:
              0.06928478 = queryWeight, product of:
                1.125808 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.019219818 = queryNorm
              0.25015795 = fieldWeight in 4731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.078125 = fieldNorm(doc=4731)
          0.048585955 = weight(abstract_txt:text in 4731) [ClassicSimilarity], result of:
            0.048585955 = score(doc=4731,freq=3.0), product of:
              0.08866111 = queryWeight, product of:
                1.1390872 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.019219818 = queryNorm
              0.5479962 = fieldWeight in 4731, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.078125 = fieldNorm(doc=4731)
          0.010565103 = weight(abstract_txt:information in 4731) [ClassicSimilarity], result of:
            0.010565103 = score(doc=4731,freq=1.0), product of:
              0.0557222 = queryWeight, product of:
                1.1946028 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.019219818 = queryNorm
              0.18960312 = fieldWeight in 4731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.078125 = fieldNorm(doc=4731)
          0.029056702 = weight(abstract_txt:knowledge in 4731) [ClassicSimilarity], result of:
            0.029056702 = score(doc=4731,freq=1.0), product of:
              0.103902906 = queryWeight, product of:
                1.5102535 = boost
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.019219818 = queryNorm
              0.27965245 = fieldWeight in 4731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.078125 = fieldNorm(doc=4731)
          0.09280305 = weight(abstract_txt:ranking in 4731) [ClassicSimilarity], result of:
            0.09280305 = score(doc=4731,freq=1.0), product of:
              0.21205309 = queryWeight, product of:
                1.9695531 = boost
                5.6018004 = idf(docFreq=433, maxDocs=43254)
                0.019219818 = queryNorm
              0.43764067 = fieldWeight in 4731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6018004 = idf(docFreq=433, maxDocs=43254)
                0.078125 = fieldNorm(doc=4731)
          0.31958237 = weight(abstract_txt:entity in 4731) [ClassicSimilarity], result of:
            0.31958237 = score(doc=4731,freq=4.0), product of:
              0.3237169 = queryWeight, product of:
                2.6657457 = boost
                6.318259 = idf(docFreq=211, maxDocs=43254)
                0.019219818 = queryNorm
              0.9872279 = fieldWeight in 4731, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.318259 = idf(docFreq=211, maxDocs=43254)
                0.078125 = fieldNorm(doc=4731)
          0.10110556 = weight(abstract_txt:query in 4731) [ClassicSimilarity], result of:
            0.10110556 = score(doc=4731,freq=1.0), product of:
              0.273114 = queryWeight, product of:
                2.9988422 = boost
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.019219818 = queryNorm
              0.37019548 = fieldWeight in 4731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.078125 = fieldNorm(doc=4731)
          0.19100325 = weight(abstract_txt:entities in 4731) [ClassicSimilarity], result of:
            0.19100325 = score(doc=4731,freq=1.0), product of:
              0.41736984 = queryWeight, product of:
                3.7071674 = boost
                5.8577337 = idf(docFreq=335, maxDocs=43254)
                0.019219818 = queryNorm
              0.45763546 = fieldWeight in 4731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8577337 = idf(docFreq=335, maxDocs=43254)
                0.078125 = fieldNorm(doc=4731)
        0.36 = coord(9/25)
    
  5. Aker, A.; Gaizauskas, R.: Generating descriptive multi-document summaries of geo-located entities using entity type models (2015) 0.29
    0.29097986 = sum of:
      0.29097986 = product of:
        0.9093121 = sum of:
          0.0072853477 = weight(abstract_txt:this in 3191) [ClassicSimilarity], result of:
            0.0072853477 = score(doc=3191,freq=1.0), product of:
              0.047940627 = queryWeight, product of:
                1.0258595 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.019219818 = queryNorm
              0.15196605 = fieldWeight in 3191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=3191)
          0.055003386 = weight(abstract_txt:words in 3191) [ClassicSimilarity], result of:
            0.055003386 = score(doc=3191,freq=2.0), product of:
              0.116227746 = queryWeight, product of:
                1.1294732 = boost
                5.354077 = idf(docFreq=555, maxDocs=43254)
                0.019219818 = queryNorm
              0.473238 = fieldWeight in 3191, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.354077 = idf(docFreq=555, maxDocs=43254)
                0.0625 = fieldNorm(doc=3191)
          0.031736214 = weight(abstract_txt:text in 3191) [ClassicSimilarity], result of:
            0.031736214 = score(doc=3191,freq=2.0), product of:
              0.08866111 = queryWeight, product of:
                1.1390872 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.019219818 = queryNorm
              0.35794964 = fieldWeight in 3191, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=3191)
          0.010895815 = weight(abstract_txt:from in 3191) [ClassicSimilarity], result of:
            0.010895815 = score(doc=3191,freq=1.0), product of:
              0.0626965 = queryWeight, product of:
                1.1731611 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.019219818 = queryNorm
              0.17378664 = fieldWeight in 3191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.0625 = fieldNorm(doc=3191)
          0.017014416 = weight(abstract_txt:with in 3191) [ClassicSimilarity], result of:
            0.017014416 = score(doc=3191,freq=2.0), product of:
              0.076671734 = queryWeight, product of:
                1.58891 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.019219818 = queryNorm
              0.22191249 = fieldWeight in 3191, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.0625 = fieldNorm(doc=3191)
          0.3382142 = weight(abstract_txt:entity in 3191) [ClassicSimilarity], result of:
            0.3382142 = score(doc=3191,freq=7.0), product of:
              0.3237169 = queryWeight, product of:
                2.6657457 = boost
                6.318259 = idf(docFreq=211, maxDocs=43254)
                0.019219818 = queryNorm
              1.0447838 = fieldWeight in 3191, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.318259 = idf(docFreq=211, maxDocs=43254)
                0.0625 = fieldNorm(doc=3191)
          0.26466185 = weight(abstract_txt:entities in 3191) [ClassicSimilarity], result of:
            0.26466185 = score(doc=3191,freq=3.0), product of:
              0.41736984 = queryWeight, product of:
                3.7071674 = boost
                5.8577337 = idf(docFreq=335, maxDocs=43254)
                0.019219818 = queryNorm
              0.63411826 = fieldWeight in 3191, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8577337 = idf(docFreq=335, maxDocs=43254)
                0.0625 = fieldNorm(doc=3191)
          0.18450095 = weight(abstract_txt:representations in 3191) [ClassicSimilarity], result of:
            0.18450095 = score(doc=3191,freq=1.0), product of:
              0.4901761 = queryWeight, product of:
                4.2348347 = boost
                6.022356 = idf(docFreq=284, maxDocs=43254)
                0.019219818 = queryNorm
              0.37639725 = fieldWeight in 3191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.022356 = idf(docFreq=284, maxDocs=43254)
                0.0625 = fieldNorm(doc=3191)
        0.32 = coord(8/25)