Document (#38443)

Author
Mesquita, L.A.P.
Souza, R.R.
Baracho Porto, R.M.A.
Title
Noun phrases in automatic indexing: : a structural analysis of the distribution of relevant terms in doctoral theses
Source
Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
Imprint
Würzburg : Ergon Verlag
Year
2014
Pages
S.327-334
Series
Advances in knowledge organization; vol. 14
Abstract
The main objective of this research was to analyze whether there was a characteristic distribution behavior of relevant terms over a scientific text that could contribute as a criterion for their process of automatic indexing. The terms considered in this study were only full noun phrases contained in the texts themselves. The texts were considered a total of 98 doctoral theses of the eight areas of knowledge in a same university. Initially, 20 full noun phrases were automatically extracted from each text as candidates to be the most relevant terms, and each author of each text assigned a relevance value 0-6 (not relevant and highly relevant, respectively) for each of the 20 noun phrases sent. Only, 22.1 % of noun phrases were considered not relevant. A relevance values of the terms assigned by the authors were associated with their positions in the text. Each full noun phrases found in the text was considered as a valid linear position. The results that were obtained showed values resulting from this distribution by considering two types of position: linear, with values consolidated into ten equal consecutive parts; and structural, considering parts of the text (such as introduction, development and conclusion). As a result of considerable importance, all areas of knowledge related to the Natural Sciences showed a characteristic behavior in the distribution of relevant terms, as well as all areas of knowledge related to Social Sciences showed the same characteristic behavior of distribution, but distinct from the Natural Sciences. The difference of the distribution behavior between the Natural and Social Sciences can be clearly visualized through graphs. All behaviors, including the general behavior of all areas of knowledge together, were characterized in polynomial equations and can be applied in future as criteria for automatic indexing. Until the present date this work has become inedited of for two reasons: to present a method for characterizing the distribution of relevant terms in a scientific text, and also, through this method, pointing out a quantitative trait difference between the Natural and Social Sciences.
Content
Vgl.: http://www.ergon-verlag.de/isko_ko/downloads/aiko_vol_14_2014_45.pdf.
Theme
Automatisches Indexieren

Similar documents (author)

  1. Almeida, M.B.; Souza, R.R.; Porto, R.B.: Looking for the identity of information science in the age of big data, computing clouds and social networks (2015) 4.75
    4.7456546 = sum of:
      4.7456546 = sum of:
        1.7460054 = weight(author_txt:souza in 3453) [ClassicSimilarity], result of:
          1.7460054 = score(doc=3453,freq=1.0), product of:
            0.5718838 = queryWeight, product of:
              8.14154 = idf(docFreq=34, maxDocs=44218)
              0.07024271 = queryNorm
            3.0530772 = fieldWeight in 3453, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.14154 = idf(docFreq=34, maxDocs=44218)
              0.375 = fieldNorm(doc=3453)
        2.9996493 = weight(author_txt:porto in 3453) [ClassicSimilarity], result of:
          2.9996493 = score(doc=3453,freq=1.0), product of:
            0.8203346 = queryWeight, product of:
              1.1976823 = boost
              9.7509775 = idf(docFreq=6, maxDocs=44218)
              0.07024271 = queryNorm
            3.6566167 = fieldWeight in 3453, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.7509775 = idf(docFreq=6, maxDocs=44218)
              0.375 = fieldNorm(doc=3453)
    
  2. Porto, R. => Porto, R.B.: 2.83
    2.8280964 = sum of:
      2.8280964 = product of:
        5.656193 = sum of:
          5.656193 = weight(author_txt:porto in 3386) [ClassicSimilarity], result of:
            5.656193 = score(doc=3386,freq=2.0), product of:
              0.8203346 = queryWeight, product of:
                1.1976823 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.07024271 = queryNorm
              6.8949823 = fieldWeight in 3386, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.5 = fieldNorm(doc=3386)
        0.5 = coord(1/2)
    
  3. Gomez, I. Porto- => Porto-Gomez, I.: 2.12
    2.1210723 = sum of:
      2.1210723 = product of:
        4.2421446 = sum of:
          4.2421446 = weight(author_txt:porto in 5372) [ClassicSimilarity], result of:
            4.2421446 = score(doc=5372,freq=2.0), product of:
              0.8203346 = queryWeight, product of:
                1.1976823 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.07024271 = queryNorm
              5.171237 = fieldWeight in 5372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.375 = fieldNorm(doc=5372)
        0.5 = coord(1/2)
    
  4. Dal Porto, S.; Marchitelli, A.: ¬The functionality and flexibility of traditional classification schemes applied to a Content Management System (CMS) : facets, DDC, JITA (2006) 1.75
    1.7497953 = sum of:
      1.7497953 = product of:
        3.4995906 = sum of:
          3.4995906 = weight(author_txt:porto in 174) [ClassicSimilarity], result of:
            3.4995906 = score(doc=174,freq=1.0), product of:
              0.8203346 = queryWeight, product of:
                1.1976823 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.07024271 = queryNorm
              4.2660527 = fieldWeight in 174, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.4375 = fieldNorm(doc=174)
        0.5 = coord(1/2)
    
  5. Souza, S.d.: Informacion : utopia y realidad de la bibliotelogia (1996) 1.46
    1.4550046 = sum of:
      1.4550046 = product of:
        2.9100091 = sum of:
          2.9100091 = weight(author_txt:souza in 824) [ClassicSimilarity], result of:
            2.9100091 = score(doc=824,freq=1.0), product of:
              0.5718838 = queryWeight, product of:
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.07024271 = queryNorm
              5.0884624 = fieldWeight in 824, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.625 = fieldNorm(doc=824)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Souza, R.R.; Raghavan, K.S.: ¬A methodology for noun phrase-based automatic indexing (2006) 0.32
    0.32026297 = sum of:
      0.32026297 = product of:
        1.1437963 = sum of:
          0.027197296 = weight(abstract_txt:indexing in 173) [ClassicSimilarity], result of:
            0.027197296 = score(doc=173,freq=1.0), product of:
              0.08003642 = queryWeight, product of:
                1.0558218 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.017428057 = queryNorm
              0.3398115 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.078125 = fieldNorm(doc=173)
          0.019761888 = weight(abstract_txt:knowledge in 173) [ClassicSimilarity], result of:
            0.019761888 = score(doc=173,freq=1.0), product of:
              0.0711982 = queryWeight, product of:
                1.149875 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.017428057 = queryNorm
              0.2775616 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.078125 = fieldNorm(doc=173)
          0.050997492 = weight(abstract_txt:text in 173) [ClassicSimilarity], result of:
            0.050997492 = score(doc=173,freq=1.0), product of:
              0.16142172 = queryWeight, product of:
                2.2904253 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017428057 = queryNorm
              0.3159271 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=173)
          0.050997492 = weight(abstract_txt:terms in 173) [ClassicSimilarity], result of:
            0.050997492 = score(doc=173,freq=1.0), product of:
              0.16142172 = queryWeight, product of:
                2.2904253 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017428057 = queryNorm
              0.3159271 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=173)
          0.087791994 = weight(abstract_txt:relevant in 173) [ClassicSimilarity], result of:
            0.087791994 = score(doc=173,freq=1.0), product of:
              0.24241714 = queryWeight, product of:
                3.0006325 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.017428057 = queryNorm
              0.36215258 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.078125 = fieldNorm(doc=173)
          0.3709982 = weight(abstract_txt:phrases in 173) [ClassicSimilarity], result of:
            0.3709982 = score(doc=173,freq=3.0), product of:
              0.399167 = queryWeight, product of:
                3.3345642 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.017428057 = queryNorm
              0.9294311 = fieldWeight in 173, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.078125 = fieldNorm(doc=173)
          0.536052 = weight(abstract_txt:noun in 173) [ClassicSimilarity], result of:
            0.536052 = score(doc=173,freq=3.0), product of:
              0.5101658 = queryWeight, product of:
                3.7697926 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.017428057 = queryNorm
              1.0507407 = fieldWeight in 173, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.078125 = fieldNorm(doc=173)
        0.28 = coord(7/25)
    
  2. Kim, W.; Wilbur, W.J.: Corpus-based statistical screening for content-bearing terms (2001) 0.26
    0.25755748 = sum of:
      0.25755748 = product of:
        0.7154374 = sum of:
          0.054945186 = weight(abstract_txt:values in 5188) [ClassicSimilarity], result of:
            0.054945186 = score(doc=5188,freq=2.0), product of:
              0.14270726 = queryWeight, product of:
                1.40984 = boost
                5.808009 = idf(docFreq=360, maxDocs=44218)
                0.017428057 = queryNorm
              0.38502026 = fieldWeight in 5188, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.808009 = idf(docFreq=360, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.033470236 = weight(abstract_txt:considered in 5188) [ClassicSimilarity], result of:
            0.033470236 = score(doc=5188,freq=1.0), product of:
              0.14220725 = queryWeight, product of:
                1.6250885 = boost
                5.021064 = idf(docFreq=792, maxDocs=44218)
                0.017428057 = queryNorm
              0.23536237 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.021064 = idf(docFreq=792, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.034652174 = weight(abstract_txt:natural in 5188) [ClassicSimilarity], result of:
            0.034652174 = score(doc=5188,freq=1.0), product of:
              0.14553571 = queryWeight, product of:
                1.6439966 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.017428057 = queryNorm
              0.23810083 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.05656541 = weight(abstract_txt:each in 5188) [ClassicSimilarity], result of:
            0.05656541 = score(doc=5188,freq=6.0), product of:
              0.11961054 = queryWeight, product of:
                1.6663103 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.017428057 = queryNorm
              0.47291327 = fieldWeight in 5188, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.022873383 = weight(abstract_txt:were in 5188) [ClassicSimilarity], result of:
            0.022873383 = score(doc=5188,freq=1.0), product of:
              0.13295832 = queryWeight, product of:
                2.0787053 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.017428057 = queryNorm
              0.17203423 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.030598493 = weight(abstract_txt:text in 5188) [ClassicSimilarity], result of:
            0.030598493 = score(doc=5188,freq=1.0), product of:
              0.16142172 = queryWeight, product of:
                2.2904253 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017428057 = queryNorm
              0.18955624 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.061196987 = weight(abstract_txt:terms in 5188) [ClassicSimilarity], result of:
            0.061196987 = score(doc=5188,freq=4.0), product of:
              0.16142172 = queryWeight, product of:
                2.2904253 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017428057 = queryNorm
              0.37911248 = fieldWeight in 5188, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.0811101 = weight(abstract_txt:distribution in 5188) [ClassicSimilarity], result of:
            0.0811101 = score(doc=5188,freq=1.0), product of:
              0.30918035 = queryWeight, product of:
                3.1698675 = boost
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.017428057 = queryNorm
              0.26233912 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
          0.34002548 = weight(abstract_txt:phrases in 5188) [ClassicSimilarity], result of:
            0.34002548 = score(doc=5188,freq=7.0), product of:
              0.399167 = queryWeight, product of:
                3.3345642 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.017428057 = queryNorm
              0.85183764 = fieldWeight in 5188, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.046875 = fieldNorm(doc=5188)
        0.36 = coord(9/25)
    
  3. Vlachidis, A.; Tudhope, D.: ¬A knowledge-based approach to information extraction for semantic interoperability in the archaeology domain (2016) 0.21
    0.21115074 = sum of:
      0.21115074 = product of:
        0.65984607 = sum of:
          0.03077023 = weight(abstract_txt:indexing in 2895) [ClassicSimilarity], result of:
            0.03077023 = score(doc=2895,freq=2.0), product of:
              0.08003642 = queryWeight, product of:
                1.0558218 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.017428057 = queryNorm
              0.38445285 = fieldWeight in 2895, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.01580951 = weight(abstract_txt:knowledge in 2895) [ClassicSimilarity], result of:
            0.01580951 = score(doc=2895,freq=1.0), product of:
              0.0711982 = queryWeight, product of:
                1.149875 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.017428057 = queryNorm
              0.2220493 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.037083276 = weight(abstract_txt:automatic in 2895) [ClassicSimilarity], result of:
            0.037083276 = score(doc=2895,freq=1.0), product of:
              0.114199065 = queryWeight, product of:
                1.2611828 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.017428057 = queryNorm
              0.32472485 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.046202898 = weight(abstract_txt:natural in 2895) [ClassicSimilarity], result of:
            0.046202898 = score(doc=2895,freq=1.0), product of:
              0.14553571 = queryWeight, product of:
                1.6439966 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.017428057 = queryNorm
              0.31746778 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.040797994 = weight(abstract_txt:text in 2895) [ClassicSimilarity], result of:
            0.040797994 = score(doc=2895,freq=1.0), product of:
              0.16142172 = queryWeight, product of:
                2.2904253 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017428057 = queryNorm
              0.25274166 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.07023359 = weight(abstract_txt:relevant in 2895) [ClassicSimilarity], result of:
            0.07023359 = score(doc=2895,freq=1.0), product of:
              0.24241714 = queryWeight, product of:
                3.0006325 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.017428057 = queryNorm
              0.28972206 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.17135675 = weight(abstract_txt:phrases in 2895) [ClassicSimilarity], result of:
            0.17135675 = score(doc=2895,freq=1.0), product of:
              0.399167 = queryWeight, product of:
                3.3345642 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.017428057 = queryNorm
              0.42928585 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.24759181 = weight(abstract_txt:noun in 2895) [ClassicSimilarity], result of:
            0.24759181 = score(doc=2895,freq=1.0), product of:
              0.5101658 = queryWeight, product of:
                3.7697926 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.017428057 = queryNorm
              0.48531634 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
        0.32 = coord(8/25)
    
  4. Salles, T.; Rocha, L.; Gonçalves, M.A.; Almeida, J.M.; Mourão, F.; Meira Jr., W.; Viegas, F.: ¬A quantitative analysis of the temporal effects on automatic text classification (2016) 0.19
    0.18858945 = sum of:
      0.18858945 = product of:
        0.52385956 = sum of:
          0.031540744 = weight(abstract_txt:full in 3014) [ClassicSimilarity], result of:
            0.031540744 = score(doc=3014,freq=1.0), product of:
              0.10251603 = queryWeight, product of:
                1.1949306 = boost
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.017428057 = queryNorm
              0.30766645 = fieldWeight in 3014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.0625 = fieldNorm(doc=3014)
          0.037083276 = weight(abstract_txt:automatic in 3014) [ClassicSimilarity], result of:
            0.037083276 = score(doc=3014,freq=1.0), product of:
              0.114199065 = queryWeight, product of:
                1.2611828 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.017428057 = queryNorm
              0.32472485 = fieldWeight in 3014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=3014)
          0.04462698 = weight(abstract_txt:considered in 3014) [ClassicSimilarity], result of:
            0.04462698 = score(doc=3014,freq=1.0), product of:
              0.14220725 = queryWeight, product of:
                1.6250885 = boost
                5.021064 = idf(docFreq=792, maxDocs=44218)
                0.017428057 = queryNorm
              0.3138165 = fieldWeight in 3014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.021064 = idf(docFreq=792, maxDocs=44218)
                0.0625 = fieldNorm(doc=3014)
          0.043544073 = weight(abstract_txt:each in 3014) [ClassicSimilarity], result of:
            0.043544073 = score(doc=3014,freq=2.0), product of:
              0.11961054 = queryWeight, product of:
                1.6663103 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.017428057 = queryNorm
              0.36404878 = fieldWeight in 3014, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0625 = fieldNorm(doc=3014)
          0.062292274 = weight(abstract_txt:behavior in 3014) [ClassicSimilarity], result of:
            0.062292274 = score(doc=3014,freq=1.0), product of:
              0.19132991 = queryWeight, product of:
                2.1074758 = boost
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.017428057 = queryNorm
              0.3255752 = fieldWeight in 3014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.0625 = fieldNorm(doc=3014)
          0.040797994 = weight(abstract_txt:text in 3014) [ClassicSimilarity], result of:
            0.040797994 = score(doc=3014,freq=1.0), product of:
              0.16142172 = queryWeight, product of:
                2.2904253 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017428057 = queryNorm
              0.25274166 = fieldWeight in 3014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=3014)
          0.040797994 = weight(abstract_txt:terms in 3014) [ClassicSimilarity], result of:
            0.040797994 = score(doc=3014,freq=1.0), product of:
              0.16142172 = queryWeight, product of:
                2.2904253 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017428057 = queryNorm
              0.25274166 = fieldWeight in 3014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=3014)
          0.07023359 = weight(abstract_txt:relevant in 3014) [ClassicSimilarity], result of:
            0.07023359 = score(doc=3014,freq=1.0), product of:
              0.24241714 = queryWeight, product of:
                3.0006325 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.017428057 = queryNorm
              0.28972206 = fieldWeight in 3014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0625 = fieldNorm(doc=3014)
          0.15294267 = weight(abstract_txt:distribution in 3014) [ClassicSimilarity], result of:
            0.15294267 = score(doc=3014,freq=2.0), product of:
              0.30918035 = queryWeight, product of:
                3.1698675 = boost
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.017428057 = queryNorm
              0.4946714 = fieldWeight in 3014, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.0625 = fieldNorm(doc=3014)
        0.36 = coord(9/25)
    
  5. Spitkovsky, V.; Norvig, P.: From words to concepts and back : dictionaries for linking text, entities and ideas (2012) 0.18
    0.1848996 = sum of:
      0.1848996 = product of:
        0.57781124 = sum of:
          0.030627567 = weight(abstract_txt:areas in 337) [ClassicSimilarity], result of:
            0.030627567 = score(doc=337,freq=1.0), product of:
              0.13403685 = queryWeight, product of:
                1.577714 = boost
                4.87469 = idf(docFreq=917, maxDocs=44218)
                0.017428057 = queryNorm
              0.2285011 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.87469 = idf(docFreq=917, maxDocs=44218)
                0.046875 = fieldNorm(doc=337)
          0.04900557 = weight(abstract_txt:natural in 337) [ClassicSimilarity], result of:
            0.04900557 = score(doc=337,freq=2.0), product of:
              0.14553571 = queryWeight, product of:
                1.6439966 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.017428057 = queryNorm
              0.3367254 = fieldWeight in 337, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.046875 = fieldNorm(doc=337)
          0.039997786 = weight(abstract_txt:each in 337) [ClassicSimilarity], result of:
            0.039997786 = score(doc=337,freq=3.0), product of:
              0.11961054 = queryWeight, product of:
                1.6663103 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.017428057 = queryNorm
              0.33440018 = fieldWeight in 337, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.046875 = fieldNorm(doc=337)
          0.022873383 = weight(abstract_txt:were in 337) [ClassicSimilarity], result of:
            0.022873383 = score(doc=337,freq=1.0), product of:
              0.13295832 = queryWeight, product of:
                2.0787053 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.017428057 = queryNorm
              0.17203423 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.046875 = fieldNorm(doc=337)
          0.06842031 = weight(abstract_txt:text in 337) [ClassicSimilarity], result of:
            0.06842031 = score(doc=337,freq=5.0), product of:
              0.16142172 = queryWeight, product of:
                2.2904253 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017428057 = queryNorm
              0.42386067 = fieldWeight in 337, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.046875 = fieldNorm(doc=337)
          0.05267519 = weight(abstract_txt:relevant in 337) [ClassicSimilarity], result of:
            0.05267519 = score(doc=337,freq=1.0), product of:
              0.24241714 = queryWeight, product of:
                3.0006325 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.017428057 = queryNorm
              0.21729153 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.046875 = fieldNorm(doc=337)
          0.12851755 = weight(abstract_txt:phrases in 337) [ClassicSimilarity], result of:
            0.12851755 = score(doc=337,freq=1.0), product of:
              0.399167 = queryWeight, product of:
                3.3345642 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.017428057 = queryNorm
              0.32196438 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.046875 = fieldNorm(doc=337)
          0.18569386 = weight(abstract_txt:noun in 337) [ClassicSimilarity], result of:
            0.18569386 = score(doc=337,freq=1.0), product of:
              0.5101658 = queryWeight, product of:
                3.7697926 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.017428057 = queryNorm
              0.36398727 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.046875 = fieldNorm(doc=337)
        0.32 = coord(8/25)