Document (#27679)

Author
Jones, I.
Cunliffe, D.
Tudhope, D.
Title
Natural language processing and knowledge organization systems as an aid to retrieval
Source
Knowledge organization and the global information society: Proceedings of the 8th International ISKO Conference 13-16 July 2004, London, UK. Ed.: I.C. McIlwaine
Imprint
Würzburg : Ergon Verlag
Year
2004
Pages
S.351-356
Series
Advances in knowledge organization; vol.9
Abstract
This paper discusses research that employs methods from Natural Language Processing (NLP) in exploiting the intellectual resources of Knowledge Organization Systems (KOS), particularly in the retrieval of information. A technique for the disambiguation of homographs and nominal compounds in free text, where these are known ambiguous terms in the KOS itself, is described. The use of Roget's Thesaurus as an intermediary in the process is also reported. A short review of the relevant literature in the field is given. Design considerations, results and conclusions are presented from the implementation of a prototype system. The linguistic techniques are applied at two complementary levels, namely an a free text string used as an entry point to the KOS, and an the underlying controlled vocabulary itself.
Content
1. Introduction The need for research into the application of linguistic techniques in Information Retrieval (IR) in general, and a similar need in faceted Knowledge Organization Systems (KOS) has been indicated by various authors. Smeaton (1997) points out the inherent limitations of conventional approaches to IR based an "bags of words", mainly difficulties caused by lexical ambiguity in the words concerned, and goes an to suggest the possibility of using Natural Language Processing (NLP) in query formulation. Past experience with a faceted retrieval system highlighted the need for integrating the linguistic perspective in order to fully utilise the potential of a KOS (Tudhope et al." 2002). The present research seeks to address some of these needs in using NLP to improve the efficacy of KOS tools in query and retrieval systems. Syntactic parsing and part-of-speech tagging can substantially reduce lexical ambiguity through homograph disambiguation. Given the two strings "1 fable the motion" and "I put the motion an the fable", for instance, the parser used in this research clearly indicates that 'fable' in the first string is a verb, while 'table' in the second string is a noun, a distinction that would be missed in the "bag of words" approach. This syntactic disambiguation enables a more precise matching from free text to the controlled vocabulary of a KOS and vice versa. The use of a general linguistic resource, namely Roget's Thesaurus of English Words and Phrases (RTEWP), as an intermediary in this process, is investigated. The adaptation of the Link parser (Sleator & Temperley, 1993) to the purposes of the research is reported. The design and implementation of the early practical stages of the project are described, and the results of the initial experiments are presented and evaluated. Applications of the techniques developed are foreseen in the areas of query disambiguation, information retrieval and automatic indexing. In the first section of the paper a brief review of the literature and relevant current work in the field is presented. The second section includes reports an the development of algorithms, the construction of data sets and theoretical and experimental work undertaken to date. The third section evaluates the results obtained, and outlines directions for future research.
Theme
Computerlinguistik

Similar documents (author)

  1. Blocks, D.; Cunliffe, D.; Tudhope, D.: ¬A reference model for user-system interaction in thesaurus-based searching (2006) 2.84
    2.835551 = sum of:
      2.835551 = product of:
        4.2533264 = sum of:
          1.6297897 = weight(author_txt:tudhope in 1328) [ClassicSimilarity], result of:
            1.6297897 = score(doc=1328,freq=1.0), product of:
              0.53915215 = queryWeight, product of:
                1.1593271 = boost
                8.061001 = idf(docFreq=35, maxDocs=41962)
                0.057692103 = queryNorm
              3.0228753 = fieldWeight in 1328, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.061001 = idf(docFreq=35, maxDocs=41962)
                0.375 = fieldNorm(doc=1328)
          2.6235366 = weight(author_txt:cunliffe in 1328) [ClassicSimilarity], result of:
            2.6235366 = score(doc=1328,freq=1.0), product of:
              0.7405398 = queryWeight, product of:
                1.358703 = boost
                9.447295 = idf(docFreq=8, maxDocs=41962)
                0.057692103 = queryNorm
              3.5427356 = fieldWeight in 1328, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.447295 = idf(docFreq=8, maxDocs=41962)
                0.375 = fieldNorm(doc=1328)
        0.6666667 = coord(2/3)
    
  2. Tudhope, D.; Blocks, D.; Cunliffe, D.; Binding, C.: Query expansion via conceptual distance in thesaurus indexed collections (2006) 2.36
    2.3629594 = sum of:
      2.3629594 = product of:
        3.5444388 = sum of:
          1.3581581 = weight(author_txt:tudhope in 3216) [ClassicSimilarity], result of:
            1.3581581 = score(doc=3216,freq=1.0), product of:
              0.53915215 = queryWeight, product of:
                1.1593271 = boost
                8.061001 = idf(docFreq=35, maxDocs=41962)
                0.057692103 = queryNorm
              2.5190628 = fieldWeight in 3216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.061001 = idf(docFreq=35, maxDocs=41962)
                0.3125 = fieldNorm(doc=3216)
          2.1862807 = weight(author_txt:cunliffe in 3216) [ClassicSimilarity], result of:
            2.1862807 = score(doc=3216,freq=1.0), product of:
              0.7405398 = queryWeight, product of:
                1.358703 = boost
                9.447295 = idf(docFreq=8, maxDocs=41962)
                0.057692103 = queryNorm
              2.9522798 = fieldWeight in 3216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.447295 = idf(docFreq=8, maxDocs=41962)
                0.3125 = fieldNorm(doc=3216)
        0.6666667 = coord(2/3)
    
  3. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: Compound descriptors in context : a matching function for classifications and thesauri (2002) 2.36
    2.3629594 = sum of:
      2.3629594 = product of:
        3.5444388 = sum of:
          1.3581581 = weight(author_txt:tudhope in 4180) [ClassicSimilarity], result of:
            1.3581581 = score(doc=4180,freq=1.0), product of:
              0.53915215 = queryWeight, product of:
                1.1593271 = boost
                8.061001 = idf(docFreq=35, maxDocs=41962)
                0.057692103 = queryNorm
              2.5190628 = fieldWeight in 4180, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.061001 = idf(docFreq=35, maxDocs=41962)
                0.3125 = fieldNorm(doc=4180)
          2.1862807 = weight(author_txt:cunliffe in 4180) [ClassicSimilarity], result of:
            2.1862807 = score(doc=4180,freq=1.0), product of:
              0.7405398 = queryWeight, product of:
                1.358703 = boost
                9.447295 = idf(docFreq=8, maxDocs=41962)
                0.057692103 = queryNorm
              2.9522798 = fieldWeight in 4180, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.447295 = idf(docFreq=8, maxDocs=41962)
                0.3125 = fieldNorm(doc=4180)
        0.6666667 = coord(2/3)
    
  4. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 2.36
    2.3629594 = sum of:
      2.3629594 = product of:
        3.5444388 = sum of:
          1.3581581 = weight(author_txt:tudhope in 2176) [ClassicSimilarity], result of:
            1.3581581 = score(doc=2176,freq=1.0), product of:
              0.53915215 = queryWeight, product of:
                1.1593271 = boost
                8.061001 = idf(docFreq=35, maxDocs=41962)
                0.057692103 = queryNorm
              2.5190628 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.061001 = idf(docFreq=35, maxDocs=41962)
                0.3125 = fieldNorm(doc=2176)
          2.1862807 = weight(author_txt:cunliffe in 2176) [ClassicSimilarity], result of:
            2.1862807 = score(doc=2176,freq=1.0), product of:
              0.7405398 = queryWeight, product of:
                1.358703 = boost
                9.447295 = idf(docFreq=8, maxDocs=41962)
                0.057692103 = queryNorm
              2.9522798 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.447295 = idf(docFreq=8, maxDocs=41962)
                0.3125 = fieldNorm(doc=2176)
        0.6666667 = coord(2/3)
    
  5. Tudhope, D.; Alani, H.; Jones, C.: Augmenting thesaurus relationships : possibilities for retrieval (2001) 1.78
    1.7838306 = sum of:
      1.7838306 = product of:
        2.675746 = sum of:
          1.0459564 = weight(author_txt:jones in 3521) [ClassicSimilarity], result of:
            1.0459564 = score(doc=3521,freq=1.0), product of:
              0.4011431 = queryWeight, product of:
                6.9531717 = idf(docFreq=108, maxDocs=41962)
                0.057692103 = queryNorm
              2.6074395 = fieldWeight in 3521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9531717 = idf(docFreq=108, maxDocs=41962)
                0.375 = fieldNorm(doc=3521)
          1.6297897 = weight(author_txt:tudhope in 3521) [ClassicSimilarity], result of:
            1.6297897 = score(doc=3521,freq=1.0), product of:
              0.53915215 = queryWeight, product of:
                1.1593271 = boost
                8.061001 = idf(docFreq=35, maxDocs=41962)
                0.057692103 = queryNorm
              3.0228753 = fieldWeight in 3521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.061001 = idf(docFreq=35, maxDocs=41962)
                0.375 = fieldNorm(doc=3521)
        0.6666667 = coord(2/3)
    

Similar documents (content)

  1. Nagy T., I.: Detecting multiword expressions and named entities in natural language texts (2014) 0.26
    0.2637342 = sum of:
      0.2637342 = product of:
        0.73259497 = sum of:
          0.038136564 = weight(abstract_txt:namely in 3537) [ClassicSimilarity], result of:
            0.038136564 = score(doc=3537,freq=1.0), product of:
              0.15222727 = queryWeight, product of:
                1.0885046 = boost
                6.413411 = idf(docFreq=186, maxDocs=41962)
                0.021805855 = queryNorm
              0.25052387 = fieldWeight in 3537, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.413411 = idf(docFreq=186, maxDocs=41962)
                0.0390625 = fieldNorm(doc=3537)
          0.011991135 = weight(abstract_txt:retrieval in 3537) [ClassicSimilarity], result of:
            0.011991135 = score(doc=3537,freq=1.0), product of:
              0.0886846 = queryWeight, product of:
                1.1749601 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.021805855 = queryNorm
              0.135211 = fieldWeight in 3537, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.0390625 = fieldNorm(doc=3537)
          0.013616133 = weight(abstract_txt:knowledge in 3537) [ClassicSimilarity], result of:
            0.013616133 = score(doc=3537,freq=1.0), product of:
              0.09652591 = queryWeight, product of:
                1.2258039 = boost
                3.6111858 = idf(docFreq=3081, maxDocs=41962)
                0.021805855 = queryNorm
              0.14106195 = fieldWeight in 3537, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6111858 = idf(docFreq=3081, maxDocs=41962)
                0.0390625 = fieldNorm(doc=3537)
          0.019288503 = weight(abstract_txt:text in 3537) [ClassicSimilarity], result of:
            0.019288503 = score(doc=3537,freq=1.0), product of:
              0.12175133 = queryWeight, product of:
                1.3766891 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.021805855 = queryNorm
              0.15842539 = fieldWeight in 3537, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0390625 = fieldNorm(doc=3537)
          0.04288735 = weight(abstract_txt:language in 3537) [ClassicSimilarity], result of:
            0.04288735 = score(doc=3537,freq=4.0), product of:
              0.13065946 = queryWeight, product of:
                1.4261639 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.021805855 = queryNorm
              0.3282376 = fieldWeight in 3537, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.0390625 = fieldNorm(doc=3537)
          0.2149099 = weight(abstract_txt:compounds in 3537) [ClassicSimilarity], result of:
            0.2149099 = score(doc=3537,freq=6.0), product of:
              0.2652886 = queryWeight, product of:
                1.4369555 = boost
                8.466466 = idf(docFreq=23, maxDocs=41962)
                0.021805855 = queryNorm
              0.8100985 = fieldWeight in 3537, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.466466 = idf(docFreq=23, maxDocs=41962)
                0.0390625 = fieldNorm(doc=3537)
          0.27934533 = weight(abstract_txt:nominal in 3537) [ClassicSimilarity], result of:
            0.27934533 = score(doc=3537,freq=7.0), product of:
              0.3001417 = queryWeight, product of:
                1.5284357 = boost
                9.005463 = idf(docFreq=13, maxDocs=41962)
                0.021805855 = queryNorm
              0.9307115 = fieldWeight in 3537, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                9.005463 = idf(docFreq=13, maxDocs=41962)
                0.0390625 = fieldNorm(doc=3537)
          0.03509415 = weight(abstract_txt:processing in 3537) [ClassicSimilarity], result of:
            0.03509415 = score(doc=3537,freq=1.0), product of:
              0.18145315 = queryWeight, product of:
                1.6806655 = boost
                4.951196 = idf(docFreq=806, maxDocs=41962)
                0.021805855 = queryNorm
              0.1934061 = fieldWeight in 3537, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.951196 = idf(docFreq=806, maxDocs=41962)
                0.0390625 = fieldNorm(doc=3537)
          0.077325955 = weight(abstract_txt:natural in 3537) [ClassicSimilarity], result of:
            0.077325955 = score(doc=3537,freq=4.0), product of:
              0.19355524 = queryWeight, product of:
                1.7358072 = boost
                5.113642 = idf(docFreq=685, maxDocs=41962)
                0.021805855 = queryNorm
              0.3995033 = fieldWeight in 3537, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.113642 = idf(docFreq=685, maxDocs=41962)
                0.0390625 = fieldNorm(doc=3537)
        0.36 = coord(9/25)
    
  2. Guglielmo, E.J.; Rowe, N.C.: Natural-language retrieval of images based on descriptive captions (1996) 0.22
    0.22325096 = sum of:
      0.22325096 = product of:
        0.69765925 = sum of:
          0.047311947 = weight(abstract_txt:prototype in 6693) [ClassicSimilarity], result of:
            0.047311947 = score(doc=6693,freq=1.0), product of:
              0.12847894 = queryWeight, product of:
                5.8919473 = idf(docFreq=314, maxDocs=41962)
                0.021805855 = queryNorm
              0.3682467 = fieldWeight in 6693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8919473 = idf(docFreq=314, maxDocs=41962)
                0.0625 = fieldNorm(doc=6693)
          0.019185815 = weight(abstract_txt:retrieval in 6693) [ClassicSimilarity], result of:
            0.019185815 = score(doc=6693,freq=1.0), product of:
              0.0886846 = queryWeight, product of:
                1.1749601 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.021805855 = queryNorm
              0.2163376 = fieldWeight in 6693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.0625 = fieldNorm(doc=6693)
          0.099555515 = weight(abstract_txt:ambiguous in 6693) [ClassicSimilarity], result of:
            0.099555515 = score(doc=6693,freq=1.0), product of:
              0.21097368 = queryWeight, product of:
                1.2814397 = boost
                7.550175 = idf(docFreq=59, maxDocs=41962)
                0.021805855 = queryNorm
              0.47188595 = fieldWeight in 6693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.550175 = idf(docFreq=59, maxDocs=41962)
                0.0625 = fieldNorm(doc=6693)
          0.03430988 = weight(abstract_txt:language in 6693) [ClassicSimilarity], result of:
            0.03430988 = score(doc=6693,freq=1.0), product of:
              0.13065946 = queryWeight, product of:
                1.4261639 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.021805855 = queryNorm
              0.26259008 = fieldWeight in 6693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.0625 = fieldNorm(doc=6693)
          0.14037855 = weight(abstract_txt:compounds in 6693) [ClassicSimilarity], result of:
            0.14037855 = score(doc=6693,freq=1.0), product of:
              0.2652886 = queryWeight, product of:
                1.4369555 = boost
                8.466466 = idf(docFreq=23, maxDocs=41962)
                0.021805855 = queryNorm
              0.5291541 = fieldWeight in 6693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.466466 = idf(docFreq=23, maxDocs=41962)
                0.0625 = fieldNorm(doc=6693)
          0.23890617 = weight(abstract_txt:nominal in 6693) [ClassicSimilarity], result of:
            0.23890617 = score(doc=6693,freq=2.0), product of:
              0.3001417 = queryWeight, product of:
                1.5284357 = boost
                9.005463 = idf(docFreq=13, maxDocs=41962)
                0.021805855 = queryNorm
              0.79597795 = fieldWeight in 6693, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.005463 = idf(docFreq=13, maxDocs=41962)
                0.0625 = fieldNorm(doc=6693)
          0.056150634 = weight(abstract_txt:processing in 6693) [ClassicSimilarity], result of:
            0.056150634 = score(doc=6693,freq=1.0), product of:
              0.18145315 = queryWeight, product of:
                1.6806655 = boost
                4.951196 = idf(docFreq=806, maxDocs=41962)
                0.021805855 = queryNorm
              0.30944976 = fieldWeight in 6693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.951196 = idf(docFreq=806, maxDocs=41962)
                0.0625 = fieldNorm(doc=6693)
          0.061860763 = weight(abstract_txt:natural in 6693) [ClassicSimilarity], result of:
            0.061860763 = score(doc=6693,freq=1.0), product of:
              0.19355524 = queryWeight, product of:
                1.7358072 = boost
                5.113642 = idf(docFreq=685, maxDocs=41962)
                0.021805855 = queryNorm
              0.31960264 = fieldWeight in 6693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.113642 = idf(docFreq=685, maxDocs=41962)
                0.0625 = fieldNorm(doc=6693)
        0.32 = coord(8/25)
    
  3. Taylor, S.L.: Integrating natural language understanding with document structure analysis (1994) 0.20
    0.19764315 = sum of:
      0.19764315 = product of:
        0.70586836 = sum of:
          0.07096792 = weight(abstract_txt:prototype in 1863) [ClassicSimilarity], result of:
            0.07096792 = score(doc=1863,freq=1.0), product of:
              0.12847894 = queryWeight, product of:
                5.8919473 = idf(docFreq=314, maxDocs=41962)
                0.021805855 = queryNorm
              0.5523701 = fieldWeight in 1863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8919473 = idf(docFreq=314, maxDocs=41962)
                0.09375 = fieldNorm(doc=1863)
          0.028778723 = weight(abstract_txt:retrieval in 1863) [ClassicSimilarity], result of:
            0.028778723 = score(doc=1863,freq=1.0), product of:
              0.0886846 = queryWeight, product of:
                1.1749601 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.021805855 = queryNorm
              0.3245064 = fieldWeight in 1863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.09375 = fieldNorm(doc=1863)
          0.13359722 = weight(abstract_txt:employs in 1863) [ClassicSimilarity], result of:
            0.13359722 = score(doc=1863,freq=1.0), product of:
              0.19587944 = queryWeight, product of:
                1.2347484 = boost
                7.275072 = idf(docFreq=78, maxDocs=41962)
                0.021805855 = queryNorm
              0.682038 = fieldWeight in 1863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.275072 = idf(docFreq=78, maxDocs=41962)
                0.09375 = fieldNorm(doc=1863)
          0.080180794 = weight(abstract_txt:text in 1863) [ClassicSimilarity], result of:
            0.080180794 = score(doc=1863,freq=3.0), product of:
              0.12175133 = queryWeight, product of:
                1.3766891 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.021805855 = queryNorm
              0.65856194 = fieldWeight in 1863, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.09375 = fieldNorm(doc=1863)
          0.07278225 = weight(abstract_txt:language in 1863) [ClassicSimilarity], result of:
            0.07278225 = score(doc=1863,freq=2.0), product of:
              0.13065946 = queryWeight, product of:
                1.4261639 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.021805855 = queryNorm
              0.5570377 = fieldWeight in 1863, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.09375 = fieldNorm(doc=1863)
          0.18833496 = weight(abstract_txt:processing in 1863) [ClassicSimilarity], result of:
            0.18833496 = score(doc=1863,freq=5.0), product of:
              0.18145315 = queryWeight, product of:
                1.6806655 = boost
                4.951196 = idf(docFreq=806, maxDocs=41962)
                0.021805855 = queryNorm
              1.0379261 = fieldWeight in 1863, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.951196 = idf(docFreq=806, maxDocs=41962)
                0.09375 = fieldNorm(doc=1863)
          0.1312265 = weight(abstract_txt:natural in 1863) [ClassicSimilarity], result of:
            0.1312265 = score(doc=1863,freq=2.0), product of:
              0.19355524 = queryWeight, product of:
                1.7358072 = boost
                5.113642 = idf(docFreq=685, maxDocs=41962)
                0.021805855 = queryNorm
              0.6779796 = fieldWeight in 1863, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.113642 = idf(docFreq=685, maxDocs=41962)
                0.09375 = fieldNorm(doc=1863)
        0.28 = coord(7/25)
    
  4. Köhler, J.; Philippi, S.; Specht, M.; Rüegg, A.: Ontology based text indexing and querying for the semantic web (2006) 0.20
    0.19512689 = sum of:
      0.19512689 = product of:
        0.5420191 = sum of:
          0.047311947 = weight(abstract_txt:prototype in 281) [ClassicSimilarity], result of:
            0.047311947 = score(doc=281,freq=1.0), product of:
              0.12847894 = queryWeight, product of:
                5.8919473 = idf(docFreq=314, maxDocs=41962)
                0.021805855 = queryNorm
              0.3682467 = fieldWeight in 281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8919473 = idf(docFreq=314, maxDocs=41962)
                0.0625 = fieldNorm(doc=281)
          0.027132839 = weight(abstract_txt:retrieval in 281) [ClassicSimilarity], result of:
            0.027132839 = score(doc=281,freq=2.0), product of:
              0.0886846 = queryWeight, product of:
                1.1749601 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.021805855 = queryNorm
              0.30594757 = fieldWeight in 281, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.0625 = fieldNorm(doc=281)
          0.085160814 = weight(abstract_txt:string in 281) [ClassicSimilarity], result of:
            0.085160814 = score(doc=281,freq=1.0), product of:
              0.19011277 = queryWeight, product of:
                1.2164371 = boost
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.021805855 = queryNorm
              0.44794893 = fieldWeight in 281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.0625 = fieldNorm(doc=281)
          0.094686 = weight(abstract_txt:disambiguation in 281) [ClassicSimilarity], result of:
            0.094686 = score(doc=281,freq=1.0), product of:
              0.20403685 = queryWeight, product of:
                1.2601967 = boost
                7.425012 = idf(docFreq=67, maxDocs=41962)
                0.021805855 = queryNorm
              0.46406326 = fieldWeight in 281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.425012 = idf(docFreq=67, maxDocs=41962)
                0.0625 = fieldNorm(doc=281)
          0.053453863 = weight(abstract_txt:text in 281) [ClassicSimilarity], result of:
            0.053453863 = score(doc=281,freq=3.0), product of:
              0.12175133 = queryWeight, product of:
                1.3766891 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.021805855 = queryNorm
              0.4390413 = fieldWeight in 281, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=281)
          0.03430988 = weight(abstract_txt:language in 281) [ClassicSimilarity], result of:
            0.03430988 = score(doc=281,freq=1.0), product of:
              0.13065946 = queryWeight, product of:
                1.4261639 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.021805855 = queryNorm
              0.26259008 = fieldWeight in 281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.0625 = fieldNorm(doc=281)
          0.056150634 = weight(abstract_txt:processing in 281) [ClassicSimilarity], result of:
            0.056150634 = score(doc=281,freq=1.0), product of:
              0.18145315 = queryWeight, product of:
                1.6806655 = boost
                4.951196 = idf(docFreq=806, maxDocs=41962)
                0.021805855 = queryNorm
              0.30944976 = fieldWeight in 281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.951196 = idf(docFreq=806, maxDocs=41962)
                0.0625 = fieldNorm(doc=281)
          0.061860763 = weight(abstract_txt:natural in 281) [ClassicSimilarity], result of:
            0.061860763 = score(doc=281,freq=1.0), product of:
              0.19355524 = queryWeight, product of:
                1.7358072 = boost
                5.113642 = idf(docFreq=685, maxDocs=41962)
                0.021805855 = queryNorm
              0.31960264 = fieldWeight in 281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.113642 = idf(docFreq=685, maxDocs=41962)
                0.0625 = fieldNorm(doc=281)
          0.08195236 = weight(abstract_txt:free in 281) [ClassicSimilarity], result of:
            0.08195236 = score(doc=281,freq=1.0), product of:
              0.2334725 = queryWeight, product of:
                1.9064125 = boost
                5.616241 = idf(docFreq=414, maxDocs=41962)
                0.021805855 = queryNorm
              0.35101506 = fieldWeight in 281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.616241 = idf(docFreq=414, maxDocs=41962)
                0.0625 = fieldNorm(doc=281)
        0.36 = coord(9/25)
    
  5. Chowdhury, G.G.: Natural language processing (2002) 0.18
    0.18066551 = sum of:
      0.18066551 = product of:
        0.5645797 = sum of:
          0.07627313 = weight(abstract_txt:namely in 285) [ClassicSimilarity], result of:
            0.07627313 = score(doc=285,freq=1.0), product of:
              0.15222727 = queryWeight, product of:
                1.0885046 = boost
                6.413411 = idf(docFreq=186, maxDocs=41962)
                0.021805855 = queryNorm
              0.50104773 = fieldWeight in 285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.413411 = idf(docFreq=186, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.03272377 = weight(abstract_txt:systems in 285) [ClassicSimilarity], result of:
            0.03272377 = score(doc=285,freq=2.0), product of:
              0.08659383 = queryWeight, product of:
                1.1610274 = boost
                3.4203563 = idf(docFreq=3729, maxDocs=41962)
                0.021805855 = queryNorm
              0.37789956 = fieldWeight in 285, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4203563 = idf(docFreq=3729, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.02398227 = weight(abstract_txt:retrieval in 285) [ClassicSimilarity], result of:
            0.02398227 = score(doc=285,freq=1.0), product of:
              0.0886846 = queryWeight, product of:
                1.1749601 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.021805855 = queryNorm
              0.270422 = fieldWeight in 285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.027232265 = weight(abstract_txt:knowledge in 285) [ClassicSimilarity], result of:
            0.027232265 = score(doc=285,freq=1.0), product of:
              0.09652591 = queryWeight, product of:
                1.2258039 = boost
                3.6111858 = idf(docFreq=3081, maxDocs=41962)
                0.021805855 = queryNorm
              0.2821239 = fieldWeight in 285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6111858 = idf(docFreq=3081, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.054556116 = weight(abstract_txt:text in 285) [ClassicSimilarity], result of:
            0.054556116 = score(doc=285,freq=2.0), product of:
              0.12175133 = queryWeight, product of:
                1.3766891 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.021805855 = queryNorm
              0.44809464 = fieldWeight in 285, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.09589902 = weight(abstract_txt:language in 285) [ClassicSimilarity], result of:
            0.09589902 = score(doc=285,freq=5.0), product of:
              0.13065946 = queryWeight, product of:
                1.4261639 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.021805855 = queryNorm
              0.7339616 = fieldWeight in 285, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.09926123 = weight(abstract_txt:processing in 285) [ClassicSimilarity], result of:
            0.09926123 = score(doc=285,freq=2.0), product of:
              0.18145315 = queryWeight, product of:
                1.6806655 = boost
                4.951196 = idf(docFreq=806, maxDocs=41962)
                0.021805855 = queryNorm
              0.54703504 = fieldWeight in 285, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.951196 = idf(docFreq=806, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.15465191 = weight(abstract_txt:natural in 285) [ClassicSimilarity], result of:
            0.15465191 = score(doc=285,freq=4.0), product of:
              0.19355524 = queryWeight, product of:
                1.7358072 = boost
                5.113642 = idf(docFreq=685, maxDocs=41962)
                0.021805855 = queryNorm
              0.7990066 = fieldWeight in 285, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.113642 = idf(docFreq=685, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
        0.32 = coord(8/25)