Document (#29531)

Author
Robertson, S.E.
Sparck Jones, K.
Title
Simple, proven approaches to text retrieval
Issue
May, 1997, Update of 1994 and 1996 versions.
Source
http://www.cl.cam.ac.uk/TechReports/UCAM-CL-TR-356.pdf
Year
1997
Series
Technical Report TR356, University of Cambridge, Computer Laboratory
Abstract
This technical note describes straightforward techniques for document indexing and retrieval that have been solidly established through extensive testing and are easy to apply. They are useful for many different types of text material, are viable for very large files, and have the advantage that they do not require special skills or training for searching, but are easy for end users. The document and text retrieval methods described here have a sound theoretical basis, are well established by extensive testing, and the ideas involved are now implemented in some commercial retrieval systems. Testing in the last few years has, in particular, shown that the methods presented here work very well with full texts, not only title and abstracts, and with large files of texts containing three quarters of a million documents. These tests, the TREC Tests (see Harman 1993 - 1997; IP&M 1995), have been rigorous comparative evaluations involving many different approaches to information retrieval. These techniques depend an the use of simple terms for indexing both request and document texts; an term weighting exploiting statistical information about term occurrences; an scoring for request-document matching, using these weights, to obtain a ranked search output; and an relevance feedback to modify request weights or term sets in iterative searching. The normal implementation is via an inverted file organisation using a term list with linked document identifiers, plus counting data, and pointers to the actual texts. The user's request can be a word list, phrases, sentences or extended text.
Footnote
Auch unter: http://www.ftp.cl.cam.ac.uk/ftp/papers/reports/.
Theme
Retrievalalgorithmen
Retrievalstudien

Similar documents (author)

  1. Robertson, S.E.; Sparck Jones, K.: Relevance weighting of search terms (1976) 5.64
    5.6376514 = sum of:
      5.6376514 = sum of:
        1.5048561 = weight(author_txt:jones in 137) [ClassicSimilarity], result of:
          1.5048561 = score(doc=137,freq=1.0), product of:
            0.49585345 = queryWeight, product of:
              6.9368706 = idf(docFreq=114, maxDocs=43556)
              0.071480855 = queryNorm
            3.0348809 = fieldWeight in 137, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9368706 = idf(docFreq=114, maxDocs=43556)
              0.4375 = fieldNorm(doc=137)
        1.7626902 = weight(author_txt:robertson in 137) [ClassicSimilarity], result of:
          1.7626902 = score(doc=137,freq=1.0), product of:
            0.5509861 = queryWeight, product of:
              1.0541288 = boost
              7.312355 = idf(docFreq=78, maxDocs=43556)
              0.071480855 = queryNorm
            3.1991553 = fieldWeight in 137, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.312355 = idf(docFreq=78, maxDocs=43556)
              0.4375 = fieldNorm(doc=137)
        2.3701053 = weight(author_txt:sparck in 137) [ClassicSimilarity], result of:
          2.3701053 = score(doc=137,freq=1.0), product of:
            0.6712255 = queryWeight, product of:
              1.1634763 = boost
              8.070885 = idf(docFreq=36, maxDocs=43556)
              0.071480855 = queryNorm
            3.531012 = fieldWeight in 137, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.070885 = idf(docFreq=36, maxDocs=43556)
              0.4375 = fieldNorm(doc=137)
    
  2. Sparck Jones, K.; Walker, S.; Robertson, S.E.: ¬A probabilistic model of information retrieval : development and comparative experiments - part 1 (2000) 4.83
    4.832273 = sum of:
      4.832273 = sum of:
        1.2898767 = weight(author_txt:jones in 4247) [ClassicSimilarity], result of:
          1.2898767 = score(doc=4247,freq=1.0), product of:
            0.49585345 = queryWeight, product of:
              6.9368706 = idf(docFreq=114, maxDocs=43556)
              0.071480855 = queryNorm
            2.6013265 = fieldWeight in 4247, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9368706 = idf(docFreq=114, maxDocs=43556)
              0.375 = fieldNorm(doc=4247)
        1.5108773 = weight(author_txt:robertson in 4247) [ClassicSimilarity], result of:
          1.5108773 = score(doc=4247,freq=1.0), product of:
            0.5509861 = queryWeight, product of:
              1.0541288 = boost
              7.312355 = idf(docFreq=78, maxDocs=43556)
              0.071480855 = queryNorm
            2.7421331 = fieldWeight in 4247, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.312355 = idf(docFreq=78, maxDocs=43556)
              0.375 = fieldNorm(doc=4247)
        2.031519 = weight(author_txt:sparck in 4247) [ClassicSimilarity], result of:
          2.031519 = score(doc=4247,freq=1.0), product of:
            0.6712255 = queryWeight, product of:
              1.1634763 = boost
              8.070885 = idf(docFreq=36, maxDocs=43556)
              0.071480855 = queryNorm
            3.0265818 = fieldWeight in 4247, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.070885 = idf(docFreq=36, maxDocs=43556)
              0.375 = fieldNorm(doc=4247)
    
  3. Sparck Jones, K.; Walker, S.; Robertson, S.E.: ¬A probabilistic model of information retrieval : development and comparative experiments - part 2 (2000) 4.83
    4.832273 = sum of:
      4.832273 = sum of:
        1.2898767 = weight(author_txt:jones in 4352) [ClassicSimilarity], result of:
          1.2898767 = score(doc=4352,freq=1.0), product of:
            0.49585345 = queryWeight, product of:
              6.9368706 = idf(docFreq=114, maxDocs=43556)
              0.071480855 = queryNorm
            2.6013265 = fieldWeight in 4352, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9368706 = idf(docFreq=114, maxDocs=43556)
              0.375 = fieldNorm(doc=4352)
        1.5108773 = weight(author_txt:robertson in 4352) [ClassicSimilarity], result of:
          1.5108773 = score(doc=4352,freq=1.0), product of:
            0.5509861 = queryWeight, product of:
              1.0541288 = boost
              7.312355 = idf(docFreq=78, maxDocs=43556)
              0.071480855 = queryNorm
            2.7421331 = fieldWeight in 4352, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.312355 = idf(docFreq=78, maxDocs=43556)
              0.375 = fieldNorm(doc=4352)
        2.031519 = weight(author_txt:sparck in 4352) [ClassicSimilarity], result of:
          2.031519 = score(doc=4352,freq=1.0), product of:
            0.6712255 = queryWeight, product of:
              1.1634763 = boost
              8.070885 = idf(docFreq=36, maxDocs=43556)
              0.071480855 = queryNorm
            3.0265818 = fieldWeight in 4352, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.070885 = idf(docFreq=36, maxDocs=43556)
              0.375 = fieldNorm(doc=4352)
    
  4. Sparck Jones, K.: Fashionable trends and feasible strategies in information management (1988) 2.95
    2.9523516 = sum of:
      2.9523516 = product of:
        4.4285274 = sum of:
          1.7198356 = weight(author_txt:jones in 817) [ClassicSimilarity], result of:
            1.7198356 = score(doc=817,freq=1.0), product of:
              0.49585345 = queryWeight, product of:
                6.9368706 = idf(docFreq=114, maxDocs=43556)
                0.071480855 = queryNorm
              3.4684353 = fieldWeight in 817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9368706 = idf(docFreq=114, maxDocs=43556)
                0.5 = fieldNorm(doc=817)
          2.7086918 = weight(author_txt:sparck in 817) [ClassicSimilarity], result of:
            2.7086918 = score(doc=817,freq=1.0), product of:
              0.6712255 = queryWeight, product of:
                1.1634763 = boost
                8.070885 = idf(docFreq=36, maxDocs=43556)
                0.071480855 = queryNorm
              4.0354424 = fieldWeight in 817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.070885 = idf(docFreq=36, maxDocs=43556)
                0.5 = fieldNorm(doc=817)
        0.6666667 = coord(2/3)
    
  5. Sparck Jones, K.: Automatic classification (1976) 2.95
    2.9523516 = sum of:
      2.9523516 = product of:
        4.4285274 = sum of:
          1.7198356 = weight(author_txt:jones in 2908) [ClassicSimilarity], result of:
            1.7198356 = score(doc=2908,freq=1.0), product of:
              0.49585345 = queryWeight, product of:
                6.9368706 = idf(docFreq=114, maxDocs=43556)
                0.071480855 = queryNorm
              3.4684353 = fieldWeight in 2908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9368706 = idf(docFreq=114, maxDocs=43556)
                0.5 = fieldNorm(doc=2908)
          2.7086918 = weight(author_txt:sparck in 2908) [ClassicSimilarity], result of:
            2.7086918 = score(doc=2908,freq=1.0), product of:
              0.6712255 = queryWeight, product of:
                1.1634763 = boost
                8.070885 = idf(docFreq=36, maxDocs=43556)
                0.071480855 = queryNorm
              4.0354424 = fieldWeight in 2908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.070885 = idf(docFreq=36, maxDocs=43556)
                0.5 = fieldNorm(doc=2908)
        0.6666667 = coord(2/3)
    

Similar documents (content)

  1. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: Beyond bag-of-words : bigram-enhanced context-dependent term weights (2014) 0.32
    0.31680396 = sum of:
      0.31680396 = product of:
        0.7920099 = sum of:
          0.024452776 = weight(abstract_txt:large in 3281) [ClassicSimilarity], result of:
            0.024452776 = score(doc=3281,freq=1.0), product of:
              0.10018982 = queryWeight, product of:
                1.0433357 = boost
                4.462893 = idf(docFreq=1364, maxDocs=43556)
                0.021517068 = queryNorm
              0.24406447 = fieldWeight in 3281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.462893 = idf(docFreq=1364, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3281)
          0.038547095 = weight(abstract_txt:approaches in 3281) [ClassicSimilarity], result of:
            0.038547095 = score(doc=3281,freq=2.0), product of:
              0.10770998 = queryWeight, product of:
                1.0817832 = boost
                4.627353 = idf(docFreq=1157, maxDocs=43556)
                0.021517068 = queryNorm
              0.35787857 = fieldWeight in 3281, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.627353 = idf(docFreq=1157, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3281)
          0.047377393 = weight(abstract_txt:established in 3281) [ClassicSimilarity], result of:
            0.047377393 = score(doc=3281,freq=1.0), product of:
              0.15571088 = queryWeight, product of:
                1.3006837 = boost
                5.5637054 = idf(docFreq=453, maxDocs=43556)
                0.021517068 = queryNorm
              0.30426514 = fieldWeight in 3281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5637054 = idf(docFreq=453, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3281)
          0.059877973 = weight(abstract_txt:extensive in 3281) [ClassicSimilarity], result of:
            0.059877973 = score(doc=3281,freq=1.0), product of:
              0.1820188 = queryWeight, product of:
                1.4062754 = boost
                6.015376 = idf(docFreq=288, maxDocs=43556)
                0.021517068 = queryNorm
              0.32896587 = fieldWeight in 3281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.015376 = idf(docFreq=288, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3281)
          0.018367087 = weight(abstract_txt:have in 3281) [ClassicSimilarity], result of:
            0.018367087 = score(doc=3281,freq=1.0), product of:
              0.10430578 = queryWeight, product of:
                1.5055023 = boost
                3.2199109 = idf(docFreq=4730, maxDocs=43556)
                0.021517068 = queryNorm
              0.17608887 = fieldWeight in 3281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2199109 = idf(docFreq=4730, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3281)
          0.23450652 = weight(abstract_txt:weights in 3281) [ClassicSimilarity], result of:
            0.23450652 = score(doc=3281,freq=5.0), product of:
              0.2644751 = queryWeight, product of:
                1.6951364 = boost
                7.250986 = idf(docFreq=83, maxDocs=43556)
                0.021517068 = queryNorm
              0.88668656 = fieldWeight in 3281, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.250986 = idf(docFreq=83, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3281)
          0.036532663 = weight(abstract_txt:text in 3281) [ClassicSimilarity], result of:
            0.036532663 = score(doc=3281,freq=1.0), product of:
              0.16496903 = queryWeight, product of:
                1.8933393 = boost
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.021517068 = queryNorm
              0.22145166 = fieldWeight in 3281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3281)
          0.064467244 = weight(abstract_txt:retrieval in 3281) [ClassicSimilarity], result of:
            0.064467244 = score(doc=3281,freq=5.0), product of:
              0.15175892 = queryWeight, product of:
                2.030296 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.021517068 = queryNorm
              0.42480034 = fieldWeight in 3281, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3281)
          0.17392881 = weight(abstract_txt:term in 3281) [ClassicSimilarity], result of:
            0.17392881 = score(doc=3281,freq=8.0), product of:
              0.23343496 = queryWeight, product of:
                2.2522168 = boost
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.021517068 = queryNorm
              0.7450847 = fieldWeight in 3281, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3281)
          0.09395237 = weight(abstract_txt:document in 3281) [ClassicSimilarity], result of:
            0.09395237 = score(doc=3281,freq=3.0), product of:
              0.23128614 = queryWeight, product of:
                2.5064385 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.021517068 = queryNorm
              0.40621704 = fieldWeight in 3281, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3281)
        0.4 = coord(10/25)
    
  2. Maron, M.E.; Kuhns, I.L.: On relevance, probabilistic indexing and information retrieval (1960) 0.26
    0.2609894 = sum of:
      0.2609894 = product of:
        0.81559193 = sum of:
          0.024606392 = weight(abstract_txt:searching in 2926) [ClassicSimilarity], result of:
            0.024606392 = score(doc=2926,freq=1.0), product of:
              0.092039764 = queryWeight, product of:
                4.2775235 = idf(docFreq=1642, maxDocs=43556)
                0.021517068 = queryNorm
              0.26734522 = fieldWeight in 2926, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2775235 = idf(docFreq=1642, maxDocs=43556)
                0.0625 = fieldNorm(doc=2926)
          0.036566883 = weight(abstract_txt:indexing in 2926) [ClassicSimilarity], result of:
            0.036566883 = score(doc=2926,freq=2.0), product of:
              0.09513176 = queryWeight, product of:
                1.0166583 = boost
                4.3487797 = idf(docFreq=1529, maxDocs=43556)
                0.021517068 = queryNorm
              0.38438144 = fieldWeight in 2926, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3487797 = idf(docFreq=1529, maxDocs=43556)
                0.0625 = fieldNorm(doc=2926)
          0.06963193 = weight(abstract_txt:list in 2926) [ClassicSimilarity], result of:
            0.06963193 = score(doc=2926,freq=2.0), product of:
              0.14615236 = queryWeight, product of:
                1.2601295 = boost
                5.3902335 = idf(docFreq=539, maxDocs=43556)
                0.021517068 = queryNorm
              0.4764338 = fieldWeight in 2926, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3902335 = idf(docFreq=539, maxDocs=43556)
                0.0625 = fieldNorm(doc=2926)
          0.020990957 = weight(abstract_txt:have in 2926) [ClassicSimilarity], result of:
            0.020990957 = score(doc=2926,freq=1.0), product of:
              0.10430578 = queryWeight, product of:
                1.5055023 = boost
                3.2199109 = idf(docFreq=4730, maxDocs=43556)
                0.021517068 = queryNorm
              0.20124443 = fieldWeight in 2926, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2199109 = idf(docFreq=4730, maxDocs=43556)
                0.0625 = fieldNorm(doc=2926)
          0.032949287 = weight(abstract_txt:retrieval in 2926) [ClassicSimilarity], result of:
            0.032949287 = score(doc=2926,freq=1.0), product of:
              0.15175892 = queryWeight, product of:
                2.030296 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.021517068 = queryNorm
              0.21711598 = fieldWeight in 2926, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.0625 = fieldNorm(doc=2926)
          0.070277855 = weight(abstract_txt:term in 2926) [ClassicSimilarity], result of:
            0.070277855 = score(doc=2926,freq=1.0), product of:
              0.23343496 = queryWeight, product of:
                2.2522168 = boost
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.021517068 = queryNorm
              0.3010597 = fieldWeight in 2926, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.0625 = fieldNorm(doc=2926)
          0.087670624 = weight(abstract_txt:document in 2926) [ClassicSimilarity], result of:
            0.087670624 = score(doc=2926,freq=2.0), product of:
              0.23128614 = queryWeight, product of:
                2.5064385 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.021517068 = queryNorm
              0.37905696 = fieldWeight in 2926, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.0625 = fieldNorm(doc=2926)
          0.47289798 = weight(abstract_txt:request in 2926) [ClassicSimilarity], result of:
            0.47289798 = score(doc=2926,freq=5.0), product of:
              0.4865661 = queryWeight, product of:
                3.251608 = boost
                6.954415 = idf(docFreq=112, maxDocs=43556)
                0.021517068 = queryNorm
              0.97190905 = fieldWeight in 2926, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.954415 = idf(docFreq=112, maxDocs=43556)
                0.0625 = fieldNorm(doc=2926)
        0.32 = coord(8/25)
    
  3. Dumais, S.T.: Latent semantic analysis (2003) 0.25
    0.24629201 = sum of:
      0.24629201 = product of:
        0.4736385 = sum of:
          0.017399346 = weight(abstract_txt:searching in 4460) [ClassicSimilarity], result of:
            0.017399346 = score(doc=4460,freq=2.0), product of:
              0.092039764 = queryWeight, product of:
                4.2775235 = idf(docFreq=1642, maxDocs=43556)
                0.021517068 = queryNorm
              0.18904161 = fieldWeight in 4460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2775235 = idf(docFreq=1642, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.018283442 = weight(abstract_txt:indexing in 4460) [ClassicSimilarity], result of:
            0.018283442 = score(doc=4460,freq=2.0), product of:
              0.09513176 = queryWeight, product of:
                1.0166583 = boost
                4.3487797 = idf(docFreq=1529, maxDocs=43556)
                0.021517068 = queryNorm
              0.19219072 = fieldWeight in 4460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3487797 = idf(docFreq=1529, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.031244608 = weight(abstract_txt:large in 4460) [ClassicSimilarity], result of:
            0.031244608 = score(doc=4460,freq=5.0), product of:
              0.10018982 = queryWeight, product of:
                1.0433357 = boost
                4.462893 = idf(docFreq=1364, maxDocs=43556)
                0.021517068 = queryNorm
              0.31185412 = fieldWeight in 4460, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.462893 = idf(docFreq=1364, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.020637924 = weight(abstract_txt:techniques in 4460) [ClassicSimilarity], result of:
            0.020637924 = score(doc=4460,freq=2.0), product of:
              0.10313298 = queryWeight, product of:
                1.0585492 = boost
                4.527969 = idf(docFreq=1278, maxDocs=43556)
                0.021517068 = queryNorm
              0.20010984 = fieldWeight in 4460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.527969 = idf(docFreq=1278, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.026977347 = weight(abstract_txt:approaches in 4460) [ClassicSimilarity], result of:
            0.026977347 = score(doc=4460,freq=3.0), product of:
              0.10770998 = queryWeight, product of:
                1.0817832 = boost
                4.627353 = idf(docFreq=1157, maxDocs=43556)
                0.021517068 = queryNorm
              0.25046283 = fieldWeight in 4460, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.627353 = idf(docFreq=1157, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.013409075 = weight(abstract_txt:these in 4460) [ClassicSimilarity], result of:
            0.013409075 = score(doc=4460,freq=3.0), product of:
              0.07736647 = queryWeight, product of:
                1.1228824 = boost
                3.2021039 = idf(docFreq=4815, maxDocs=43556)
                0.021517068 = queryNorm
              0.17331895 = fieldWeight in 4460, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.2021039 = idf(docFreq=4815, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.024618605 = weight(abstract_txt:list in 4460) [ClassicSimilarity], result of:
            0.024618605 = score(doc=4460,freq=1.0), product of:
              0.14615236 = queryWeight, product of:
                1.2601295 = boost
                5.3902335 = idf(docFreq=539, maxDocs=43556)
                0.021517068 = queryNorm
              0.1684448 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3902335 = idf(docFreq=539, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.027768426 = weight(abstract_txt:have in 4460) [ClassicSimilarity], result of:
            0.027768426 = score(doc=4460,freq=7.0), product of:
              0.10430578 = queryWeight, product of:
                1.5055023 = boost
                3.2199109 = idf(docFreq=4730, maxDocs=43556)
                0.021517068 = queryNorm
              0.26622134 = fieldWeight in 4460, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.2199109 = idf(docFreq=4730, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.051135078 = weight(abstract_txt:text in 4460) [ClassicSimilarity], result of:
            0.051135078 = score(doc=4460,freq=6.0), product of:
              0.16496903 = queryWeight, product of:
                1.8933393 = boost
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.021517068 = queryNorm
              0.30996776 = fieldWeight in 4460, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.04942393 = weight(abstract_txt:retrieval in 4460) [ClassicSimilarity], result of:
            0.04942393 = score(doc=4460,freq=9.0), product of:
              0.15175892 = queryWeight, product of:
                2.030296 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.021517068 = queryNorm
              0.32567397 = fieldWeight in 4460, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.035138927 = weight(abstract_txt:term in 4460) [ClassicSimilarity], result of:
            0.035138927 = score(doc=4460,freq=1.0), product of:
              0.23343496 = queryWeight, product of:
                2.2522168 = boost
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.021517068 = queryNorm
              0.15052985 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.043835312 = weight(abstract_txt:document in 4460) [ClassicSimilarity], result of:
            0.043835312 = score(doc=4460,freq=2.0), product of:
              0.23128614 = queryWeight, product of:
                2.5064385 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.021517068 = queryNorm
              0.18952848 = fieldWeight in 4460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.113766484 = weight(abstract_txt:texts in 4460) [ClassicSimilarity], result of:
            0.113766484 = score(doc=4460,freq=4.0), product of:
              0.32183242 = queryWeight, product of:
                2.6444912 = boost
                5.6559367 = idf(docFreq=413, maxDocs=43556)
                0.021517068 = queryNorm
              0.35349604 = fieldWeight in 4460, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6559367 = idf(docFreq=413, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
        0.52 = coord(13/25)
    
  4. Patrick, T.B.; Sievert, M.C.; Popescu, M.: Text indexing of images based on graphical image content (1999) 0.21
    0.2145687 = sum of:
      0.2145687 = product of:
        0.6705272 = sum of:
          0.05781732 = weight(abstract_txt:indexing in 678) [ClassicSimilarity], result of:
            0.05781732 = score(doc=678,freq=5.0), product of:
              0.09513176 = queryWeight, product of:
                1.0166583 = boost
                4.3487797 = idf(docFreq=1529, maxDocs=43556)
                0.021517068 = queryNorm
              0.6077604 = fieldWeight in 678, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.3487797 = idf(docFreq=1529, maxDocs=43556)
                0.0625 = fieldNorm(doc=678)
          0.027946029 = weight(abstract_txt:large in 678) [ClassicSimilarity], result of:
            0.027946029 = score(doc=678,freq=1.0), product of:
              0.10018982 = queryWeight, product of:
                1.0433357 = boost
                4.462893 = idf(docFreq=1364, maxDocs=43556)
                0.021517068 = queryNorm
              0.2789308 = fieldWeight in 678, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.462893 = idf(docFreq=1364, maxDocs=43556)
                0.0625 = fieldNorm(doc=678)
          0.037580527 = weight(abstract_txt:very in 678) [ClassicSimilarity], result of:
            0.037580527 = score(doc=678,freq=1.0), product of:
              0.1220634 = queryWeight, product of:
                1.1516088 = boost
                4.926034 = idf(docFreq=858, maxDocs=43556)
                0.021517068 = queryNorm
              0.30787712 = fieldWeight in 678, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.926034 = idf(docFreq=858, maxDocs=43556)
                0.0625 = fieldNorm(doc=678)
          0.11985658 = weight(abstract_txt:weights in 678) [ClassicSimilarity], result of:
            0.11985658 = score(doc=678,freq=1.0), product of:
              0.2644751 = queryWeight, product of:
                1.6951364 = boost
                7.250986 = idf(docFreq=83, maxDocs=43556)
                0.021517068 = queryNorm
              0.45318663 = fieldWeight in 678, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.250986 = idf(docFreq=83, maxDocs=43556)
                0.0625 = fieldNorm(doc=678)
          0.08350323 = weight(abstract_txt:text in 678) [ClassicSimilarity], result of:
            0.08350323 = score(doc=678,freq=4.0), product of:
              0.16496903 = queryWeight, product of:
                1.8933393 = boost
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.021517068 = queryNorm
              0.5061752 = fieldWeight in 678, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.0625 = fieldNorm(doc=678)
          0.032949287 = weight(abstract_txt:retrieval in 678) [ClassicSimilarity], result of:
            0.032949287 = score(doc=678,freq=1.0), product of:
              0.15175892 = queryWeight, product of:
                2.030296 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.021517068 = queryNorm
              0.21711598 = fieldWeight in 678, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.0625 = fieldNorm(doc=678)
          0.09938789 = weight(abstract_txt:term in 678) [ClassicSimilarity], result of:
            0.09938789 = score(doc=678,freq=2.0), product of:
              0.23343496 = queryWeight, product of:
                2.2522168 = boost
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.021517068 = queryNorm
              0.42576268 = fieldWeight in 678, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.0625 = fieldNorm(doc=678)
          0.2114864 = weight(abstract_txt:request in 678) [ClassicSimilarity], result of:
            0.2114864 = score(doc=678,freq=1.0), product of:
              0.4865661 = queryWeight, product of:
                3.251608 = boost
                6.954415 = idf(docFreq=112, maxDocs=43556)
                0.021517068 = queryNorm
              0.43465093 = fieldWeight in 678, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.954415 = idf(docFreq=112, maxDocs=43556)
                0.0625 = fieldNorm(doc=678)
        0.32 = coord(8/25)
    
  5. Can, F.; Kocberber, S.; Balcik, E.; Kaynak, C.; Ocalan, H.C.: Information retrieval on Turkish texts (2008) 0.21
    0.20809701 = sum of:
      0.20809701 = product of:
        0.6503032 = sum of:
          0.038785037 = weight(abstract_txt:indexing in 3371) [ClassicSimilarity], result of:
            0.038785037 = score(doc=3371,freq=1.0), product of:
              0.09513176 = queryWeight, product of:
                1.0166583 = boost
                4.3487797 = idf(docFreq=1529, maxDocs=43556)
                0.021517068 = queryNorm
              0.4076981 = fieldWeight in 3371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3487797 = idf(docFreq=1529, maxDocs=43556)
                0.09375 = fieldNorm(doc=3371)
          0.041919045 = weight(abstract_txt:large in 3371) [ClassicSimilarity], result of:
            0.041919045 = score(doc=3371,freq=1.0), product of:
              0.10018982 = queryWeight, product of:
                1.0433357 = boost
                4.462893 = idf(docFreq=1364, maxDocs=43556)
                0.021517068 = queryNorm
              0.41839623 = fieldWeight in 3371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.462893 = idf(docFreq=1364, maxDocs=43556)
                0.09375 = fieldNorm(doc=3371)
          0.023225201 = weight(abstract_txt:these in 3371) [ClassicSimilarity], result of:
            0.023225201 = score(doc=3371,freq=1.0), product of:
              0.07736647 = queryWeight, product of:
                1.1228824 = boost
                3.2021039 = idf(docFreq=4815, maxDocs=43556)
                0.021517068 = queryNorm
              0.30019724 = fieldWeight in 3371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2021039 = idf(docFreq=4815, maxDocs=43556)
                0.09375 = fieldNorm(doc=3371)
          0.07151456 = weight(abstract_txt:simple in 3371) [ClassicSimilarity], result of:
            0.07151456 = score(doc=3371,freq=1.0), product of:
              0.1430471 = queryWeight, product of:
                1.2466707 = boost
                5.3326635 = idf(docFreq=571, maxDocs=43556)
                0.021517068 = queryNorm
              0.4999372 = fieldWeight in 3371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3326635 = idf(docFreq=571, maxDocs=43556)
                0.09375 = fieldNorm(doc=3371)
          0.07385581 = weight(abstract_txt:list in 3371) [ClassicSimilarity], result of:
            0.07385581 = score(doc=3371,freq=1.0), product of:
              0.14615236 = queryWeight, product of:
                1.2601295 = boost
                5.3902335 = idf(docFreq=539, maxDocs=43556)
                0.021517068 = queryNorm
              0.5053344 = fieldWeight in 3371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3902335 = idf(docFreq=539, maxDocs=43556)
                0.09375 = fieldNorm(doc=3371)
          0.09884786 = weight(abstract_txt:retrieval in 3371) [ClassicSimilarity], result of:
            0.09884786 = score(doc=3371,freq=4.0), product of:
              0.15175892 = queryWeight, product of:
                2.030296 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.021517068 = queryNorm
              0.65134794 = fieldWeight in 3371, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.09375 = fieldNorm(doc=3371)
          0.13150594 = weight(abstract_txt:document in 3371) [ClassicSimilarity], result of:
            0.13150594 = score(doc=3371,freq=2.0), product of:
              0.23128614 = queryWeight, product of:
                2.5064385 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.021517068 = queryNorm
              0.56858546 = fieldWeight in 3371, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.09375 = fieldNorm(doc=3371)
          0.17064972 = weight(abstract_txt:texts in 3371) [ClassicSimilarity], result of:
            0.17064972 = score(doc=3371,freq=1.0), product of:
              0.32183242 = queryWeight, product of:
                2.6444912 = boost
                5.6559367 = idf(docFreq=413, maxDocs=43556)
                0.021517068 = queryNorm
              0.53024405 = fieldWeight in 3371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6559367 = idf(docFreq=413, maxDocs=43556)
                0.09375 = fieldNorm(doc=3371)
        0.32 = coord(8/25)