Document (#29533)

Robertson, S.E.
Sparck Jones, K.
Simple, proven approaches to text retrieval
May, 1997, Update of 1994 and 1996 versions.
Technical Report TR356, University of Cambridge, Computer Laboratory
This technical note describes straightforward techniques for document indexing and retrieval that have been solidly established through extensive testing and are easy to apply. They are useful for many different types of text material, are viable for very large files, and have the advantage that they do not require special skills or training for searching, but are easy for end users. The document and text retrieval methods described here have a sound theoretical basis, are well established by extensive testing, and the ideas involved are now implemented in some commercial retrieval systems. Testing in the last few years has, in particular, shown that the methods presented here work very well with full texts, not only title and abstracts, and with large files of texts containing three quarters of a million documents. These tests, the TREC Tests (see Harman 1993 - 1997; IP&M 1995), have been rigorous comparative evaluations involving many different approaches to information retrieval. These techniques depend an the use of simple terms for indexing both request and document texts; an term weighting exploiting statistical information about term occurrences; an scoring for request-document matching, using these weights, to obtain a ranked search output; and an relevance feedback to modify request weights or term sets in iterative searching. The normal implementation is via an inverted file organisation using a term list with linked document identifiers, plus counting data, and pointers to the actual texts. The user's request can be a word list, phrases, sentences or extended text.
Auch unter:

Similar documents (author)

  1. Robertson, S.E.; Sparck Jones, K.: Relevance weighting of search terms (1976) 5.64
    5.6415434 = sum of:
      5.6415434 = sum of:
        1.5008372 = weight(author_txt:jones in 71) [ClassicSimilarity], result of:
          1.5008372 = score(doc=71,freq=1.0), product of:
            0.49468306 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.07133432 = queryNorm
            3.033937 = fieldWeight in 71, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.4375 = fieldNorm(doc=71)
        1.7614334 = weight(author_txt:robertson in 71) [ClassicSimilarity], result of:
          1.7614334 = score(doc=71,freq=1.0), product of:
            0.55040467 = queryWeight, product of:
              1.054818 = boost
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.07133432 = queryNorm
            3.2002516 = fieldWeight in 71, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.4375 = fieldNorm(doc=71)
        2.379273 = weight(author_txt:sparck in 71) [ClassicSimilarity], result of:
          2.379273 = score(doc=71,freq=1.0), product of:
            0.6725648 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.07133432 = queryNorm
            3.5376115 = fieldWeight in 71, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.4375 = fieldNorm(doc=71)
  2. Sparck Jones, K.; Walker, S.; Robertson, S.E.: ¬A probabilistic model of information retrieval : development and comparative experiments - part 1 (2000) 4.84
    4.835609 = sum of:
      4.835609 = sum of:
        1.2864319 = weight(author_txt:jones in 4181) [ClassicSimilarity], result of:
          1.2864319 = score(doc=4181,freq=1.0), product of:
            0.49468306 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.07133432 = queryNorm
            2.6005175 = fieldWeight in 4181, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.375 = fieldNorm(doc=4181)
        1.5098001 = weight(author_txt:robertson in 4181) [ClassicSimilarity], result of:
          1.5098001 = score(doc=4181,freq=1.0), product of:
            0.55040467 = queryWeight, product of:
              1.054818 = boost
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.07133432 = queryNorm
            2.7430727 = fieldWeight in 4181, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.375 = fieldNorm(doc=4181)
        2.039377 = weight(author_txt:sparck in 4181) [ClassicSimilarity], result of:
          2.039377 = score(doc=4181,freq=1.0), product of:
            0.6725648 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.07133432 = queryNorm
            3.0322385 = fieldWeight in 4181, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.375 = fieldNorm(doc=4181)
  3. Sparck Jones, K.; Walker, S.; Robertson, S.E.: ¬A probabilistic model of information retrieval : development and comparative experiments - part 2 (2000) 4.84
    4.835609 = sum of:
      4.835609 = sum of:
        1.2864319 = weight(author_txt:jones in 4286) [ClassicSimilarity], result of:
          1.2864319 = score(doc=4286,freq=1.0), product of:
            0.49468306 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.07133432 = queryNorm
            2.6005175 = fieldWeight in 4286, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.375 = fieldNorm(doc=4286)
        1.5098001 = weight(author_txt:robertson in 4286) [ClassicSimilarity], result of:
          1.5098001 = score(doc=4286,freq=1.0), product of:
            0.55040467 = queryWeight, product of:
              1.054818 = boost
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.07133432 = queryNorm
            2.7430727 = fieldWeight in 4286, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.375 = fieldNorm(doc=4286)
        2.039377 = weight(author_txt:sparck in 4286) [ClassicSimilarity], result of:
          2.039377 = score(doc=4286,freq=1.0), product of:
            0.6725648 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.07133432 = queryNorm
            3.0322385 = fieldWeight in 4286, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.375 = fieldNorm(doc=4286)
  4. Sparck Jones, K.: Fashionable trends and feasible strategies in information management (1988) 2.96
    2.9562747 = sum of:
      2.9562747 = product of:
        4.434412 = sum of:
          1.7152426 = weight(author_txt:jones in 817) [ClassicSimilarity], result of:
            1.7152426 = score(doc=817,freq=1.0), product of:
              0.49468306 = queryWeight, product of:
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.07133432 = queryNorm
              3.4673567 = fieldWeight in 817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.5 = fieldNorm(doc=817)
          2.7191691 = weight(author_txt:sparck in 817) [ClassicSimilarity], result of:
            2.7191691 = score(doc=817,freq=1.0), product of:
              0.6725648 = queryWeight, product of:
                1.1660135 = boost
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.07133432 = queryNorm
              4.0429845 = fieldWeight in 817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.5 = fieldNorm(doc=817)
        0.6666667 = coord(2/3)
  5. Sparck Jones, K.: Automatic classification (1976) 2.96
    2.9562747 = sum of:
      2.9562747 = product of:
        4.434412 = sum of:
          1.7152426 = weight(author_txt:jones in 2908) [ClassicSimilarity], result of:
            1.7152426 = score(doc=2908,freq=1.0), product of:
              0.49468306 = queryWeight, product of:
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.07133432 = queryNorm
              3.4673567 = fieldWeight in 2908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.5 = fieldNorm(doc=2908)
          2.7191691 = weight(author_txt:sparck in 2908) [ClassicSimilarity], result of:
            2.7191691 = score(doc=2908,freq=1.0), product of:
              0.6725648 = queryWeight, product of:
                1.1660135 = boost
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.07133432 = queryNorm
              4.0429845 = fieldWeight in 2908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.5 = fieldNorm(doc=2908)
        0.6666667 = coord(2/3)

Similar documents (content)

  1. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: Beyond bag-of-words : bigram-enhanced context-dependent term weights (2014) 0.32
    0.31601536 = sum of:
      0.31601536 = product of:
        0.7900384 = sum of:
          0.024361953 = weight(abstract_txt:large in 1283) [ClassicSimilarity], result of:
            0.024361953 = score(doc=1283,freq=1.0), product of:
              0.100015 = queryWeight, product of:
                1.0395269 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.021600833 = queryNorm
              0.243583 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.038161293 = weight(abstract_txt:approaches in 1283) [ClassicSimilarity], result of:
            0.038161293 = score(doc=1283,freq=2.0), product of:
              0.10706868 = queryWeight, product of:
                1.0755594 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.021600833 = queryNorm
              0.35641882 = fieldWeight in 1283, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.047037244 = weight(abstract_txt:established in 1283) [ClassicSimilarity], result of:
            0.047037244 = score(doc=1283,freq=1.0), product of:
              0.15507852 = queryWeight, product of:
                1.2944312 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.021600833 = queryNorm
              0.30331245 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.05994813 = weight(abstract_txt:extensive in 1283) [ClassicSimilarity], result of:
            0.05994813 = score(doc=1283,freq=1.0), product of:
              0.18229474 = queryWeight, product of:
                1.4034283 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.021600833 = queryNorm
              0.32885277 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.01814628 = weight(abstract_txt:have in 1283) [ClassicSimilarity], result of:
            0.01814628 = score(doc=1283,freq=1.0), product of:
              0.10354413 = queryWeight, product of:
                1.4958254 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.021600833 = queryNorm
              0.17525166 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.23420332 = weight(abstract_txt:weights in 1283) [ClassicSimilarity], result of:
            0.23420332 = score(doc=1283,freq=5.0), product of:
              0.2644412 = queryWeight, product of:
                1.6903154 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.021600833 = queryNorm
              0.88565373 = fieldWeight in 1283, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.03646329 = weight(abstract_txt:text in 1283) [ClassicSimilarity], result of:
            0.03646329 = score(doc=1283,freq=1.0), product of:
              0.16488114 = queryWeight, product of:
                1.8875725 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.021600833 = queryNorm
              0.22114895 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.06468105 = weight(abstract_txt:retrieval in 1283) [ClassicSimilarity], result of:
            0.06468105 = score(doc=1283,freq=5.0), product of:
              0.152206 = queryWeight, product of:
                2.0276315 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021600833 = queryNorm
              0.4249573 = fieldWeight in 1283, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.17260805 = weight(abstract_txt:term in 1283) [ClassicSimilarity], result of:
            0.17260805 = score(doc=1283,freq=8.0), product of:
              0.23242229 = queryWeight, product of:
                2.2410784 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.021600833 = queryNorm
              0.7426484 = fieldWeight in 1283, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.09442779 = weight(abstract_txt:document in 1283) [ClassicSimilarity], result of:
            0.09442779 = score(doc=1283,freq=3.0), product of:
              0.23223618 = queryWeight, product of:
                2.5045984 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.021600833 = queryNorm
              0.4066024 = fieldWeight in 1283, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
        0.4 = coord(10/25)
  2. Maron, M.E.; Kuhns, I.L.: On relevance, probabilistic indexing and information retrieval (1960) 0.26
    0.2600389 = sum of:
      0.2600389 = product of:
        0.8126216 = sum of:
          0.024785452 = weight(abstract_txt:searching in 1928) [ClassicSimilarity], result of:
            0.024785452 = score(doc=1928,freq=1.0), product of:
              0.092553675 = queryWeight, product of:
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.021600833 = queryNorm
              0.26779544 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.0625 = fieldNorm(doc=1928)
          0.036667943 = weight(abstract_txt:indexing in 1928) [ClassicSimilarity], result of:
            0.036667943 = score(doc=1928,freq=2.0), product of:
              0.09537696 = queryWeight, product of:
                1.0151376 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.021600833 = queryNorm
              0.38445285 = fieldWeight in 1928, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0625 = fieldNorm(doc=1928)
          0.06958832 = weight(abstract_txt:list in 1928) [ClassicSimilarity], result of:
            0.06958832 = score(doc=1928,freq=2.0), product of:
              0.14619865 = queryWeight, product of:
                1.2568251 = boost
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.021600833 = queryNorm
              0.47598472 = fieldWeight in 1928, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.0625 = fieldNorm(doc=1928)
          0.020738607 = weight(abstract_txt:have in 1928) [ClassicSimilarity], result of:
            0.020738607 = score(doc=1928,freq=1.0), product of:
              0.10354413 = queryWeight, product of:
                1.4958254 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.021600833 = queryNorm
              0.20028761 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0625 = fieldNorm(doc=1928)
          0.03305857 = weight(abstract_txt:retrieval in 1928) [ClassicSimilarity], result of:
            0.03305857 = score(doc=1928,freq=1.0), product of:
              0.152206 = queryWeight, product of:
                2.0276315 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021600833 = queryNorm
              0.21719621 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=1928)
          0.069744185 = weight(abstract_txt:term in 1928) [ClassicSimilarity], result of:
            0.069744185 = score(doc=1928,freq=1.0), product of:
              0.23242229 = queryWeight, product of:
                2.2410784 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.021600833 = queryNorm
              0.3000753 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0625 = fieldNorm(doc=1928)
          0.08811425 = weight(abstract_txt:document in 1928) [ClassicSimilarity], result of:
            0.08811425 = score(doc=1928,freq=2.0), product of:
              0.23223618 = queryWeight, product of:
                2.5045984 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.021600833 = queryNorm
              0.37941656 = fieldWeight in 1928, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=1928)
          0.4699243 = weight(abstract_txt:request in 1928) [ClassicSimilarity], result of:
            0.4699243 = score(doc=1928,freq=5.0), product of:
              0.4848801 = queryWeight, product of:
                3.2369452 = boost
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.021600833 = queryNorm
              0.96915567 = fieldWeight in 1928, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.0625 = fieldNorm(doc=1928)
        0.32 = coord(8/25)
  3. Dumais, S.T.: Latent semantic analysis (2003) 0.25
    0.24605204 = sum of:
      0.24605204 = product of:
        0.47317702 = sum of:
          0.017525962 = weight(abstract_txt:searching in 2462) [ClassicSimilarity], result of:
            0.017525962 = score(doc=2462,freq=2.0), product of:
              0.092553675 = queryWeight, product of:
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.021600833 = queryNorm
              0.18935998 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.018333972 = weight(abstract_txt:indexing in 2462) [ClassicSimilarity], result of:
            0.018333972 = score(doc=2462,freq=2.0), product of:
              0.09537696 = queryWeight, product of:
                1.0151376 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.021600833 = queryNorm
              0.19222642 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.031128563 = weight(abstract_txt:large in 2462) [ClassicSimilarity], result of:
            0.031128563 = score(doc=2462,freq=5.0), product of:
              0.100015 = queryWeight, product of:
                1.0395269 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.021600833 = queryNorm
              0.31123894 = fieldWeight in 2462, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.020709218 = weight(abstract_txt:techniques in 2462) [ClassicSimilarity], result of:
            0.020709218 = score(doc=2462,freq=2.0), product of:
              0.10344629 = queryWeight, product of:
                1.0572084 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.021600833 = queryNorm
              0.20019296 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.02670734 = weight(abstract_txt:approaches in 2462) [ClassicSimilarity], result of:
            0.02670734 = score(doc=2462,freq=3.0), product of:
              0.10706868 = queryWeight, product of:
                1.0755594 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.021600833 = queryNorm
              0.2494412 = fieldWeight in 2462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.013263467 = weight(abstract_txt:these in 2462) [ClassicSimilarity], result of:
            0.013263467 = score(doc=2462,freq=3.0), product of:
              0.076861784 = queryWeight, product of:
                1.1161023 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.021600833 = queryNorm
              0.17256257 = fieldWeight in 2462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.024603186 = weight(abstract_txt:list in 2462) [ClassicSimilarity], result of:
            0.024603186 = score(doc=2462,freq=1.0), product of:
              0.14619865 = queryWeight, product of:
                1.2568251 = boost
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.021600833 = queryNorm
              0.16828601 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.027434599 = weight(abstract_txt:have in 2462) [ClassicSimilarity], result of:
            0.027434599 = score(doc=2462,freq=7.0), product of:
              0.10354413 = queryWeight, product of:
                1.4958254 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.021600833 = queryNorm
              0.2649556 = fieldWeight in 2462, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.05103798 = weight(abstract_txt:text in 2462) [ClassicSimilarity], result of:
            0.05103798 = score(doc=2462,freq=6.0), product of:
              0.16488114 = queryWeight, product of:
                1.8875725 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.021600833 = queryNorm
              0.30954406 = fieldWeight in 2462, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.04958785 = weight(abstract_txt:retrieval in 2462) [ClassicSimilarity], result of:
            0.04958785 = score(doc=2462,freq=9.0), product of:
              0.152206 = queryWeight, product of:
                2.0276315 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021600833 = queryNorm
              0.3257943 = fieldWeight in 2462, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.034872092 = weight(abstract_txt:term in 2462) [ClassicSimilarity], result of:
            0.034872092 = score(doc=2462,freq=1.0), product of:
              0.23242229 = queryWeight, product of:
                2.2410784 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.021600833 = queryNorm
              0.15003765 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.044057123 = weight(abstract_txt:document in 2462) [ClassicSimilarity], result of:
            0.044057123 = score(doc=2462,freq=2.0), product of:
              0.23223618 = queryWeight, product of:
                2.5045984 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.021600833 = queryNorm
              0.18970828 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.11391565 = weight(abstract_txt:texts in 2462) [ClassicSimilarity], result of:
            0.11391565 = score(doc=2462,freq=4.0), product of:
              0.32235026 = queryWeight, product of:
                2.63926 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.021600833 = queryNorm
              0.3533909 = fieldWeight in 2462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
        0.52 = coord(13/25)
  4. Patrick, T.B.; Sievert, M.C.; Popescu, M.: Text indexing of images based on graphical image content (1999) 0.21
    0.21384765 = sum of:
      0.21384765 = product of:
        0.6682739 = sum of:
          0.05797711 = weight(abstract_txt:indexing in 6680) [ClassicSimilarity], result of:
            0.05797711 = score(doc=6680,freq=5.0), product of:
              0.09537696 = queryWeight, product of:
                1.0151376 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.021600833 = queryNorm
              0.6078733 = fieldWeight in 6680, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0625 = fieldNorm(doc=6680)
          0.027842233 = weight(abstract_txt:large in 6680) [ClassicSimilarity], result of:
            0.027842233 = score(doc=6680,freq=1.0), product of:
              0.100015 = queryWeight, product of:
                1.0395269 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.021600833 = queryNorm
              0.27838057 = fieldWeight in 6680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=6680)
          0.03755999 = weight(abstract_txt:very in 6680) [ClassicSimilarity], result of:
            0.03755999 = score(doc=6680,freq=1.0), product of:
              0.12210855 = queryWeight, product of:
                1.1486195 = boost
                4.921521 = idf(docFreq=875, maxDocs=44218)
                0.021600833 = queryNorm
              0.30759507 = fieldWeight in 6680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.921521 = idf(docFreq=875, maxDocs=44218)
                0.0625 = fieldNorm(doc=6680)
          0.11970162 = weight(abstract_txt:weights in 6680) [ClassicSimilarity], result of:
            0.11970162 = score(doc=6680,freq=1.0), product of:
              0.2644412 = queryWeight, product of:
                1.6903154 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.021600833 = queryNorm
              0.45265874 = fieldWeight in 6680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.0625 = fieldNorm(doc=6680)
          0.08334467 = weight(abstract_txt:text in 6680) [ClassicSimilarity], result of:
            0.08334467 = score(doc=6680,freq=4.0), product of:
              0.16488114 = queryWeight, product of:
                1.8875725 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.021600833 = queryNorm
              0.5054833 = fieldWeight in 6680, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=6680)
          0.03305857 = weight(abstract_txt:retrieval in 6680) [ClassicSimilarity], result of:
            0.03305857 = score(doc=6680,freq=1.0), product of:
              0.152206 = queryWeight, product of:
                2.0276315 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021600833 = queryNorm
              0.21719621 = fieldWeight in 6680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=6680)
          0.09863317 = weight(abstract_txt:term in 6680) [ClassicSimilarity], result of:
            0.09863317 = score(doc=6680,freq=2.0), product of:
              0.23242229 = queryWeight, product of:
                2.2410784 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.021600833 = queryNorm
              0.42437053 = fieldWeight in 6680, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0625 = fieldNorm(doc=6680)
          0.21015653 = weight(abstract_txt:request in 6680) [ClassicSimilarity], result of:
            0.21015653 = score(doc=6680,freq=1.0), product of:
              0.4848801 = queryWeight, product of:
                3.2369452 = boost
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.021600833 = queryNorm
              0.4334196 = fieldWeight in 6680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.0625 = fieldNorm(doc=6680)
        0.32 = coord(8/25)
  5. Can, F.; Kocberber, S.; Balcik, E.; Kaynak, C.; Ocalan, H.C.: Information retrieval on Turkish texts (2008) 0.21
    0.20828682 = sum of:
      0.20828682 = product of:
        0.6508963 = sum of:
          0.03889223 = weight(abstract_txt:indexing in 1373) [ClassicSimilarity], result of:
            0.03889223 = score(doc=1373,freq=1.0), product of:
              0.09537696 = queryWeight, product of:
                1.0151376 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.021600833 = queryNorm
              0.40777382 = fieldWeight in 1373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=1373)
          0.04176335 = weight(abstract_txt:large in 1373) [ClassicSimilarity], result of:
            0.04176335 = score(doc=1373,freq=1.0), product of:
              0.100015 = queryWeight, product of:
                1.0395269 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.021600833 = queryNorm
              0.41757086 = fieldWeight in 1373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.09375 = fieldNorm(doc=1373)
          0.022973 = weight(abstract_txt:these in 1373) [ClassicSimilarity], result of:
            0.022973 = score(doc=1373,freq=1.0), product of:
              0.076861784 = queryWeight, product of:
                1.1161023 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.021600833 = queryNorm
              0.29888713 = fieldWeight in 1373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.09375 = fieldNorm(doc=1373)
          0.07123764 = weight(abstract_txt:simple in 1373) [ClassicSimilarity], result of:
            0.07123764 = score(doc=1373,freq=1.0), product of:
              0.14278238 = queryWeight, product of:
                1.242054 = boost
                5.321862 = idf(docFreq=586, maxDocs=44218)
                0.021600833 = queryNorm
              0.49892458 = fieldWeight in 1373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.321862 = idf(docFreq=586, maxDocs=44218)
                0.09375 = fieldNorm(doc=1373)
          0.07380956 = weight(abstract_txt:list in 1373) [ClassicSimilarity], result of:
            0.07380956 = score(doc=1373,freq=1.0), product of:
              0.14619865 = queryWeight, product of:
                1.2568251 = boost
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.021600833 = queryNorm
              0.504858 = fieldWeight in 1373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.09375 = fieldNorm(doc=1373)
          0.0991757 = weight(abstract_txt:retrieval in 1373) [ClassicSimilarity], result of:
            0.0991757 = score(doc=1373,freq=4.0), product of:
              0.152206 = queryWeight, product of:
                2.0276315 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021600833 = queryNorm
              0.6515886 = fieldWeight in 1373, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=1373)
          0.13217138 = weight(abstract_txt:document in 1373) [ClassicSimilarity], result of:
            0.13217138 = score(doc=1373,freq=2.0), product of:
              0.23223618 = queryWeight, product of:
                2.5045984 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.021600833 = queryNorm
              0.5691248 = fieldWeight in 1373, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.09375 = fieldNorm(doc=1373)
          0.17087348 = weight(abstract_txt:texts in 1373) [ClassicSimilarity], result of:
            0.17087348 = score(doc=1373,freq=1.0), product of:
              0.32235026 = queryWeight, product of:
                2.63926 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.021600833 = queryNorm
              0.53008634 = fieldWeight in 1373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.09375 = fieldNorm(doc=1373)
        0.32 = coord(8/25)