Document (#5504)

Author
Waller, W.G.
Kraft, D.H.
Title
¬A mathematical model of a weighted Boolean retrieval system
Source
Information processing and management. 15(1979), S.235-245
Year
1979
Abstract
The use of weights to denote a query representation and/or the indexing of a document is analysed as a generalization of a Boolean retrieval system. Criteria are given for the functions used to evaluate the relevance of the records to a specific query, including self-consistency. Various mechnaisms suggested in the literature for evaluating the relevance of records with regard to a given query are tested and found to be less than satisfactory. A new approach is suggested to avoid some of the perils of a weighted Boolean retrieval system

Similar documents (author)

  1. Kraft, A.: Mit silbernen Scheibchen will sich der Buchhandel seine Zukunft vergolden : CD-ROMs sind auch bei der eher innovationsscheuen Branche auf dem Vormarsch, doch Experten warnen vor unübersichtlichem Markt mit minderwertigen Angeboten (1995) 5.87
    5.871439 = sum of:
      5.871439 = weight(author_txt:kraft in 1858) [ClassicSimilarity], result of:
        5.871439 = fieldWeight in 1858, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.625 = fieldNorm(doc=1858)
    
  2. Kraft, U.: Wo Gott wohnt : Religion (2002) 5.87
    5.871439 = sum of:
      5.871439 = weight(author_txt:kraft in 953) [ClassicSimilarity], result of:
        5.871439 = fieldWeight in 953, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.625 = fieldNorm(doc=953)
    
  3. Kraft, M.: Juristische Online-Datenbanken : Eine Einkaufshilfe (2005) 5.87
    5.871439 = sum of:
      5.871439 = weight(author_txt:kraft in 3054) [ClassicSimilarity], result of:
        5.871439 = fieldWeight in 3054, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.625 = fieldNorm(doc=3054)
    
  4. Born, J.; Kraft, U.: Lernen im Schlaf - kein Traum (2004) 4.70
    4.697151 = sum of:
      4.697151 = weight(author_txt:kraft in 2892) [ClassicSimilarity], result of:
        4.697151 = fieldWeight in 2892, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.5 = fieldNorm(doc=2892)
    
  5. Colvin, E.; Kraft, D.H.: Fuzzy retrieval for software reuse (2016) 4.70
    4.697151 = sum of:
      4.697151 = weight(author_txt:kraft in 3119) [ClassicSimilarity], result of:
        4.697151 = fieldWeight in 3119, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.5 = fieldNorm(doc=3119)
    

Similar documents (content)

  1. Petry, F.E.; Buckles, B.P.; Prabhu, D.: Fuzzy information retrieval using genetic algorithms and relevance feedback (1993) 0.45
    0.45231792 = sum of:
      0.45231792 = product of:
        1.2564386 = sum of:
          0.0455447 = weight(abstract_txt:functions in 7962) [ClassicSimilarity], result of:
            0.0455447 = score(doc=7962,freq=1.0), product of:
              0.106337555 = queryWeight, product of:
                1.0288211 = boost
                5.4822793 = idf(docFreq=499, maxDocs=44218)
                0.018853225 = queryNorm
              0.42830306 = fieldWeight in 7962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4822793 = idf(docFreq=499, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.054864008 = weight(abstract_txt:tested in 7962) [ClassicSimilarity], result of:
            0.054864008 = score(doc=7962,freq=1.0), product of:
              0.1203889 = queryWeight, product of:
                1.0946865 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.018853225 = queryNorm
              0.45572314 = fieldWeight in 7962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.10500913 = weight(abstract_txt:weights in 7962) [ClassicSimilarity], result of:
            0.10500913 = score(doc=7962,freq=1.0), product of:
              0.1855864 = queryWeight, product of:
                1.359157 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.018853225 = queryNorm
              0.56582344 = fieldWeight in 7962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.13263053 = weight(abstract_txt:relevance in 7962) [ClassicSimilarity], result of:
            0.13263053 = score(doc=7962,freq=4.0), product of:
              0.17211303 = queryWeight, product of:
                1.8510511 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.018853225 = queryNorm
              0.7706013 = fieldWeight in 7962, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.031802226 = weight(abstract_txt:system in 7962) [ClassicSimilarity], result of:
            0.031802226 = score(doc=7962,freq=1.0), product of:
              0.12070915 = queryWeight, product of:
                1.8985728 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.018853225 = queryNorm
              0.2634616 = fieldWeight in 7962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.04921611 = weight(abstract_txt:retrieval in 7962) [ClassicSimilarity], result of:
            0.04921611 = score(doc=7962,freq=2.0), product of:
              0.12818289 = queryWeight, product of:
                1.9564655 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.018853225 = queryNorm
              0.38395226 = fieldWeight in 7962, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.37288165 = weight(abstract_txt:weighted in 7962) [ClassicSimilarity], result of:
            0.37288165 = score(doc=7962,freq=4.0), product of:
              0.34284574 = queryWeight, product of:
                2.6125278 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.018853225 = queryNorm
              1.0876076 = fieldWeight in 7962, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.17816345 = weight(abstract_txt:query in 7962) [ClassicSimilarity], result of:
            0.17816345 = score(doc=7962,freq=4.0), product of:
              0.23986171 = queryWeight, product of:
                2.6763175 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.018853225 = queryNorm
              0.74277574 = fieldWeight in 7962, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.28632677 = weight(abstract_txt:boolean in 7962) [ClassicSimilarity], result of:
            0.28632677 = score(doc=7962,freq=2.0), product of:
              0.4146355 = queryWeight, product of:
                3.5187664 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.018853225 = queryNorm
              0.6905505 = fieldWeight in 7962, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
        0.36 = coord(9/25)
    
  2. Bordogna, G.; Pasi, G.: ¬A fuzzy linguistic approach generalizing Boolean information retrieval : a model and its evaluation (1993) 0.23
    0.22914386 = sum of:
      0.22914386 = product of:
        1.1457193 = sum of:
          0.1680146 = weight(abstract_txt:weights in 2569) [ClassicSimilarity], result of:
            0.1680146 = score(doc=2569,freq=1.0), product of:
              0.1855864 = queryWeight, product of:
                1.359157 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.018853225 = queryNorm
              0.9053175 = fieldWeight in 2569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.125 = fieldNorm(doc=2569)
          0.078745775 = weight(abstract_txt:retrieval in 2569) [ClassicSimilarity], result of:
            0.078745775 = score(doc=2569,freq=2.0), product of:
              0.12818289 = queryWeight, product of:
                1.9564655 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.018853225 = queryNorm
              0.6143236 = fieldWeight in 2569, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.125 = fieldNorm(doc=2569)
          0.2983053 = weight(abstract_txt:weighted in 2569) [ClassicSimilarity], result of:
            0.2983053 = score(doc=2569,freq=1.0), product of:
              0.34284574 = queryWeight, product of:
                2.6125278 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.018853225 = queryNorm
              0.8700861 = fieldWeight in 2569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.125 = fieldNorm(doc=2569)
          0.14253077 = weight(abstract_txt:query in 2569) [ClassicSimilarity], result of:
            0.14253077 = score(doc=2569,freq=1.0), product of:
              0.23986171 = queryWeight, product of:
                2.6763175 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.018853225 = queryNorm
              0.5942206 = fieldWeight in 2569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.125 = fieldNorm(doc=2569)
          0.45812282 = weight(abstract_txt:boolean in 2569) [ClassicSimilarity], result of:
            0.45812282 = score(doc=2569,freq=2.0), product of:
              0.4146355 = queryWeight, product of:
                3.5187664 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.018853225 = queryNorm
              1.1048808 = fieldWeight in 2569, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.125 = fieldNorm(doc=2569)
        0.2 = coord(5/25)
    
  3. Harman, D.: Ranking algorithms (1992) 0.19
    0.19068165 = sum of:
      0.19068165 = product of:
        0.7945069 = sum of:
          0.09449047 = weight(abstract_txt:records in 3511) [ClassicSimilarity], result of:
            0.09449047 = score(doc=3511,freq=2.0), product of:
              0.1382191 = queryWeight, product of:
                1.658806 = boost
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.018853225 = queryNorm
              0.68362814 = fieldWeight in 3511, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.109375 = fieldNorm(doc=3511)
          0.092841364 = weight(abstract_txt:relevance in 3511) [ClassicSimilarity], result of:
            0.092841364 = score(doc=3511,freq=1.0), product of:
              0.17211303 = queryWeight, product of:
                1.8510511 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.018853225 = queryNorm
              0.5394209 = fieldWeight in 3511, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.109375 = fieldNorm(doc=3511)
          0.0629652 = weight(abstract_txt:system in 3511) [ClassicSimilarity], result of:
            0.0629652 = score(doc=3511,freq=2.0), product of:
              0.12070915 = queryWeight, product of:
                1.8985728 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.018853225 = queryNorm
              0.52162737 = fieldWeight in 3511, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.109375 = fieldNorm(doc=3511)
          0.08438805 = weight(abstract_txt:retrieval in 3511) [ClassicSimilarity], result of:
            0.08438805 = score(doc=3511,freq=3.0), product of:
              0.12818289 = queryWeight, product of:
                1.9564655 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.018853225 = queryNorm
              0.658341 = fieldWeight in 3511, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=3511)
          0.17637283 = weight(abstract_txt:query in 3511) [ClassicSimilarity], result of:
            0.17637283 = score(doc=3511,freq=2.0), product of:
              0.23986171 = queryWeight, product of:
                2.6763175 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.018853225 = queryNorm
              0.73531044 = fieldWeight in 3511, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.109375 = fieldNorm(doc=3511)
          0.28344905 = weight(abstract_txt:boolean in 3511) [ClassicSimilarity], result of:
            0.28344905 = score(doc=3511,freq=1.0), product of:
              0.4146355 = queryWeight, product of:
                3.5187664 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.018853225 = queryNorm
              0.68361014 = fieldWeight in 3511, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.109375 = fieldNorm(doc=3511)
        0.24 = coord(6/25)
    
  4. Smith, M.P.; Smith, M.: ¬The use of genetic programming to build Boolean queries for text retrieval through relevance feedback (1997) 0.18
    0.18227254 = sum of:
      0.18227254 = product of:
        0.7594689 = sum of:
          0.08122761 = weight(abstract_txt:given in 761) [ClassicSimilarity], result of:
            0.08122761 = score(doc=761,freq=2.0), product of:
              0.15638576 = queryWeight, product of:
                1.7644532 = boost
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.018853225 = queryNorm
              0.51940536 = fieldWeight in 761, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.078125 = fieldNorm(doc=761)
          0.06631526 = weight(abstract_txt:relevance in 761) [ClassicSimilarity], result of:
            0.06631526 = score(doc=761,freq=1.0), product of:
              0.17211303 = queryWeight, product of:
                1.8510511 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.018853225 = queryNorm
              0.38530064 = fieldWeight in 761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=761)
          0.031802226 = weight(abstract_txt:system in 761) [ClassicSimilarity], result of:
            0.031802226 = score(doc=761,freq=1.0), product of:
              0.12070915 = queryWeight, product of:
                1.8985728 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.018853225 = queryNorm
              0.2634616 = fieldWeight in 761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=761)
          0.04921611 = weight(abstract_txt:retrieval in 761) [ClassicSimilarity], result of:
            0.04921611 = score(doc=761,freq=2.0), product of:
              0.12818289 = queryWeight, product of:
                1.9564655 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.018853225 = queryNorm
              0.38395226 = fieldWeight in 761, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=761)
          0.12598059 = weight(abstract_txt:query in 761) [ClassicSimilarity], result of:
            0.12598059 = score(doc=761,freq=2.0), product of:
              0.23986171 = queryWeight, product of:
                2.6763175 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.018853225 = queryNorm
              0.52522177 = fieldWeight in 761, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=761)
          0.40492716 = weight(abstract_txt:boolean in 761) [ClassicSimilarity], result of:
            0.40492716 = score(doc=761,freq=4.0), product of:
              0.4146355 = queryWeight, product of:
                3.5187664 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.018853225 = queryNorm
              0.97658587 = fieldWeight in 761, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.078125 = fieldNorm(doc=761)
        0.24 = coord(6/25)
    
  5. Losee, R.M.: Upper bounds for retrieval performance and their user measuring performance and generating optimal queries : can it get any better than this? (1994) 0.15
    0.15176055 = sum of:
      0.15176055 = product of:
        0.63233566 = sum of:
          0.0455447 = weight(abstract_txt:functions in 7418) [ClassicSimilarity], result of:
            0.0455447 = score(doc=7418,freq=1.0), product of:
              0.106337555 = queryWeight, product of:
                1.0288211 = boost
                5.4822793 = idf(docFreq=499, maxDocs=44218)
                0.018853225 = queryNorm
              0.42830306 = fieldWeight in 7418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4822793 = idf(docFreq=499, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
          0.05743659 = weight(abstract_txt:given in 7418) [ClassicSimilarity], result of:
            0.05743659 = score(doc=7418,freq=1.0), product of:
              0.15638576 = queryWeight, product of:
                1.7644532 = boost
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.018853225 = queryNorm
              0.36727506 = fieldWeight in 7418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
          0.031802226 = weight(abstract_txt:system in 7418) [ClassicSimilarity], result of:
            0.031802226 = score(doc=7418,freq=1.0), product of:
              0.12070915 = queryWeight, product of:
                1.8985728 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.018853225 = queryNorm
              0.2634616 = fieldWeight in 7418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
          0.08524481 = weight(abstract_txt:retrieval in 7418) [ClassicSimilarity], result of:
            0.08524481 = score(doc=7418,freq=6.0), product of:
              0.12818289 = queryWeight, product of:
                1.9564655 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.018853225 = queryNorm
              0.6650249 = fieldWeight in 7418, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
          0.12598059 = weight(abstract_txt:query in 7418) [ClassicSimilarity], result of:
            0.12598059 = score(doc=7418,freq=2.0), product of:
              0.23986171 = queryWeight, product of:
                2.6763175 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.018853225 = queryNorm
              0.52522177 = fieldWeight in 7418, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
          0.28632677 = weight(abstract_txt:boolean in 7418) [ClassicSimilarity], result of:
            0.28632677 = score(doc=7418,freq=2.0), product of:
              0.4146355 = queryWeight, product of:
                3.5187664 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.018853225 = queryNorm
              0.6905505 = fieldWeight in 7418, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
        0.24 = coord(6/25)