Document (#21513)

Author
Fox, E.
Betrabet, S.
Koushik, M.
Lee, W.
Title
Extended Boolean models
Source
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Imprint
Englewood Cliffs, NJ : Prentice Hall
Year
1992
Pages
S.393-418
Abstract
The classical interpretation of Boolean operators in an information retrieval system is in general too strict. A standard Boolean query rarely comes close to retrieving all and only those documents which are relevant to a query. Many models have been proposed with the aim of softening the interpretation of the Boolean operators in order to improve the precision and recall of the search results. This chapter discusses 3 such models: the Mixed Min and Max (MMM), the Paice, and the P-noem models. The MMM and Paice models are essentially variations of the classical fuzzy-set model, while the P-norm scheme is a distance-based approach. Our experimental results indicate that each of the above models provide better performance than the classical Boolean model in terms of retrieval effectiveness
Theme
Retrievalalgorithmen

Similar documents (content)

  1. Lee, J.H.; Kim, M.H.; Lee, Y.J.: Information retrieval based on conceptual distance in is-a hierarchies (1993) 0.26
    0.26296702 = sum of:
      0.26296702 = product of:
        0.9391679 = sum of:
          0.070883125 = weight(abstract_txt:extended in 6729) [ClassicSimilarity], result of:
            0.070883125 = score(doc=6729,freq=2.0), product of:
              0.105327845 = queryWeight, product of:
                1.1024216 = boost
                6.091085 = idf(docFreq=271, maxDocs=44218)
                0.015685588 = queryNorm
              0.67297614 = fieldWeight in 6729, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.091085 = idf(docFreq=271, maxDocs=44218)
                0.078125 = fieldNorm(doc=6729)
          0.059804145 = weight(abstract_txt:distance in 6729) [ClassicSimilarity], result of:
            0.059804145 = score(doc=6729,freq=1.0), product of:
              0.118489206 = queryWeight, product of:
                1.169272 = boost
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.015685588 = queryNorm
              0.5047223 = fieldWeight in 6729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.078125 = fieldNorm(doc=6729)
          0.018616179 = weight(abstract_txt:retrieval in 6729) [ClassicSimilarity], result of:
            0.018616179 = score(doc=6729,freq=1.0), product of:
              0.06856908 = queryWeight, product of:
                1.2579266 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.015685588 = queryNorm
              0.27149525 = fieldWeight in 6729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=6729)
          0.039735366 = weight(abstract_txt:model in 6729) [ClassicSimilarity], result of:
            0.039735366 = score(doc=6729,freq=2.0), product of:
              0.09022137 = queryWeight, product of:
                1.4429319 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.015685588 = queryNorm
              0.44042078 = fieldWeight in 6729, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.078125 = fieldNorm(doc=6729)
          0.047652632 = weight(abstract_txt:query in 6729) [ClassicSimilarity], result of:
            0.047652632 = score(doc=6729,freq=1.0), product of:
              0.12830961 = queryWeight, product of:
                1.7207617 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.015685588 = queryNorm
              0.37138787 = fieldWeight in 6729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=6729)
          0.16095568 = weight(abstract_txt:operators in 6729) [ClassicSimilarity], result of:
            0.16095568 = score(doc=6729,freq=1.0), product of:
              0.28884986 = queryWeight, product of:
                2.5818274 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.015685588 = queryNorm
              0.5572296 = fieldWeight in 6729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=6729)
          0.5415208 = weight(abstract_txt:boolean in 6729) [ClassicSimilarity], result of:
            0.5415208 = score(doc=6729,freq=4.0), product of:
              0.554504 = queryWeight, product of:
                5.656053 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.015685588 = queryNorm
              0.97658587 = fieldWeight in 6729, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.078125 = fieldNorm(doc=6729)
        0.28 = coord(7/25)
    
  2. Lucas, W.; Topi, H.: Form and function : the impact of query term and operator usage on Web search results (2002) 0.22
    0.21868889 = sum of:
      0.21868889 = product of:
        0.6834028 = sum of:
          0.014892944 = weight(abstract_txt:retrieval in 198) [ClassicSimilarity], result of:
            0.014892944 = score(doc=198,freq=1.0), product of:
              0.06856908 = queryWeight, product of:
                1.2579266 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.015685588 = queryNorm
              0.21719621 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
          0.029973544 = weight(abstract_txt:results in 198) [ClassicSimilarity], result of:
            0.029973544 = score(doc=198,freq=4.0), product of:
              0.068856776 = queryWeight, product of:
                1.2605628 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.015685588 = queryNorm
              0.43530276 = fieldWeight in 198, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
          0.063283145 = weight(abstract_txt:rarely in 198) [ClassicSimilarity], result of:
            0.063283145 = score(doc=198,freq=1.0), product of:
              0.14277647 = queryWeight, product of:
                1.2835253 = boost
                7.0917172 = idf(docFreq=99, maxDocs=44218)
                0.015685588 = queryNorm
              0.44323233 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0917172 = idf(docFreq=99, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
          0.022477718 = weight(abstract_txt:model in 198) [ClassicSimilarity], result of:
            0.022477718 = score(doc=198,freq=1.0), product of:
              0.09022137 = queryWeight, product of:
                1.4429319 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.015685588 = queryNorm
              0.24913962 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
          0.038122106 = weight(abstract_txt:query in 198) [ClassicSimilarity], result of:
            0.038122106 = score(doc=198,freq=1.0), product of:
              0.12830961 = queryWeight, product of:
                1.7207617 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.015685588 = queryNorm
              0.2971103 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
          0.07501831 = weight(abstract_txt:interpretation in 198) [ClassicSimilarity], result of:
            0.07501831 = score(doc=198,freq=1.0), product of:
              0.20148967 = queryWeight, product of:
                2.1563413 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.015685588 = queryNorm
              0.3723184 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
          0.22302674 = weight(abstract_txt:operators in 198) [ClassicSimilarity], result of:
            0.22302674 = score(doc=198,freq=3.0), product of:
              0.28884986 = queryWeight, product of:
                2.5818274 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.015685588 = queryNorm
              0.77211994 = fieldWeight in 198, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
          0.2166083 = weight(abstract_txt:boolean in 198) [ClassicSimilarity], result of:
            0.2166083 = score(doc=198,freq=1.0), product of:
              0.554504 = queryWeight, product of:
                5.656053 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.015685588 = queryNorm
              0.39063436 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
        0.32 = coord(8/25)
    
  3. Petry, F.E.; Buckles, B.P.; Prabhu, D.: Fuzzy information retrieval using genetic algorithms and relevance feedback (1993) 0.20
    0.19695686 = sum of:
      0.19695686 = product of:
        0.70341736 = sum of:
          0.037409745 = weight(abstract_txt:precision in 7962) [ClassicSimilarity], result of:
            0.037409745 = score(doc=7962,freq=1.0), product of:
              0.0866658 = queryWeight, product of:
                5.5251865 = idf(docFreq=478, maxDocs=44218)
                0.015685588 = queryNorm
              0.4316552 = fieldWeight in 7962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5251865 = idf(docFreq=478, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.0421393 = weight(abstract_txt:recall in 7962) [ClassicSimilarity], result of:
            0.0421393 = score(doc=7962,freq=1.0), product of:
              0.093824476 = queryWeight, product of:
                1.0404811 = boost
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.015685588 = queryNorm
              0.44912907 = fieldWeight in 7962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.10058935 = weight(abstract_txt:fuzzy in 7962) [ClassicSimilarity], result of:
            0.10058935 = score(doc=7962,freq=2.0), product of:
              0.1330095 = queryWeight, product of:
                1.2388463 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.015685588 = queryNorm
              0.75625694 = fieldWeight in 7962, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.026327252 = weight(abstract_txt:retrieval in 7962) [ClassicSimilarity], result of:
            0.026327252 = score(doc=7962,freq=2.0), product of:
              0.06856908 = queryWeight, product of:
                1.2579266 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.015685588 = queryNorm
              0.38395226 = fieldWeight in 7962, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.018733466 = weight(abstract_txt:results in 7962) [ClassicSimilarity], result of:
            0.018733466 = score(doc=7962,freq=1.0), product of:
              0.068856776 = queryWeight, product of:
                1.2605628 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.015685588 = queryNorm
              0.27206424 = fieldWeight in 7962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.095305264 = weight(abstract_txt:query in 7962) [ClassicSimilarity], result of:
            0.095305264 = score(doc=7962,freq=4.0), product of:
              0.12830961 = queryWeight, product of:
                1.7207617 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.015685588 = queryNorm
              0.74277574 = fieldWeight in 7962, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
          0.382913 = weight(abstract_txt:boolean in 7962) [ClassicSimilarity], result of:
            0.382913 = score(doc=7962,freq=2.0), product of:
              0.554504 = queryWeight, product of:
                5.656053 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.015685588 = queryNorm
              0.6905505 = fieldWeight in 7962, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.078125 = fieldNorm(doc=7962)
        0.28 = coord(7/25)
    
  4. Kim, Y.W.; Kim, J.H.: ¬A model of knowledge based information retrieval with hierarchical concept graph (1990) 0.19
    0.18608083 = sum of:
      0.18608083 = product of:
        0.6645744 = sum of:
          0.059804145 = weight(abstract_txt:distance in 3909) [ClassicSimilarity], result of:
            0.059804145 = score(doc=3909,freq=1.0), product of:
              0.118489206 = queryWeight, product of:
                1.169272 = boost
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.015685588 = queryNorm
              0.5047223 = fieldWeight in 3909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
          0.018616179 = weight(abstract_txt:retrieval in 3909) [ClassicSimilarity], result of:
            0.018616179 = score(doc=3909,freq=1.0), product of:
              0.06856908 = queryWeight, product of:
                1.2579266 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.015685588 = queryNorm
              0.27149525 = fieldWeight in 3909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
          0.032447316 = weight(abstract_txt:results in 3909) [ClassicSimilarity], result of:
            0.032447316 = score(doc=3909,freq=3.0), product of:
              0.068856776 = queryWeight, product of:
                1.2605628 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.015685588 = queryNorm
              0.47122908 = fieldWeight in 3909, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
          0.07433806 = weight(abstract_txt:model in 3909) [ClassicSimilarity], result of:
            0.07433806 = score(doc=3909,freq=7.0), product of:
              0.09022137 = queryWeight, product of:
                1.4429319 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.015685588 = queryNorm
              0.82395184 = fieldWeight in 3909, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
          0.047652632 = weight(abstract_txt:query in 3909) [ClassicSimilarity], result of:
            0.047652632 = score(doc=3909,freq=1.0), product of:
              0.12830961 = queryWeight, product of:
                1.7207617 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.015685588 = queryNorm
              0.37138787 = fieldWeight in 3909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
          0.16095568 = weight(abstract_txt:operators in 3909) [ClassicSimilarity], result of:
            0.16095568 = score(doc=3909,freq=1.0), product of:
              0.28884986 = queryWeight, product of:
                2.5818274 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.015685588 = queryNorm
              0.5572296 = fieldWeight in 3909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
          0.2707604 = weight(abstract_txt:boolean in 3909) [ClassicSimilarity], result of:
            0.2707604 = score(doc=3909,freq=1.0), product of:
              0.554504 = queryWeight, product of:
                5.656053 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.015685588 = queryNorm
              0.48829293 = fieldWeight in 3909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.078125 = fieldNorm(doc=3909)
        0.28 = coord(7/25)
    
  5. Losee, R.M.: Upper bounds for retrieval performance and their user measuring performance and generating optimal queries : can it get any better than this? (1994) 0.18
    0.1825651 = sum of:
      0.1825651 = product of:
        0.6520182 = sum of:
          0.037409745 = weight(abstract_txt:precision in 7418) [ClassicSimilarity], result of:
            0.037409745 = score(doc=7418,freq=1.0), product of:
              0.0866658 = queryWeight, product of:
                5.5251865 = idf(docFreq=478, maxDocs=44218)
                0.015685588 = queryNorm
              0.4316552 = fieldWeight in 7418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5251865 = idf(docFreq=478, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
          0.0421393 = weight(abstract_txt:recall in 7418) [ClassicSimilarity], result of:
            0.0421393 = score(doc=7418,freq=1.0), product of:
              0.093824476 = queryWeight, product of:
                1.0404811 = boost
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.015685588 = queryNorm
              0.44912907 = fieldWeight in 7418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
          0.057831563 = weight(abstract_txt:close in 7418) [ClassicSimilarity], result of:
            0.057831563 = score(doc=7418,freq=1.0), product of:
              0.11586917 = queryWeight, product of:
                1.1562722 = boost
                6.3886194 = idf(docFreq=201, maxDocs=44218)
                0.015685588 = queryNorm
              0.49911088 = fieldWeight in 7418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3886194 = idf(docFreq=201, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
          0.045600142 = weight(abstract_txt:retrieval in 7418) [ClassicSimilarity], result of:
            0.045600142 = score(doc=7418,freq=6.0), product of:
              0.06856908 = queryWeight, product of:
                1.2579266 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.015685588 = queryNorm
              0.6650249 = fieldWeight in 7418, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
          0.018733466 = weight(abstract_txt:results in 7418) [ClassicSimilarity], result of:
            0.018733466 = score(doc=7418,freq=1.0), product of:
              0.068856776 = queryWeight, product of:
                1.2605628 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.015685588 = queryNorm
              0.27206424 = fieldWeight in 7418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
          0.067391 = weight(abstract_txt:query in 7418) [ClassicSimilarity], result of:
            0.067391 = score(doc=7418,freq=2.0), product of:
              0.12830961 = queryWeight, product of:
                1.7207617 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.015685588 = queryNorm
              0.52522177 = fieldWeight in 7418, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
          0.382913 = weight(abstract_txt:boolean in 7418) [ClassicSimilarity], result of:
            0.382913 = score(doc=7418,freq=2.0), product of:
              0.554504 = queryWeight, product of:
                5.656053 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.015685588 = queryNorm
              0.6905505 = fieldWeight in 7418, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.078125 = fieldNorm(doc=7418)
        0.28 = coord(7/25)