Document (#5504)

Author
Waller, W.G.
Kraft, D.H.
Title
¬A mathematical model of a weighted Boolean retrieval system
Source
Information processing and management. 15(1979), S.235-245
Year
1979
Abstract
The use of weights to denote a query representation and/or the indexing of a document is analysed as a generalization of a Boolean retrieval system. Criteria are given for the functions used to evaluate the relevance of the records to a specific query, including self-consistency. Various mechnaisms suggested in the literature for evaluating the relevance of records with regard to a given query are tested and found to be less than satisfactory. A new approach is suggested to avoid some of the perils of a weighted Boolean retrieval system

Similar documents (author)

  1. Kraft, A.: Mit silbernen Scheibchen will sich der Buchhandel seine Zukunft vergolden : CD-ROMs sind auch bei der eher innovationsscheuen Branche auf dem Vormarsch, doch Experten warnen vor unübersichtlichem Markt mit minderwertigen Angeboten (1995) 5.85
    5.850191 = sum of:
      5.850191 = weight(author_txt:kraft in 1927) [ClassicSimilarity], result of:
        5.850191 = fieldWeight in 1927, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.360306 = idf(docFreq=9, maxDocs=42740)
          0.625 = fieldNorm(doc=1927)
    
  2. Kraft, U.: Wo Gott wohnt : Religion (2002) 5.85
    5.850191 = sum of:
      5.850191 = weight(author_txt:kraft in 1954) [ClassicSimilarity], result of:
        5.850191 = fieldWeight in 1954, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.360306 = idf(docFreq=9, maxDocs=42740)
          0.625 = fieldNorm(doc=1954)
    
  3. Kraft, M.: Juristische Online-Datenbanken : Eine Einkaufshilfe (2005) 5.85
    5.850191 = sum of:
      5.850191 = weight(author_txt:kraft in 4055) [ClassicSimilarity], result of:
        5.850191 = fieldWeight in 4055, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.360306 = idf(docFreq=9, maxDocs=42740)
          0.625 = fieldNorm(doc=4055)
    
  4. Born, J.; Kraft, U.: Lernen im Schlaf - kein Traum (2004) 4.68
    4.680153 = sum of:
      4.680153 = weight(author_txt:kraft in 3893) [ClassicSimilarity], result of:
        4.680153 = fieldWeight in 3893, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.360306 = idf(docFreq=9, maxDocs=42740)
          0.5 = fieldNorm(doc=3893)
    
  5. Colvin, E.; Kraft, D.H.: Fuzzy retrieval for software reuse (2016) 4.68
    4.680153 = sum of:
      4.680153 = weight(author_txt:kraft in 5120) [ClassicSimilarity], result of:
        4.680153 = fieldWeight in 5120, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.360306 = idf(docFreq=9, maxDocs=42740)
          0.5 = fieldNorm(doc=5120)
    

Similar documents (content)

  1. Petry, F.E.; Buckles, B.P.; Prabhu, D.: Fuzzy information retrieval using genetic algorithms and relevance feedback (1993) 0.45
    0.45060456 = sum of:
      0.45060456 = product of:
        1.2516793 = sum of:
          0.046069883 = weight(abstract_txt:functions in 7962) [ClassicSimilarity], result of:
            0.046069883 = score(doc=7962,freq=1.0), product of:
              0.10722545 = queryWeight, product of:
                1.0238776 = boost
                5.4995756 = idf(docFreq=474, maxDocs=42740)
                0.019042354 = queryNorm
              0.42965436 = fieldWeight in 7962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4995756 = idf(docFreq=474, maxDocs=42740)
                0.078125 = fieldNorm(doc=7962)
          0.055584565 = weight(abstract_txt:tested in 7962) [ClassicSimilarity], result of:
            0.055584565 = score(doc=7962,freq=1.0), product of:
              0.12152229 = queryWeight, product of:
                1.0900015 = boost
                5.8547482 = idf(docFreq=332, maxDocs=42740)
                0.019042354 = queryNorm
              0.4574022 = fieldWeight in 7962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8547482 = idf(docFreq=332, maxDocs=42740)
                0.078125 = fieldNorm(doc=7962)
          0.10635388 = weight(abstract_txt:weights in 7962) [ClassicSimilarity], result of:
            0.10635388 = score(doc=7962,freq=1.0), product of:
              0.18729322 = queryWeight, product of:
                1.3531942 = boost
                7.268441 = idf(docFreq=80, maxDocs=42740)
                0.019042354 = queryNorm
              0.56784695 = fieldWeight in 7962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.268441 = idf(docFreq=80, maxDocs=42740)
                0.078125 = fieldNorm(doc=7962)
          0.13309327 = weight(abstract_txt:relevance in 7962) [ClassicSimilarity], result of:
            0.13309327 = score(doc=7962,freq=4.0), product of:
              0.17262906 = queryWeight, product of:
                1.8372619 = boost
                4.934262 = idf(docFreq=835, maxDocs=42740)
                0.019042354 = queryNorm
              0.7709784 = fieldWeight in 7962, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.934262 = idf(docFreq=835, maxDocs=42740)
                0.078125 = fieldNorm(doc=7962)
          0.03161326 = weight(abstract_txt:system in 7962) [ClassicSimilarity], result of:
            0.03161326 = score(doc=7962,freq=1.0), product of:
              0.1203113 = queryWeight, product of:
                1.8785076 = boost
                3.3633559 = idf(docFreq=4021, maxDocs=42740)
                0.019042354 = queryNorm
              0.2627622 = fieldWeight in 7962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3633559 = idf(docFreq=4021, maxDocs=42740)
                0.078125 = fieldNorm(doc=7962)
          0.048876576 = weight(abstract_txt:retrieval in 7962) [ClassicSimilarity], result of:
            0.048876576 = score(doc=7962,freq=2.0), product of:
              0.1276784 = queryWeight, product of:
                1.935167 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.019042354 = queryNorm
              0.3828101 = fieldWeight in 7962, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.078125 = fieldNorm(doc=7962)
          0.37101558 = weight(abstract_txt:weighted in 7962) [ClassicSimilarity], result of:
            0.37101558 = score(doc=7962,freq=4.0), product of:
              0.34193057 = queryWeight, product of:
                2.5857294 = boost
                6.9443917 = idf(docFreq=111, maxDocs=42740)
                0.019042354 = queryNorm
              1.0850612 = fieldWeight in 7962, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9443917 = idf(docFreq=111, maxDocs=42740)
                0.078125 = fieldNorm(doc=7962)
          0.17623541 = weight(abstract_txt:query in 7962) [ClassicSimilarity], result of:
            0.17623541 = score(doc=7962,freq=4.0), product of:
              0.2382881 = queryWeight, product of:
                2.6436925 = boost
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.019042354 = queryNorm
              0.73958963 = fieldWeight in 7962, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.078125 = fieldNorm(doc=7962)
          0.28283688 = weight(abstract_txt:boolean in 7962) [ClassicSimilarity], result of:
            0.28283688 = score(doc=7962,freq=2.0), product of:
              0.41153577 = queryWeight, product of:
                3.4742699 = boost
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.019042354 = queryNorm
              0.68727165 = fieldWeight in 7962, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.078125 = fieldNorm(doc=7962)
        0.36 = coord(9/25)
    
  2. Bordogna, G.; Pasi, G.: ¬A fuzzy linguistic approach generalizing Boolean information retrieval : a model and its evaluation (1993) 0.23
    0.22774172 = sum of:
      0.22774172 = product of:
        1.1387086 = sum of:
          0.17016621 = weight(abstract_txt:weights in 3570) [ClassicSimilarity], result of:
            0.17016621 = score(doc=3570,freq=1.0), product of:
              0.18729322 = queryWeight, product of:
                1.3531942 = boost
                7.268441 = idf(docFreq=80, maxDocs=42740)
                0.019042354 = queryNorm
              0.90855515 = fieldWeight in 3570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.268441 = idf(docFreq=80, maxDocs=42740)
                0.125 = fieldNorm(doc=3570)
          0.07820252 = weight(abstract_txt:retrieval in 3570) [ClassicSimilarity], result of:
            0.07820252 = score(doc=3570,freq=2.0), product of:
              0.1276784 = queryWeight, product of:
                1.935167 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.019042354 = queryNorm
              0.61249614 = fieldWeight in 3570, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.125 = fieldNorm(doc=3570)
          0.29681247 = weight(abstract_txt:weighted in 3570) [ClassicSimilarity], result of:
            0.29681247 = score(doc=3570,freq=1.0), product of:
              0.34193057 = queryWeight, product of:
                2.5857294 = boost
                6.9443917 = idf(docFreq=111, maxDocs=42740)
                0.019042354 = queryNorm
              0.86804897 = fieldWeight in 3570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9443917 = idf(docFreq=111, maxDocs=42740)
                0.125 = fieldNorm(doc=3570)
          0.14098834 = weight(abstract_txt:query in 3570) [ClassicSimilarity], result of:
            0.14098834 = score(doc=3570,freq=1.0), product of:
              0.2382881 = queryWeight, product of:
                2.6436925 = boost
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.019042354 = queryNorm
              0.5916717 = fieldWeight in 3570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.125 = fieldNorm(doc=3570)
          0.452539 = weight(abstract_txt:boolean in 3570) [ClassicSimilarity], result of:
            0.452539 = score(doc=3570,freq=2.0), product of:
              0.41153577 = queryWeight, product of:
                3.4742699 = boost
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.019042354 = queryNorm
              1.0996346 = fieldWeight in 3570, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.125 = fieldNorm(doc=3570)
        0.2 = coord(5/25)
    
  3. Harman, D.: Ranking algorithms (1992) 0.19
    0.18925133 = sum of:
      0.18925133 = product of:
        0.7885472 = sum of:
          0.09452664 = weight(abstract_txt:records in 4512) [ClassicSimilarity], result of:
            0.09452664 = score(doc=4512,freq=2.0), product of:
              0.13834728 = queryWeight, product of:
                1.6447482 = boost
                4.4172354 = idf(docFreq=1401, maxDocs=42740)
                0.019042354 = queryNorm
              0.6832562 = fieldWeight in 4512, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4172354 = idf(docFreq=1401, maxDocs=42740)
                0.109375 = fieldNorm(doc=4512)
          0.09316529 = weight(abstract_txt:relevance in 4512) [ClassicSimilarity], result of:
            0.09316529 = score(doc=4512,freq=1.0), product of:
              0.17262906 = queryWeight, product of:
                1.8372619 = boost
                4.934262 = idf(docFreq=835, maxDocs=42740)
                0.019042354 = queryNorm
              0.5396849 = fieldWeight in 4512, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.934262 = idf(docFreq=835, maxDocs=42740)
                0.109375 = fieldNorm(doc=4512)
          0.06259106 = weight(abstract_txt:system in 4512) [ClassicSimilarity], result of:
            0.06259106 = score(doc=4512,freq=2.0), product of:
              0.1203113 = queryWeight, product of:
                1.8785076 = boost
                3.3633559 = idf(docFreq=4021, maxDocs=42740)
                0.019042354 = queryNorm
              0.5202426 = fieldWeight in 4512, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3633559 = idf(docFreq=4021, maxDocs=42740)
                0.109375 = fieldNorm(doc=4512)
          0.08380587 = weight(abstract_txt:retrieval in 4512) [ClassicSimilarity], result of:
            0.08380587 = score(doc=4512,freq=3.0), product of:
              0.1276784 = queryWeight, product of:
                1.935167 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.019042354 = queryNorm
              0.6563825 = fieldWeight in 4512, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.109375 = fieldNorm(doc=4512)
          0.17446417 = weight(abstract_txt:query in 4512) [ClassicSimilarity], result of:
            0.17446417 = score(doc=4512,freq=2.0), product of:
              0.2382881 = queryWeight, product of:
                2.6436925 = boost
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.019042354 = queryNorm
              0.7321564 = fieldWeight in 4512, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.109375 = fieldNorm(doc=4512)
          0.2799942 = weight(abstract_txt:boolean in 4512) [ClassicSimilarity], result of:
            0.2799942 = score(doc=4512,freq=1.0), product of:
              0.41153577 = queryWeight, product of:
                3.4742699 = boost
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.019042354 = queryNorm
              0.6803642 = fieldWeight in 4512, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.109375 = fieldNorm(doc=4512)
        0.24 = coord(6/25)
    
  4. Smith, M.P.; Smith, M.: ¬The use of genetic programming to build Boolean queries for text retrieval through relevance feedback (1997) 0.18
    0.18074659 = sum of:
      0.18074659 = product of:
        0.75311077 = sum of:
          0.08146533 = weight(abstract_txt:given in 1762) [ClassicSimilarity], result of:
            0.08146533 = score(doc=1762,freq=2.0), product of:
              0.15679602 = queryWeight, product of:
                1.7509818 = boost
                4.702543 = idf(docFreq=1053, maxDocs=42740)
                0.019042354 = queryNorm
              0.5195625 = fieldWeight in 1762, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.702543 = idf(docFreq=1053, maxDocs=42740)
                0.078125 = fieldNorm(doc=1762)
          0.066546634 = weight(abstract_txt:relevance in 1762) [ClassicSimilarity], result of:
            0.066546634 = score(doc=1762,freq=1.0), product of:
              0.17262906 = queryWeight, product of:
                1.8372619 = boost
                4.934262 = idf(docFreq=835, maxDocs=42740)
                0.019042354 = queryNorm
              0.3854892 = fieldWeight in 1762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.934262 = idf(docFreq=835, maxDocs=42740)
                0.078125 = fieldNorm(doc=1762)
          0.03161326 = weight(abstract_txt:system in 1762) [ClassicSimilarity], result of:
            0.03161326 = score(doc=1762,freq=1.0), product of:
              0.1203113 = queryWeight, product of:
                1.8785076 = boost
                3.3633559 = idf(docFreq=4021, maxDocs=42740)
                0.019042354 = queryNorm
              0.2627622 = fieldWeight in 1762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3633559 = idf(docFreq=4021, maxDocs=42740)
                0.078125 = fieldNorm(doc=1762)
          0.048876576 = weight(abstract_txt:retrieval in 1762) [ClassicSimilarity], result of:
            0.048876576 = score(doc=1762,freq=2.0), product of:
              0.1276784 = queryWeight, product of:
                1.935167 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.019042354 = queryNorm
              0.3828101 = fieldWeight in 1762, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.078125 = fieldNorm(doc=1762)
          0.12461725 = weight(abstract_txt:query in 1762) [ClassicSimilarity], result of:
            0.12461725 = score(doc=1762,freq=2.0), product of:
              0.2382881 = queryWeight, product of:
                2.6436925 = boost
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.019042354 = queryNorm
              0.5229688 = fieldWeight in 1762, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.078125 = fieldNorm(doc=1762)
          0.39999172 = weight(abstract_txt:boolean in 1762) [ClassicSimilarity], result of:
            0.39999172 = score(doc=1762,freq=4.0), product of:
              0.41153577 = queryWeight, product of:
                3.4742699 = boost
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.019042354 = queryNorm
              0.97194886 = fieldWeight in 1762, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.078125 = fieldNorm(doc=1762)
        0.24 = coord(6/25)
    
  5. Losee, R.M.: Upper bounds for retrieval performance and their user measuring performance and generating optimal queries : can it get any better than this? (1994) 0.15
    0.1505757 = sum of:
      0.1505757 = product of:
        0.6273987 = sum of:
          0.046069883 = weight(abstract_txt:functions in 7418) [ClassicSimilarity], result of:
            0.046069883 = score(doc=7418,freq=1.0), product of:
              0.10722545 = queryWeight, product of:
                1.0238776 = boost
                5.4995756 = idf(docFreq=474, maxDocs=42740)
                0.019042354 = queryNorm
              0.42965436 = fieldWeight in 7418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4995756 = idf(docFreq=474, maxDocs=42740)
                0.078125 = fieldNorm(doc=7418)
          0.05760469 = weight(abstract_txt:given in 7418) [ClassicSimilarity], result of:
            0.05760469 = score(doc=7418,freq=1.0), product of:
              0.15679602 = queryWeight, product of:
                1.7509818 = boost
                4.702543 = idf(docFreq=1053, maxDocs=42740)
                0.019042354 = queryNorm
              0.36738616 = fieldWeight in 7418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.702543 = idf(docFreq=1053, maxDocs=42740)
                0.078125 = fieldNorm(doc=7418)
          0.03161326 = weight(abstract_txt:system in 7418) [ClassicSimilarity], result of:
            0.03161326 = score(doc=7418,freq=1.0), product of:
              0.1203113 = queryWeight, product of:
                1.8785076 = boost
                3.3633559 = idf(docFreq=4021, maxDocs=42740)
                0.019042354 = queryNorm
              0.2627622 = fieldWeight in 7418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3633559 = idf(docFreq=4021, maxDocs=42740)
                0.078125 = fieldNorm(doc=7418)
          0.084656715 = weight(abstract_txt:retrieval in 7418) [ClassicSimilarity], result of:
            0.084656715 = score(doc=7418,freq=6.0), product of:
              0.1276784 = queryWeight, product of:
                1.935167 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.019042354 = queryNorm
              0.66304654 = fieldWeight in 7418, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.078125 = fieldNorm(doc=7418)
          0.12461725 = weight(abstract_txt:query in 7418) [ClassicSimilarity], result of:
            0.12461725 = score(doc=7418,freq=2.0), product of:
              0.2382881 = queryWeight, product of:
                2.6436925 = boost
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.019042354 = queryNorm
              0.5229688 = fieldWeight in 7418, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.078125 = fieldNorm(doc=7418)
          0.28283688 = weight(abstract_txt:boolean in 7418) [ClassicSimilarity], result of:
            0.28283688 = score(doc=7418,freq=2.0), product of:
              0.41153577 = queryWeight, product of:
                3.4742699 = boost
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.019042354 = queryNorm
              0.68727165 = fieldWeight in 7418, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.078125 = fieldNorm(doc=7418)
        0.24 = coord(6/25)