Document (#12601)

Author
Nakkouzi, Z.S.
Eastman, C.M.
Title
Query formulation for handling negation in information retrieval systems
Source
Journal of the American Society for Information Science. 41(1990) no.3, S.171-182
Year
1990
Abstract
Queries containing negation are widely recognised as presenting problems for both users and systems. In information retrieval systems such problems usually manifest themselves in the use of the NOT operator. Describes an algorithm to transform Boolean queries with negated terms into queries without negation; the transformation process is based on the use of a hierarchical thesaurus. Examines a set of user requests submitted to the Thomas Cooper Library at the University of South Carolina to determine the pattern and frequency of use of negation.
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Eastman, C.M.: Overlaps in postings to thesaurus terms : a preliminary study (1988) 5.81
    5.81187 = sum of:
      5.81187 = weight(author_txt:eastman in 3555) [ClassicSimilarity], result of:
        5.81187 = fieldWeight in 3555, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.625 = fieldNorm(doc=3555)
    
  2. Eastman, C.M.: 30,000 hits may be better than 300 : precision anomalies in Internet searches (2002) 5.81
    5.81187 = sum of:
      5.81187 = weight(author_txt:eastman in 5231) [ClassicSimilarity], result of:
        5.81187 = fieldWeight in 5231, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.625 = fieldNorm(doc=5231)
    
  3. Chang, Y.F.; Eastman, C.M.: ¬An information retrieval system for reusable software (1993) 4.65
    4.649496 = sum of:
      4.649496 = weight(author_txt:eastman in 6348) [ClassicSimilarity], result of:
        4.649496 = fieldWeight in 6348, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.5 = fieldNorm(doc=6348)
    
  4. Eastman, C.M.; Carter, R.M.: Anthropological perspectives on classification schemes (1994) 4.65
    4.649496 = sum of:
      4.649496 = weight(author_txt:eastman in 8888) [ClassicSimilarity], result of:
        4.649496 = fieldWeight in 8888, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.5 = fieldNorm(doc=8888)
    
  5. Rose, J.R.; Eastman, C.M.: Hierarchical classification as an aid to browsing (1994) 4.65
    4.649496 = sum of:
      4.649496 = weight(author_txt:eastman in 8894) [ClassicSimilarity], result of:
        4.649496 = fieldWeight in 8894, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.5 = fieldNorm(doc=8894)
    

Similar documents (content)

  1. Klein, S.T.: On the use of negation in Boolean IR queries. (2009) 0.32
    0.31990734 = sum of:
      0.31990734 = product of:
        1.3329473 = sum of:
          0.05610788 = weight(abstract_txt:boolean in 3927) [ClassicSimilarity], result of:
            0.05610788 = score(doc=3927,freq=1.0), product of:
              0.095755145 = queryWeight, product of:
                1.050888 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.014578583 = queryNorm
              0.58595157 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
          0.019288534 = weight(abstract_txt:retrieval in 3927) [ClassicSimilarity], result of:
            0.019288534 = score(doc=3927,freq=1.0), product of:
              0.059204638 = queryWeight, product of:
                1.1686063 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014578583 = queryNorm
              0.3257943 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
          0.12815465 = weight(abstract_txt:operator in 3927) [ClassicSimilarity], result of:
            0.12815465 = score(doc=3927,freq=1.0), product of:
              0.16607432 = queryWeight, product of:
                1.3839697 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.014578583 = queryNorm
              0.77167046 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
          0.3540727 = weight(abstract_txt:negated in 3927) [ClassicSimilarity], result of:
            0.3540727 = score(doc=3927,freq=3.0), product of:
              0.22672571 = queryWeight, product of:
                1.6170585 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.014578583 = queryNorm
              1.5616786 = fieldWeight in 3927, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
          0.15901071 = weight(abstract_txt:queries in 3927) [ClassicSimilarity], result of:
            0.15901071 = score(doc=3927,freq=3.0), product of:
              0.19176257 = queryWeight, product of:
                2.575834 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.014578583 = queryNorm
              0.82920617 = fieldWeight in 3927, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
          0.61631274 = weight(abstract_txt:negation in 3927) [ClassicSimilarity], result of:
            0.61631274 = score(doc=3927,freq=1.0), product of:
              0.7511045 = queryWeight, product of:
                5.886478 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.014578583 = queryNorm
              0.820542 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
        0.24 = coord(6/25)
    
  2. Young, C.W.; Eastman, C.M.; Oakman, R.L.: ¬An analysis of ill-formed input in natural language queries to document retrieval systems (1991) 0.26
    0.26107138 = sum of:
      0.26107138 = product of:
        0.72519827 = sum of:
          0.040287785 = weight(abstract_txt:frequency in 5263) [ClassicSimilarity], result of:
            0.040287785 = score(doc=5263,freq=1.0), product of:
              0.086706035 = queryWeight, product of:
                5.947494 = idf(docFreq=313, maxDocs=44218)
                0.014578583 = queryNorm
              0.46464798 = fieldWeight in 5263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.947494 = idf(docFreq=313, maxDocs=44218)
                0.078125 = fieldNorm(doc=5263)
          0.06339696 = weight(abstract_txt:requests in 5263) [ClassicSimilarity], result of:
            0.06339696 = score(doc=5263,freq=1.0), product of:
              0.11730397 = queryWeight, product of:
                1.1631392 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.014578583 = queryNorm
              0.5404503 = fieldWeight in 5263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.078125 = fieldNorm(doc=5263)
          0.016073778 = weight(abstract_txt:retrieval in 5263) [ClassicSimilarity], result of:
            0.016073778 = score(doc=5263,freq=1.0), product of:
              0.059204638 = queryWeight, product of:
                1.1686063 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014578583 = queryNorm
              0.27149525 = fieldWeight in 5263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=5263)
          0.06584095 = weight(abstract_txt:south in 5263) [ClassicSimilarity], result of:
            0.06584095 = score(doc=5263,freq=1.0), product of:
              0.12029968 = queryWeight, product of:
                1.1778977 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.014578583 = queryNorm
              0.5473078 = fieldWeight in 5263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.078125 = fieldNorm(doc=5263)
          0.07780325 = weight(abstract_txt:thomas in 5263) [ClassicSimilarity], result of:
            0.07780325 = score(doc=5263,freq=1.0), product of:
              0.13446179 = queryWeight, product of:
                1.2453023 = boost
                7.406428 = idf(docFreq=72, maxDocs=44218)
                0.014578583 = queryNorm
              0.57862717 = fieldWeight in 5263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.406428 = idf(docFreq=72, maxDocs=44218)
                0.078125 = fieldNorm(doc=5263)
          0.10803609 = weight(abstract_txt:carolina in 5263) [ClassicSimilarity], result of:
            0.10803609 = score(doc=5263,freq=1.0), product of:
              0.16735794 = queryWeight, product of:
                1.3893079 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.014578583 = queryNorm
              0.6455391 = fieldWeight in 5263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.078125 = fieldNorm(doc=5263)
          0.030398048 = weight(abstract_txt:problems in 5263) [ClassicSimilarity], result of:
            0.030398048 = score(doc=5263,freq=1.0), product of:
              0.09054008 = queryWeight, product of:
                1.4451429 = boost
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.014578583 = queryNorm
              0.33574134 = fieldWeight in 5263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.078125 = fieldNorm(doc=5263)
          0.17035331 = weight(abstract_txt:cooper in 5263) [ClassicSimilarity], result of:
            0.17035331 = score(doc=5263,freq=1.0), product of:
              0.22672571 = queryWeight, product of:
                1.6170585 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.014578583 = queryNorm
              0.751363 = fieldWeight in 5263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.078125 = fieldNorm(doc=5263)
          0.15300813 = weight(abstract_txt:queries in 5263) [ClassicSimilarity], result of:
            0.15300813 = score(doc=5263,freq=4.0), product of:
              0.19176257 = queryWeight, product of:
                2.575834 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.014578583 = queryNorm
              0.7979041 = fieldWeight in 5263, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.078125 = fieldNorm(doc=5263)
        0.36 = coord(9/25)
    
  3. McQuire, A.R.; Eastman, C.M.: ¬The ambiguity of negation in natural language queries to information retrieval systems (1998) 0.21
    0.21307456 = sum of:
      0.21307456 = product of:
        1.0653728 = sum of:
          0.012859023 = weight(abstract_txt:retrieval in 1147) [ClassicSimilarity], result of:
            0.012859023 = score(doc=1147,freq=1.0), product of:
              0.059204638 = queryWeight, product of:
                1.1686063 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014578583 = queryNorm
              0.21719621 = fieldWeight in 1147, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=1147)
          0.23604847 = weight(abstract_txt:negated in 1147) [ClassicSimilarity], result of:
            0.23604847 = score(doc=1147,freq=3.0), product of:
              0.22672571 = queryWeight, product of:
                1.6170585 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.014578583 = queryNorm
              1.0411191 = fieldWeight in 1147, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.0625 = fieldNorm(doc=1147)
          0.018254215 = weight(abstract_txt:systems in 1147) [ClassicSimilarity], result of:
            0.018254215 = score(doc=1147,freq=1.0), product of:
              0.085603125 = queryWeight, product of:
                1.7209996 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.014578583 = queryNorm
              0.2132424 = fieldWeight in 1147, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.0625 = fieldNorm(doc=1147)
          0.08655447 = weight(abstract_txt:queries in 1147) [ClassicSimilarity], result of:
            0.08655447 = score(doc=1147,freq=2.0), product of:
              0.19176257 = queryWeight, product of:
                2.575834 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.014578583 = queryNorm
              0.4513627 = fieldWeight in 1147, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0625 = fieldNorm(doc=1147)
          0.7116567 = weight(abstract_txt:negation in 1147) [ClassicSimilarity], result of:
            0.7116567 = score(doc=1147,freq=3.0), product of:
              0.7511045 = queryWeight, product of:
                5.886478 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.014578583 = queryNorm
              0.94748026 = fieldWeight in 1147, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0625 = fieldNorm(doc=1147)
        0.2 = coord(5/25)
    
  4. Lucas, W.; Topi, H.: Form and function : the impact of query term and operator usage on Web search results (2002) 0.10
    0.09921498 = sum of:
      0.09921498 = product of:
        0.41339576 = sum of:
          0.03740525 = weight(abstract_txt:boolean in 198) [ClassicSimilarity], result of:
            0.03740525 = score(doc=198,freq=1.0), product of:
              0.095755145 = queryWeight, product of:
                1.050888 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.014578583 = queryNorm
              0.39063436 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
          0.012859023 = weight(abstract_txt:retrieval in 198) [ClassicSimilarity], result of:
            0.012859023 = score(doc=198,freq=1.0), product of:
              0.059204638 = queryWeight, product of:
                1.1686063 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014578583 = queryNorm
              0.21719621 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
          0.07449053 = weight(abstract_txt:submitted in 198) [ClassicSimilarity], result of:
            0.07449053 = score(doc=198,freq=2.0), product of:
              0.12029968 = queryWeight, product of:
                1.1778977 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.014578583 = queryNorm
              0.61920804 = fieldWeight in 198, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
          0.14798023 = weight(abstract_txt:operator in 198) [ClassicSimilarity], result of:
            0.14798023 = score(doc=198,freq=3.0), product of:
              0.16607432 = queryWeight, product of:
                1.3839697 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.014578583 = queryNorm
              0.89104825 = fieldWeight in 198, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
          0.018254215 = weight(abstract_txt:systems in 198) [ClassicSimilarity], result of:
            0.018254215 = score(doc=198,freq=1.0), product of:
              0.085603125 = queryWeight, product of:
                1.7209996 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.014578583 = queryNorm
              0.2132424 = fieldWeight in 198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
          0.122406505 = weight(abstract_txt:queries in 198) [ClassicSimilarity], result of:
            0.122406505 = score(doc=198,freq=4.0), product of:
              0.19176257 = queryWeight, product of:
                2.575834 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.014578583 = queryNorm
              0.63832325 = fieldWeight in 198, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0625 = fieldNorm(doc=198)
        0.24 = coord(6/25)
    
  5. Spink, A.; Wolfram, D.; Jansen, B.J.; Saracevic, T.: Searching the Web : the public and their queries (2001) 0.08
    0.08355324 = sum of:
      0.08355324 = product of:
        0.3481385 = sum of:
          0.02417267 = weight(abstract_txt:frequency in 6980) [ClassicSimilarity], result of:
            0.02417267 = score(doc=6980,freq=1.0), product of:
              0.086706035 = queryWeight, product of:
                5.947494 = idf(docFreq=313, maxDocs=44218)
                0.014578583 = queryNorm
              0.27878878 = fieldWeight in 6980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.947494 = idf(docFreq=313, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.03967426 = weight(abstract_txt:boolean in 6980) [ClassicSimilarity], result of:
            0.03967426 = score(doc=6980,freq=2.0), product of:
              0.095755145 = queryWeight, product of:
                1.050888 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.014578583 = queryNorm
              0.4143303 = fieldWeight in 6980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.028468512 = weight(abstract_txt:containing in 6980) [ClassicSimilarity], result of:
            0.028468512 = score(doc=6980,freq=1.0), product of:
              0.0966962 = queryWeight, product of:
                1.0560392 = boost
                6.280787 = idf(docFreq=224, maxDocs=44218)
                0.014578583 = queryNorm
              0.2944119 = fieldWeight in 6980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.280787 = idf(docFreq=224, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.039504573 = weight(abstract_txt:submitted in 6980) [ClassicSimilarity], result of:
            0.039504573 = score(doc=6980,freq=1.0), product of:
              0.12029968 = queryWeight, product of:
                1.1778977 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.014578583 = queryNorm
              0.32838467 = fieldWeight in 6980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.064077325 = weight(abstract_txt:operator in 6980) [ClassicSimilarity], result of:
            0.064077325 = score(doc=6980,freq=1.0), product of:
              0.16607432 = queryWeight, product of:
                1.3839697 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.014578583 = queryNorm
              0.38583523 = fieldWeight in 6980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
          0.15224117 = weight(abstract_txt:queries in 6980) [ClassicSimilarity], result of:
            0.15224117 = score(doc=6980,freq=11.0), product of:
              0.19176257 = queryWeight, product of:
                2.575834 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.014578583 = queryNorm
              0.79390454 = fieldWeight in 6980, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.046875 = fieldNorm(doc=6980)
        0.24 = coord(6/25)