Document (#6188)

Author
Keen, E.M.
Title
Some aspects of proximity searching in text retrieval systems
Source
Journal of information science. 18(1992), S.89-98
Year
1992
Abstract
Describes and evaluates the proximity search facilities in external online systems and in-house retrieval software. Discusses and illustrates capabilities, syntax and circumstances of use. Presents measurements of the overheads required by proximity for storage, record input time and search time. The search strategy narrowing effect of proximity is illustrated by recall and precision test results. Usage and problems lead to a number of design ideas for better implementation: some based on existing Boolean strategies, one on the use of weighted proximity to automatically produce ranked output. A comparison of Boolean, quorum and proximate term pairs distance is included
Theme
Retrievalstudien
Suchtaktik

Similar documents (author)

  1. Keen, E.M.: ¬The Aberystwyth index languages tests (1973) 5.37
    5.369225 = sum of:
      5.369225 = weight(author_txt:keen in 773) [ClassicSimilarity], result of:
        5.369225 = fieldWeight in 773, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.59076 = idf(docFreq=21, maxDocs=43556)
          0.625 = fieldNorm(doc=773)
    
  2. Keen, E.M.: Prospects for classification suggested by evaluation tests (1976) 5.37
    5.369225 = sum of:
      5.369225 = weight(author_txt:keen in 1277) [ClassicSimilarity], result of:
        5.369225 = fieldWeight in 1277, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.59076 = idf(docFreq=21, maxDocs=43556)
          0.625 = fieldNorm(doc=1277)
    
  3. Keen, E.M.: On the generation and searching of entries in printed subject indexes (1977) 5.37
    5.369225 = sum of:
      5.369225 = weight(author_txt:keen in 2302) [ClassicSimilarity], result of:
        5.369225 = fieldWeight in 2302, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.59076 = idf(docFreq=21, maxDocs=43556)
          0.625 = fieldNorm(doc=2302)
    
  4. Keen, E.M.: Presenting results of experimental retrieval comparisons (1992) 5.37
    5.369225 = sum of:
      5.369225 = weight(author_txt:keen in 3644) [ClassicSimilarity], result of:
        5.369225 = fieldWeight in 3644, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.59076 = idf(docFreq=21, maxDocs=43556)
          0.625 = fieldNorm(doc=3644)
    
  5. Keen, M.: Query reformulation in ranked output interaction (1994) 5.37
    5.369225 = sum of:
      5.369225 = weight(author_txt:keen in 1131) [ClassicSimilarity], result of:
        5.369225 = fieldWeight in 1131, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.59076 = idf(docFreq=21, maxDocs=43556)
          0.625 = fieldNorm(doc=1131)
    

Similar documents (content)

  1. Boeri, R.J.; Hensel, M.: Set up a winning text retrieval system : carefully (1995) 0.18
    0.17646909 = sum of:
      0.17646909 = product of:
        0.88234544 = sum of:
          0.0281323 = weight(abstract_txt:systems in 2875) [ClassicSimilarity], result of:
            0.0281323 = score(doc=2875,freq=1.0), product of:
              0.06589059 = queryWeight, product of:
                1.1521355 = boost
                3.4156382 = idf(docFreq=3889, maxDocs=43556)
                0.016743567 = queryNorm
              0.42695478 = fieldWeight in 2875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4156382 = idf(docFreq=3889, maxDocs=43556)
                0.125 = fieldNorm(doc=2875)
          0.11403047 = weight(abstract_txt:house in 2875) [ClassicSimilarity], result of:
            0.11403047 = score(doc=2875,freq=1.0), product of:
              0.13295066 = queryWeight, product of:
                1.1572365 = boost
                6.8615212 = idf(docFreq=123, maxDocs=43556)
                0.016743567 = queryNorm
              0.85769016 = fieldWeight in 2875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8615212 = idf(docFreq=123, maxDocs=43556)
                0.125 = fieldNorm(doc=2875)
          0.04185429 = weight(abstract_txt:retrieval in 2875) [ClassicSimilarity], result of:
            0.04185429 = score(doc=2875,freq=2.0), product of:
              0.06815586 = queryWeight, product of:
                1.171773 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.016743567 = queryNorm
              0.6140967 = fieldWeight in 2875, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.125 = fieldNorm(doc=2875)
          0.03519587 = weight(abstract_txt:some in 2875) [ClassicSimilarity], result of:
            0.03519587 = score(doc=2875,freq=1.0), product of:
              0.07650345 = queryWeight, product of:
                1.2414589 = boost
                3.6804478 = idf(docFreq=2984, maxDocs=43556)
                0.016743567 = queryNorm
              0.46005598 = fieldWeight in 2875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6804478 = idf(docFreq=2984, maxDocs=43556)
                0.125 = fieldNorm(doc=2875)
          0.6631325 = weight(abstract_txt:proximity in 2875) [ClassicSimilarity], result of:
            0.6631325 = score(doc=2875,freq=1.0), product of:
              0.73519087 = queryWeight, product of:
                6.0850186 = boost
                7.2158947 = idf(docFreq=86, maxDocs=43556)
                0.016743567 = queryNorm
              0.90198684 = fieldWeight in 2875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2158947 = idf(docFreq=86, maxDocs=43556)
                0.125 = fieldNorm(doc=2875)
        0.2 = coord(5/25)
    
  2. Ojala, M.: Who's hosting this search? (1995) 0.15
    0.15472934 = sum of:
      0.15472934 = product of:
        0.96705836 = sum of:
          0.08110966 = weight(abstract_txt:input in 2742) [ClassicSimilarity], result of:
            0.08110966 = score(doc=2742,freq=1.0), product of:
              0.10593958 = queryWeight, product of:
                1.0330135 = boost
                6.1249747 = idf(docFreq=258, maxDocs=43556)
                0.016743567 = queryNorm
              0.76562184 = fieldWeight in 2742, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1249747 = idf(docFreq=258, maxDocs=43556)
                0.125 = fieldNorm(doc=2742)
          0.051691514 = weight(abstract_txt:search in 2742) [ClassicSimilarity], result of:
            0.051691514 = score(doc=2742,freq=1.0), product of:
              0.113152236 = queryWeight, product of:
                1.8491368 = boost
                3.6546526 = idf(docFreq=3062, maxDocs=43556)
                0.016743567 = queryNorm
              0.45683157 = fieldWeight in 2742, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6546526 = idf(docFreq=3062, maxDocs=43556)
                0.125 = fieldNorm(doc=2742)
          0.17112471 = weight(abstract_txt:boolean in 2742) [ClassicSimilarity], result of:
            0.17112471 = score(doc=2742,freq=1.0), product of:
              0.21956429 = queryWeight, product of:
                2.1031618 = boost
                6.2350655 = idf(docFreq=231, maxDocs=43556)
                0.016743567 = queryNorm
              0.7793832 = fieldWeight in 2742, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2350655 = idf(docFreq=231, maxDocs=43556)
                0.125 = fieldNorm(doc=2742)
          0.6631325 = weight(abstract_txt:proximity in 2742) [ClassicSimilarity], result of:
            0.6631325 = score(doc=2742,freq=1.0), product of:
              0.73519087 = queryWeight, product of:
                6.0850186 = boost
                7.2158947 = idf(docFreq=86, maxDocs=43556)
                0.016743567 = queryNorm
              0.90198684 = fieldWeight in 2742, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2158947 = idf(docFreq=86, maxDocs=43556)
                0.125 = fieldNorm(doc=2742)
        0.16 = coord(4/25)
    
  3. Milstead, J.L.: Specifications for thesaurus software (1991) 0.15
    0.15286306 = sum of:
      0.15286306 = product of:
        0.6369294 = sum of:
          0.047131564 = weight(abstract_txt:capabilities in 2291) [ClassicSimilarity], result of:
            0.047131564 = score(doc=2291,freq=1.0), product of:
              0.10091703 = queryWeight, product of:
                1.0082288 = boost
                5.97802 = idf(docFreq=299, maxDocs=43556)
                0.016743567 = queryNorm
              0.46703282 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.97802 = idf(docFreq=299, maxDocs=43556)
                0.078125 = fieldNorm(doc=2291)
          0.017582688 = weight(abstract_txt:systems in 2291) [ClassicSimilarity], result of:
            0.017582688 = score(doc=2291,freq=1.0), product of:
              0.06589059 = queryWeight, product of:
                1.1521355 = boost
                3.4156382 = idf(docFreq=3889, maxDocs=43556)
                0.016743567 = queryNorm
              0.26684675 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4156382 = idf(docFreq=3889, maxDocs=43556)
                0.078125 = fieldNorm(doc=2291)
          0.018497158 = weight(abstract_txt:retrieval in 2291) [ClassicSimilarity], result of:
            0.018497158 = score(doc=2291,freq=1.0), product of:
              0.06815586 = queryWeight, product of:
                1.171773 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.016743567 = queryNorm
              0.27139497 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.078125 = fieldNorm(doc=2291)
          0.032307196 = weight(abstract_txt:search in 2291) [ClassicSimilarity], result of:
            0.032307196 = score(doc=2291,freq=1.0), product of:
              0.113152236 = queryWeight, product of:
                1.8491368 = boost
                3.6546526 = idf(docFreq=3062, maxDocs=43556)
                0.016743567 = queryNorm
              0.28551972 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6546526 = idf(docFreq=3062, maxDocs=43556)
                0.078125 = fieldNorm(doc=2291)
          0.10695294 = weight(abstract_txt:boolean in 2291) [ClassicSimilarity], result of:
            0.10695294 = score(doc=2291,freq=1.0), product of:
              0.21956429 = queryWeight, product of:
                2.1031618 = boost
                6.2350655 = idf(docFreq=231, maxDocs=43556)
                0.016743567 = queryNorm
              0.4871145 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2350655 = idf(docFreq=231, maxDocs=43556)
                0.078125 = fieldNorm(doc=2291)
          0.41445783 = weight(abstract_txt:proximity in 2291) [ClassicSimilarity], result of:
            0.41445783 = score(doc=2291,freq=1.0), product of:
              0.73519087 = queryWeight, product of:
                6.0850186 = boost
                7.2158947 = idf(docFreq=86, maxDocs=43556)
                0.016743567 = queryNorm
              0.5637418 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2158947 = idf(docFreq=86, maxDocs=43556)
                0.078125 = fieldNorm(doc=2291)
        0.24 = coord(6/25)
    
  4. Clarke, S.J.: Search engines for the World Wide Web : an evaluation of recent developments (2000) 0.14
    0.13918874 = sum of:
      0.13918874 = product of:
        0.8699296 = sum of:
          0.0754105 = weight(abstract_txt:capabilities in 105) [ClassicSimilarity], result of:
            0.0754105 = score(doc=105,freq=1.0), product of:
              0.10091703 = queryWeight, product of:
                1.0082288 = boost
                5.97802 = idf(docFreq=299, maxDocs=43556)
                0.016743567 = queryNorm
              0.7472525 = fieldWeight in 105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.97802 = idf(docFreq=299, maxDocs=43556)
                0.125 = fieldNorm(doc=105)
          0.04185429 = weight(abstract_txt:retrieval in 105) [ClassicSimilarity], result of:
            0.04185429 = score(doc=105,freq=2.0), product of:
              0.06815586 = queryWeight, product of:
                1.171773 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.016743567 = queryNorm
              0.6140967 = fieldWeight in 105, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.125 = fieldNorm(doc=105)
          0.08953232 = weight(abstract_txt:search in 105) [ClassicSimilarity], result of:
            0.08953232 = score(doc=105,freq=3.0), product of:
              0.113152236 = queryWeight, product of:
                1.8491368 = boost
                3.6546526 = idf(docFreq=3062, maxDocs=43556)
                0.016743567 = queryNorm
              0.7912555 = fieldWeight in 105, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6546526 = idf(docFreq=3062, maxDocs=43556)
                0.125 = fieldNorm(doc=105)
          0.6631325 = weight(abstract_txt:proximity in 105) [ClassicSimilarity], result of:
            0.6631325 = score(doc=105,freq=1.0), product of:
              0.73519087 = queryWeight, product of:
                6.0850186 = boost
                7.2158947 = idf(docFreq=86, maxDocs=43556)
                0.016743567 = queryNorm
              0.90198684 = fieldWeight in 105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2158947 = idf(docFreq=86, maxDocs=43556)
                0.125 = fieldNorm(doc=105)
        0.16 = coord(4/25)
    
  5. Loughran, H.: ¬A review of nearest neighbour information retrieval (1994) 0.14
    0.13508846 = sum of:
      0.13508846 = product of:
        0.48245877 = sum of:
          0.073579125 = weight(abstract_txt:output in 682) [ClassicSimilarity], result of:
            0.073579125 = score(doc=682,freq=1.0), product of:
              0.09927646 = queryWeight, product of:
                5.92923 = idf(docFreq=314, maxDocs=43556)
                0.016743567 = queryNorm
              0.7411538 = fieldWeight in 682, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.92923 = idf(docFreq=314, maxDocs=43556)
                0.125 = fieldNorm(doc=682)
          0.093139805 = weight(abstract_txt:ranked in 682) [ClassicSimilarity], result of:
            0.093139805 = score(doc=682,freq=1.0), product of:
              0.116171636 = queryWeight, product of:
                1.08175 = boost
                6.4139447 = idf(docFreq=193, maxDocs=43556)
                0.016743567 = queryNorm
              0.8017431 = fieldWeight in 682, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4139447 = idf(docFreq=193, maxDocs=43556)
                0.125 = fieldNorm(doc=682)
          0.0281323 = weight(abstract_txt:systems in 682) [ClassicSimilarity], result of:
            0.0281323 = score(doc=682,freq=1.0), product of:
              0.06589059 = queryWeight, product of:
                1.1521355 = boost
                3.4156382 = idf(docFreq=3889, maxDocs=43556)
                0.016743567 = queryNorm
              0.42695478 = fieldWeight in 682, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4156382 = idf(docFreq=3889, maxDocs=43556)
                0.125 = fieldNorm(doc=682)
          0.029595453 = weight(abstract_txt:retrieval in 682) [ClassicSimilarity], result of:
            0.029595453 = score(doc=682,freq=1.0), product of:
              0.06815586 = queryWeight, product of:
                1.171773 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.016743567 = queryNorm
              0.43423197 = fieldWeight in 682, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.125 = fieldNorm(doc=682)
          0.03519587 = weight(abstract_txt:some in 682) [ClassicSimilarity], result of:
            0.03519587 = score(doc=682,freq=1.0), product of:
              0.07650345 = queryWeight, product of:
                1.2414589 = boost
                3.6804478 = idf(docFreq=2984, maxDocs=43556)
                0.016743567 = queryNorm
              0.46005598 = fieldWeight in 682, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6804478 = idf(docFreq=2984, maxDocs=43556)
                0.125 = fieldNorm(doc=682)
          0.051691514 = weight(abstract_txt:search in 682) [ClassicSimilarity], result of:
            0.051691514 = score(doc=682,freq=1.0), product of:
              0.113152236 = queryWeight, product of:
                1.8491368 = boost
                3.6546526 = idf(docFreq=3062, maxDocs=43556)
                0.016743567 = queryNorm
              0.45683157 = fieldWeight in 682, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6546526 = idf(docFreq=3062, maxDocs=43556)
                0.125 = fieldNorm(doc=682)
          0.17112471 = weight(abstract_txt:boolean in 682) [ClassicSimilarity], result of:
            0.17112471 = score(doc=682,freq=1.0), product of:
              0.21956429 = queryWeight, product of:
                2.1031618 = boost
                6.2350655 = idf(docFreq=231, maxDocs=43556)
                0.016743567 = queryNorm
              0.7793832 = fieldWeight in 682, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2350655 = idf(docFreq=231, maxDocs=43556)
                0.125 = fieldNorm(doc=682)
        0.28 = coord(7/25)