Document (#6191)

Author
Keen, E.M.
Title
Some aspects of proximity searching in text retrieval systems
Source
Journal of information science. 18(1992), S.89-98
Year
1992
Abstract
Describes and evaluates the proximity search facilities in external online systems and in-house retrieval software. Discusses and illustrates capabilities, syntax and circumstances of use. Presents measurements of the overheads required by proximity for storage, record input time and search time. The search strategy narrowing effect of proximity is illustrated by recall and precision test results. Usage and problems lead to a number of design ideas for better implementation: some based on existing Boolean strategies, one on the use of weighted proximity to automatically produce ranked output. A comparison of Boolean, quorum and proximate term pairs distance is included
Theme
Retrievalstudien
Suchtaktik

Similar documents (author)

  1. Keen, E.M.: ¬The Aberystwyth index languages tests (1973) 5.35
    5.3510256 = sum of:
      5.3510256 = weight(author_txt:keen in 773) [ClassicSimilarity], result of:
        5.3510256 = score(doc=773,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.116800025 = queryNorm
          5.351026 = fieldWeight in 773, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.625 = fieldNorm(doc=773)
    
  2. Keen, E.M.: Prospects for classification suggested by evaluation tests (1976) 5.35
    5.3510256 = sum of:
      5.3510256 = weight(author_txt:keen in 1277) [ClassicSimilarity], result of:
        5.3510256 = score(doc=1277,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.116800025 = queryNorm
          5.351026 = fieldWeight in 1277, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.625 = fieldNorm(doc=1277)
    
  3. Keen, E.M.: On the generation and searching of entries in printed subject indexes (1977) 5.35
    5.3510256 = sum of:
      5.3510256 = weight(author_txt:keen in 2302) [ClassicSimilarity], result of:
        5.3510256 = score(doc=2302,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.116800025 = queryNorm
          5.351026 = fieldWeight in 2302, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.625 = fieldNorm(doc=2302)
    
  4. Keen, E.M.: Presenting results of experimental retrieval comparisons (1992) 5.35
    5.3510256 = sum of:
      5.3510256 = weight(author_txt:keen in 3644) [ClassicSimilarity], result of:
        5.3510256 = score(doc=3644,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.116800025 = queryNorm
          5.351026 = fieldWeight in 3644, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.625 = fieldNorm(doc=3644)
    
  5. Keen, M.: Query reformulation in ranked output interaction (1994) 5.35
    5.3510256 = sum of:
      5.3510256 = weight(author_txt:keen in 1134) [ClassicSimilarity], result of:
        5.3510256 = score(doc=1134,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.116800025 = queryNorm
          5.351026 = fieldWeight in 1134, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.625 = fieldNorm(doc=1134)
    

Similar documents (content)

  1. Boeri, R.J.; Hensel, M.: Set up a winning text retrieval system : carefully (1995) 0.18
    0.1780169 = sum of:
      0.1780169 = product of:
        0.8900845 = sum of:
          0.027965892 = weight(abstract_txt:systems in 2878) [ClassicSimilarity], result of:
            0.027965892 = score(doc=2878,freq=1.0), product of:
              0.06546853 = queryWeight, product of:
                1.1508172 = boost
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.016647158 = queryNorm
              0.42716545 = fieldWeight in 2878, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.125 = fieldNorm(doc=2878)
          0.11255319 = weight(abstract_txt:house in 2878) [ClassicSimilarity], result of:
            0.11255319 = score(doc=2878,freq=1.0), product of:
              0.13147463 = queryWeight, product of:
                1.1531771 = boost
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.016647158 = queryNorm
              0.8560829 = fieldWeight in 2878, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.125 = fieldNorm(doc=2878)
          0.041142907 = weight(abstract_txt:retrieval in 2878) [ClassicSimilarity], result of:
            0.041142907 = score(doc=2878,freq=2.0), product of:
              0.067215085 = queryWeight, product of:
                1.1660668 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.016647158 = queryNorm
              0.61210823 = fieldWeight in 2878, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.125 = fieldNorm(doc=2878)
          0.03508716 = weight(abstract_txt:some in 2878) [ClassicSimilarity], result of:
            0.03508716 = score(doc=2878,freq=1.0), product of:
              0.07615743 = queryWeight, product of:
                1.2412126 = boost
                3.6857507 = idf(docFreq=2883, maxDocs=42306)
                0.016647158 = queryNorm
              0.46071884 = fieldWeight in 2878, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6857507 = idf(docFreq=2883, maxDocs=42306)
                0.125 = fieldNorm(doc=2878)
          0.6733353 = weight(abstract_txt:proximity in 2878) [ClassicSimilarity], result of:
            0.6733353 = score(doc=2878,freq=1.0), product of:
              0.7408797 = queryWeight, product of:
                6.1211624 = boost
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.016647158 = queryNorm
              0.9088322 = fieldWeight in 2878, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.125 = fieldNorm(doc=2878)
        0.2 = coord(5/25)
    
  2. Ojala, M.: Who's hosting this search? (1995) 0.16
    0.15578246 = sum of:
      0.15578246 = product of:
        0.9736404 = sum of:
          0.08075729 = weight(abstract_txt:input in 2745) [ClassicSimilarity], result of:
            0.08075729 = score(doc=2745,freq=1.0), product of:
              0.10537185 = queryWeight, product of:
                1.0323747 = boost
                6.131223 = idf(docFreq=249, maxDocs=42306)
                0.016647158 = queryNorm
              0.7664029 = fieldWeight in 2745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.131223 = idf(docFreq=249, maxDocs=42306)
                0.125 = fieldNorm(doc=2745)
          0.05135361 = weight(abstract_txt:search in 2745) [ClassicSimilarity], result of:
            0.05135361 = score(doc=2745,freq=1.0), product of:
              0.11238056 = queryWeight, product of:
                1.8466359 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.016647158 = queryNorm
              0.45696172 = fieldWeight in 2745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.125 = fieldNorm(doc=2745)
          0.16819416 = weight(abstract_txt:boolean in 2745) [ClassicSimilarity], result of:
            0.16819416 = score(doc=2745,freq=1.0), product of:
              0.21651469 = queryWeight, product of:
                2.092829 = boost
                6.214605 = idf(docFreq=229, maxDocs=42306)
                0.016647158 = queryNorm
              0.7768256 = fieldWeight in 2745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.214605 = idf(docFreq=229, maxDocs=42306)
                0.125 = fieldNorm(doc=2745)
          0.6733353 = weight(abstract_txt:proximity in 2745) [ClassicSimilarity], result of:
            0.6733353 = score(doc=2745,freq=1.0), product of:
              0.7408797 = queryWeight, product of:
                6.1211624 = boost
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.016647158 = queryNorm
              0.9088322 = fieldWeight in 2745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.125 = fieldNorm(doc=2745)
        0.16 = coord(4/25)
    
  3. Milstead, J.L.: Specifications for thesaurus software (1991) 0.15
    0.15359333 = sum of:
      0.15359333 = product of:
        0.6399722 = sum of:
          0.046258856 = weight(abstract_txt:capabilities in 2291) [ClassicSimilarity], result of:
            0.046258856 = score(doc=2291,freq=1.0), product of:
              0.09942143 = queryWeight, product of:
                1.0028017 = boost
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.016647158 = queryNorm
              0.46528053 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.078125 = fieldNorm(doc=2291)
          0.017478684 = weight(abstract_txt:systems in 2291) [ClassicSimilarity], result of:
            0.017478684 = score(doc=2291,freq=1.0), product of:
              0.06546853 = queryWeight, product of:
                1.1508172 = boost
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.016647158 = queryNorm
              0.2669784 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.078125 = fieldNorm(doc=2291)
          0.01818277 = weight(abstract_txt:retrieval in 2291) [ClassicSimilarity], result of:
            0.01818277 = score(doc=2291,freq=1.0), product of:
              0.067215085 = queryWeight, product of:
                1.1660668 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.016647158 = queryNorm
              0.2705162 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.078125 = fieldNorm(doc=2291)
          0.03209601 = weight(abstract_txt:search in 2291) [ClassicSimilarity], result of:
            0.03209601 = score(doc=2291,freq=1.0), product of:
              0.11238056 = queryWeight, product of:
                1.8466359 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.016647158 = queryNorm
              0.28560108 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.078125 = fieldNorm(doc=2291)
          0.10512135 = weight(abstract_txt:boolean in 2291) [ClassicSimilarity], result of:
            0.10512135 = score(doc=2291,freq=1.0), product of:
              0.21651469 = queryWeight, product of:
                2.092829 = boost
                6.214605 = idf(docFreq=229, maxDocs=42306)
                0.016647158 = queryNorm
              0.485516 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.214605 = idf(docFreq=229, maxDocs=42306)
                0.078125 = fieldNorm(doc=2291)
          0.42083457 = weight(abstract_txt:proximity in 2291) [ClassicSimilarity], result of:
            0.42083457 = score(doc=2291,freq=1.0), product of:
              0.7408797 = queryWeight, product of:
                6.1211624 = boost
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.016647158 = queryNorm
              0.5680201 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.078125 = fieldNorm(doc=2291)
        0.24 = coord(6/25)
    
  4. Clarke, S.J.: Search engines for the World Wide Web : an evaluation of recent developments (2000) 0.14
    0.1403903 = sum of:
      0.1403903 = product of:
        0.87743944 = sum of:
          0.074014165 = weight(abstract_txt:capabilities in 108) [ClassicSimilarity], result of:
            0.074014165 = score(doc=108,freq=1.0), product of:
              0.09942143 = queryWeight, product of:
                1.0028017 = boost
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.016647158 = queryNorm
              0.74444884 = fieldWeight in 108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.125 = fieldNorm(doc=108)
          0.041142907 = weight(abstract_txt:retrieval in 108) [ClassicSimilarity], result of:
            0.041142907 = score(doc=108,freq=2.0), product of:
              0.067215085 = queryWeight, product of:
                1.1660668 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.016647158 = queryNorm
              0.61210823 = fieldWeight in 108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.125 = fieldNorm(doc=108)
          0.088947065 = weight(abstract_txt:search in 108) [ClassicSimilarity], result of:
            0.088947065 = score(doc=108,freq=3.0), product of:
              0.11238056 = queryWeight, product of:
                1.8466359 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.016647158 = queryNorm
              0.7914809 = fieldWeight in 108, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.125 = fieldNorm(doc=108)
          0.6733353 = weight(abstract_txt:proximity in 108) [ClassicSimilarity], result of:
            0.6733353 = score(doc=108,freq=1.0), product of:
              0.7408797 = queryWeight, product of:
                6.1211624 = boost
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.016647158 = queryNorm
              0.9088322 = fieldWeight in 108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2706575 = idf(docFreq=79, maxDocs=42306)
                0.125 = fieldNorm(doc=108)
        0.16 = coord(4/25)
    
  5. Loughran, H.: ¬A review of nearest neighbour information retrieval (1994) 0.13
    0.13348496 = sum of:
      0.13348496 = product of:
        0.47673202 = sum of:
          0.07339554 = weight(abstract_txt:output in 685) [ClassicSimilarity], result of:
            0.07339554 = score(doc=685,freq=1.0), product of:
              0.098866664 = queryWeight, product of:
                5.9389515 = idf(docFreq=302, maxDocs=42306)
                0.016647158 = queryNorm
              0.74236894 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9389515 = idf(docFreq=302, maxDocs=42306)
                0.125 = fieldNorm(doc=685)
          0.09164325 = weight(abstract_txt:ranked in 685) [ClassicSimilarity], result of:
            0.09164325 = score(doc=685,freq=1.0), product of:
              0.11464024 = queryWeight, product of:
                1.0768212 = boost
                6.395189 = idf(docFreq=191, maxDocs=42306)
                0.016647158 = queryNorm
              0.7993986 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.395189 = idf(docFreq=191, maxDocs=42306)
                0.125 = fieldNorm(doc=685)
          0.027965892 = weight(abstract_txt:systems in 685) [ClassicSimilarity], result of:
            0.027965892 = score(doc=685,freq=1.0), product of:
              0.06546853 = queryWeight, product of:
                1.1508172 = boost
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.016647158 = queryNorm
              0.42716545 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.125 = fieldNorm(doc=685)
          0.02909243 = weight(abstract_txt:retrieval in 685) [ClassicSimilarity], result of:
            0.02909243 = score(doc=685,freq=1.0), product of:
              0.067215085 = queryWeight, product of:
                1.1660668 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.016647158 = queryNorm
              0.4328259 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.125 = fieldNorm(doc=685)
          0.03508716 = weight(abstract_txt:some in 685) [ClassicSimilarity], result of:
            0.03508716 = score(doc=685,freq=1.0), product of:
              0.07615743 = queryWeight, product of:
                1.2412126 = boost
                3.6857507 = idf(docFreq=2883, maxDocs=42306)
                0.016647158 = queryNorm
              0.46071884 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6857507 = idf(docFreq=2883, maxDocs=42306)
                0.125 = fieldNorm(doc=685)
          0.05135361 = weight(abstract_txt:search in 685) [ClassicSimilarity], result of:
            0.05135361 = score(doc=685,freq=1.0), product of:
              0.11238056 = queryWeight, product of:
                1.8466359 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.016647158 = queryNorm
              0.45696172 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.125 = fieldNorm(doc=685)
          0.16819416 = weight(abstract_txt:boolean in 685) [ClassicSimilarity], result of:
            0.16819416 = score(doc=685,freq=1.0), product of:
              0.21651469 = queryWeight, product of:
                2.092829 = boost
                6.214605 = idf(docFreq=229, maxDocs=42306)
                0.016647158 = queryNorm
              0.7768256 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.214605 = idf(docFreq=229, maxDocs=42306)
                0.125 = fieldNorm(doc=685)
        0.28 = coord(7/25)