Document (#19317)

Author
Keen, E.M.
Hartley, R.J.
Title
Phrase processing in text retrieval
Source
Journal of document and text management. 2(1994) no.1, S.23-34
Year
1994
Abstract
After introducing types of records, queries and text processing options, the features needed in software for phrase processing are identified and different approaches in current text retrieval research in the Text Retrieval Conference (TREC) projects are enumerated. Then follow eight observations on issues in phrase searching relating both to practice and to research, giving the authors' selection of crucial and controversial issues, supported by 21 references
Theme
Retrievalstudien
Object
TREC

Similar documents (author)

  1. Keen, E.M.: ¬The Aberystwyth index languages tests (1973) 1.98
    1.9772978 = sum of:
      1.9772978 = product of:
        3.9545956 = sum of:
          3.9545956 = weight(author_txt:keen in 773) [ClassicSimilarity], result of:
            3.9545956 = score(doc=773,freq=1.0), product of:
              0.7352391 = queryWeight, product of:
                1.0415041 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.08203026 = queryNorm
              5.3786526 = fieldWeight in 773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.625 = fieldNorm(doc=773)
        0.5 = coord(1/2)
    
  2. Keen, E.M.: Prospects for classification suggested by evaluation tests (1976) 1.98
    1.9772978 = sum of:
      1.9772978 = product of:
        3.9545956 = sum of:
          3.9545956 = weight(author_txt:keen in 1277) [ClassicSimilarity], result of:
            3.9545956 = score(doc=1277,freq=1.0), product of:
              0.7352391 = queryWeight, product of:
                1.0415041 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.08203026 = queryNorm
              5.3786526 = fieldWeight in 1277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.625 = fieldNorm(doc=1277)
        0.5 = coord(1/2)
    
  3. Keen, E.M.: On the generation and searching of entries in printed subject indexes (1977) 1.98
    1.9772978 = sum of:
      1.9772978 = product of:
        3.9545956 = sum of:
          3.9545956 = weight(author_txt:keen in 2302) [ClassicSimilarity], result of:
            3.9545956 = score(doc=2302,freq=1.0), product of:
              0.7352391 = queryWeight, product of:
                1.0415041 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.08203026 = queryNorm
              5.3786526 = fieldWeight in 2302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.625 = fieldNorm(doc=2302)
        0.5 = coord(1/2)
    
  4. Keen, E.M.: Presenting results of experimental retrieval comparisons (1992) 1.98
    1.9772978 = sum of:
      1.9772978 = product of:
        3.9545956 = sum of:
          3.9545956 = weight(author_txt:keen in 3644) [ClassicSimilarity], result of:
            3.9545956 = score(doc=3644,freq=1.0), product of:
              0.7352391 = queryWeight, product of:
                1.0415041 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.08203026 = queryNorm
              5.3786526 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.625 = fieldNorm(doc=3644)
        0.5 = coord(1/2)
    
  5. Keen, E.M.: Some aspects of proximity searching in text retrieval systems (1992) 1.98
    1.9772978 = sum of:
      1.9772978 = product of:
        3.9545956 = sum of:
          3.9545956 = weight(author_txt:keen in 6190) [ClassicSimilarity], result of:
            3.9545956 = score(doc=6190,freq=1.0), product of:
              0.7352391 = queryWeight, product of:
                1.0415041 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.08203026 = queryNorm
              5.3786526 = fieldWeight in 6190, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.625 = fieldNorm(doc=6190)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Fagan, J.L.: ¬The effectiveness of a nonsyntactic approach to automatic phrase indexing for document retrieval (1989) 0.16
    0.16241549 = sum of:
      0.16241549 = product of:
        0.81207746 = sum of:
          0.03452532 = weight(abstract_txt:needed in 1845) [ClassicSimilarity], result of:
            0.03452532 = score(doc=1845,freq=1.0), product of:
              0.10490917 = queryWeight, product of:
                1.0311304 = boost
                5.2655563 = idf(docFreq=620, maxDocs=44218)
                0.019322155 = queryNorm
              0.32909727 = fieldWeight in 1845, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2655563 = idf(docFreq=620, maxDocs=44218)
                0.0625 = fieldNorm(doc=1845)
          0.01507133 = weight(abstract_txt:research in 1845) [ClassicSimilarity], result of:
            0.01507133 = score(doc=1845,freq=1.0), product of:
              0.076061696 = queryWeight, product of:
                1.2416663 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.019322155 = queryNorm
              0.19814612 = fieldWeight in 1845, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=1845)
          0.05157076 = weight(abstract_txt:retrieval in 1845) [ClassicSimilarity], result of:
            0.05157076 = score(doc=1845,freq=3.0), product of:
              0.13708523 = queryWeight, product of:
                2.0415633 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019322155 = queryNorm
              0.37619486 = fieldWeight in 1845, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=1845)
          0.08846478 = weight(abstract_txt:text in 1845) [ClassicSimilarity], result of:
            0.08846478 = score(doc=1845,freq=2.0), product of:
              0.24750191 = queryWeight, product of:
                3.1675696 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019322155 = queryNorm
              0.3574307 = fieldWeight in 1845, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1845)
          0.6224453 = weight(abstract_txt:phrase in 1845) [ClassicSimilarity], result of:
            0.6224453 = score(doc=1845,freq=6.0), product of:
              0.5725047 = queryWeight, product of:
                4.1721225 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.019322155 = queryNorm
              1.0872318 = fieldWeight in 1845, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.0625 = fieldNorm(doc=1845)
        0.2 = coord(5/25)
    
  2. Fidel, R.; Efthimiadis, E.N.: Terminological knowledge structure for intermediary expert systems (1995) 0.15
    0.14502001 = sum of:
      0.14502001 = product of:
        0.6042501 = sum of:
          0.04597881 = weight(abstract_txt:selection in 5695) [ClassicSimilarity], result of:
            0.04597881 = score(doc=5695,freq=1.0), product of:
              0.1094343 = queryWeight, product of:
                1.053134 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.019322155 = queryNorm
              0.42014992 = fieldWeight in 5695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.078125 = fieldNorm(doc=5695)
          0.018839162 = weight(abstract_txt:research in 5695) [ClassicSimilarity], result of:
            0.018839162 = score(doc=5695,freq=1.0), product of:
              0.076061696 = queryWeight, product of:
                1.2416663 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.019322155 = queryNorm
              0.24768265 = fieldWeight in 5695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.078125 = fieldNorm(doc=5695)
          0.03721799 = weight(abstract_txt:retrieval in 5695) [ClassicSimilarity], result of:
            0.03721799 = score(doc=5695,freq=1.0), product of:
              0.13708523 = queryWeight, product of:
                2.0415633 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019322155 = queryNorm
              0.27149525 = fieldWeight in 5695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=5695)
          0.10638129 = weight(abstract_txt:processing in 5695) [ClassicSimilarity], result of:
            0.10638129 = score(doc=5695,freq=1.0), product of:
              0.27609944 = queryWeight, product of:
                2.8973455 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.019322155 = queryNorm
              0.38530064 = fieldWeight in 5695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=5695)
          0.078192554 = weight(abstract_txt:text in 5695) [ClassicSimilarity], result of:
            0.078192554 = score(doc=5695,freq=1.0), product of:
              0.24750191 = queryWeight, product of:
                3.1675696 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019322155 = queryNorm
              0.3159271 = fieldWeight in 5695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=5695)
          0.31764027 = weight(abstract_txt:phrase in 5695) [ClassicSimilarity], result of:
            0.31764027 = score(doc=5695,freq=1.0), product of:
              0.5725047 = queryWeight, product of:
                4.1721225 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.019322155 = queryNorm
              0.5548256 = fieldWeight in 5695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.078125 = fieldNorm(doc=5695)
        0.24 = coord(6/25)
    
  3. TREC: experiment and evaluation in information retrieval (2005) 0.14
    0.14134191 = sum of:
      0.14134191 = product of:
        0.58892465 = sum of:
          0.040723644 = weight(abstract_txt:conference in 636) [ClassicSimilarity], result of:
            0.040723644 = score(doc=636,freq=1.0), product of:
              0.117116846 = queryWeight, product of:
                1.0894732 = boost
                5.563489 = idf(docFreq=460, maxDocs=44218)
                0.019322155 = queryNorm
              0.34771806 = fieldWeight in 636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.563489 = idf(docFreq=460, maxDocs=44218)
                0.0625 = fieldNorm(doc=636)
          0.03370052 = weight(abstract_txt:research in 636) [ClassicSimilarity], result of:
            0.03370052 = score(doc=636,freq=5.0), product of:
              0.076061696 = queryWeight, product of:
                1.2416663 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.019322155 = queryNorm
              0.4430682 = fieldWeight in 636, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=636)
          0.20114587 = weight(abstract_txt:trec in 636) [ClassicSimilarity], result of:
            0.20114587 = score(doc=636,freq=8.0), product of:
              0.16983703 = queryWeight, product of:
                1.3119675 = boost
                6.699675 = idf(docFreq=147, maxDocs=44218)
                0.019322155 = queryNorm
              1.1843464 = fieldWeight in 636, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.699675 = idf(docFreq=147, maxDocs=44218)
                0.0625 = fieldNorm(doc=636)
          0.10314152 = weight(abstract_txt:retrieval in 636) [ClassicSimilarity], result of:
            0.10314152 = score(doc=636,freq=12.0), product of:
              0.13708523 = queryWeight, product of:
                2.0415633 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019322155 = queryNorm
              0.7523897 = fieldWeight in 636, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=636)
          0.08510503 = weight(abstract_txt:processing in 636) [ClassicSimilarity], result of:
            0.08510503 = score(doc=636,freq=1.0), product of:
              0.27609944 = queryWeight, product of:
                2.8973455 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.019322155 = queryNorm
              0.3082405 = fieldWeight in 636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0625 = fieldNorm(doc=636)
          0.1251081 = weight(abstract_txt:text in 636) [ClassicSimilarity], result of:
            0.1251081 = score(doc=636,freq=4.0), product of:
              0.24750191 = queryWeight, product of:
                3.1675696 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019322155 = queryNorm
              0.5054833 = fieldWeight in 636, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=636)
        0.24 = coord(6/25)
    
  4. Frohmann, B.: Rules of indexing : a critique of mentalism in information retrieval theory (1990) 0.14
    0.14052401 = sum of:
      0.14052401 = product of:
        0.70262 = sum of:
          0.022606995 = weight(abstract_txt:research in 3908) [ClassicSimilarity], result of:
            0.022606995 = score(doc=3908,freq=1.0), product of:
              0.076061696 = queryWeight, product of:
                1.2416663 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.019322155 = queryNorm
              0.2972192 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.09375 = fieldNorm(doc=3908)
          0.07735614 = weight(abstract_txt:retrieval in 3908) [ClassicSimilarity], result of:
            0.07735614 = score(doc=3908,freq=3.0), product of:
              0.13708523 = queryWeight, product of:
                2.0415633 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019322155 = queryNorm
              0.5642923 = fieldWeight in 3908, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=3908)
          0.12765755 = weight(abstract_txt:processing in 3908) [ClassicSimilarity], result of:
            0.12765755 = score(doc=3908,freq=1.0), product of:
              0.27609944 = queryWeight, product of:
                2.8973455 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.019322155 = queryNorm
              0.46236074 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.09375 = fieldNorm(doc=3908)
          0.09383106 = weight(abstract_txt:text in 3908) [ClassicSimilarity], result of:
            0.09383106 = score(doc=3908,freq=1.0), product of:
              0.24750191 = queryWeight, product of:
                3.1675696 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019322155 = queryNorm
              0.37911248 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=3908)
          0.3811683 = weight(abstract_txt:phrase in 3908) [ClassicSimilarity], result of:
            0.3811683 = score(doc=3908,freq=1.0), product of:
              0.5725047 = queryWeight, product of:
                4.1721225 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.019322155 = queryNorm
              0.6657907 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.09375 = fieldNorm(doc=3908)
        0.2 = coord(5/25)
    
  5. Harman, D.: ¬The Text REtrieval Conferences (TRECs) : providing a test-bed for information retrieval systems (1998) 0.14
    0.14044875 = sum of:
      0.14044875 = product of:
        0.5852031 = sum of:
          0.052611284 = weight(abstract_txt:projects in 1314) [ClassicSimilarity], result of:
            0.052611284 = score(doc=1314,freq=1.0), product of:
              0.10601811 = queryWeight, product of:
                1.0365659 = boost
                5.293313 = idf(docFreq=603, maxDocs=44218)
                0.019322155 = queryNorm
              0.4962481 = fieldWeight in 1314, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.293313 = idf(docFreq=603, maxDocs=44218)
                0.09375 = fieldNorm(doc=1314)
          0.10580313 = weight(abstract_txt:conference in 1314) [ClassicSimilarity], result of:
            0.10580313 = score(doc=1314,freq=3.0), product of:
              0.117116846 = queryWeight, product of:
                1.0894732 = boost
                5.563489 = idf(docFreq=460, maxDocs=44218)
                0.019322155 = queryNorm
              0.90339804 = fieldWeight in 1314, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.563489 = idf(docFreq=460, maxDocs=44218)
                0.09375 = fieldNorm(doc=1314)
          0.03197112 = weight(abstract_txt:research in 1314) [ClassicSimilarity], result of:
            0.03197112 = score(doc=1314,freq=2.0), product of:
              0.076061696 = queryWeight, product of:
                1.2416663 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.019322155 = queryNorm
              0.4203314 = fieldWeight in 1314, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.09375 = fieldNorm(doc=1314)
          0.18476427 = weight(abstract_txt:trec in 1314) [ClassicSimilarity], result of:
            0.18476427 = score(doc=1314,freq=3.0), product of:
              0.16983703 = queryWeight, product of:
                1.3119675 = boost
                6.699675 = idf(docFreq=147, maxDocs=44218)
                0.019322155 = queryNorm
              1.0878916 = fieldWeight in 1314, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.699675 = idf(docFreq=147, maxDocs=44218)
                0.09375 = fieldNorm(doc=1314)
          0.07735614 = weight(abstract_txt:retrieval in 1314) [ClassicSimilarity], result of:
            0.07735614 = score(doc=1314,freq=3.0), product of:
              0.13708523 = queryWeight, product of:
                2.0415633 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019322155 = queryNorm
              0.5642923 = fieldWeight in 1314, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=1314)
          0.13269717 = weight(abstract_txt:text in 1314) [ClassicSimilarity], result of:
            0.13269717 = score(doc=1314,freq=2.0), product of:
              0.24750191 = queryWeight, product of:
                3.1675696 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019322155 = queryNorm
              0.53614604 = fieldWeight in 1314, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=1314)
        0.24 = coord(6/25)