Document (#19791)

Author
Oh, S.G.
Title
Document representation and retrieval using empirical facts : evaluation of a pilot system
Source
Journal of the American Society for Information Science. 49(1998) no.10, S.920-931
Year
1998
Abstract
This article investigates the potentialities of using empirical variables and their associated statistical relationships in document representation and retrieval. To this end, a newly devised empirical fact retrieval system (EFRS) was evaluated in comparison to a simulated traditional retrieval system (TRS) involving a set of predetermined empirical queries. Results indicate that the EFRS generally outperformed the TRS in terms of precision, search effort, and measures of user satisfaction. Possible advantages of the EFRS, as well as the necessity of establishing an efficient methos for extracting empirical facts, are discussed

Similar documents (content)

  1. Frisch, A.M.; Allen, J.F.: Knowledge retrieval as limited inference (1982) 0.11
    0.11085966 = sum of:
      0.11085966 = product of:
        0.46191525 = sum of:
          0.041568544 = weight(abstract_txt:efficient in 2090) [ClassicSimilarity], result of:
            0.041568544 = score(doc=2090,freq=1.0), product of:
              0.11486675 = queryWeight, product of:
                1.0098349 = boost
                5.7901587 = idf(docFreq=361, maxDocs=43556)
                0.019645067 = queryNorm
              0.36188492 = fieldWeight in 2090, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7901587 = idf(docFreq=361, maxDocs=43556)
                0.0625 = fieldNorm(doc=2090)
          0.017911742 = weight(abstract_txt:using in 2090) [ClassicSimilarity], result of:
            0.017911742 = score(doc=2090,freq=1.0), product of:
              0.08256317 = queryWeight, product of:
                1.2107692 = boost
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.019645067 = queryNorm
              0.21694592 = fieldWeight in 2090, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.0625 = fieldNorm(doc=2090)
          0.05181932 = weight(abstract_txt:representation in 2090) [ClassicSimilarity], result of:
            0.05181932 = score(doc=2090,freq=1.0), product of:
              0.16763149 = queryWeight, product of:
                1.7252259 = boost
                4.9460225 = idf(docFreq=841, maxDocs=43556)
                0.019645067 = queryNorm
              0.3091264 = fieldWeight in 2090, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9460225 = idf(docFreq=841, maxDocs=43556)
                0.0625 = fieldNorm(doc=2090)
          0.034654804 = weight(abstract_txt:system in 2090) [ClassicSimilarity], result of:
            0.034654804 = score(doc=2090,freq=2.0), product of:
              0.116472624 = queryWeight, product of:
                1.7612692 = boost
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.019645067 = queryNorm
              0.29753605 = fieldWeight in 2090, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.0625 = fieldNorm(doc=2090)
          0.0359078 = weight(abstract_txt:retrieval in 2090) [ClassicSimilarity], result of:
            0.0359078 = score(doc=2090,freq=1.0), product of:
              0.16538534 = queryWeight, product of:
                2.4234366 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.019645067 = queryNorm
              0.21711598 = fieldWeight in 2090, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.0625 = fieldNorm(doc=2090)
          0.28005305 = weight(abstract_txt:facts in 2090) [ClassicSimilarity], result of:
            0.28005305 = score(doc=2090,freq=3.0), product of:
              0.3579433 = queryWeight, product of:
                2.5210142 = boost
                7.2274556 = idf(docFreq=85, maxDocs=43556)
                0.019645067 = queryNorm
              0.782395 = fieldWeight in 2090, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.2274556 = idf(docFreq=85, maxDocs=43556)
                0.0625 = fieldNorm(doc=2090)
        0.24 = coord(6/25)
    
  2. Kostoff, R.N.; Eberhart, H.J.; Toothman, D.R.: Database tomography for information retrieval (1997) 0.10
    0.104886115 = sum of:
      0.104886115 = product of:
        0.5244306 = sum of:
          0.076136276 = weight(abstract_txt:involving in 999) [ClassicSimilarity], result of:
            0.076136276 = score(doc=999,freq=1.0), product of:
              0.13122544 = queryWeight, product of:
                1.0793498 = boost
                6.188741 = idf(docFreq=242, maxDocs=43556)
                0.019645067 = queryNorm
              0.5801945 = fieldWeight in 999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.188741 = idf(docFreq=242, maxDocs=43556)
                0.09375 = fieldNorm(doc=999)
          0.10762541 = weight(abstract_txt:newly in 999) [ClassicSimilarity], result of:
            0.10762541 = score(doc=999,freq=1.0), product of:
              0.16528502 = queryWeight, product of:
                1.2113508 = boost
                6.9456043 = idf(docFreq=113, maxDocs=43556)
                0.019645067 = queryNorm
              0.6511504 = fieldWeight in 999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9456043 = idf(docFreq=113, maxDocs=43556)
                0.09375 = fieldNorm(doc=999)
          0.22773995 = weight(abstract_txt:simulated in 999) [ClassicSimilarity], result of:
            0.22773995 = score(doc=999,freq=2.0), product of:
              0.21622527 = queryWeight, product of:
                1.3854996 = boost
                7.9441333 = idf(docFreq=41, maxDocs=43556)
                0.019645067 = queryNorm
              1.0532532 = fieldWeight in 999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.9441333 = idf(docFreq=41, maxDocs=43556)
                0.09375 = fieldNorm(doc=999)
          0.036756974 = weight(abstract_txt:system in 999) [ClassicSimilarity], result of:
            0.036756974 = score(doc=999,freq=1.0), product of:
              0.116472624 = queryWeight, product of:
                1.7612692 = boost
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.019645067 = queryNorm
              0.31558466 = fieldWeight in 999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.09375 = fieldNorm(doc=999)
          0.07617194 = weight(abstract_txt:retrieval in 999) [ClassicSimilarity], result of:
            0.07617194 = score(doc=999,freq=2.0), product of:
              0.16538534 = queryWeight, product of:
                2.4234366 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.019645067 = queryNorm
              0.46057254 = fieldWeight in 999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.09375 = fieldNorm(doc=999)
        0.2 = coord(5/25)
    
  3. Suchanek, F.M.; Kasneci, G.; Weikum, G.: YAGO: a core of semantic knowledge unifying WordNet and Wikipedia (2007) 0.10
    0.09865924 = sum of:
      0.09865924 = product of:
        0.61662024 = sum of:
          0.052035194 = weight(abstract_txt:fact in 401) [ClassicSimilarity], result of:
            0.052035194 = score(doc=401,freq=1.0), product of:
              0.11497654 = queryWeight, product of:
                1.0103173 = boost
                5.792925 = idf(docFreq=360, maxDocs=43556)
                0.019645067 = queryNorm
              0.45257226 = fieldWeight in 401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.792925 = idf(docFreq=360, maxDocs=43556)
                0.078125 = fieldNorm(doc=401)
          0.022389678 = weight(abstract_txt:using in 401) [ClassicSimilarity], result of:
            0.022389678 = score(doc=401,freq=1.0), product of:
              0.08256317 = queryWeight, product of:
                1.2107692 = boost
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.019645067 = queryNorm
              0.2711824 = fieldWeight in 401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.078125 = fieldNorm(doc=401)
          0.3500663 = weight(abstract_txt:facts in 401) [ClassicSimilarity], result of:
            0.3500663 = score(doc=401,freq=3.0), product of:
              0.3579433 = queryWeight, product of:
                2.5210142 = boost
                7.2274556 = idf(docFreq=85, maxDocs=43556)
                0.019645067 = queryNorm
              0.9779937 = fieldWeight in 401, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.2274556 = idf(docFreq=85, maxDocs=43556)
                0.078125 = fieldNorm(doc=401)
          0.19212908 = weight(abstract_txt:empirical in 401) [ClassicSimilarity], result of:
            0.19212908 = score(doc=401,freq=1.0), product of:
              0.469674 = queryWeight, product of:
                4.5660057 = boost
                5.236083 = idf(docFreq=629, maxDocs=43556)
                0.019645067 = queryNorm
              0.409069 = fieldWeight in 401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.236083 = idf(docFreq=629, maxDocs=43556)
                0.078125 = fieldNorm(doc=401)
        0.16 = coord(4/25)
    
  4. White, K.J.; Sutcliffe, R.F.E.: Applying incremental tree induction to retrieval : from manuals and medical texts (2006) 0.09
    0.09198124 = sum of:
      0.09198124 = product of:
        0.38325518 = sum of:
          0.050457273 = weight(abstract_txt:evaluated in 42) [ClassicSimilarity], result of:
            0.050457273 = score(doc=42,freq=1.0), product of:
              0.112640254 = queryWeight, product of:
                5.733768 = idf(docFreq=382, maxDocs=43556)
                0.019645067 = queryNorm
              0.44795063 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.733768 = idf(docFreq=382, maxDocs=43556)
                0.078125 = fieldNorm(doc=42)
          0.044779357 = weight(abstract_txt:using in 42) [ClassicSimilarity], result of:
            0.044779357 = score(doc=42,freq=4.0), product of:
              0.08256317 = queryWeight, product of:
                1.2107692 = boost
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.019645067 = queryNorm
              0.5423648 = fieldWeight in 42, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.078125 = fieldNorm(doc=42)
          0.13419706 = weight(abstract_txt:simulated in 42) [ClassicSimilarity], result of:
            0.13419706 = score(doc=42,freq=1.0), product of:
              0.21622527 = queryWeight, product of:
                1.3854996 = boost
                7.9441333 = idf(docFreq=41, maxDocs=43556)
                0.019645067 = queryNorm
              0.6206354 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9441333 = idf(docFreq=41, maxDocs=43556)
                0.078125 = fieldNorm(doc=42)
          0.059714083 = weight(abstract_txt:document in 42) [ClassicSimilarity], result of:
            0.059714083 = score(doc=42,freq=2.0), product of:
              0.12602662 = queryWeight, product of:
                1.4958888 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.019645067 = queryNorm
              0.4738212 = fieldWeight in 42, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.078125 = fieldNorm(doc=42)
          0.030630808 = weight(abstract_txt:system in 42) [ClassicSimilarity], result of:
            0.030630808 = score(doc=42,freq=1.0), product of:
              0.116472624 = queryWeight, product of:
                1.7612692 = boost
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.019645067 = queryNorm
              0.2629872 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.078125 = fieldNorm(doc=42)
          0.063476615 = weight(abstract_txt:retrieval in 42) [ClassicSimilarity], result of:
            0.063476615 = score(doc=42,freq=2.0), product of:
              0.16538534 = queryWeight, product of:
                2.4234366 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.019645067 = queryNorm
              0.38381043 = fieldWeight in 42, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.078125 = fieldNorm(doc=42)
        0.24 = coord(6/25)
    
  5. Spink, A.; Saracevic, T.: Human-computer interaction in information retrieval : nature and manifestations of feedback (1998) 0.09
    0.08950169 = sum of:
      0.08950169 = product of:
        0.55938554 = sum of:
          0.076136276 = weight(abstract_txt:involving in 4761) [ClassicSimilarity], result of:
            0.076136276 = score(doc=4761,freq=1.0), product of:
              0.13122544 = queryWeight, product of:
                1.0793498 = boost
                6.188741 = idf(docFreq=242, maxDocs=43556)
                0.019645067 = queryNorm
              0.5801945 = fieldWeight in 4761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.188741 = idf(docFreq=242, maxDocs=43556)
                0.09375 = fieldNorm(doc=4761)
          0.036756974 = weight(abstract_txt:system in 4761) [ClassicSimilarity], result of:
            0.036756974 = score(doc=4761,freq=1.0), product of:
              0.116472624 = queryWeight, product of:
                1.7612692 = boost
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.019645067 = queryNorm
              0.31558466 = fieldWeight in 4761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.09375 = fieldNorm(doc=4761)
          0.12043843 = weight(abstract_txt:retrieval in 4761) [ClassicSimilarity], result of:
            0.12043843 = score(doc=4761,freq=5.0), product of:
              0.16538534 = queryWeight, product of:
                2.4234366 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.019645067 = queryNorm
              0.72822917 = fieldWeight in 4761, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.09375 = fieldNorm(doc=4761)
          0.32605383 = weight(abstract_txt:empirical in 4761) [ClassicSimilarity], result of:
            0.32605383 = score(doc=4761,freq=2.0), product of:
              0.469674 = queryWeight, product of:
                4.5660057 = boost
                5.236083 = idf(docFreq=629, maxDocs=43556)
                0.019645067 = queryNorm
              0.6942131 = fieldWeight in 4761, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.236083 = idf(docFreq=629, maxDocs=43556)
                0.09375 = fieldNorm(doc=4761)
        0.16 = coord(4/25)