Document (#26427)

Author
Nie, J.-Y.
Title
Query expansion and query translation as logical inference
Source
Journal of the American Society for Information Science and technology. 54(2003) no.4, S.335-346
Year
2003
Abstract
A number of studies have examined the problems of query expansion in monolingual Information Retrieval (IR), and query translation for crosslanguage IR. However, no link has been made between them. This article first shows that query translation is a special case of query expansion. There is also another set of studies an inferential IR. Again, there is no relationship established with query translation or query expansion. The second claim of this article is that logical inference is a general form that covers query expansion and query translation. This analysis provides a unified view of different subareas of IR. We further develop the inferential IR approach in two particular contexts: using fuzzy logic and probability theory. The evaluation formulas obtained are shown to strongly correspond to those used in other IR models. This indicates that inference is indeed the core of advanced IR.
Footnote
Beitrag eines Themenheftes: Mathematical, logical, and formal methods in information retrieval
Theme
Retrievalalgorithmen
Semantisches Umfeld in Indexierung u. Retrieval
Multilinguale Probleme

Similar documents (content)

  1. He, D.; Wu, D.: Enhancing query translation with relevance feedback in translingual information retrieval : a study of the medication process (2011) 0.31
    0.31197897 = sum of:
      0.31197897 = product of:
        1.1142106 = sum of:
          0.06647342 = weight(abstract_txt:monolingual in 1245) [ClassicSimilarity], result of:
            0.06647342 = score(doc=1245,freq=1.0), product of:
              0.13312785 = queryWeight, product of:
                1.27641 = boost
                7.9891224 = idf(docFreq=38, maxDocs=42306)
                0.013055084 = queryNorm
              0.49932015 = fieldWeight in 1245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9891224 = idf(docFreq=38, maxDocs=42306)
                0.0625 = fieldNorm(doc=1245)
          0.021101246 = weight(abstract_txt:studies in 1245) [ClassicSimilarity], result of:
            0.021101246 = score(doc=1245,freq=1.0), product of:
              0.07805229 = queryWeight, product of:
                1.3821766 = boost
                4.325561 = idf(docFreq=1520, maxDocs=42306)
                0.013055084 = queryNorm
              0.27034757 = fieldWeight in 1245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.325561 = idf(docFreq=1520, maxDocs=42306)
                0.0625 = fieldNorm(doc=1245)
          0.014504666 = weight(abstract_txt:that in 1245) [ClassicSimilarity], result of:
            0.014504666 = score(doc=1245,freq=4.0), product of:
              0.04825127 = queryWeight, product of:
                1.5368805 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.013055084 = queryNorm
              0.30060694 = fieldWeight in 1245, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=1245)
          0.0077100215 = weight(abstract_txt:this in 1245) [ClassicSimilarity], result of:
            0.0077100215 = score(doc=1245,freq=1.0), product of:
              0.05026056 = queryWeight, product of:
                1.5685537 = boost
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.013055084 = queryNorm
              0.15340103 = fieldWeight in 1245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.0625 = fieldNorm(doc=1245)
          0.25769615 = weight(abstract_txt:expansion in 1245) [ClassicSimilarity], result of:
            0.25769615 = score(doc=1245,freq=3.0), product of:
              0.38951585 = queryWeight, product of:
                4.882062 = boost
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.013055084 = queryNorm
              0.6615807 = fieldWeight in 1245, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.0625 = fieldNorm(doc=1245)
          0.4079527 = weight(abstract_txt:translation in 1245) [ClassicSimilarity], result of:
            0.4079527 = score(doc=1245,freq=7.0), product of:
              0.39890313 = queryWeight, product of:
                4.9405403 = boost
                6.184624 = idf(docFreq=236, maxDocs=42306)
                0.013055084 = queryNorm
              1.0226861 = fieldWeight in 1245, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.184624 = idf(docFreq=236, maxDocs=42306)
                0.0625 = fieldNorm(doc=1245)
          0.33877248 = weight(abstract_txt:query in 1245) [ClassicSimilarity], result of:
            0.33877248 = score(doc=1245,freq=6.0), product of:
              0.46743932 = queryWeight, product of:
                7.5634227 = boost
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.013055084 = queryNorm
              0.7247411 = fieldWeight in 1245, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.0625 = fieldNorm(doc=1245)
        0.28 = coord(7/25)
    
  2. Levow, G.-A.; Oard, D.W.; Resnik, P.: Dictionary-based techniques for cross-language information retrieval (2005) 0.28
    0.28307495 = sum of:
      0.28307495 = product of:
        1.0109819 = sum of:
          0.047323503 = weight(abstract_txt:unified in 3026) [ClassicSimilarity], result of:
            0.047323503 = score(doc=3026,freq=1.0), product of:
              0.09147059 = queryWeight, product of:
                1.0580263 = boost
                6.6222463 = idf(docFreq=152, maxDocs=42306)
                0.013055084 = queryNorm
              0.517363 = fieldWeight in 3026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6222463 = idf(docFreq=152, maxDocs=42306)
                0.078125 = fieldNorm(doc=3026)
          0.018676555 = weight(abstract_txt:article in 3026) [ClassicSimilarity], result of:
            0.018676555 = score(doc=3026,freq=1.0), product of:
              0.062006626 = queryWeight, product of:
                1.2319406 = boost
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.013055084 = queryNorm
              0.30120257 = fieldWeight in 3026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.078125 = fieldNorm(doc=3026)
          0.018130833 = weight(abstract_txt:that in 3026) [ClassicSimilarity], result of:
            0.018130833 = score(doc=3026,freq=4.0), product of:
              0.04825127 = queryWeight, product of:
                1.5368805 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.013055084 = queryNorm
              0.37575868 = fieldWeight in 3026, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.078125 = fieldNorm(doc=3026)
          0.009637527 = weight(abstract_txt:this in 3026) [ClassicSimilarity], result of:
            0.009637527 = score(doc=3026,freq=1.0), product of:
              0.05026056 = queryWeight, product of:
                1.5685537 = boost
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.013055084 = queryNorm
              0.19175129 = fieldWeight in 3026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.078125 = fieldNorm(doc=3026)
          0.18597618 = weight(abstract_txt:expansion in 3026) [ClassicSimilarity], result of:
            0.18597618 = score(doc=3026,freq=1.0), product of:
              0.38951585 = queryWeight, product of:
                4.882062 = boost
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.013055084 = queryNorm
              0.47745472 = fieldWeight in 3026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.078125 = fieldNorm(doc=3026)
          0.38547906 = weight(abstract_txt:translation in 3026) [ClassicSimilarity], result of:
            0.38547906 = score(doc=3026,freq=4.0), product of:
              0.39890313 = queryWeight, product of:
                4.9405403 = boost
                6.184624 = idf(docFreq=236, maxDocs=42306)
                0.013055084 = queryNorm
              0.9663475 = fieldWeight in 3026, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.184624 = idf(docFreq=236, maxDocs=42306)
                0.078125 = fieldNorm(doc=3026)
          0.34575823 = weight(abstract_txt:query in 3026) [ClassicSimilarity], result of:
            0.34575823 = score(doc=3026,freq=4.0), product of:
              0.46743932 = queryWeight, product of:
                7.5634227 = boost
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.013055084 = queryNorm
              0.7396858 = fieldWeight in 3026, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.078125 = fieldNorm(doc=3026)
        0.28 = coord(7/25)
    
  3. Efthimiadis, E.N.: Interactive query expansion : a user-based evaluation in a relevance feedback environment (2000) 0.24
    0.24422796 = sum of:
      0.24422796 = product of:
        1.0176165 = sum of:
          0.013073589 = weight(abstract_txt:article in 702) [ClassicSimilarity], result of:
            0.013073589 = score(doc=702,freq=1.0), product of:
              0.062006626 = queryWeight, product of:
                1.2319406 = boost
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.013055084 = queryNorm
              0.2108418 = fieldWeight in 702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.0546875 = fieldNorm(doc=702)
          0.018463591 = weight(abstract_txt:studies in 702) [ClassicSimilarity], result of:
            0.018463591 = score(doc=702,freq=1.0), product of:
              0.07805229 = queryWeight, product of:
                1.3821766 = boost
                4.325561 = idf(docFreq=1520, maxDocs=42306)
                0.013055084 = queryNorm
              0.23655412 = fieldWeight in 702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.325561 = idf(docFreq=1520, maxDocs=42306)
                0.0546875 = fieldNorm(doc=702)
          0.010991233 = weight(abstract_txt:that in 702) [ClassicSimilarity], result of:
            0.010991233 = score(doc=702,freq=3.0), product of:
              0.04825127 = queryWeight, product of:
                1.5368805 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.013055084 = queryNorm
              0.22779158 = fieldWeight in 702, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0546875 = fieldNorm(doc=702)
          0.006746269 = weight(abstract_txt:this in 702) [ClassicSimilarity], result of:
            0.006746269 = score(doc=702,freq=1.0), product of:
              0.05026056 = queryWeight, product of:
                1.5685537 = boost
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.013055084 = queryNorm
              0.1342259 = fieldWeight in 702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.0546875 = fieldNorm(doc=702)
          0.46938267 = weight(abstract_txt:expansion in 702) [ClassicSimilarity], result of:
            0.46938267 = score(doc=702,freq=13.0), product of:
              0.38951585 = queryWeight, product of:
                4.882062 = boost
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.013055084 = queryNorm
              1.2050413 = fieldWeight in 702, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.0546875 = fieldNorm(doc=702)
          0.49895915 = weight(abstract_txt:query in 702) [ClassicSimilarity], result of:
            0.49895915 = score(doc=702,freq=17.0), product of:
              0.46743932 = queryWeight, product of:
                7.5634227 = boost
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.013055084 = queryNorm
              1.0674309 = fieldWeight in 702, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.0546875 = fieldNorm(doc=702)
        0.24 = coord(6/25)
    
  4. Nie, J.-Y.; Brisebois, M.: ¬An inferential approach to information retrieval and its implementation using a manual thesaurus (1996) 0.22
    0.21844135 = sum of:
      0.21844135 = product of:
        0.91017234 = sum of:
          0.055939104 = weight(abstract_txt:logic in 776) [ClassicSimilarity], result of:
            0.055939104 = score(doc=776,freq=1.0), product of:
              0.081712514 = queryWeight, product of:
                6.2590566 = idf(docFreq=219, maxDocs=42306)
                0.013055084 = queryNorm
              0.6845843 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2590566 = idf(docFreq=219, maxDocs=42306)
                0.109375 = fieldNorm(doc=776)
          0.07200167 = weight(abstract_txt:fuzzy in 776) [ClassicSimilarity], result of:
            0.07200167 = score(doc=776,freq=1.0), product of:
              0.09668816 = queryWeight, product of:
                1.0877832 = boost
                6.808497 = idf(docFreq=126, maxDocs=42306)
                0.013055084 = queryNorm
              0.74467933 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.808497 = idf(docFreq=126, maxDocs=42306)
                0.109375 = fieldNorm(doc=776)
          0.013492538 = weight(abstract_txt:this in 776) [ClassicSimilarity], result of:
            0.013492538 = score(doc=776,freq=1.0), product of:
              0.05026056 = queryWeight, product of:
                1.5685537 = boost
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.013055084 = queryNorm
              0.2684518 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.109375 = fieldNorm(doc=776)
          0.10905436 = weight(abstract_txt:logical in 776) [ClassicSimilarity], result of:
            0.10905436 = score(doc=776,freq=1.0), product of:
              0.16066338 = queryWeight, product of:
                1.9830295 = boost
                6.205947 = idf(docFreq=231, maxDocs=42306)
                0.013055084 = queryNorm
              0.6787754 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.205947 = idf(docFreq=231, maxDocs=42306)
                0.109375 = fieldNorm(doc=776)
          0.32651767 = weight(abstract_txt:inferential in 776) [ClassicSimilarity], result of:
            0.32651767 = score(doc=776,freq=1.0), product of:
              0.33375365 = queryWeight, product of:
                2.8581414 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.013055084 = queryNorm
              0.9783194 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.109375 = fieldNorm(doc=776)
          0.33316696 = weight(abstract_txt:inference in 776) [ClassicSimilarity], result of:
            0.33316696 = score(doc=776,freq=2.0), product of:
              0.30733824 = queryWeight, product of:
                3.3591132 = boost
                7.008293 = idf(docFreq=103, maxDocs=42306)
                0.013055084 = queryNorm
              1.08404 = fieldWeight in 776, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.008293 = idf(docFreq=103, maxDocs=42306)
                0.109375 = fieldNorm(doc=776)
        0.24 = coord(6/25)
    
  5. Kim, S.; Ko, Y.; Oard, D.W.: Combining lexical and statistical translation evidence for cross-language information retrieval (2015) 0.20
    0.20324609 = sum of:
      0.20324609 = product of:
        0.84685874 = sum of:
          0.014941244 = weight(abstract_txt:article in 3607) [ClassicSimilarity], result of:
            0.014941244 = score(doc=3607,freq=1.0), product of:
              0.062006626 = queryWeight, product of:
                1.2319406 = boost
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.013055084 = queryNorm
              0.24096206 = fieldWeight in 3607, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.855393 = idf(docFreq=2433, maxDocs=42306)
                0.0625 = fieldNorm(doc=3607)
          0.010256348 = weight(abstract_txt:that in 3607) [ClassicSimilarity], result of:
            0.010256348 = score(doc=3607,freq=2.0), product of:
              0.04825127 = queryWeight, product of:
                1.5368805 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.013055084 = queryNorm
              0.2125612 = fieldWeight in 3607, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=3607)
          0.0077100215 = weight(abstract_txt:this in 3607) [ClassicSimilarity], result of:
            0.0077100215 = score(doc=3607,freq=1.0), product of:
              0.05026056 = queryWeight, product of:
                1.5685537 = boost
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.013055084 = queryNorm
              0.15340103 = fieldWeight in 3607, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.0625 = fieldNorm(doc=3607)
          0.21040803 = weight(abstract_txt:expansion in 3607) [ClassicSimilarity], result of:
            0.21040803 = score(doc=3607,freq=2.0), product of:
              0.38951585 = queryWeight, product of:
                4.882062 = boost
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.013055084 = queryNorm
              0.54017836 = fieldWeight in 3607, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.0625 = fieldNorm(doc=3607)
          0.4079527 = weight(abstract_txt:translation in 3607) [ClassicSimilarity], result of:
            0.4079527 = score(doc=3607,freq=7.0), product of:
              0.39890313 = queryWeight, product of:
                4.9405403 = boost
                6.184624 = idf(docFreq=236, maxDocs=42306)
                0.013055084 = queryNorm
              1.0226861 = fieldWeight in 3607, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.184624 = idf(docFreq=236, maxDocs=42306)
                0.0625 = fieldNorm(doc=3607)
          0.19559038 = weight(abstract_txt:query in 3607) [ClassicSimilarity], result of:
            0.19559038 = score(doc=3607,freq=2.0), product of:
              0.46743932 = queryWeight, product of:
                7.5634227 = boost
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.013055084 = queryNorm
              0.41842943 = fieldWeight in 3607, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.0625 = fieldNorm(doc=3607)
        0.24 = coord(6/25)