Document (#26426)

Author
Nie, J.-Y.
Title
Query expansion and query translation as logical inference
Source
Journal of the American Society for Information Science and technology. 54(2003) no.4, S.335-346
Year
2003
Abstract
A number of studies have examined the problems of query expansion in monolingual Information Retrieval (IR), and query translation for crosslanguage IR. However, no link has been made between them. This article first shows that query translation is a special case of query expansion. There is also another set of studies an inferential IR. Again, there is no relationship established with query translation or query expansion. The second claim of this article is that logical inference is a general form that covers query expansion and query translation. This analysis provides a unified view of different subareas of IR. We further develop the inferential IR approach in two particular contexts: using fuzzy logic and probability theory. The evaluation formulas obtained are shown to strongly correspond to those used in other IR models. This indicates that inference is indeed the core of advanced IR.
Footnote
Beitrag eines Themenheftes: Mathematical, logical, and formal methods in information retrieval
Theme
Retrievalalgorithmen
Semantisches Umfeld in Indexierung u. Retrieval
Multilinguale Probleme

Similar documents (content)

  1. He, D.; Wu, D.: Enhancing query translation with relevance feedback in translingual information retrieval : a study of the medication process (2011) 0.31
    0.31326684 = sum of:
      0.31326684 = product of:
        1.1188102 = sum of:
          0.067800336 = weight(abstract_txt:monolingual in 4244) [ClassicSimilarity], result of:
            0.067800336 = score(doc=4244,freq=1.0), product of:
              0.13503815 = queryWeight, product of:
                1.2879487 = boost
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.013051565 = queryNorm
              0.5020828 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.02033466 = weight(abstract_txt:studies in 4244) [ClassicSimilarity], result of:
            0.02033466 = score(doc=4244,freq=1.0), product of:
              0.07623186 = queryWeight, product of:
                1.3685277 = boost
                4.26796 = idf(docFreq=1683, maxDocs=44218)
                0.013051565 = queryNorm
              0.2667475 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.26796 = idf(docFreq=1683, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.013918496 = weight(abstract_txt:that in 4244) [ClassicSimilarity], result of:
            0.013918496 = score(doc=4244,freq=4.0), product of:
              0.04699267 = queryWeight, product of:
                1.5195514 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.013051565 = queryNorm
              0.2961844 = fieldWeight in 4244, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.0073499987 = weight(abstract_txt:this in 4244) [ClassicSimilarity], result of:
            0.0073499987 = score(doc=4244,freq=1.0), product of:
              0.048735652 = queryWeight, product of:
                1.5474752 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.013051565 = queryNorm
              0.1508136 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.25782534 = weight(abstract_txt:expansion in 4244) [ClassicSimilarity], result of:
            0.25782534 = score(doc=4244,freq=3.0), product of:
              0.39006343 = queryWeight, product of:
                4.894665 = boost
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.013051565 = queryNorm
              0.6609831 = fieldWeight in 4244, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.4074419 = weight(abstract_txt:translation in 4244) [ClassicSimilarity], result of:
            0.4074419 = score(doc=4244,freq=7.0), product of:
              0.3989971 = queryWeight, product of:
                4.950399 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.013051565 = queryNorm
              1.0211651 = fieldWeight in 4244, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.3441395 = weight(abstract_txt:query in 4244) [ClassicSimilarity], result of:
            0.3441395 = score(doc=4244,freq=6.0), product of:
              0.47286934 = queryWeight, product of:
                7.621508 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.013051565 = queryNorm
              0.72776866 = fieldWeight in 4244, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
        0.28 = coord(7/25)
    
  2. Levow, G.-A.; Oard, D.W.; Resnik, P.: Dictionary-based techniques for cross-language information retrieval (2005) 0.28
    0.28379753 = sum of:
      0.28379753 = product of:
        1.0135626 = sum of:
          0.046677034 = weight(abstract_txt:unified in 1025) [ClassicSimilarity], result of:
            0.046677034 = score(doc=1025,freq=1.0), product of:
              0.09073275 = queryWeight, product of:
                1.0557288 = boost
                6.5848994 = idf(docFreq=165, maxDocs=44218)
                0.013051565 = queryNorm
              0.51444525 = fieldWeight in 1025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5848994 = idf(docFreq=165, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
          0.017998176 = weight(abstract_txt:article in 1025) [ClassicSimilarity], result of:
            0.017998176 = score(doc=1025,freq=1.0), product of:
              0.060560703 = queryWeight, product of:
                1.219778 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.013051565 = queryNorm
              0.2971923 = fieldWeight in 1025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
          0.017398119 = weight(abstract_txt:that in 1025) [ClassicSimilarity], result of:
            0.017398119 = score(doc=1025,freq=4.0), product of:
              0.04699267 = queryWeight, product of:
                1.5195514 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.013051565 = queryNorm
              0.3702305 = fieldWeight in 1025, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
          0.009187498 = weight(abstract_txt:this in 1025) [ClassicSimilarity], result of:
            0.009187498 = score(doc=1025,freq=1.0), product of:
              0.048735652 = queryWeight, product of:
                1.5474752 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.013051565 = queryNorm
              0.18851699 = fieldWeight in 1025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
          0.1860694 = weight(abstract_txt:expansion in 1025) [ClassicSimilarity], result of:
            0.1860694 = score(doc=1025,freq=1.0), product of:
              0.39006343 = queryWeight, product of:
                4.894665 = boost
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.013051565 = queryNorm
              0.47702345 = fieldWeight in 1025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
          0.38499644 = weight(abstract_txt:translation in 1025) [ClassicSimilarity], result of:
            0.38499644 = score(doc=1025,freq=4.0), product of:
              0.3989971 = queryWeight, product of:
                4.950399 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.013051565 = queryNorm
              0.9649104 = fieldWeight in 1025, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
          0.35123587 = weight(abstract_txt:query in 1025) [ClassicSimilarity], result of:
            0.35123587 = score(doc=1025,freq=4.0), product of:
              0.47286934 = queryWeight, product of:
                7.621508 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.013051565 = queryNorm
              0.74277574 = fieldWeight in 1025, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
        0.28 = coord(7/25)
    
  3. Efthimiadis, E.N.: Interactive query expansion : a user-based evaluation in a relevance feedback environment (2000) 0.25
    0.24572438 = sum of:
      0.24572438 = product of:
        1.0238516 = sum of:
          0.012598723 = weight(abstract_txt:article in 5701) [ClassicSimilarity], result of:
            0.012598723 = score(doc=5701,freq=1.0), product of:
              0.060560703 = queryWeight, product of:
                1.219778 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.013051565 = queryNorm
              0.20803462 = fieldWeight in 5701, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5701)
          0.017792827 = weight(abstract_txt:studies in 5701) [ClassicSimilarity], result of:
            0.017792827 = score(doc=5701,freq=1.0), product of:
              0.07623186 = queryWeight, product of:
                1.3685277 = boost
                4.26796 = idf(docFreq=1683, maxDocs=44218)
                0.013051565 = queryNorm
              0.23340407 = fieldWeight in 5701, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.26796 = idf(docFreq=1683, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5701)
          0.010547048 = weight(abstract_txt:that in 5701) [ClassicSimilarity], result of:
            0.010547048 = score(doc=5701,freq=3.0), product of:
              0.04699267 = queryWeight, product of:
                1.5195514 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.013051565 = queryNorm
              0.22444029 = fieldWeight in 5701, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5701)
          0.006431249 = weight(abstract_txt:this in 5701) [ClassicSimilarity], result of:
            0.006431249 = score(doc=5701,freq=1.0), product of:
              0.048735652 = queryWeight, product of:
                1.5474752 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.013051565 = queryNorm
              0.1319619 = fieldWeight in 5701, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5701)
          0.46961796 = weight(abstract_txt:expansion in 5701) [ClassicSimilarity], result of:
            0.46961796 = score(doc=5701,freq=13.0), product of:
              0.39006343 = queryWeight, product of:
                4.894665 = boost
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.013051565 = queryNorm
              1.2039528 = fieldWeight in 5701, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5701)
          0.50686383 = weight(abstract_txt:query in 5701) [ClassicSimilarity], result of:
            0.50686383 = score(doc=5701,freq=17.0), product of:
              0.47286934 = queryWeight, product of:
                7.621508 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.013051565 = queryNorm
              1.0718899 = fieldWeight in 5701, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5701)
        0.24 = coord(6/25)
    
  4. Nie, J.-Y.; Brisebois, M.: ¬An inferential approach to information retrieval and its implementation using a manual thesaurus (1996) 0.21
    0.21435946 = sum of:
      0.21435946 = product of:
        0.89316446 = sum of:
          0.055535946 = weight(abstract_txt:logic in 7706) [ClassicSimilarity], result of:
            0.055535946 = score(doc=7706,freq=1.0), product of:
              0.08140655 = queryWeight, product of:
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.013051565 = queryNorm
              0.6822049 = fieldWeight in 7706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.109375 = fieldNorm(doc=7706)
          0.07339678 = weight(abstract_txt:fuzzy in 7706) [ClassicSimilarity], result of:
            0.07339678 = score(doc=7706,freq=1.0), product of:
              0.098038025 = queryWeight, product of:
                1.0974067 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.013051565 = queryNorm
              0.7486562 = fieldWeight in 7706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.109375 = fieldNorm(doc=7706)
          0.012862498 = weight(abstract_txt:this in 7706) [ClassicSimilarity], result of:
            0.012862498 = score(doc=7706,freq=1.0), product of:
              0.048735652 = queryWeight, product of:
                1.5474752 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.013051565 = queryNorm
              0.2639238 = fieldWeight in 7706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.109375 = fieldNorm(doc=7706)
          0.10759009 = weight(abstract_txt:logical in 7706) [ClassicSimilarity], result of:
            0.10759009 = score(doc=7706,freq=1.0), product of:
              0.15939257 = queryWeight, product of:
                1.9788796 = boost
                6.1714344 = idf(docFreq=250, maxDocs=44218)
                0.013051565 = queryNorm
              0.67500067 = fieldWeight in 7706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1714344 = idf(docFreq=250, maxDocs=44218)
                0.109375 = fieldNorm(doc=7706)
          0.3187537 = weight(abstract_txt:inferential in 7706) [ClassicSimilarity], result of:
            0.3187537 = score(doc=7706,freq=1.0), product of:
              0.32879362 = queryWeight, product of:
                2.8421502 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.013051565 = queryNorm
              0.96946436 = fieldWeight in 7706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.109375 = fieldNorm(doc=7706)
          0.32502544 = weight(abstract_txt:inference in 7706) [ClassicSimilarity], result of:
            0.32502544 = score(doc=7706,freq=2.0), product of:
              0.3026346 = queryWeight, product of:
                3.3395677 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.013051565 = queryNorm
              1.0739864 = fieldWeight in 7706, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.109375 = fieldNorm(doc=7706)
        0.24 = coord(6/25)
    
  5. Kim, S.; Ko, Y.; Oard, D.W.: Combining lexical and statistical translation evidence for cross-language information retrieval (2015) 0.20
    0.20357634 = sum of:
      0.20357634 = product of:
        0.8482348 = sum of:
          0.01439854 = weight(abstract_txt:article in 1606) [ClassicSimilarity], result of:
            0.01439854 = score(doc=1606,freq=1.0), product of:
              0.060560703 = queryWeight, product of:
                1.219778 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.013051565 = queryNorm
              0.23775385 = fieldWeight in 1606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0625 = fieldNorm(doc=1606)
          0.009841862 = weight(abstract_txt:that in 1606) [ClassicSimilarity], result of:
            0.009841862 = score(doc=1606,freq=2.0), product of:
              0.04699267 = queryWeight, product of:
                1.5195514 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.013051565 = queryNorm
              0.20943399 = fieldWeight in 1606, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1606)
          0.0073499987 = weight(abstract_txt:this in 1606) [ClassicSimilarity], result of:
            0.0073499987 = score(doc=1606,freq=1.0), product of:
              0.048735652 = queryWeight, product of:
                1.5474752 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.013051565 = queryNorm
              0.1508136 = fieldWeight in 1606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=1606)
          0.2105135 = weight(abstract_txt:expansion in 1606) [ClassicSimilarity], result of:
            0.2105135 = score(doc=1606,freq=2.0), product of:
              0.39006343 = queryWeight, product of:
                4.894665 = boost
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.013051565 = queryNorm
              0.53969043 = fieldWeight in 1606, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.0625 = fieldNorm(doc=1606)
          0.4074419 = weight(abstract_txt:translation in 1606) [ClassicSimilarity], result of:
            0.4074419 = score(doc=1606,freq=7.0), product of:
              0.3989971 = queryWeight, product of:
                4.950399 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.013051565 = queryNorm
              1.0211651 = fieldWeight in 1606, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0625 = fieldNorm(doc=1606)
          0.19868901 = weight(abstract_txt:query in 1606) [ClassicSimilarity], result of:
            0.19868901 = score(doc=1606,freq=2.0), product of:
              0.47286934 = queryWeight, product of:
                7.621508 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.013051565 = queryNorm
              0.4201774 = fieldWeight in 1606, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=1606)
        0.24 = coord(6/25)