Document (#38502)

Author
Wu, S.
Li, J.
Zeng, X.
Bi, Y.
Title
Adaptive data fusion methods in information retrieval
Source
Journal of the Association for Information Science and Technology. 65(2014) no.10, S.2048-2061
Year
2014
Abstract
Data fusion is currently used extensively in information retrieval for various tasks. It has proved to be a useful technology because it is able to improve retrieval performance frequently. However, in almost all prior research in data fusion, static search environments have been used, and dynamic search environments have generally not been considered. In this article, we investigate adaptive data fusion methods that can change their behavior when the search environment changes. Three adaptive data fusion methods are proposed and investigated. To test these proposed methods properly, we generate a benchmark from a historic Text REtrieval Conference data set. Experiments with the benchmark show that 2 of the proposed methods are good and may potentially be used in practice.

Similar documents (author)

  1. Zeng, L.: ¬An introduction to thesauri and classification systems in the People's Republic of China (1986) 4.73
    4.734466 = sum of:
      4.734466 = weight(author_txt:zeng in 1731) [ClassicSimilarity], result of:
        4.734466 = score(doc=1731,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.5751467 = idf(docFreq=58, maxDocs=42306)
            0.13201064 = queryNorm
          4.7344666 = fieldWeight in 1731, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.5751467 = idf(docFreq=58, maxDocs=42306)
            0.625 = fieldNorm(doc=1731)
    
  2. Zeng, L.: Achieving compatibility of indexing languages in online access environment (1992) 4.73
    4.734466 = sum of:
      4.734466 = weight(author_txt:zeng in 1353) [ClassicSimilarity], result of:
        4.734466 = score(doc=1353,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.5751467 = idf(docFreq=58, maxDocs=42306)
            0.13201064 = queryNorm
          4.7344666 = fieldWeight in 1353, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.5751467 = idf(docFreq=58, maxDocs=42306)
            0.625 = fieldNorm(doc=1353)
    
  3. Zeng, L.: Automatic indexing for Chinese text : problems and progress (1992) 4.73
    4.734466 = sum of:
      4.734466 = weight(author_txt:zeng in 1358) [ClassicSimilarity], result of:
        4.734466 = score(doc=1358,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.5751467 = idf(docFreq=58, maxDocs=42306)
            0.13201064 = queryNorm
          4.7344666 = fieldWeight in 1358, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.5751467 = idf(docFreq=58, maxDocs=42306)
            0.625 = fieldNorm(doc=1358)
    
  4. Zeng, M.L.: Towards a unified medical langugae in a diverse cultural environment (1996) 4.73
    4.734466 = sum of:
      4.734466 = weight(author_txt:zeng in 5225) [ClassicSimilarity], result of:
        4.734466 = score(doc=5225,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.5751467 = idf(docFreq=58, maxDocs=42306)
            0.13201064 = queryNorm
          4.7344666 = fieldWeight in 5225, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.5751467 = idf(docFreq=58, maxDocs=42306)
            0.625 = fieldNorm(doc=5225)
    
  5. Zeng, M.L.: Developing control mechanisms for discipline-based virtual libraries : a study of the process (1995) 4.73
    4.734466 = sum of:
      4.734466 = weight(author_txt:zeng in 6906) [ClassicSimilarity], result of:
        4.734466 = score(doc=6906,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.5751467 = idf(docFreq=58, maxDocs=42306)
            0.13201064 = queryNorm
          4.7344666 = fieldWeight in 6906, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.5751467 = idf(docFreq=58, maxDocs=42306)
            0.625 = fieldNorm(doc=6906)
    

Similar documents (content)

  1. Beitzel, S.M.; Jensen, E.C.; Chowdhury, A.; Grossman, D.; Frieder, O; Goharian, N.: Fusion of effective retrieval strategies in the same information retrieval system (2004) 0.27
    0.26737747 = sum of:
      0.26737747 = product of:
        1.1140728 = sum of:
          0.045616593 = weight(abstract_txt:prior in 3503) [ClassicSimilarity], result of:
            0.045616593 = score(doc=3503,freq=1.0), product of:
              0.0932876 = queryWeight, product of:
                1.0817184 = boost
                6.2590566 = idf(docFreq=219, maxDocs=42306)
                0.013778464 = queryNorm
              0.4889888 = fieldWeight in 3503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2590566 = idf(docFreq=219, maxDocs=42306)
                0.078125 = fieldNorm(doc=3503)
          0.01793677 = weight(abstract_txt:have in 3503) [ClassicSimilarity], result of:
            0.01793677 = score(doc=3503,freq=2.0), product of:
              0.050069302 = queryWeight, product of:
                1.1207353 = boost
                3.2424083 = idf(docFreq=4492, maxDocs=42306)
                0.013778464 = queryNorm
              0.35823888 = fieldWeight in 3503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2424083 = idf(docFreq=4492, maxDocs=42306)
                0.078125 = fieldNorm(doc=3503)
          0.025467725 = weight(abstract_txt:been in 3503) [ClassicSimilarity], result of:
            0.025467725 = score(doc=3503,freq=2.0), product of:
              0.063251205 = queryWeight, product of:
                1.2596551 = boost
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.013778464 = queryNorm
              0.40264413 = fieldWeight in 3503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.078125 = fieldNorm(doc=3503)
          0.087379746 = weight(abstract_txt:retrieval in 3503) [ClassicSimilarity], result of:
            0.087379746 = score(doc=3503,freq=8.0), product of:
              0.114201695 = queryWeight, product of:
                2.3936934 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.013778464 = queryNorm
              0.7651353 = fieldWeight in 3503, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.078125 = fieldNorm(doc=3503)
          0.07483201 = weight(abstract_txt:data in 3503) [ClassicSimilarity], result of:
            0.07483201 = score(doc=3503,freq=3.0), product of:
              0.16348463 = queryWeight, product of:
                3.5076509 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.013778464 = queryNorm
              0.45773113 = fieldWeight in 3503, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.078125 = fieldNorm(doc=3503)
          0.86284 = weight(abstract_txt:fusion in 3503) [ClassicSimilarity], result of:
            0.86284 = score(doc=3503,freq=4.0), product of:
              0.7133985 = queryWeight, product of:
                6.6888795 = boost
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.013778464 = queryNorm
              1.2094783 = fieldWeight in 3503, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.078125 = fieldNorm(doc=3503)
        0.24 = coord(6/25)
    
  2. Wu, S.; McClean, S.I.: Improving high accuracy retrieval by eliminating the uneven correlation effect in data fusion (2006) 0.26
    0.26190406 = sum of:
      0.26190406 = product of:
        1.091267 = sum of:
          0.024953172 = weight(abstract_txt:been in 1345) [ClassicSimilarity], result of:
            0.024953172 = score(doc=1345,freq=3.0), product of:
              0.063251205 = queryWeight, product of:
                1.2596551 = boost
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.013778464 = queryNorm
              0.39450905 = fieldWeight in 1345, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.0625 = fieldNorm(doc=1345)
          0.017258191 = weight(abstract_txt:used in 1345) [ClassicSimilarity], result of:
            0.017258191 = score(doc=1345,freq=1.0), product of:
              0.08166814 = queryWeight, product of:
                1.7530295 = boost
                3.381136 = idf(docFreq=3910, maxDocs=42306)
                0.013778464 = queryNorm
              0.211321 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.381136 = idf(docFreq=3910, maxDocs=42306)
                0.0625 = fieldNorm(doc=1345)
          0.04942945 = weight(abstract_txt:retrieval in 1345) [ClassicSimilarity], result of:
            0.04942945 = score(doc=1345,freq=4.0), product of:
              0.114201695 = queryWeight, product of:
                2.3936934 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.013778464 = queryNorm
              0.4328259 = fieldWeight in 1345, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.0625 = fieldNorm(doc=1345)
          0.07728616 = weight(abstract_txt:data in 1345) [ClassicSimilarity], result of:
            0.07728616 = score(doc=1345,freq=5.0), product of:
              0.16348463 = queryWeight, product of:
                3.5076509 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.013778464 = queryNorm
              0.47274268 = fieldWeight in 1345, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.0625 = fieldNorm(doc=1345)
          0.07693283 = weight(abstract_txt:methods in 1345) [ClassicSimilarity], result of:
            0.07693283 = score(doc=1345,freq=2.0), product of:
              0.20816283 = queryWeight, product of:
                3.6131737 = boost
                4.181321 = idf(docFreq=1756, maxDocs=42306)
                0.013778464 = queryNorm
              0.36958006 = fieldWeight in 1345, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.181321 = idf(docFreq=1756, maxDocs=42306)
                0.0625 = fieldNorm(doc=1345)
          0.8454071 = weight(abstract_txt:fusion in 1345) [ClassicSimilarity], result of:
            0.8454071 = score(doc=1345,freq=6.0), product of:
              0.7133985 = queryWeight, product of:
                6.6888795 = boost
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.013778464 = queryNorm
              1.1850419 = fieldWeight in 1345, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.0625 = fieldNorm(doc=1345)
        0.24 = coord(6/25)
    
  3. Wu, M.; Hawking, D.; Turpin, A.; Scholer, F.: Using anchor text for homepage and topic distillation search tasks (2012) 0.20
    0.19981574 = sum of:
      0.19981574 = product of:
        0.71362764 = sum of:
          0.014349417 = weight(abstract_txt:have in 2258) [ClassicSimilarity], result of:
            0.014349417 = score(doc=2258,freq=2.0), product of:
              0.050069302 = queryWeight, product of:
                1.1207353 = boost
                3.2424083 = idf(docFreq=4492, maxDocs=42306)
                0.013778464 = queryNorm
              0.2865911 = fieldWeight in 2258, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2424083 = idf(docFreq=4492, maxDocs=42306)
                0.0625 = fieldNorm(doc=2258)
          0.014406721 = weight(abstract_txt:been in 2258) [ClassicSimilarity], result of:
            0.014406721 = score(doc=2258,freq=1.0), product of:
              0.063251205 = queryWeight, product of:
                1.2596551 = boost
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.013778464 = queryNorm
              0.22776991 = fieldWeight in 2258, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.0625 = fieldNorm(doc=2258)
          0.024406768 = weight(abstract_txt:used in 2258) [ClassicSimilarity], result of:
            0.024406768 = score(doc=2258,freq=2.0), product of:
              0.08166814 = queryWeight, product of:
                1.7530295 = boost
                3.381136 = idf(docFreq=3910, maxDocs=42306)
                0.013778464 = queryNorm
              0.298853 = fieldWeight in 2258, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.381136 = idf(docFreq=3910, maxDocs=42306)
                0.0625 = fieldNorm(doc=2258)
          0.0534309 = weight(abstract_txt:search in 2258) [ClassicSimilarity], result of:
            0.0534309 = score(doc=2258,freq=6.0), product of:
              0.09547002 = queryWeight, product of:
                1.8953804 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.013778464 = queryNorm
              0.55966157 = fieldWeight in 2258, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.0625 = fieldNorm(doc=2258)
          0.024714725 = weight(abstract_txt:retrieval in 2258) [ClassicSimilarity], result of:
            0.024714725 = score(doc=2258,freq=1.0), product of:
              0.114201695 = queryWeight, product of:
                2.3936934 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.013778464 = queryNorm
              0.21641295 = fieldWeight in 2258, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.0625 = fieldNorm(doc=2258)
          0.09422309 = weight(abstract_txt:methods in 2258) [ClassicSimilarity], result of:
            0.09422309 = score(doc=2258,freq=3.0), product of:
              0.20816283 = queryWeight, product of:
                3.6131737 = boost
                4.181321 = idf(docFreq=1756, maxDocs=42306)
                0.013778464 = queryNorm
              0.45264128 = fieldWeight in 2258, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.181321 = idf(docFreq=1756, maxDocs=42306)
                0.0625 = fieldNorm(doc=2258)
          0.48809603 = weight(abstract_txt:fusion in 2258) [ClassicSimilarity], result of:
            0.48809603 = score(doc=2258,freq=2.0), product of:
              0.7133985 = queryWeight, product of:
                6.6888795 = boost
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.013778464 = queryNorm
              0.68418425 = fieldWeight in 2258, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.0625 = fieldNorm(doc=2258)
        0.28 = coord(7/25)
    
  4. Larsen, B.; Ingwersen, P.; Lund, B.: Data fusion according to the principle of polyrepresentation (2009) 0.20
    0.1994799 = sum of:
      0.1994799 = product of:
        0.99739945 = sum of:
          0.015100919 = weight(abstract_txt:used in 572) [ClassicSimilarity], result of:
            0.015100919 = score(doc=572,freq=1.0), product of:
              0.08166814 = queryWeight, product of:
                1.7530295 = boost
                3.381136 = idf(docFreq=3910, maxDocs=42306)
                0.013778464 = queryNorm
              0.18490587 = fieldWeight in 572, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.381136 = idf(docFreq=3910, maxDocs=42306)
                0.0546875 = fieldNorm(doc=572)
          0.048355833 = weight(abstract_txt:retrieval in 572) [ClassicSimilarity], result of:
            0.048355833 = score(doc=572,freq=5.0), product of:
              0.114201695 = queryWeight, product of:
                2.3936934 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.013778464 = queryNorm
              0.4234248 = fieldWeight in 572, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.0546875 = fieldNorm(doc=572)
          0.067625396 = weight(abstract_txt:data in 572) [ClassicSimilarity], result of:
            0.067625396 = score(doc=572,freq=5.0), product of:
              0.16348463 = queryWeight, product of:
                3.5076509 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.013778464 = queryNorm
              0.41364986 = fieldWeight in 572, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.0546875 = fieldNorm(doc=572)
          0.06731623 = weight(abstract_txt:methods in 572) [ClassicSimilarity], result of:
            0.06731623 = score(doc=572,freq=2.0), product of:
              0.20816283 = queryWeight, product of:
                3.6131737 = boost
                4.181321 = idf(docFreq=1756, maxDocs=42306)
                0.013778464 = queryNorm
              0.32338256 = fieldWeight in 572, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.181321 = idf(docFreq=1756, maxDocs=42306)
                0.0546875 = fieldNorm(doc=572)
          0.7990011 = weight(abstract_txt:fusion in 572) [ClassicSimilarity], result of:
            0.7990011 = score(doc=572,freq=7.0), product of:
              0.7133985 = queryWeight, product of:
                6.6888795 = boost
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.013778464 = queryNorm
              1.1199926 = fieldWeight in 572, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.0546875 = fieldNorm(doc=572)
        0.2 = coord(5/25)
    
  5. Seco de Herrera, A.G.; Schaer, R.; Müller, H.: Shangri-La : a medical case-based retrieval tool (2017) 0.18
    0.1801312 = sum of:
      0.1801312 = product of:
        0.7505467 = sum of:
          0.041416213 = weight(abstract_txt:potentially in 843) [ClassicSimilarity], result of:
            0.041416213 = score(doc=843,freq=1.0), product of:
              0.101499125 = queryWeight, product of:
                1.128323 = boost
                6.5287204 = idf(docFreq=167, maxDocs=42306)
                0.013778464 = queryNorm
              0.40804502 = fieldWeight in 843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5287204 = idf(docFreq=167, maxDocs=42306)
                0.0625 = fieldNorm(doc=843)
          0.060538467 = weight(abstract_txt:retrieval in 843) [ClassicSimilarity], result of:
            0.060538467 = score(doc=843,freq=6.0), product of:
              0.114201695 = queryWeight, product of:
                2.3936934 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.013778464 = queryNorm
              0.5301013 = fieldWeight in 843, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.0625 = fieldNorm(doc=843)
          0.11944886 = weight(abstract_txt:benchmark in 843) [ClassicSimilarity], result of:
            0.11944886 = score(doc=843,freq=1.0), product of:
              0.25910753 = queryWeight, product of:
                2.5495133 = boost
                7.376018 = idf(docFreq=71, maxDocs=42306)
                0.013778464 = queryNorm
              0.46100113 = fieldWeight in 843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.376018 = idf(docFreq=71, maxDocs=42306)
                0.0625 = fieldNorm(doc=843)
          0.034563422 = weight(abstract_txt:data in 843) [ClassicSimilarity], result of:
            0.034563422 = score(doc=843,freq=1.0), product of:
              0.16348463 = queryWeight, product of:
                3.5076509 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.013778464 = queryNorm
              0.21141694 = fieldWeight in 843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.0625 = fieldNorm(doc=843)
          0.14944375 = weight(abstract_txt:adaptive in 843) [ClassicSimilarity], result of:
            0.14944375 = score(doc=843,freq=1.0), product of:
              0.3443824 = queryWeight, product of:
                3.5998416 = boost
                6.943154 = idf(docFreq=110, maxDocs=42306)
                0.013778464 = queryNorm
              0.43394712 = fieldWeight in 843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943154 = idf(docFreq=110, maxDocs=42306)
                0.0625 = fieldNorm(doc=843)
          0.34513602 = weight(abstract_txt:fusion in 843) [ClassicSimilarity], result of:
            0.34513602 = score(doc=843,freq=1.0), product of:
              0.7133985 = queryWeight, product of:
                6.6888795 = boost
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.013778464 = queryNorm
              0.48379132 = fieldWeight in 843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.0625 = fieldNorm(doc=843)
        0.24 = coord(6/25)