Document (#38501)

Author
Wu, S.
Li, J.
Zeng, X.
Bi, Y.
Title
Adaptive data fusion methods in information retrieval
Source
Journal of the Association for Information Science and Technology. 65(2014) no.10, S.2048-2061
Year
2014
Abstract
Data fusion is currently used extensively in information retrieval for various tasks. It has proved to be a useful technology because it is able to improve retrieval performance frequently. However, in almost all prior research in data fusion, static search environments have been used, and dynamic search environments have generally not been considered. In this article, we investigate adaptive data fusion methods that can change their behavior when the search environment changes. Three adaptive data fusion methods are proposed and investigated. To test these proposed methods properly, we generate a benchmark from a historic Text REtrieval Conference data set. Experiments with the benchmark show that 2 of the proposed methods are good and may potentially be used in practice.

Similar documents (author)

  1. Zeng, L.: ¬An introduction to thesauri and classification systems in the People's Republic of China (1986) 4.73
    4.7310953 = sum of:
      4.7310953 = weight(author_txt:zeng in 1731) [ClassicSimilarity], result of:
        4.7310953 = fieldWeight in 1731, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.5697527 = idf(docFreq=61, maxDocs=44218)
          0.625 = fieldNorm(doc=1731)
    
  2. Zeng, L.: Achieving compatibility of indexing languages in online access environment (1992) 4.73
    4.7310953 = sum of:
      4.7310953 = weight(author_txt:zeng in 1284) [ClassicSimilarity], result of:
        4.7310953 = fieldWeight in 1284, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.5697527 = idf(docFreq=61, maxDocs=44218)
          0.625 = fieldNorm(doc=1284)
    
  3. Zeng, L.: Automatic indexing for Chinese text : problems and progress (1992) 4.73
    4.7310953 = sum of:
      4.7310953 = weight(author_txt:zeng in 1289) [ClassicSimilarity], result of:
        4.7310953 = fieldWeight in 1289, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.5697527 = idf(docFreq=61, maxDocs=44218)
          0.625 = fieldNorm(doc=1289)
    
  4. Zeng, M.L.: Towards a unified medical langugae in a diverse cultural environment (1996) 4.73
    4.7310953 = sum of:
      4.7310953 = weight(author_txt:zeng in 5156) [ClassicSimilarity], result of:
        4.7310953 = fieldWeight in 5156, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.5697527 = idf(docFreq=61, maxDocs=44218)
          0.625 = fieldNorm(doc=5156)
    
  5. Zeng, M.L.: Developing control mechanisms for discipline-based virtual libraries : a study of the process (1995) 4.73
    4.7310953 = sum of:
      4.7310953 = weight(author_txt:zeng in 6837) [ClassicSimilarity], result of:
        4.7310953 = fieldWeight in 6837, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.5697527 = idf(docFreq=61, maxDocs=44218)
          0.625 = fieldNorm(doc=6837)
    

Similar documents (content)

  1. Beitzel, S.M.; Jensen, E.C.; Chowdhury, A.; Grossman, D.; Frieder, O; Goharian, N.: Fusion of effective retrieval strategies in the same information retrieval system (2004) 0.27
    0.26770642 = sum of:
      0.26770642 = product of:
        1.1154435 = sum of:
          0.043199558 = weight(abstract_txt:prior in 2502) [ClassicSimilarity], result of:
            0.043199558 = score(doc=2502,freq=1.0), product of:
              0.090113394 = queryWeight, product of:
                1.0634743 = boost
                6.1362057 = idf(docFreq=259, maxDocs=44218)
                0.013809006 = queryNorm
              0.47939107 = fieldWeight in 2502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1362057 = idf(docFreq=259, maxDocs=44218)
                0.078125 = fieldNorm(doc=2502)
          0.017403906 = weight(abstract_txt:have in 2502) [ClassicSimilarity], result of:
            0.017403906 = score(doc=2502,freq=2.0), product of:
              0.049154997 = queryWeight, product of:
                1.110788 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.013809006 = queryNorm
              0.35406178 = fieldWeight in 2502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.078125 = fieldNorm(doc=2502)
          0.025036788 = weight(abstract_txt:been in 2502) [ClassicSimilarity], result of:
            0.025036788 = score(doc=2502,freq=2.0), product of:
              0.06264055 = queryWeight, product of:
                1.2539352 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.013809006 = queryNorm
              0.3996898 = fieldWeight in 2502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.078125 = fieldNorm(doc=2502)
          0.08877714 = weight(abstract_txt:retrieval in 2502) [ClassicSimilarity], result of:
            0.08877714 = score(doc=2502,freq=8.0), product of:
              0.11560961 = queryWeight, product of:
                2.4091249 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.013809006 = queryNorm
              0.7679045 = fieldWeight in 2502, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=2502)
          0.07216147 = weight(abstract_txt:data in 2502) [ClassicSimilarity], result of:
            0.07216147 = score(doc=2502,freq=3.0), product of:
              0.15983924 = queryWeight, product of:
                3.4693625 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.013809006 = queryNorm
              0.4514628 = fieldWeight in 2502, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=2502)
          0.8688646 = weight(abstract_txt:fusion in 2502) [ClassicSimilarity], result of:
            0.8688646 = score(doc=2502,freq=4.0), product of:
              0.71791756 = queryWeight, product of:
                6.7120414 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.013809006 = queryNorm
              1.2102568 = fieldWeight in 2502, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.078125 = fieldNorm(doc=2502)
        0.24 = coord(6/25)
    
  2. Wu, S.; McClean, S.I.: Improving high accuracy retrieval by eliminating the uneven correlation effect in data fusion (2006) 0.26
    0.26232454 = sum of:
      0.26232454 = product of:
        1.0930189 = sum of:
          0.024530942 = weight(abstract_txt:been in 219) [ClassicSimilarity], result of:
            0.024530942 = score(doc=219,freq=3.0), product of:
              0.06264055 = queryWeight, product of:
                1.2539352 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.013809006 = queryNorm
              0.3916144 = fieldWeight in 219, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=219)
          0.017011276 = weight(abstract_txt:used in 219) [ClassicSimilarity], result of:
            0.017011276 = score(doc=219,freq=1.0), product of:
              0.08102297 = queryWeight, product of:
                1.7466145 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.013809006 = queryNorm
              0.2099562 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=219)
          0.05021994 = weight(abstract_txt:retrieval in 219) [ClassicSimilarity], result of:
            0.05021994 = score(doc=219,freq=4.0), product of:
              0.11560961 = queryWeight, product of:
                2.4091249 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.013809006 = queryNorm
              0.43439242 = fieldWeight in 219, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=219)
          0.074528046 = weight(abstract_txt:data in 219) [ClassicSimilarity], result of:
            0.074528046 = score(doc=219,freq=5.0), product of:
              0.15983924 = queryWeight, product of:
                3.4693625 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.013809006 = queryNorm
              0.46626878 = fieldWeight in 219, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=219)
          0.075418636 = weight(abstract_txt:methods in 219) [ClassicSimilarity], result of:
            0.075418636 = score(doc=219,freq=2.0), product of:
              0.20576695 = queryWeight, product of:
                3.5933964 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.013809006 = queryNorm
              0.36652455 = fieldWeight in 219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=219)
          0.85131 = weight(abstract_txt:fusion in 219) [ClassicSimilarity], result of:
            0.85131 = score(doc=219,freq=6.0), product of:
              0.71791756 = queryWeight, product of:
                6.7120414 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.013809006 = queryNorm
              1.1858047 = fieldWeight in 219, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.0625 = fieldNorm(doc=219)
        0.24 = coord(6/25)
    
  3. Wu, M.; Hawking, D.; Turpin, A.; Scholer, F.: Using anchor text for homepage and topic distillation search tasks (2012) 0.20
    0.20018053 = sum of:
      0.20018053 = product of:
        0.7149305 = sum of:
          0.013923125 = weight(abstract_txt:have in 257) [ClassicSimilarity], result of:
            0.013923125 = score(doc=257,freq=2.0), product of:
              0.049154997 = queryWeight, product of:
                1.110788 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.013809006 = queryNorm
              0.28324944 = fieldWeight in 257, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0625 = fieldNorm(doc=257)
          0.014162946 = weight(abstract_txt:been in 257) [ClassicSimilarity], result of:
            0.014162946 = score(doc=257,freq=1.0), product of:
              0.06264055 = queryWeight, product of:
                1.2539352 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.013809006 = queryNorm
              0.22609869 = fieldWeight in 257, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=257)
          0.024057575 = weight(abstract_txt:used in 257) [ClassicSimilarity], result of:
            0.024057575 = score(doc=257,freq=2.0), product of:
              0.08102297 = queryWeight, product of:
                1.7466145 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.013809006 = queryNorm
              0.2969229 = fieldWeight in 257, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=257)
          0.053804204 = weight(abstract_txt:search in 257) [ClassicSimilarity], result of:
            0.053804204 = score(doc=257,freq=6.0), product of:
              0.09607505 = queryWeight, product of:
                1.9019464 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.013809006 = queryNorm
              0.56002265 = fieldWeight in 257, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=257)
          0.02510997 = weight(abstract_txt:retrieval in 257) [ClassicSimilarity], result of:
            0.02510997 = score(doc=257,freq=1.0), product of:
              0.11560961 = queryWeight, product of:
                2.4091249 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.013809006 = queryNorm
              0.21719621 = fieldWeight in 257, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=257)
          0.09236859 = weight(abstract_txt:methods in 257) [ClassicSimilarity], result of:
            0.09236859 = score(doc=257,freq=3.0), product of:
              0.20576695 = queryWeight, product of:
                3.5933964 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.013809006 = queryNorm
              0.44889906 = fieldWeight in 257, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=257)
          0.49150404 = weight(abstract_txt:fusion in 257) [ClassicSimilarity], result of:
            0.49150404 = score(doc=257,freq=2.0), product of:
              0.71791756 = queryWeight, product of:
                6.7120414 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.013809006 = queryNorm
              0.6846246 = fieldWeight in 257, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.0625 = fieldNorm(doc=257)
        0.28 = coord(7/25)
    
  4. Larsen, B.; Ingwersen, P.; Lund, B.: Data fusion according to the principle of polyrepresentation (2009) 0.20
    0.19995944 = sum of:
      0.19995944 = product of:
        0.9997972 = sum of:
          0.014884866 = weight(abstract_txt:used in 2752) [ClassicSimilarity], result of:
            0.014884866 = score(doc=2752,freq=1.0), product of:
              0.08102297 = queryWeight, product of:
                1.7466145 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.013809006 = queryNorm
              0.18371168 = fieldWeight in 2752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2752)
          0.049129147 = weight(abstract_txt:retrieval in 2752) [ClassicSimilarity], result of:
            0.049129147 = score(doc=2752,freq=5.0), product of:
              0.11560961 = queryWeight, product of:
                2.4091249 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.013809006 = queryNorm
              0.4249573 = fieldWeight in 2752, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2752)
          0.06521204 = weight(abstract_txt:data in 2752) [ClassicSimilarity], result of:
            0.06521204 = score(doc=2752,freq=5.0), product of:
              0.15983924 = queryWeight, product of:
                3.4693625 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.013809006 = queryNorm
              0.40798518 = fieldWeight in 2752, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2752)
          0.06599131 = weight(abstract_txt:methods in 2752) [ClassicSimilarity], result of:
            0.06599131 = score(doc=2752,freq=2.0), product of:
              0.20576695 = queryWeight, product of:
                3.5933964 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.013809006 = queryNorm
              0.320709 = fieldWeight in 2752, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2752)
          0.80457985 = weight(abstract_txt:fusion in 2752) [ClassicSimilarity], result of:
            0.80457985 = score(doc=2752,freq=7.0), product of:
              0.71791756 = queryWeight, product of:
                6.7120414 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.013809006 = queryNorm
              1.1207135 = fieldWeight in 2752, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2752)
        0.2 = coord(5/25)
    
  5. Seco de Herrera, A.G.; Schaer, R.; Müller, H.: Shangri-La : a medical case-based retrieval tool (2017) 0.18
    0.17997876 = sum of:
      0.17997876 = product of:
        0.7499115 = sum of:
          0.04023338 = weight(abstract_txt:potentially in 3924) [ClassicSimilarity], result of:
            0.04023338 = score(doc=3924,freq=1.0), product of:
              0.09972426 = queryWeight, product of:
                1.1187493 = boost
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.013809006 = queryNorm
              0.40344626 = fieldWeight in 3924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.0625 = fieldNorm(doc=3924)
          0.061506614 = weight(abstract_txt:retrieval in 3924) [ClassicSimilarity], result of:
            0.061506614 = score(doc=3924,freq=6.0), product of:
              0.11560961 = queryWeight, product of:
                2.4091249 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.013809006 = queryNorm
              0.5320199 = fieldWeight in 3924, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=3924)
          0.11708943 = weight(abstract_txt:benchmark in 3924) [ClassicSimilarity], result of:
            0.11708943 = score(doc=3924,freq=1.0), product of:
              0.256113 = queryWeight, product of:
                2.5354974 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.013809006 = queryNorm
              0.4571788 = fieldWeight in 3924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.0625 = fieldNorm(doc=3924)
          0.033329956 = weight(abstract_txt:data in 3924) [ClassicSimilarity], result of:
            0.033329956 = score(doc=3924,freq=1.0), product of:
              0.15983924 = queryWeight, product of:
                3.4693625 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.013809006 = queryNorm
              0.20852174 = fieldWeight in 3924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=3924)
          0.15020625 = weight(abstract_txt:adaptive in 3924) [ClassicSimilarity], result of:
            0.15020625 = score(doc=3924,freq=1.0), product of:
              0.3461324 = queryWeight, product of:
                3.6100574 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.013809006 = queryNorm
              0.43395606 = fieldWeight in 3924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.0625 = fieldNorm(doc=3924)
          0.34754586 = weight(abstract_txt:fusion in 3924) [ClassicSimilarity], result of:
            0.34754586 = score(doc=3924,freq=1.0), product of:
              0.71791756 = queryWeight, product of:
                6.7120414 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.013809006 = queryNorm
              0.48410273 = fieldWeight in 3924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.0625 = fieldNorm(doc=3924)
        0.24 = coord(6/25)