Document (#34447)

Author
Bai, J.
Nie, J.-Y.
Title
Adapting information retrieval to query contexts
Source
Information processing and management. 44(2008) no.6, S.1901-1922
Year
2008
Abstract
In current IR approaches documents are retrieved only according to the terms specified in the query. The same answers are returned for the same query whatever the user and the search goal are. In reality, many other contextual factors strongly influence document's relevance and they should be taken into account in IR operations. This paper proposes a method, based on language modeling, to integrate several contextual factors so that document ranking will be adapted to the specific query contexts. We will consider three contextual factors in this paper: the topic domain of the query, the characteristics of the document collection, as well as context words within the query. Each contextual factor is used to generate a new query language model to specify some aspect of the information need. All these query models are then combined together to produce a more complete model for the underlying information need. Our experiments on TREC collections show that each contextual factor can positively influence the IR effectiveness and the combined model results in the highest effectiveness. This study shows that it is both beneficial and feasible to integrate more contextual factors in the current IR practice.
Footnote
Beitrag in einem Themenheft "Adaptive information retrieval"
Theme
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (content)

  1. Lu, K.; Joo, S.; Lee, T.; Hu, R.: Factors that influence query reformulations and search performance in health information retrieval : a multilevel modeling approach (2017) 0.26
    0.2598949 = sum of:
      0.2598949 = product of:
        0.92819613 = sum of:
          0.021586433 = weight(abstract_txt:each in 3754) [ClassicSimilarity], result of:
            0.021586433 = score(doc=3754,freq=1.0), product of:
              0.08385641 = queryWeight, product of:
                1.137375 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.017900618 = queryNorm
              0.25742137 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0625 = fieldNorm(doc=3754)
          0.03292895 = weight(abstract_txt:same in 3754) [ClassicSimilarity], result of:
            0.03292895 = score(doc=3754,freq=1.0), product of:
              0.111122236 = queryWeight, product of:
                1.3092905 = boost
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.017900618 = queryNorm
              0.2963309 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.0625 = fieldNorm(doc=3754)
          0.063208595 = weight(abstract_txt:influence in 3754) [ClassicSimilarity], result of:
            0.063208595 = score(doc=3754,freq=2.0), product of:
              0.1362249 = queryWeight, product of:
                1.4496521 = boost
                5.2495813 = idf(docFreq=630, maxDocs=44218)
                0.017900618 = queryNorm
              0.4640018 = fieldWeight in 3754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2495813 = idf(docFreq=630, maxDocs=44218)
                0.0625 = fieldNorm(doc=3754)
          0.041512772 = weight(abstract_txt:model in 3754) [ClassicSimilarity], result of:
            0.041512772 = score(doc=3754,freq=2.0), product of:
              0.117821336 = queryWeight, product of:
                1.6511751 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.017900618 = queryNorm
              0.35233662 = fieldWeight in 3754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=3754)
          0.13068137 = weight(abstract_txt:factors in 3754) [ClassicSimilarity], result of:
            0.13068137 = score(doc=3754,freq=3.0), product of:
              0.24332929 = queryWeight, product of:
                2.7399821 = boost
                4.9611073 = idf(docFreq=841, maxDocs=44218)
                0.017900618 = queryNorm
              0.5370556 = fieldWeight in 3754, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9611073 = idf(docFreq=841, maxDocs=44218)
                0.0625 = fieldNorm(doc=3754)
          0.39827347 = weight(abstract_txt:query in 3754) [ClassicSimilarity], result of:
            0.39827347 = score(doc=3754,freq=9.0), product of:
              0.44683012 = queryWeight, product of:
                5.2509365 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.017900618 = queryNorm
              0.89133084 = fieldWeight in 3754, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=3754)
          0.24000454 = weight(abstract_txt:contextual in 3754) [ClassicSimilarity], result of:
            0.24000454 = score(doc=3754,freq=1.0), product of:
              0.60247046 = queryWeight, product of:
                5.2803664 = boost
                6.373877 = idf(docFreq=204, maxDocs=44218)
                0.017900618 = queryNorm
              0.39836732 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.373877 = idf(docFreq=204, maxDocs=44218)
                0.0625 = fieldNorm(doc=3754)
        0.28 = coord(7/25)
    
  2. Ponte, J.M.: Language models for relevance feedback (2000) 0.21
    0.21150182 = sum of:
      0.21150182 = product of:
        0.58750504 = sum of:
          0.008219473 = weight(abstract_txt:information in 35) [ClassicSimilarity], result of:
            0.008219473 = score(doc=35,freq=1.0), product of:
              0.043457903 = queryWeight, product of:
                1.0028028 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.017900618 = queryNorm
              0.18913643 = fieldWeight in 35, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=35)
          0.022233125 = weight(abstract_txt:will in 35) [ClassicSimilarity], result of:
            0.022233125 = score(doc=35,freq=1.0), product of:
              0.07370145 = queryWeight, product of:
                1.0662856 = boost
                3.8613079 = idf(docFreq=2528, maxDocs=44218)
                0.017900618 = queryNorm
              0.30166468 = fieldWeight in 35, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8613079 = idf(docFreq=2528, maxDocs=44218)
                0.078125 = fieldNorm(doc=35)
          0.026983041 = weight(abstract_txt:each in 35) [ClassicSimilarity], result of:
            0.026983041 = score(doc=35,freq=1.0), product of:
              0.08385641 = queryWeight, product of:
                1.137375 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.017900618 = queryNorm
              0.32177672 = fieldWeight in 35, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.078125 = fieldNorm(doc=35)
          0.02821418 = weight(abstract_txt:need in 35) [ClassicSimilarity], result of:
            0.02821418 = score(doc=35,freq=1.0), product of:
              0.08638811 = queryWeight, product of:
                1.1544166 = boost
                4.180454 = idf(docFreq=1837, maxDocs=44218)
                0.017900618 = queryNorm
              0.32659796 = fieldWeight in 35, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.180454 = idf(docFreq=1837, maxDocs=44218)
                0.078125 = fieldNorm(doc=35)
          0.056494538 = weight(abstract_txt:language in 35) [ClassicSimilarity], result of:
            0.056494538 = score(doc=35,freq=4.0), product of:
              0.08645564 = queryWeight, product of:
                1.1548676 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017900618 = queryNorm
              0.65345114 = fieldWeight in 35, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=35)
          0.04319922 = weight(abstract_txt:document in 35) [ClassicSimilarity], result of:
            0.04319922 = score(doc=35,freq=2.0), product of:
              0.091085576 = queryWeight, product of:
                1.1853875 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.017900618 = queryNorm
              0.4742707 = fieldWeight in 35, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=35)
          0.05117917 = weight(abstract_txt:effectiveness in 35) [ClassicSimilarity], result of:
            0.05117917 = score(doc=35,freq=1.0), product of:
              0.12849055 = queryWeight, product of:
                1.4078978 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.017900618 = queryNorm
              0.39831078 = fieldWeight in 35, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.078125 = fieldNorm(doc=35)
          0.06355319 = weight(abstract_txt:model in 35) [ClassicSimilarity], result of:
            0.06355319 = score(doc=35,freq=3.0), product of:
              0.117821336 = queryWeight, product of:
                1.6511751 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.017900618 = queryNorm
              0.5394031 = fieldWeight in 35, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.078125 = fieldNorm(doc=35)
          0.28742912 = weight(abstract_txt:query in 35) [ClassicSimilarity], result of:
            0.28742912 = score(doc=35,freq=3.0), product of:
              0.44683012 = queryWeight, product of:
                5.2509365 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.017900618 = queryNorm
              0.6432626 = fieldWeight in 35, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=35)
        0.36 = coord(9/25)
    
  3. Shvartzshnaider, Y.; Sanfilippo, M.R.; Apthorpe, N.: GKC-CI : a unifying framework for contextual norms and information governance (2022) 0.20
    0.19939664 = sum of:
      0.19939664 = product of:
        0.83081937 = sum of:
          0.017083853 = weight(abstract_txt:information in 651) [ClassicSimilarity], result of:
            0.017083853 = score(doc=651,freq=3.0), product of:
              0.043457903 = queryWeight, product of:
                1.0028028 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.017900618 = queryNorm
              0.3931127 = fieldWeight in 651, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=651)
          0.033857014 = weight(abstract_txt:need in 651) [ClassicSimilarity], result of:
            0.033857014 = score(doc=651,freq=1.0), product of:
              0.08638811 = queryWeight, product of:
                1.1544166 = boost
                4.180454 = idf(docFreq=1837, maxDocs=44218)
                0.017900618 = queryNorm
              0.39191753 = fieldWeight in 651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.180454 = idf(docFreq=1837, maxDocs=44218)
                0.09375 = fieldNorm(doc=651)
          0.06704284 = weight(abstract_txt:influence in 651) [ClassicSimilarity], result of:
            0.06704284 = score(doc=651,freq=1.0), product of:
              0.1362249 = queryWeight, product of:
                1.4496521 = boost
                5.2495813 = idf(docFreq=630, maxDocs=44218)
                0.017900618 = queryNorm
              0.49214825 = fieldWeight in 651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2495813 = idf(docFreq=630, maxDocs=44218)
                0.09375 = fieldNorm(doc=651)
          0.0905358 = weight(abstract_txt:factor in 651) [ClassicSimilarity], result of:
            0.0905358 = score(doc=651,freq=1.0), product of:
              0.16643132 = queryWeight, product of:
                1.6023341 = boost
                5.8024845 = idf(docFreq=362, maxDocs=44218)
                0.017900618 = queryNorm
              0.5439829 = fieldWeight in 651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8024845 = idf(docFreq=362, maxDocs=44218)
                0.09375 = fieldNorm(doc=651)
          0.11317338 = weight(abstract_txt:factors in 651) [ClassicSimilarity], result of:
            0.11317338 = score(doc=651,freq=1.0), product of:
              0.24332929 = queryWeight, product of:
                2.7399821 = boost
                4.9611073 = idf(docFreq=841, maxDocs=44218)
                0.017900618 = queryNorm
              0.4651038 = fieldWeight in 651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9611073 = idf(docFreq=841, maxDocs=44218)
                0.09375 = fieldNorm(doc=651)
          0.5091265 = weight(abstract_txt:contextual in 651) [ClassicSimilarity], result of:
            0.5091265 = score(doc=651,freq=2.0), product of:
              0.60247046 = queryWeight, product of:
                5.2803664 = boost
                6.373877 = idf(docFreq=204, maxDocs=44218)
                0.017900618 = queryNorm
              0.84506464 = fieldWeight in 651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.373877 = idf(docFreq=204, maxDocs=44218)
                0.09375 = fieldNorm(doc=651)
        0.24 = coord(6/25)
    
  4. Hofmann, K.; Balog, K.; Bogers, T.; Rijke, M. de: Contextual factors for finding similar experts (2010) 0.19
    0.18736348 = sum of:
      0.18736348 = product of:
        0.7806812 = sum of:
          0.021382522 = weight(abstract_txt:document in 3456) [ClassicSimilarity], result of:
            0.021382522 = score(doc=3456,freq=1.0), product of:
              0.091085576 = queryWeight, product of:
                1.1853875 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.017900618 = queryNorm
              0.23475201 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3456)
          0.028812833 = weight(abstract_txt:same in 3456) [ClassicSimilarity], result of:
            0.028812833 = score(doc=3456,freq=1.0), product of:
              0.111122236 = queryWeight, product of:
                1.3092905 = boost
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.017900618 = queryNorm
              0.25928953 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3456)
          0.055307522 = weight(abstract_txt:influence in 3456) [ClassicSimilarity], result of:
            0.055307522 = score(doc=3456,freq=2.0), product of:
              0.1362249 = queryWeight, product of:
                1.4496521 = boost
                5.2495813 = idf(docFreq=630, maxDocs=44218)
                0.017900618 = queryNorm
              0.40600157 = fieldWeight in 3456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2495813 = idf(docFreq=630, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3456)
          0.0684438 = weight(abstract_txt:integrate in 3456) [ClassicSimilarity], result of:
            0.0684438 = score(doc=3456,freq=1.0), product of:
              0.19783345 = queryWeight, product of:
                1.7469698 = boost
                6.326249 = idf(docFreq=214, maxDocs=44218)
                0.017900618 = queryNorm
              0.34596676 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.326249 = idf(docFreq=214, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3456)
          0.18672656 = weight(abstract_txt:factors in 3456) [ClassicSimilarity], result of:
            0.18672656 = score(doc=3456,freq=8.0), product of:
              0.24332929 = queryWeight, product of:
                2.7399821 = boost
                4.9611073 = idf(docFreq=841, maxDocs=44218)
                0.017900618 = queryNorm
              0.76738214 = fieldWeight in 3456, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.9611073 = idf(docFreq=841, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3456)
          0.42000794 = weight(abstract_txt:contextual in 3456) [ClassicSimilarity], result of:
            0.42000794 = score(doc=3456,freq=4.0), product of:
              0.60247046 = queryWeight, product of:
                5.2803664 = boost
                6.373877 = idf(docFreq=204, maxDocs=44218)
                0.017900618 = queryNorm
              0.6971428 = fieldWeight in 3456, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.373877 = idf(docFreq=204, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3456)
        0.24 = coord(6/25)
    
  5. Liu, J.; Belkin, N.J.: Personalizing information retrieval for multi-session tasks : examining the roles of task stage, task type, and topic knowledge on the interpretation of dwell time as an indicator of document usefulness (2015) 0.18
    0.18373366 = sum of:
      0.18373366 = product of:
        0.76555693 = sum of:
          0.011389235 = weight(abstract_txt:information in 1608) [ClassicSimilarity], result of:
            0.011389235 = score(doc=1608,freq=3.0), product of:
              0.043457903 = queryWeight, product of:
                1.0028028 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.017900618 = queryNorm
              0.26207513 = fieldWeight in 1608, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=1608)
          0.021586433 = weight(abstract_txt:each in 1608) [ClassicSimilarity], result of:
            0.021586433 = score(doc=1608,freq=1.0), product of:
              0.08385641 = queryWeight, product of:
                1.137375 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.017900618 = queryNorm
              0.25742137 = fieldWeight in 1608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0625 = fieldNorm(doc=1608)
          0.042326417 = weight(abstract_txt:document in 1608) [ClassicSimilarity], result of:
            0.042326417 = score(doc=1608,freq=3.0), product of:
              0.091085576 = queryWeight, product of:
                1.1853875 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.017900618 = queryNorm
              0.46468848 = fieldWeight in 1608, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=1608)
          0.059347965 = weight(abstract_txt:contexts in 1608) [ClassicSimilarity], result of:
            0.059347965 = score(doc=1608,freq=1.0), product of:
              0.16457085 = queryWeight, product of:
                1.593353 = boost
                5.7699614 = idf(docFreq=374, maxDocs=44218)
                0.017900618 = queryNorm
              0.36062258 = fieldWeight in 1608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7699614 = idf(docFreq=374, maxDocs=44218)
                0.0625 = fieldNorm(doc=1608)
          0.15089783 = weight(abstract_txt:factors in 1608) [ClassicSimilarity], result of:
            0.15089783 = score(doc=1608,freq=4.0), product of:
              0.24332929 = queryWeight, product of:
                2.7399821 = boost
                4.9611073 = idf(docFreq=841, maxDocs=44218)
                0.017900618 = queryNorm
              0.6201384 = fieldWeight in 1608, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.9611073 = idf(docFreq=841, maxDocs=44218)
                0.0625 = fieldNorm(doc=1608)
          0.48000908 = weight(abstract_txt:contextual in 1608) [ClassicSimilarity], result of:
            0.48000908 = score(doc=1608,freq=4.0), product of:
              0.60247046 = queryWeight, product of:
                5.2803664 = boost
                6.373877 = idf(docFreq=204, maxDocs=44218)
                0.017900618 = queryNorm
              0.79673463 = fieldWeight in 1608, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.373877 = idf(docFreq=204, maxDocs=44218)
                0.0625 = fieldNorm(doc=1608)
        0.24 = coord(6/25)