Document (#37380)

Author
Makris, C.
Plegas, Y.
Stamou, S.
Title
Web query disambiguation using PageRank
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.8, S.1581-1592
Year
2012
Abstract
In this article, we propose new word sense disambiguation strategies for resolving the senses of polysemous query terms issued to Web search engines, and we explore the application of those strategies when used in a query expansion framework. The novelty of our approach lies in the exploitation of the Web page PageRank values as indicators of the significance the different senses of a term carry when employed in search queries. We also aim at scalable query sense resolution techniques that can be applied without loss of efficiency to large data sets such as those on the Web. Our experimental findings validate that the proposed techniques perform more accurately than do the traditional disambiguation strategies and improve the quality of the search results, when involved in query expansion.
Theme
Suchmaschinen
Aid
PageRank

Similar documents (content)

  1. Krovetz, R.; Croft, W.B.: Lexical ambiguity and information retrieval (1992) 0.22
    0.22082508 = sum of:
      0.22082508 = product of:
        1.1041254 = sum of:
          0.14524236 = weight(abstract_txt:resolving in 4028) [ClassicSimilarity], result of:
            0.14524236 = score(doc=4028,freq=1.0), product of:
              0.18432955 = queryWeight, product of:
                1.362241 = boost
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.016099557 = queryNorm
              0.78794944 = fieldWeight in 4028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.09375 = fieldNorm(doc=4028)
          0.09773933 = weight(abstract_txt:sense in 4028) [ClassicSimilarity], result of:
            0.09773933 = score(doc=4028,freq=1.0), product of:
              0.17834283 = queryWeight, product of:
                1.8949567 = boost
                5.8457794 = idf(docFreq=335, maxDocs=42740)
                0.016099557 = queryNorm
              0.5480418 = fieldWeight in 4028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8457794 = idf(docFreq=335, maxDocs=42740)
                0.09375 = fieldNorm(doc=4028)
          0.31319672 = weight(abstract_txt:senses in 4028) [ClassicSimilarity], result of:
            0.31319672 = score(doc=4028,freq=1.0), product of:
              0.38763312 = queryWeight, product of:
                2.7937138 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.016099557 = queryNorm
              0.807972 = fieldWeight in 4028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.09375 = fieldNorm(doc=4028)
          0.41823062 = weight(abstract_txt:disambiguation in 4028) [ClassicSimilarity], result of:
            0.41823062 = score(doc=4028,freq=2.0), product of:
              0.42707786 = queryWeight, product of:
                3.5914567 = boost
                7.3862243 = idf(docFreq=71, maxDocs=42740)
                0.016099557 = queryNorm
              0.9792843 = fieldWeight in 4028, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3862243 = idf(docFreq=71, maxDocs=42740)
                0.09375 = fieldNorm(doc=4028)
          0.12971628 = weight(abstract_txt:query in 4028) [ClassicSimilarity], result of:
            0.12971628 = score(doc=4028,freq=1.0), product of:
              0.29231587 = queryWeight, product of:
                3.8359034 = boost
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.016099557 = queryNorm
              0.44375378 = fieldWeight in 4028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.09375 = fieldNorm(doc=4028)
        0.2 = coord(5/25)
    
  2. Montejo-Ráez, A.; Martínez-Cámara, E.; Martín-Valdivia, M.T.; Ureña-López, L.A.: ¬A knowledge-based approach for polarity classification in Twitter (2014) 0.14
    0.13671207 = sum of:
      0.13671207 = product of:
        0.85445046 = sum of:
          0.15842524 = weight(abstract_txt:expansion in 3205) [ClassicSimilarity], result of:
            0.15842524 = score(doc=3205,freq=2.0), product of:
              0.19532104 = queryWeight, product of:
                1.9831063 = boost
                6.117713 = idf(docFreq=255, maxDocs=42740)
                0.016099557 = queryNorm
              0.8111018 = fieldWeight in 3205, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.117713 = idf(docFreq=255, maxDocs=42740)
                0.09375 = fieldNorm(doc=3205)
          0.29998145 = weight(abstract_txt:pagerank in 3205) [ClassicSimilarity], result of:
            0.29998145 = score(doc=3205,freq=2.0), product of:
              0.29894802 = queryWeight, product of:
                2.4534054 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.016099557 = queryNorm
              1.0034568 = fieldWeight in 3205, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.09375 = fieldNorm(doc=3205)
          0.1003101 = weight(abstract_txt:strategies in 3205) [ClassicSimilarity], result of:
            0.1003101 = score(doc=3205,freq=1.0), product of:
              0.20771584 = queryWeight, product of:
                2.504678 = boost
                5.151145 = idf(docFreq=672, maxDocs=42740)
                0.016099557 = queryNorm
              0.48291984 = fieldWeight in 3205, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.151145 = idf(docFreq=672, maxDocs=42740)
                0.09375 = fieldNorm(doc=3205)
          0.2957337 = weight(abstract_txt:disambiguation in 3205) [ClassicSimilarity], result of:
            0.2957337 = score(doc=3205,freq=1.0), product of:
              0.42707786 = queryWeight, product of:
                3.5914567 = boost
                7.3862243 = idf(docFreq=71, maxDocs=42740)
                0.016099557 = queryNorm
              0.6924585 = fieldWeight in 3205, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3862243 = idf(docFreq=71, maxDocs=42740)
                0.09375 = fieldNorm(doc=3205)
        0.16 = coord(4/25)
    
  3. Bando, L.L.; Scholer, F.; Turpin, A.: Query-biased summary generation assisted by query expansion : temporality (2015) 0.12
    0.12364379 = sum of:
      0.12364379 = product of:
        0.61821896 = sum of:
          0.0797599 = weight(abstract_txt:novelty in 3821) [ClassicSimilarity], result of:
            0.0797599 = score(doc=3821,freq=1.0), product of:
              0.16197576 = queryWeight, product of:
                1.2769723 = boost
                7.8787007 = idf(docFreq=43, maxDocs=42740)
                0.016099557 = queryNorm
              0.4924188 = fieldWeight in 3821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8787007 = idf(docFreq=43, maxDocs=42740)
                0.0625 = fieldNorm(doc=3821)
          0.030233104 = weight(abstract_txt:techniques in 3821) [ClassicSimilarity], result of:
            0.030233104 = score(doc=3821,freq=1.0), product of:
              0.10688713 = queryWeight, product of:
                1.4670137 = boost
                4.525612 = idf(docFreq=1257, maxDocs=42740)
                0.016099557 = queryNorm
              0.28285074 = fieldWeight in 3821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.525612 = idf(docFreq=1257, maxDocs=42740)
                0.0625 = fieldNorm(doc=3821)
          0.023821479 = weight(abstract_txt:search in 3821) [ClassicSimilarity], result of:
            0.023821479 = score(doc=3821,freq=1.0), product of:
              0.104379006 = queryWeight, product of:
                1.7755123 = boost
                3.6515355 = idf(docFreq=3014, maxDocs=42740)
                0.016099557 = queryNorm
              0.22822097 = fieldWeight in 3821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6515355 = idf(docFreq=3014, maxDocs=42740)
                0.0625 = fieldNorm(doc=3821)
          0.197591 = weight(abstract_txt:expansion in 3821) [ClassicSimilarity], result of:
            0.197591 = score(doc=3821,freq=7.0), product of:
              0.19532104 = queryWeight, product of:
                1.9831063 = boost
                6.117713 = idf(docFreq=255, maxDocs=42740)
                0.016099557 = queryNorm
              1.0116217 = fieldWeight in 3821, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.117713 = idf(docFreq=255, maxDocs=42740)
                0.0625 = fieldNorm(doc=3821)
          0.28681347 = weight(abstract_txt:query in 3821) [ClassicSimilarity], result of:
            0.28681347 = score(doc=3821,freq=11.0), product of:
              0.29231587 = queryWeight, product of:
                3.8359034 = boost
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.016099557 = queryNorm
              0.98117656 = fieldWeight in 3821, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.0625 = fieldNorm(doc=3821)
        0.2 = coord(5/25)
    
  4. Kelley, D.: Relevance feedback : getting to know your user (2008) 0.12
    0.12061054 = sum of:
      0.12061054 = product of:
        0.5025439 = sum of:
          0.059940502 = weight(abstract_txt:accurately in 3925) [ClassicSimilarity], result of:
            0.059940502 = score(doc=3925,freq=1.0), product of:
              0.13388765 = queryWeight, product of:
                1.1609854 = boost
                7.1630807 = idf(docFreq=89, maxDocs=42740)
                0.016099557 = queryNorm
              0.44769254 = fieldWeight in 3925, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1630807 = idf(docFreq=89, maxDocs=42740)
                0.0625 = fieldNorm(doc=3925)
          0.060466208 = weight(abstract_txt:techniques in 3925) [ClassicSimilarity], result of:
            0.060466208 = score(doc=3925,freq=4.0), product of:
              0.10688713 = queryWeight, product of:
                1.4670137 = boost
                4.525612 = idf(docFreq=1257, maxDocs=42740)
                0.016099557 = queryNorm
              0.5657015 = fieldWeight in 3925, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.525612 = idf(docFreq=1257, maxDocs=42740)
                0.0625 = fieldNorm(doc=3925)
          0.023821479 = weight(abstract_txt:search in 3925) [ClassicSimilarity], result of:
            0.023821479 = score(doc=3925,freq=1.0), product of:
              0.104379006 = queryWeight, product of:
                1.7755123 = boost
                3.6515355 = idf(docFreq=3014, maxDocs=42740)
                0.016099557 = queryNorm
              0.22822097 = fieldWeight in 3925, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6515355 = idf(docFreq=3014, maxDocs=42740)
                0.0625 = fieldNorm(doc=3925)
          0.07468238 = weight(abstract_txt:expansion in 3925) [ClassicSimilarity], result of:
            0.07468238 = score(doc=3925,freq=1.0), product of:
              0.19532104 = queryWeight, product of:
                1.9831063 = boost
                6.117713 = idf(docFreq=255, maxDocs=42740)
                0.016099557 = queryNorm
              0.38235706 = fieldWeight in 3925, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.117713 = idf(docFreq=255, maxDocs=42740)
                0.0625 = fieldNorm(doc=3925)
          0.1971558 = weight(abstract_txt:disambiguation in 3925) [ClassicSimilarity], result of:
            0.1971558 = score(doc=3925,freq=1.0), product of:
              0.42707786 = queryWeight, product of:
                3.5914567 = boost
                7.3862243 = idf(docFreq=71, maxDocs=42740)
                0.016099557 = queryNorm
              0.46163902 = fieldWeight in 3925, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3862243 = idf(docFreq=71, maxDocs=42740)
                0.0625 = fieldNorm(doc=3925)
          0.08647752 = weight(abstract_txt:query in 3925) [ClassicSimilarity], result of:
            0.08647752 = score(doc=3925,freq=1.0), product of:
              0.29231587 = queryWeight, product of:
                3.8359034 = boost
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.016099557 = queryNorm
              0.29583585 = fieldWeight in 3925, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.0625 = fieldNorm(doc=3925)
        0.24 = coord(6/25)
    
  5. Fidel, R.; Efthimiadis, E.N.: Terminological knowledge structure for intermediary expert systems (1995) 0.12
    0.119388595 = sum of:
      0.119388595 = product of:
        0.59694296 = sum of:
          0.06545659 = weight(abstract_txt:techniques in 611) [ClassicSimilarity], result of:
            0.06545659 = score(doc=611,freq=3.0), product of:
              0.10688713 = queryWeight, product of:
                1.4670137 = boost
                4.525612 = idf(docFreq=1257, maxDocs=42740)
                0.016099557 = queryNorm
              0.6123898 = fieldWeight in 611, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.525612 = idf(docFreq=1257, maxDocs=42740)
                0.078125 = fieldNorm(doc=611)
          0.09335297 = weight(abstract_txt:expansion in 611) [ClassicSimilarity], result of:
            0.09335297 = score(doc=611,freq=1.0), product of:
              0.19532104 = queryWeight, product of:
                1.9831063 = boost
                6.117713 = idf(docFreq=255, maxDocs=42740)
                0.016099557 = queryNorm
              0.47794634 = fieldWeight in 611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.117713 = idf(docFreq=255, maxDocs=42740)
                0.078125 = fieldNorm(doc=611)
          0.08359175 = weight(abstract_txt:strategies in 611) [ClassicSimilarity], result of:
            0.08359175 = score(doc=611,freq=1.0), product of:
              0.20771584 = queryWeight, product of:
                2.504678 = boost
                5.151145 = idf(docFreq=672, maxDocs=42740)
                0.016099557 = queryNorm
              0.40243322 = fieldWeight in 611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.151145 = idf(docFreq=672, maxDocs=42740)
                0.078125 = fieldNorm(doc=611)
          0.24644476 = weight(abstract_txt:disambiguation in 611) [ClassicSimilarity], result of:
            0.24644476 = score(doc=611,freq=1.0), product of:
              0.42707786 = queryWeight, product of:
                3.5914567 = boost
                7.3862243 = idf(docFreq=71, maxDocs=42740)
                0.016099557 = queryNorm
              0.5770488 = fieldWeight in 611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3862243 = idf(docFreq=71, maxDocs=42740)
                0.078125 = fieldNorm(doc=611)
          0.10809689 = weight(abstract_txt:query in 611) [ClassicSimilarity], result of:
            0.10809689 = score(doc=611,freq=1.0), product of:
              0.29231587 = queryWeight, product of:
                3.8359034 = boost
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.016099557 = queryNorm
              0.36979482 = fieldWeight in 611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7333736 = idf(docFreq=1021, maxDocs=42740)
                0.078125 = fieldNorm(doc=611)
        0.2 = coord(5/25)