Document (#30186)

Author
Widyantoro, D.H.
Ioerger, T.R.
Yen, J.
Title
Learning user Interest dynamics with a three-descriptor representation
Source
Journal of the American Society for Information Science and technology. 52(2001) no.3, S.212-225
Year
2001
Abstract
The use of documents ranked high by user feedback to profile user interests is commonly done with Rocchio's `s algorithm which uses a single list of attribute value pairs called a descriptor to carry term value weights for an individual. Negative feed back on old preferences or positive feedback on new preferences adjusts the descriptor at a fixed, predetermined, and often slow pace. Widyantoro, et alia, suggest a three descriptor model which adds two short term interest descriptors, one each for positive and negative feedback. User short term interest in a particular document is computed by subtracting the similarity measure with the negative descriptor from the similarity measure with the positive descriptor. Using a constant to represent the desired impact of long and short term interests these values may be summed for a single interest value. Using the Reuters 21578 1.0 test collection split into training and test sets, topics with at least 100 documents in a tight cluster were chosen. The TDR handles change well showing better recovery speed and accuracy than the single descriptor model. The nearest neighbor update strategy appears to keep the category concept relatively consistent when multiple TDRs are used.
Theme
Retrievalalgorithmen
Object
Rocchio-Algorithmus

Similar documents (content)

  1. Díaz, A.; Gervás, P.: User-model based personalized summarization (2007) 0.16
    0.16064528 = sum of:
      0.16064528 = product of:
        0.44623688 = sum of:
          0.03395147 = weight(abstract_txt:measure in 952) [ClassicSimilarity], result of:
            0.03395147 = score(doc=952,freq=1.0), product of:
              0.09990674 = queryWeight, product of:
                1.231221 = boost
                5.437306 = idf(docFreq=522, maxDocs=44218)
                0.014923649 = queryNorm
              0.33983162 = fieldWeight in 952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.437306 = idf(docFreq=522, maxDocs=44218)
                0.0625 = fieldNorm(doc=952)
          0.008247615 = weight(abstract_txt:with in 952) [ClassicSimilarity], result of:
            0.008247615 = score(doc=952,freq=1.0), product of:
              0.052790362 = queryWeight, product of:
                1.415096 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.014923649 = queryNorm
              0.15623334 = fieldWeight in 952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=952)
          0.05520013 = weight(abstract_txt:interests in 952) [ClassicSimilarity], result of:
            0.05520013 = score(doc=952,freq=1.0), product of:
              0.13813885 = queryWeight, product of:
                1.4477597 = boost
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.014923649 = queryNorm
              0.3995989 = fieldWeight in 952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.0625 = fieldNorm(doc=952)
          0.061672423 = weight(abstract_txt:preferences in 952) [ClassicSimilarity], result of:
            0.061672423 = score(doc=952,freq=1.0), product of:
              0.1487361 = queryWeight, product of:
                1.5022659 = boost
                6.634292 = idf(docFreq=157, maxDocs=44218)
                0.014923649 = queryNorm
              0.41464326 = fieldWeight in 952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.634292 = idf(docFreq=157, maxDocs=44218)
                0.0625 = fieldNorm(doc=952)
          0.05585782 = weight(abstract_txt:user in 952) [ClassicSimilarity], result of:
            0.05585782 = score(doc=952,freq=7.0), product of:
              0.09170416 = queryWeight, product of:
                1.6682 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.014923649 = queryNorm
              0.60910887 = fieldWeight in 952, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=952)
          0.061717194 = weight(abstract_txt:short in 952) [ClassicSimilarity], result of:
            0.061717194 = score(doc=952,freq=1.0), product of:
              0.17034273 = queryWeight, product of:
                1.9690015 = boost
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.014923649 = queryNorm
              0.36231187 = fieldWeight in 952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.0625 = fieldNorm(doc=952)
          0.06654322 = weight(abstract_txt:feedback in 952) [ClassicSimilarity], result of:
            0.06654322 = score(doc=952,freq=1.0), product of:
              0.17911091 = queryWeight, product of:
                2.0190415 = boost
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.014923649 = queryNorm
              0.37151965 = fieldWeight in 952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.0625 = fieldNorm(doc=952)
          0.04675068 = weight(abstract_txt:term in 952) [ClassicSimilarity], result of:
            0.04675068 = score(doc=952,freq=1.0), product of:
              0.1557965 = queryWeight, product of:
                2.174365 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.014923649 = queryNorm
              0.3000753 = fieldWeight in 952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0625 = fieldNorm(doc=952)
          0.05629631 = weight(abstract_txt:interest in 952) [ClassicSimilarity], result of:
            0.05629631 = score(doc=952,freq=1.0), product of:
              0.17634062 = queryWeight, product of:
                2.3132885 = boost
                5.1079607 = idf(docFreq=726, maxDocs=44218)
                0.014923649 = queryNorm
              0.31924754 = fieldWeight in 952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1079607 = idf(docFreq=726, maxDocs=44218)
                0.0625 = fieldNorm(doc=952)
        0.36 = coord(9/25)
    
  2. Chen, Z.; Meng, X.; Fowler, R.H.; Zhu, B.: Real-time adaptive feature and document learning for Web search (2001) 0.15
    0.1498296 = sum of:
      0.1498296 = product of:
        0.46821752 = sum of:
          0.07505844 = weight(abstract_txt:alia in 5209) [ClassicSimilarity], result of:
            0.07505844 = score(doc=5209,freq=1.0), product of:
              0.13456912 = queryWeight, product of:
                1.010407 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.014923649 = queryNorm
              0.55776864 = fieldWeight in 5209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0625 = fieldNorm(doc=5209)
          0.10262601 = weight(abstract_txt:summed in 5209) [ClassicSimilarity], result of:
            0.10262601 = score(doc=5209,freq=1.0), product of:
              0.16577436 = queryWeight, product of:
                1.1214561 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.014923649 = queryNorm
              0.6190705 = fieldWeight in 5209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=5209)
          0.027146008 = weight(abstract_txt:test in 5209) [ClassicSimilarity], result of:
            0.027146008 = score(doc=5209,freq=1.0), product of:
              0.086064965 = queryWeight, product of:
                1.1427515 = boost
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.014923649 = queryNorm
              0.315413 = fieldWeight in 5209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.0625 = fieldNorm(doc=5209)
          0.008247615 = weight(abstract_txt:with in 5209) [ClassicSimilarity], result of:
            0.008247615 = score(doc=5209,freq=1.0), product of:
              0.052790362 = queryWeight, product of:
                1.415096 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.014923649 = queryNorm
              0.15623334 = fieldWeight in 5209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=5209)
          0.04222454 = weight(abstract_txt:user in 5209) [ClassicSimilarity], result of:
            0.04222454 = score(doc=5209,freq=4.0), product of:
              0.09170416 = queryWeight, product of:
                1.6682 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.014923649 = queryNorm
              0.46044302 = fieldWeight in 5209, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=5209)
          0.065397136 = weight(abstract_txt:positive in 5209) [ClassicSimilarity], result of:
            0.065397136 = score(doc=5209,freq=1.0), product of:
              0.17704839 = queryWeight, product of:
                2.0073829 = boost
                5.90999 = idf(docFreq=325, maxDocs=44218)
                0.014923649 = queryNorm
              0.36937436 = fieldWeight in 5209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.90999 = idf(docFreq=325, maxDocs=44218)
                0.0625 = fieldNorm(doc=5209)
          0.06654322 = weight(abstract_txt:feedback in 5209) [ClassicSimilarity], result of:
            0.06654322 = score(doc=5209,freq=1.0), product of:
              0.17911091 = queryWeight, product of:
                2.0190415 = boost
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.014923649 = queryNorm
              0.37151965 = fieldWeight in 5209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.0625 = fieldNorm(doc=5209)
          0.08097455 = weight(abstract_txt:term in 5209) [ClassicSimilarity], result of:
            0.08097455 = score(doc=5209,freq=3.0), product of:
              0.1557965 = queryWeight, product of:
                2.174365 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.014923649 = queryNorm
              0.51974565 = fieldWeight in 5209, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0625 = fieldNorm(doc=5209)
        0.32 = coord(8/25)
    
  3. Spiteri, L.F.: Word association testing and thesaurus construction : a pilot study (2005) 0.15
    0.14924584 = sum of:
      0.14924584 = product of:
        0.7462292 = sum of:
          0.05758538 = weight(abstract_txt:test in 5216) [ClassicSimilarity], result of:
            0.05758538 = score(doc=5216,freq=2.0), product of:
              0.086064965 = queryWeight, product of:
                1.1427515 = boost
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.014923649 = queryNorm
              0.669092 = fieldWeight in 5216, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.09375 = fieldNorm(doc=5216)
          0.012371422 = weight(abstract_txt:with in 5216) [ClassicSimilarity], result of:
            0.012371422 = score(doc=5216,freq=1.0), product of:
              0.052790362 = queryWeight, product of:
                1.415096 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.014923649 = queryNorm
              0.23435001 = fieldWeight in 5216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.09375 = fieldNorm(doc=5216)
          0.031668406 = weight(abstract_txt:user in 5216) [ClassicSimilarity], result of:
            0.031668406 = score(doc=5216,freq=1.0), product of:
              0.09170416 = queryWeight, product of:
                1.6682 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.014923649 = queryNorm
              0.34533226 = fieldWeight in 5216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.09375 = fieldNorm(doc=5216)
          0.12146183 = weight(abstract_txt:term in 5216) [ClassicSimilarity], result of:
            0.12146183 = score(doc=5216,freq=3.0), product of:
              0.1557965 = queryWeight, product of:
                2.174365 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.014923649 = queryNorm
              0.7796185 = fieldWeight in 5216, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.09375 = fieldNorm(doc=5216)
          0.52314216 = weight(abstract_txt:descriptor in 5216) [ClassicSimilarity], result of:
            0.52314216 = score(doc=5216,freq=1.0), product of:
              0.716799 = queryWeight, product of:
                6.169803 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.014923649 = queryNorm
              0.72983104 = fieldWeight in 5216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.09375 = fieldNorm(doc=5216)
        0.2 = coord(5/25)
    
  4. Fagni, T.; Sebastiani, F.: Selecting negative examples for hierarchical text classification: An experimental comparison (2010) 0.10
    0.10097615 = sum of:
      0.10097615 = product of:
        0.42073396 = sum of:
          0.025725601 = weight(abstract_txt:three in 4101) [ClassicSimilarity], result of:
            0.025725601 = score(doc=4101,freq=2.0), product of:
              0.06590567 = queryWeight, product of:
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.014923649 = queryNorm
              0.39033973 = fieldWeight in 4101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.0625 = fieldNorm(doc=4101)
          0.10262601 = weight(abstract_txt:21578 in 4101) [ClassicSimilarity], result of:
            0.10262601 = score(doc=4101,freq=1.0), product of:
              0.16577436 = queryWeight, product of:
                1.1214561 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.014923649 = queryNorm
              0.6190705 = fieldWeight in 4101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=4101)
          0.011663889 = weight(abstract_txt:with in 4101) [ClassicSimilarity], result of:
            0.011663889 = score(doc=4101,freq=2.0), product of:
              0.052790362 = queryWeight, product of:
                1.415096 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.014923649 = queryNorm
              0.22094731 = fieldWeight in 4101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=4101)
          0.065397136 = weight(abstract_txt:positive in 4101) [ClassicSimilarity], result of:
            0.065397136 = score(doc=4101,freq=1.0), product of:
              0.17704839 = queryWeight, product of:
                2.0073829 = boost
                5.90999 = idf(docFreq=325, maxDocs=44218)
                0.014923649 = queryNorm
              0.36937436 = fieldWeight in 4101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.90999 = idf(docFreq=325, maxDocs=44218)
                0.0625 = fieldNorm(doc=4101)
          0.15902503 = weight(abstract_txt:negative in 4101) [ClassicSimilarity], result of:
            0.15902503 = score(doc=4101,freq=4.0), product of:
              0.20168634 = queryWeight, product of:
                2.142508 = boost
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.014923649 = queryNorm
              0.78847694 = fieldWeight in 4101, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.0625 = fieldNorm(doc=4101)
          0.05629631 = weight(abstract_txt:interest in 4101) [ClassicSimilarity], result of:
            0.05629631 = score(doc=4101,freq=1.0), product of:
              0.17634062 = queryWeight, product of:
                2.3132885 = boost
                5.1079607 = idf(docFreq=726, maxDocs=44218)
                0.014923649 = queryNorm
              0.31924754 = fieldWeight in 4101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1079607 = idf(docFreq=726, maxDocs=44218)
                0.0625 = fieldNorm(doc=4101)
        0.24 = coord(6/25)
    
  5. Cho, H.; Donovan, A.; Lee, J.H.: Art in an algorithm : a taxonomy for describing video game visual styles (2018) 0.10
    0.09612288 = sum of:
      0.09612288 = product of:
        0.4806144 = sum of:
          0.008247615 = weight(abstract_txt:with in 4218) [ClassicSimilarity], result of:
            0.008247615 = score(doc=4218,freq=1.0), product of:
              0.052790362 = queryWeight, product of:
                1.415096 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.014923649 = queryNorm
              0.15623334 = fieldWeight in 4218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.037451774 = weight(abstract_txt:value in 4218) [ClassicSimilarity], result of:
            0.037451774 = score(doc=4218,freq=2.0), product of:
              0.096907586 = queryWeight, product of:
                1.4851254 = boost
                4.3723974 = idf(docFreq=1516, maxDocs=44218)
                0.014923649 = queryNorm
              0.38646898 = fieldWeight in 4218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3723974 = idf(docFreq=1516, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.02985726 = weight(abstract_txt:user in 4218) [ClassicSimilarity], result of:
            0.02985726 = score(doc=4218,freq=2.0), product of:
              0.09170416 = queryWeight, product of:
                1.6682 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.014923649 = queryNorm
              0.32558239 = fieldWeight in 4218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.05629631 = weight(abstract_txt:interest in 4218) [ClassicSimilarity], result of:
            0.05629631 = score(doc=4218,freq=1.0), product of:
              0.17634062 = queryWeight, product of:
                2.3132885 = boost
                5.1079607 = idf(docFreq=726, maxDocs=44218)
                0.014923649 = queryNorm
              0.31924754 = fieldWeight in 4218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1079607 = idf(docFreq=726, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.34876144 = weight(abstract_txt:descriptor in 4218) [ClassicSimilarity], result of:
            0.34876144 = score(doc=4218,freq=1.0), product of:
              0.716799 = queryWeight, product of:
                6.169803 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.014923649 = queryNorm
              0.48655403 = fieldWeight in 4218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
        0.2 = coord(5/25)