Document (#32607)

Author
Liu, Y.
Huang, X.
An, A.
Title
Personalized recommendation with adaptive mixture of markov models
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.12, S.1851-1870
Year
2007
Abstract
With more and more information available on the Internet, the task of making personalized recommendations to assist the user's navigation has become increasingly important. Considering there might be millions of users with different backgrounds accessing a Web site everyday, it is infeasible to build a separate recommendation system for each user. To address this problem, clustering techniques can first be employed to discover user groups. Then, user navigation patterns for each group can be discovered, to allow the adaptation of a Web site to the interest of each individual group. In this paper, we propose to model user access sequences as stochastic processes, and a mixture of Markov models based approach is taken to cluster users and to capture the sequential relationships inherent in user access histories. Several important issues that arise in constructing the Markov models are also addressed. The first issue lies in the complexity of the mixture of Markov models. To improve the efficiency of building/maintaining the mixture of Markov models, we develop a lightweight adapt-ive algorithm to update the model parameters without recomputing model parameters from scratch. The second issue concerns the proper selection of training data for building the mixture of Markov models. We investigate two different training data selection strategies and perform extensive experiments to compare their effectiveness on a real dataset that is generated by a Web-based knowledge management system, Livelink.
Footnote
Beitrag eines Themenschwerpunktes "Mining Web resources for enhancing information retrieval"
Theme
Data Mining
Object
WWW

Similar documents (author)

  1. Huang, G.W.: Accessing information in an information society (1989) 4.50
    4.4981737 = sum of:
      4.4981737 = weight(author_txt:huang in 2497) [ClassicSimilarity], result of:
        4.4981737 = fieldWeight in 2497, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.625 = fieldNorm(doc=2497)
    
  2. Huang, X.: Applying a generic function-based topical relevance typology to structure clinical questions and answers (2013) 4.50
    4.4981737 = sum of:
      4.4981737 = weight(author_txt:huang in 530) [ClassicSimilarity], result of:
        4.4981737 = fieldWeight in 530, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.625 = fieldNorm(doc=530)
    
  3. Huang, J. Xiangji => Xiangji Huang, J.: 3.82
    3.8168268 = sum of:
      3.8168268 = weight(author_txt:huang in 8235) [ClassicSimilarity], result of:
        3.8168268 = fieldWeight in 8235, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.375 = fieldNorm(doc=8235)
    
  4. Huang, M.-H.: Developing an ideal online thesaurus display format (1994) 3.60
    3.5985389 = sum of:
      3.5985389 = weight(author_txt:huang in 4030) [ClassicSimilarity], result of:
        3.5985389 = fieldWeight in 4030, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.5 = fieldNorm(doc=4030)
    
  5. Huang, M.-h.: End-users' searching behaviour : changes in search type over time (1996) 3.60
    3.5985389 = sum of:
      3.5985389 = weight(author_txt:huang in 5128) [ClassicSimilarity], result of:
        3.5985389 = fieldWeight in 5128, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.5 = fieldNorm(doc=5128)
    

Similar documents (content)

  1. Chen, H.-M.; Cooper, M.D.: Stochastic modeling of usage patterns in a Web-based information system (2002) 0.24
    0.24107327 = sum of:
      0.24107327 = product of:
        0.66964793 = sum of:
          0.09888526 = weight(abstract_txt:sequential in 577) [ClassicSimilarity], result of:
            0.09888526 = score(doc=577,freq=7.0), product of:
              0.101053625 = queryWeight, product of:
                1.0308851 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.012423738 = queryNorm
              0.97854245 = fieldWeight in 577, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.046875 = fieldNorm(doc=577)
          0.01101877 = weight(abstract_txt:first in 577) [ClassicSimilarity], result of:
            0.01101877 = score(doc=577,freq=1.0), product of:
              0.056397814 = queryWeight, product of:
                1.0891318 = boost
                4.168018 = idf(docFreq=1860, maxDocs=44218)
                0.012423738 = queryNorm
              0.19537583 = fieldWeight in 577, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.168018 = idf(docFreq=1860, maxDocs=44218)
                0.046875 = fieldNorm(doc=577)
          0.04636565 = weight(abstract_txt:stochastic in 577) [ClassicSimilarity], result of:
            0.04636565 = score(doc=577,freq=1.0), product of:
              0.11667051 = queryWeight, product of:
                1.1076814 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.012423738 = queryNorm
              0.39740676 = fieldWeight in 577, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.046875 = fieldNorm(doc=577)
          0.043381568 = weight(abstract_txt:group in 577) [ClassicSimilarity], result of:
            0.043381568 = score(doc=577,freq=6.0), product of:
              0.07738556 = queryWeight, product of:
                1.2757902 = boost
                4.8823442 = idf(docFreq=910, maxDocs=44218)
                0.012423738 = queryNorm
              0.56058997 = fieldWeight in 577, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.8823442 = idf(docFreq=910, maxDocs=44218)
                0.046875 = fieldNorm(doc=577)
          0.014458528 = weight(abstract_txt:model in 577) [ClassicSimilarity], result of:
            0.014458528 = score(doc=577,freq=1.0), product of:
              0.077378444 = queryWeight, product of:
                1.5624456 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.012423738 = queryNorm
              0.18685472 = fieldWeight in 577, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.046875 = fieldNorm(doc=577)
          0.022555085 = weight(abstract_txt:each in 577) [ClassicSimilarity], result of:
            0.022555085 = score(doc=577,freq=2.0), product of:
              0.08260828 = queryWeight, product of:
                1.6143835 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.012423738 = queryNorm
              0.2730366 = fieldWeight in 577, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.046875 = fieldNorm(doc=577)
          0.042517435 = weight(abstract_txt:user in 577) [ClassicSimilarity], result of:
            0.042517435 = score(doc=577,freq=5.0), product of:
              0.110122204 = queryWeight, product of:
                2.4063387 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.012423738 = queryNorm
              0.3860932 = fieldWeight in 577, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.046875 = fieldNorm(doc=577)
          0.045652185 = weight(abstract_txt:models in 577) [ClassicSimilarity], result of:
            0.045652185 = score(doc=577,freq=1.0), product of:
              0.20982392 = queryWeight, product of:
                3.6386259 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.012423738 = queryNorm
              0.21757379 = fieldWeight in 577, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.046875 = fieldNorm(doc=577)
          0.34481344 = weight(abstract_txt:markov in 577) [ClassicSimilarity], result of:
            0.34481344 = score(doc=577,freq=2.0), product of:
              0.6411014 = queryWeight, product of:
                6.3602366 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.012423738 = queryNorm
              0.5378454 = fieldWeight in 577, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.046875 = fieldNorm(doc=577)
        0.36 = coord(9/25)
    
  2. Bohlin, L.; Esquivel, A.V.; Lancichinetti, A.; Rosvall, M.: Robustness of journal rankings by network flows with different amounts of memory (2016) 0.19
    0.19281347 = sum of:
      0.19281347 = product of:
        0.8033895 = sum of:
          0.014691694 = weight(abstract_txt:first in 3125) [ClassicSimilarity], result of:
            0.014691694 = score(doc=3125,freq=1.0), product of:
              0.056397814 = queryWeight, product of:
                1.0891318 = boost
                4.168018 = idf(docFreq=1860, maxDocs=44218)
                0.012423738 = queryNorm
              0.26050112 = fieldWeight in 3125, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.168018 = idf(docFreq=1860, maxDocs=44218)
                0.0625 = fieldNorm(doc=3125)
          0.021484205 = weight(abstract_txt:important in 3125) [ClassicSimilarity], result of:
            0.021484205 = score(doc=3125,freq=2.0), product of:
              0.057670083 = queryWeight, product of:
                1.101348 = boost
                4.2147684 = idf(docFreq=1775, maxDocs=44218)
                0.012423738 = queryNorm
              0.37253642 = fieldWeight in 3125, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2147684 = idf(docFreq=1775, maxDocs=44218)
                0.0625 = fieldNorm(doc=3125)
          0.0631184 = weight(abstract_txt:selection in 3125) [ClassicSimilarity], result of:
            0.0631184 = score(doc=3125,freq=4.0), product of:
              0.09389266 = queryWeight, product of:
                1.4052873 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.012423738 = queryNorm
              0.6722399 = fieldWeight in 3125, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.0625 = fieldNorm(doc=3125)
          0.019278036 = weight(abstract_txt:model in 3125) [ClassicSimilarity], result of:
            0.019278036 = score(doc=3125,freq=1.0), product of:
              0.077378444 = queryWeight, product of:
                1.5624456 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.012423738 = queryNorm
              0.24913962 = fieldWeight in 3125, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=3125)
          0.121739164 = weight(abstract_txt:models in 3125) [ClassicSimilarity], result of:
            0.121739164 = score(doc=3125,freq=4.0), product of:
              0.20982392 = queryWeight, product of:
                3.6386259 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.012423738 = queryNorm
              0.5801968 = fieldWeight in 3125, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=3125)
          0.563078 = weight(abstract_txt:markov in 3125) [ClassicSimilarity], result of:
            0.563078 = score(doc=3125,freq=3.0), product of:
              0.6411014 = queryWeight, product of:
                6.3602366 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.012423738 = queryNorm
              0.87829787 = fieldWeight in 3125, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.0625 = fieldNorm(doc=3125)
        0.24 = coord(6/25)
    
  3. Xu, L.; Qiu, J.: Unsupervised multi-class sentiment classification approach (2019) 0.13
    0.1272166 = sum of:
      0.1272166 = product of:
        0.53006923 = sum of:
          0.014691694 = weight(abstract_txt:first in 5003) [ClassicSimilarity], result of:
            0.014691694 = score(doc=5003,freq=1.0), product of:
              0.056397814 = queryWeight, product of:
                1.0891318 = boost
                4.168018 = idf(docFreq=1860, maxDocs=44218)
                0.012423738 = queryNorm
              0.26050112 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.168018 = idf(docFreq=1860, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.02726326 = weight(abstract_txt:model in 5003) [ClassicSimilarity], result of:
            0.02726326 = score(doc=5003,freq=2.0), product of:
              0.077378444 = queryWeight, product of:
                1.5624456 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.012423738 = queryNorm
              0.35233662 = fieldWeight in 5003, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.021265138 = weight(abstract_txt:each in 5003) [ClassicSimilarity], result of:
            0.021265138 = score(doc=5003,freq=1.0), product of:
              0.08260828 = queryWeight, product of:
                1.6143835 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.012423738 = queryNorm
              0.25742137 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.06597896 = weight(abstract_txt:parameters in 5003) [ClassicSimilarity], result of:
            0.06597896 = score(doc=5003,freq=1.0), product of:
              0.15351517 = queryWeight, product of:
                1.7969043 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.012423738 = queryNorm
              0.42978784 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.0253525 = weight(abstract_txt:user in 5003) [ClassicSimilarity], result of:
            0.0253525 = score(doc=5003,freq=1.0), product of:
              0.110122204 = queryWeight, product of:
                2.4063387 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.012423738 = queryNorm
              0.23022151 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.37551767 = weight(abstract_txt:mixture in 5003) [ClassicSimilarity], result of:
            0.37551767 = score(doc=5003,freq=2.0), product of:
              0.52715456 = queryWeight, product of:
                5.264878 = boost
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.012423738 = queryNorm
              0.71234834 = fieldWeight in 5003, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
        0.24 = coord(6/25)
    
  4. Sun, J.; Zhu, M.; Jiang, Y.; Liu, Y.; Wu, L.L.: Hierarchical attention model for personalized tag recommendation : peer effects on information value perception (2021) 0.12
    0.11628036 = sum of:
      0.11628036 = product of:
        0.41528702 = sum of:
          0.015191628 = weight(abstract_txt:important in 98) [ClassicSimilarity], result of:
            0.015191628 = score(doc=98,freq=1.0), product of:
              0.057670083 = queryWeight, product of:
                1.101348 = boost
                4.2147684 = idf(docFreq=1775, maxDocs=44218)
                0.012423738 = queryNorm
              0.26342303 = fieldWeight in 98, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2147684 = idf(docFreq=1775, maxDocs=44218)
                0.0625 = fieldNorm(doc=98)
          0.038556073 = weight(abstract_txt:model in 98) [ClassicSimilarity], result of:
            0.038556073 = score(doc=98,freq=4.0), product of:
              0.077378444 = queryWeight, product of:
                1.5624456 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.012423738 = queryNorm
              0.49827924 = fieldWeight in 98, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=98)
          0.021265138 = weight(abstract_txt:each in 98) [ClassicSimilarity], result of:
            0.021265138 = score(doc=98,freq=1.0), product of:
              0.08260828 = queryWeight, product of:
                1.6143835 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.012423738 = queryNorm
              0.25742137 = fieldWeight in 98, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0625 = fieldNorm(doc=98)
          0.14234486 = weight(abstract_txt:recommendation in 98) [ClassicSimilarity], result of:
            0.14234486 = score(doc=98,freq=4.0), product of:
              0.16146891 = queryWeight, product of:
                1.842866 = boost
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.012423738 = queryNorm
              0.88156205 = fieldWeight in 98, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.0625 = fieldNorm(doc=98)
          0.07599287 = weight(abstract_txt:personalized in 98) [ClassicSimilarity], result of:
            0.07599287 = score(doc=98,freq=1.0), product of:
              0.16867974 = queryWeight, product of:
                1.8835657 = boost
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.012423738 = queryNorm
              0.4505157 = fieldWeight in 98, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.0625 = fieldNorm(doc=98)
          0.03585385 = weight(abstract_txt:user in 98) [ClassicSimilarity], result of:
            0.03585385 = score(doc=98,freq=2.0), product of:
              0.110122204 = queryWeight, product of:
                2.4063387 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.012423738 = queryNorm
              0.32558239 = fieldWeight in 98, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=98)
          0.08608259 = weight(abstract_txt:models in 98) [ClassicSimilarity], result of:
            0.08608259 = score(doc=98,freq=2.0), product of:
              0.20982392 = queryWeight, product of:
                3.6386259 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.012423738 = queryNorm
              0.4102611 = fieldWeight in 98, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=98)
        0.28 = coord(7/25)
    
  5. Zhang, Y.; Xu, W.: Fast exact maximum likelihood estimation for mixture of language model (2008) 0.12
    0.115956865 = sum of:
      0.115956865 = product of:
        0.72473043 = sum of:
          0.0315592 = weight(abstract_txt:selection in 2082) [ClassicSimilarity], result of:
            0.0315592 = score(doc=2082,freq=1.0), product of:
              0.09389266 = queryWeight, product of:
                1.4052873 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.012423738 = queryNorm
              0.33611995 = fieldWeight in 2082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.0625 = fieldNorm(doc=2082)
          0.038556073 = weight(abstract_txt:model in 2082) [ClassicSimilarity], result of:
            0.038556073 = score(doc=2082,freq=4.0), product of:
              0.077378444 = queryWeight, product of:
                1.5624456 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.012423738 = queryNorm
              0.49827924 = fieldWeight in 2082, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=2082)
          0.060869582 = weight(abstract_txt:models in 2082) [ClassicSimilarity], result of:
            0.060869582 = score(doc=2082,freq=1.0), product of:
              0.20982392 = queryWeight, product of:
                3.6386259 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.012423738 = queryNorm
              0.2900984 = fieldWeight in 2082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=2082)
          0.5937456 = weight(abstract_txt:mixture in 2082) [ClassicSimilarity], result of:
            0.5937456 = score(doc=2082,freq=5.0), product of:
              0.52715456 = queryWeight, product of:
                5.264878 = boost
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.012423738 = queryNorm
              1.1263217 = fieldWeight in 2082, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.0625 = fieldNorm(doc=2082)
        0.16 = coord(4/25)