Search (3 results, page 1 of 1)

Vaughan, L.; Chen, Y.: Data mining from web search queries : a comparison of Google trends and Baidu index (2015) 0.01
```
0.01206966 = product of:
  0.04827864 = sum of:
    0.034856133 = weight(_text_:studies in 1605) [ClassicSimilarity], result of:
      0.034856133 = score(doc=1605,freq=2.0), product of:
        0.15812531 = queryWeight, product of:
          3.9902744 = idf(docFreq=2222, maxDocs=44218)
          0.03962768 = queryNorm
        0.22043361 = fieldWeight in 1605, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9902744 = idf(docFreq=2222, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1605)
    0.013422508 = product of:
      0.026845016 = sum of:
        0.026845016 = weight(_text_:22 in 1605) [ClassicSimilarity], result of:
          0.026845016 = score(doc=1605,freq=2.0), product of:
            0.13876937 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03962768 = queryNorm
            0.19345059 = fieldWeight in 1605, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1605)
      0.5 = coord(1/2)
  0.25 = coord(2/8)
```
Abstract

Numerous studies have explored the possibility of uncovering information from web search queries but few have examined the factors that affect web query data sources. We conducted a study that investigated this issue by comparing Google Trends and Baidu Index. Data from these two services are based on queries entered by users into Google and Baidu, two of the largest search engines in the world. We first compared the features and functions of the two services based on documents and extensive testing. We then carried out an empirical study that collected query volume data from the two sources. We found that data from both sources could be used to predict the quality of Chinese universities and companies. Despite the differences between the two services in terms of technology, such as differing methods of language processing, the search volume data from the two were highly correlated and combining the two data sources did not improve the predictive power of the data. However, there was a major difference between the two in terms of data availability. Baidu Index was able to provide more search volume data than Google Trends did. Our analysis showed that the disadvantage of Google Trends in this regard was due to Google's smaller user base in China. The implication of this finding goes beyond China. Google's user bases in many countries are smaller than that in China, so the search volume data related to those countries could result in the same issue as that related to China.

Source

Journal of the Association for Information Science and Technology. 66(2015) no.1, S.13-22
Ackerman, B.; Wang, C.; Chen, Y.: ¬A session-specific opportunity cost model for rank-oriented recommendation (2018) 0.01
```
0.00522842 = product of:
  0.04182736 = sum of:
    0.04182736 = weight(_text_:studies in 4468) [ClassicSimilarity], result of:
      0.04182736 = score(doc=4468,freq=2.0), product of:
        0.15812531 = queryWeight, product of:
          3.9902744 = idf(docFreq=2222, maxDocs=44218)
          0.03962768 = queryNorm
        0.26452032 = fieldWeight in 4468, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9902744 = idf(docFreq=2222, maxDocs=44218)
          0.046875 = fieldNorm(doc=4468)
  0.125 = coord(1/8)
```
Abstract

Recommender systems are changing the way that people find information, products, and even other people. This paper studies the problem of leveraging the context of the items presented to the user in a user/system interaction session to improve the recommender system's ranking prediction. We propose a novel model that incorporates the opportunity cost of giving up the other items in the session and computes session-specific relevance values for items for context-aware recommendation. The model can work on a variety of different problems settings with emphasis on implicit user feedback as it supports varying levels of ordinal relevance. Experimental evaluation demonstrates the advantages of our new model with respect to the ranking quality.
Wang, C.; Zhao, S.; Kalra, A.; Borcea, C.; Chen, Y.: Predictive models and analysis for webpage depth-level dwell time (2018) 0.00
```
0.0033217126 = product of:
  0.0265737 = sum of:
    0.0265737 = product of:
      0.0531474 = sum of:
        0.0531474 = weight(_text_:area in 4370) [ClassicSimilarity], result of:
          0.0531474 = score(doc=4370,freq=2.0), product of:
            0.1952553 = queryWeight, product of:
              4.927245 = idf(docFreq=870, maxDocs=44218)
              0.03962768 = queryNorm
            0.27219442 = fieldWeight in 4370, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.927245 = idf(docFreq=870, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4370)
      0.5 = coord(1/2)
  0.125 = coord(1/8)
```
Abstract

A half of online display ads are not rendered viewable because the users do not scroll deep enough or spend sufficient time at the page depth where the ads are placed. In order to increase the marketing efficiency and ad effectiveness, there is a strong demand for viewability prediction from both advertisers and publishers. This paper aims to predict the dwell time for a given urn:x-wiley:23301635:media:asi24025:asi24025-math-0001 triplet based on historic data collected by publishers. This problem is difficult because of user behavior variability and data sparsity. To solve it, we propose predictive models based on Factorization Machines and Field-aware Factorization Machines in order to overcome the data sparsity issue and provide flexibility to add auxiliary information such as the visible area of a user's browser. In addition, we leverage the prior dwell time behavior of the user within the current page view, that is, time series information, to further improve the proposed models. Experimental results using data from a large web publisher demonstrate that the proposed models outperform comparison models. Also, the results show that adding time series information further improves the performance.

Search (3 results, page 1 of 1)

Authors

Themes