Beitzel, S.M.; Jensen, E.C.; Chowdhury, A.; Frieder, O.; Grossman, D.: Temporal analysis of a very large topically categorized Web query log (2007)
0.00
0.0036354805 = product of:
0.029083844 = sum of:
0.029083844 = weight(_text_:work in 60) [ClassicSimilarity], result of:
0.029083844 = score(doc=60,freq=2.0), product of:
0.1434381 = queryWeight, product of:
3.6703904 = idf(docFreq=3060, maxDocs=44218)
0.039079793 = queryNorm
0.20276234 = fieldWeight in 60, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.6703904 = idf(docFreq=3060, maxDocs=44218)
0.0390625 = fieldNorm(doc=60)
0.125 = coord(1/8)
- Abstract
- The authors review a log of billions of Web queries that constituted the total query traffic for a 6-month period of a general-purpose commercial Web search service. Previously, query logs were studied from a single, cumulative view. In contrast, this study builds on the authors' previous work, which showed changes in popularity and uniqueness of topically categorized queries across the hours in a day. To further their analysis, they examine query traffic on a daily, weekly, and monthly basis by matching it against lists of queries that have been topically precategorized by human editors. These lists represent 13% of the query traffic. They show that query traffic from particular topical categories differs both from the query stream as a whole and from other categories. Additionally, they show that certain categories of queries trend differently over varying periods. The authors key contribution is twofold: They outline a method for studying both the static and topical properties of a very large query log over varying periods, and they identify and examine topical trends that may provide valuable insight for improving both retrieval effectiveness and efficiency.