Baeza-Yates, R.; Hurtado, C.; Mendoza, M.: Improving search engines by query clustering (2007)
0.10
0.09608275 = product of:
0.14412412 = sum of:
0.093939 = weight(_text_:search in 601) [ClassicSimilarity], result of:
0.093939 = score(doc=601,freq=8.0), product of:
0.1747324 = queryWeight, product of:
3.475677 = idf(docFreq=3718, maxDocs=44218)
0.05027291 = queryNorm
0.5376164 = fieldWeight in 601, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
3.475677 = idf(docFreq=3718, maxDocs=44218)
0.0546875 = fieldNorm(doc=601)
0.05018513 = product of:
0.10037026 = sum of:
0.10037026 = weight(_text_:engines in 601) [ClassicSimilarity], result of:
0.10037026 = score(doc=601,freq=2.0), product of:
0.25542772 = queryWeight, product of:
5.080822 = idf(docFreq=746, maxDocs=44218)
0.05027291 = queryNorm
0.39294976 = fieldWeight in 601, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.080822 = idf(docFreq=746, maxDocs=44218)
0.0546875 = fieldNorm(doc=601)
0.5 = coord(1/2)
0.6666667 = coord(2/3)
- Abstract
- In this paper, we present a framework for clustering Web search engine queries whose aim is to identify groups of queries used to search for similar information on the Web. The framework is based on a novel term vector model of queries that integrates user selections and the content of selected documents extracted from the logs of a search engine. The query representation obtained allows us to treat query clustering similarly to standard document clustering. We study the application of the clustering framework to two problems: relevance ranking boosting and query recommendation. Finally, we evaluate with experiments the effectiveness of our approach.