Baeza-Yates, R.; Hurtado, C.; Mendoza, M.: Improving search engines by query clustering (2007)
0.18
0.18021733 = product of:
0.24028978 = sum of:
0.07053544 = weight(_text_:web in 601) [ClassicSimilarity], result of:
0.07053544 = score(doc=601,freq=6.0), product of:
0.16134618 = queryWeight, product of:
3.2635105 = idf(docFreq=4597, maxDocs=44218)
0.049439456 = queryNorm
0.43716836 = fieldWeight in 601, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
3.2635105 = idf(docFreq=4597, maxDocs=44218)
0.0546875 = fieldNorm(doc=601)
0.09238163 = weight(_text_:search in 601) [ClassicSimilarity], result of:
0.09238163 = score(doc=601,freq=8.0), product of:
0.17183559 = queryWeight, product of:
3.475677 = idf(docFreq=3718, maxDocs=44218)
0.049439456 = queryNorm
0.5376164 = fieldWeight in 601, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
3.475677 = idf(docFreq=3718, maxDocs=44218)
0.0546875 = fieldNorm(doc=601)
0.07737271 = product of:
0.15474541 = sum of:
0.15474541 = weight(_text_:engine in 601) [ClassicSimilarity], result of:
0.15474541 = score(doc=601,freq=4.0), product of:
0.26447627 = queryWeight, product of:
5.349498 = idf(docFreq=570, maxDocs=44218)
0.049439456 = queryNorm
0.5851013 = fieldWeight in 601, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
5.349498 = idf(docFreq=570, maxDocs=44218)
0.0546875 = fieldNorm(doc=601)
0.5 = coord(1/2)
0.75 = coord(3/4)
- Abstract
- In this paper, we present a framework for clustering Web search engine queries whose aim is to identify groups of queries used to search for similar information on the Web. The framework is based on a novel term vector model of queries that integrates user selections and the content of selected documents extracted from the logs of a search engine. The query representation obtained allows us to treat query clustering similarly to standard document clustering. We study the application of the clustering framework to two problems: relevance ranking boosting and query recommendation. Finally, we evaluate with experiments the effectiveness of our approach.
- Footnote
- Beitrag eines Themenschwerpunktes "Mining Web resources for enhancing information retrieval"