Baeza-Yates, R.; Hurtado, C.; Mendoza, M.: Improving search engines by query clustering (2007)
0.01
0.008298359 = product of:
0.030427314 = sum of:
0.005467103 = weight(_text_:a in 601) [ClassicSimilarity], result of:
0.005467103 = score(doc=601,freq=8.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.17835285 = fieldWeight in 601, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0546875 = fieldNorm(doc=601)
0.022529786 = weight(_text_:r in 601) [ClassicSimilarity], result of:
0.022529786 = score(doc=601,freq=2.0), product of:
0.088001914 = queryWeight, product of:
3.3102584 = idf(docFreq=4387, maxDocs=44218)
0.026584605 = queryNorm
0.25601473 = fieldWeight in 601, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3102584 = idf(docFreq=4387, maxDocs=44218)
0.0546875 = fieldNorm(doc=601)
0.0024304248 = weight(_text_:s in 601) [ClassicSimilarity], result of:
0.0024304248 = score(doc=601,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.08408674 = fieldWeight in 601, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0546875 = fieldNorm(doc=601)
0.27272728 = coord(3/11)
- Abstract
- In this paper, we present a framework for clustering Web search engine queries whose aim is to identify groups of queries used to search for similar information on the Web. The framework is based on a novel term vector model of queries that integrates user selections and the content of selected documents extracted from the logs of a search engine. The query representation obtained allows us to treat query clustering similarly to standard document clustering. We study the application of the clustering framework to two problems: relevance ranking boosting and query recommendation. Finally, we evaluate with experiments the effectiveness of our approach.
- Source
- Journal of the American Society for Information Science and Technology. 58(2007) no.12, S.1793-1804
- Type
- a