Vilares, J.; Alonso, M.A.; Doval, Y.; Vilares, M.: Studying the effect and treatment of misspelled queries in Cross-Language Information Retrieval (2016)
0.00
0.0023375787 = product of:
0.0046751574 = sum of:
0.0046751574 = product of:
0.009350315 = sum of:
0.009350315 = weight(_text_:a in 2974) [ClassicSimilarity], result of:
0.009350315 = score(doc=2974,freq=8.0), product of:
0.06116359 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.053045183 = queryNorm
0.15287387 = fieldWeight in 2974, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.046875 = fieldNorm(doc=2974)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- General graph random walk has been successfully applied in multi-document summarization, but it has some limitations to process documents by this way. In this paper, we propose a novel hypergraph based vertex-reinforced random walk framework for multi-document summarization. The framework first exploits the Hierarchical Dirichlet Process (HDP) topic model to learn a word-topic probability distribution in sentences. Then the hypergraph is used to capture both cluster relationship based on the word-topic probability distribution and pairwise similarity among sentences. Finally, a time-variant random walk algorithm for hypergraphs is developed to rank sentences which ensures sentence diversity by vertex-reinforcement in summaries. Experimental results on the public available dataset demonstrate the effectiveness of our framework.
- Type
- a