Search (2 results, page 1 of 1)
- Did you mean:
- author's%3a%22Gilliland-swetland%2c A.%22 2
- author's%3a%22Gilliland-scotland%2c A.%22 2
- authors%3a%22Gilliland-swetland%2c A.%22 2
- author's%3a%22Gilliland-seland%2c A.%22 2
- authors%3a%22Gilliland-scotland%2c A.%22 2
-
Vechtomova, O.: ¬A method for automatic extraction of multiword units representing business aspects from user reviews (2014)
0.00
0.003159129 = product of: 0.006318258 = sum of: 0.006318258 = product of: 0.012636516 = sum of: 0.012636516 = weight(_text_:a in 1304) [ClassicSimilarity], result of: 0.012636516 = score(doc=1304,freq=24.0), product of: 0.04772363 = queryWeight, product of: 1.153047 = idf(docFreq=37942, maxDocs=44218) 0.041389145 = queryNorm 0.26478532 = fieldWeight in 1304, product of: 4.8989797 = tf(freq=24.0), with freq of: 24.0 = termFreq=24.0 1.153047 = idf(docFreq=37942, maxDocs=44218) 0.046875 = fieldNorm(doc=1304) 0.5 = coord(1/2) 0.5 = coord(1/2)
- Abstract
- The article describes a semi-supervised approach to extracting multiword aspects of user-written reviews that belong to a given category. The method starts with a small set of seed words, representing the target category, and calculates distributional similarity between the candidate and seed words. We compare 3 distributional similarity measures (Lin's, Weeds's, and balAPinc), and a document retrieval function, BM25, adapted as a word similarity measure. We then introduce a method for identifying multiword aspects by using a combination of syntactic rules and a co-occurrence association measure. Finally, we describe a method for ranking multiword aspects by the likelihood of belonging to the target aspect category. The task used for evaluation is extraction of restaurant dish names from a corpus of restaurant reviews.
- Type
- a
-
Vechtomova, O.; Karamuftuoglum, M.; Robertson, S.E.: On document relevance and lexical cohesion between query terms (2006)
0.00
0.0022338415 = product of: 0.004467683 = sum of: 0.004467683 = product of: 0.008935366 = sum of: 0.008935366 = weight(_text_:a in 987) [ClassicSimilarity], result of: 0.008935366 = score(doc=987,freq=12.0), product of: 0.04772363 = queryWeight, product of: 1.153047 = idf(docFreq=37942, maxDocs=44218) 0.041389145 = queryNorm 0.18723148 = fieldWeight in 987, product of: 3.4641016 = tf(freq=12.0), with freq of: 12.0 = termFreq=12.0 1.153047 = idf(docFreq=37942, maxDocs=44218) 0.046875 = fieldNorm(doc=987) 0.5 = coord(1/2) 0.5 = coord(1/2)
- Abstract
- Lexical cohesion is a property of text, achieved through lexical-semantic relations between words in text. Most information retrieval systems make use of lexical relations in text only to a limited extent. In this paper we empirically investigate whether the degree of lexical cohesion between the contexts of query terms' occurrences in a document is related to its relevance to the query. Lexical cohesion between distinct query terms in a document is estimated on the basis of the lexical-semantic relations (repetition, synonymy, hyponymy and sibling) that exist between there collocates - words that co-occur with them in the same windows of text. Experiments suggest significant differences between the lexical cohesion in relevant and non-relevant document sets exist. A document ranking method based on lexical cohesion shows some performance improvements.
- Type
- a