Search (2 results, page 1 of 1)

Did you mean:
rswk_00%3a%22World wide web %2f elektronische bibliothek %2f information retrieval %2f kongress %2f trondheim %3.2003%3E%22 2
rswk_00%3a%22World wide web %2f elektronische bibliothek %2f information retrieval %2f kongress %2f trondheim %32003%3E%22 2
rswk_00%3a%22World wide web %2f elektronische bibliothek %2f information retrieval %2f kongresse %2f trondheim %3.2003%3E%22 2
rswk_00%3a%22World wide web %2f elektronische bibliothek %2f information retrieval %2f kongress %2f trondheim %3.2008%3E%22 2
rswk_00%3a%22World wide web %2f elektronische bibliothek %2f information retrieval %2f kongresu %2f trondheim %3.2003%3E%22 2

Vechtomova, O.; Karamuftuoglum, M.; Robertson, S.E.: On document relevance and lexical cohesion between query terms (2006) 0.00

0.003790876 = product of:
  0.02653613 = sum of:
    0.00856136 = weight(_text_:information in 987) [ClassicSimilarity], result of:
      0.00856136 = score(doc=987,freq=4.0), product of:
        0.052020688 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.029633347 = queryNorm
        0.16457605 = fieldWeight in 987, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=987)
    0.01797477 = weight(_text_:retrieval in 987) [ClassicSimilarity], result of:
      0.01797477 = score(doc=987,freq=2.0), product of:
        0.08963835 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.029633347 = queryNorm
        0.20052543 = fieldWeight in 987, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=987)
  0.14285715 = coord(2/14)

Abstract: Lexical cohesion is a property of text, achieved through lexical-semantic relations between words in text. Most information retrieval systems make use of lexical relations in text only to a limited extent. In this paper we empirically investigate whether the degree of lexical cohesion between the contexts of query terms' occurrences in a document is related to its relevance to the query. Lexical cohesion between distinct query terms in a document is estimated on the basis of the lexical-semantic relations (repetition, synonymy, hyponymy and sibling) that exist between there collocates - words that co-occur with them in the same windows of text. Experiments suggest significant differences between the lexical cohesion in relevant and non-relevant document sets exist. A document ranking method based on lexical cohesion shows some performance improvements.
Source: Information processing and management. 42(2006) no.5, S.1230-1247

Vechtomova, O.: ¬A method for automatic extraction of multiword units representing business aspects from user reviews (2014) 0.00

0.0034326524 = product of:
  0.024028566 = sum of:
    0.0060537956 = weight(_text_:information in 1304) [ClassicSimilarity], result of:
      0.0060537956 = score(doc=1304,freq=2.0), product of:
        0.052020688 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.029633347 = queryNorm
        0.116372846 = fieldWeight in 1304, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=1304)
    0.01797477 = weight(_text_:retrieval in 1304) [ClassicSimilarity], result of:
      0.01797477 = score(doc=1304,freq=2.0), product of:
        0.08963835 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.029633347 = queryNorm
        0.20052543 = fieldWeight in 1304, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=1304)
  0.14285715 = coord(2/14)

Abstract: The article describes a semi-supervised approach to extracting multiword aspects of user-written reviews that belong to a given category. The method starts with a small set of seed words, representing the target category, and calculates distributional similarity between the candidate and seed words. We compare 3 distributional similarity measures (Lin's, Weeds's, and balAPinc), and a document retrieval function, BM25, adapted as a word similarity measure. We then introduce a method for identifying multiword aspects by using a combination of syntactic rules and a co-occurrence association measure. Finally, we describe a method for ranking multiword aspects by the likelihood of belonging to the target aspect category. The task used for evaluation is extraction of restaurant dish names from a corpus of restaurant reviews.
Source: Journal of the Association for Information Science and Technology. 65(2014) no.7, S.1463-1477

Search (2 results, page 1 of 1)

Authors

Years