Search (1 results, page 1 of 1)

Moohebat, M.; Raj, R.G.; Kareem, S.B.A.; Thorleuchter, D.: Identifying ISI-indexed articles by their lexical usage : a text analysis approach (2015) 0.01

0.013986527 = product of:
  0.027973054 = sum of:
    0.027973054 = product of:
      0.04195958 = sum of:
        0.038397755 = weight(_text_:k in 1664) [ClassicSimilarity], result of:
          0.038397755 = score(doc=1664,freq=2.0), product of:
            0.16225883 = queryWeight, product of:
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.04545348 = queryNorm
            0.23664509 = fieldWeight in 1664, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.046875 = fieldNorm(doc=1664)
        0.003561823 = weight(_text_:s in 1664) [ClassicSimilarity], result of:
          0.003561823 = score(doc=1664,freq=2.0), product of:
            0.049418733 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.04545348 = queryNorm
            0.072074346 = fieldWeight in 1664, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.046875 = fieldNorm(doc=1664)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: This research creates an architecture for investigating the existence of probable lexical divergences between articles, categorized as Institute for Scientific Information (ISI) and non-ISI, and consequently, if such a difference is discovered, to propose the best available classification method. Based on a collection of ISI- and non-ISI-indexed articles in the areas of business and computer science, three classification models are trained. A sensitivity analysis is applied to demonstrate the impact of words in different syntactical forms on the classification decision. The results demonstrate that the lexical domains of ISI and non-ISI articles are distinguishable by machine learning techniques. Our findings indicate that the support vector machine identifies ISI-indexed articles in both disciplines with higher precision than do the Naïve Bayesian and K-Nearest Neighbors techniques.
Source: Journal of the Association for Information Science and Technology. 66(2015) no.3, S.501-511