-
Moohebat, M.; Raj, R.G.; Kareem, S.B.A.; Thorleuchter, D.: Identifying ISI-indexed articles by their lexical usage : a text analysis approach (2015)
0.00
5.06297E-4 = product of:
0.0070881573 = sum of:
0.0070881573 = weight(_text_:information in 1664) [ClassicSimilarity], result of:
0.0070881573 = score(doc=1664,freq=4.0), product of:
0.04306919 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.02453417 = queryNorm
0.16457605 = fieldWeight in 1664, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=1664)
0.071428575 = coord(1/14)
- Abstract
- This research creates an architecture for investigating the existence of probable lexical divergences between articles, categorized as Institute for Scientific Information (ISI) and non-ISI, and consequently, if such a difference is discovered, to propose the best available classification method. Based on a collection of ISI- and non-ISI-indexed articles in the areas of business and computer science, three classification models are trained. A sensitivity analysis is applied to demonstrate the impact of words in different syntactical forms on the classification decision. The results demonstrate that the lexical domains of ISI and non-ISI articles are distinguishable by machine learning techniques. Our findings indicate that the support vector machine identifies ISI-indexed articles in both disciplines with higher precision than do the Naïve Bayesian and K-Nearest Neighbors techniques.
- Source
- Journal of the Association for Information Science and Technology. 66(2015) no.3, S.501-511
-
Radev, D.R.; Joseph, M.T.; Gibson, B.; Muthukrishnan, P.: ¬A bibliometric and network analysis of the field of computational linguistics (2016)
0.00
4.176737E-4 = product of:
0.0058474317 = sum of:
0.0058474317 = weight(_text_:information in 2764) [ClassicSimilarity], result of:
0.0058474317 = score(doc=2764,freq=2.0), product of:
0.04306919 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.02453417 = queryNorm
0.13576832 = fieldWeight in 2764, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=2764)
0.071428575 = coord(1/14)
- Source
- Journal of the Association for Information Science and Technology. 67(2016) no.3, S.683-706
-
Levin, M.; Krawczyk, S.; Bethard, S.; Jurafsky, D.: Citation-based bootstrapping for large-scale author disambiguation (2012)
0.00
2.9833836E-4 = product of:
0.004176737 = sum of:
0.004176737 = weight(_text_:information in 246) [ClassicSimilarity], result of:
0.004176737 = score(doc=246,freq=2.0), product of:
0.04306919 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.02453417 = queryNorm
0.09697737 = fieldWeight in 246, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0390625 = fieldNorm(doc=246)
0.071428575 = coord(1/14)
- Source
- Journal of the American Society for Information Science and Technology. 63(2012) no.5, S.1030-1047