Chen, L.; Fang, H.: ¬An automatic method for ex-tracting innovative ideas based on the Scopus® database (2019)
0.01
0.00884113 = product of:
0.04420565 = sum of:
0.04420565 = weight(_text_:semantic in 5310) [ClassicSimilarity], result of:
0.04420565 = score(doc=5310,freq=2.0), product of:
0.19245663 = queryWeight, product of:
4.1578603 = idf(docFreq=1879, maxDocs=44218)
0.04628742 = queryNorm
0.22969149 = fieldWeight in 5310, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.1578603 = idf(docFreq=1879, maxDocs=44218)
0.0390625 = fieldNorm(doc=5310)
0.2 = coord(1/5)
- Abstract
- The novelty of knowledge claims in a research paper can be considered an evaluation criterion for papers to supplement citations. To provide a foundation for research evaluation from the perspective of innovativeness, we propose an automatic approach for extracting innovative ideas from the abstracts of technology and engineering papers. The approach extracts N-grams as candidates based on part-of-speech tagging and determines whether they are novel by checking the Scopus® database to determine whether they had ever been presented previously. Moreover, we discussed the distributions of innovative ideas in different abstract structures. To improve the performance by excluding noisy N-grams, a list of stopwords and a list of research description characteristics were developed. We selected abstracts of articles published from 2011 to 2017 with the topic of semantic analysis as the experimental texts. Excluding noisy N-grams, considering the distribution of innovative ideas in abstracts, and suitably combining N-grams can effectively improve the performance of automatic innovative idea extraction. Unlike co-word and co-citation analysis, innovative-idea extraction aims to identify the differences in a paper from all previously published papers.