Chen, L.; Fang, H.: ¬An automatic method for ex-tracting innovative ideas based on the Scopus® database (2019)
0.02
0.024293922 = product of:
0.048587844 = sum of:
0.048587844 = product of:
0.09717569 = sum of:
0.09717569 = weight(_text_:tagging in 5310) [ClassicSimilarity], result of:
0.09717569 = score(doc=5310,freq=2.0), product of:
0.2979515 = queryWeight, product of:
5.9038734 = idf(docFreq=327, maxDocs=44218)
0.05046712 = queryNorm
0.326146 = fieldWeight in 5310, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.9038734 = idf(docFreq=327, maxDocs=44218)
0.0390625 = fieldNorm(doc=5310)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- The novelty of knowledge claims in a research paper can be considered an evaluation criterion for papers to supplement citations. To provide a foundation for research evaluation from the perspective of innovativeness, we propose an automatic approach for extracting innovative ideas from the abstracts of technology and engineering papers. The approach extracts N-grams as candidates based on part-of-speech tagging and determines whether they are novel by checking the Scopus® database to determine whether they had ever been presented previously. Moreover, we discussed the distributions of innovative ideas in different abstract structures. To improve the performance by excluding noisy N-grams, a list of stopwords and a list of research description characteristics were developed. We selected abstracts of articles published from 2011 to 2017 with the topic of semantic analysis as the experimental texts. Excluding noisy N-grams, considering the distribution of innovative ideas in abstracts, and suitably combining N-grams can effectively improve the performance of automatic innovative idea extraction. Unlike co-word and co-citation analysis, innovative-idea extraction aims to identify the differences in a paper from all previously published papers.