Egghe, L.; Ravichandra Rao, I.K.: ¬The influence of the broadness of a query of a topic on its h-index : models and examples of the h-index of n-grams (2008)
0.00
0.0025503722 = product of:
0.0051007443 = sum of:
0.0051007443 = product of:
0.010201489 = sum of:
0.010201489 = weight(_text_:d in 2009) [ClassicSimilarity], result of:
0.010201489 = score(doc=2009,freq=2.0), product of:
0.09719954 = queryWeight, product of:
1.899872 = idf(docFreq=17979, maxDocs=44218)
0.0511611 = queryNorm
0.104954086 = fieldWeight in 2009, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.899872 = idf(docFreq=17979, maxDocs=44218)
0.0390625 = fieldNorm(doc=2009)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- The article studies the influence of the query formulation of a topic on its h-index. In order to generate pure random sets of documents, we used N-grams (N variable) to measure this influence: strings of zeros, truncated at the end. The used databases are WoS and Scopus. The formula h=T**1/alpha, proved in Egghe and Rousseau (2006) where T is the number of retrieved documents and is Lotka's exponent, is confirmed being a concavely increasing function of T. We also give a formula for the relation between h and N the length of the N-gram: h=D10**(-N/alpha) where D is a constant, a convexly decreasing function, which is found in our experiments. Nonlinear regression on h=T**1/alpha gives an estimation of , which can then be used to estimate the h-index of the entire database (Web of Science [WoS] and Scopus): h=S**1/alpha, , where S is the total number of documents in the database.