Document (#34011)

Author
Egghe, L.
Ravichandra Rao, I.K.
Title
¬The influence of the broadness of a query of a topic on its h-index : models and examples of the h-index of n-grams
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.10, S.1688-1693
Year
2008
Series
Brief communication
Abstract
The article studies the influence of the query formulation of a topic on its h-index. In order to generate pure random sets of documents, we used N-grams (N variable) to measure this influence: strings of zeros, truncated at the end. The used databases are WoS and Scopus. The formula h=T**1/alpha, proved in Egghe and Rousseau (2006) where T is the number of retrieved documents and is Lotka's exponent, is confirmed being a concavely increasing function of T. We also give a formula for the relation between h and N the length of the N-gram: h=D10**(-N/alpha) where D is a constant, a convexly decreasing function, which is found in our experiments. Nonlinear regression on h=T**1/alpha gives an estimation of , which can then be used to estimate the h-index of the entire database (Web of Science [WoS] and Scopus): h=S**1/alpha, , where S is the total number of documents in the database.
Theme
Informetrie
Object
h-index

Similar documents (author)

  1. Egghe, L.; Ravichandra Rao, I.K.: Duality revisited : construction of fractional frequency distributions based on two dual Lotka laws (2002) 5.26
    5.2614017 = sum of:
      5.2614017 = sum of:
        1.7710968 = weight(author_txt:egghe in 2007) [ClassicSimilarity], result of:
          1.7710968 = score(doc=2007,freq=1.0), product of:
            0.53677046 = queryWeight, product of:
              7.5418105 = idf(docFreq=60, maxDocs=42306)
              0.071172625 = queryNorm
            3.2995422 = fieldWeight in 2007, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5418105 = idf(docFreq=60, maxDocs=42306)
              0.4375 = fieldNorm(doc=2007)
        3.4903047 = weight(author_txt:ravichandra in 2007) [ClassicSimilarity], result of:
          3.4903047 = score(doc=2007,freq=1.0), product of:
            0.84372836 = queryWeight, product of:
              1.2537386 = boost
              9.45546 = idf(docFreq=8, maxDocs=42306)
              0.071172625 = queryNorm
            4.1367636 = fieldWeight in 2007, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.45546 = idf(docFreq=8, maxDocs=42306)
              0.4375 = fieldNorm(doc=2007)
    
  2. Egghe, L.; Ravichandra Rao, I.K.: Study of different h-indices for groups of authors (2008) 5.26
    5.2614017 = sum of:
      5.2614017 = sum of:
        1.7710968 = weight(author_txt:egghe in 3879) [ClassicSimilarity], result of:
          1.7710968 = score(doc=3879,freq=1.0), product of:
            0.53677046 = queryWeight, product of:
              7.5418105 = idf(docFreq=60, maxDocs=42306)
              0.071172625 = queryNorm
            3.2995422 = fieldWeight in 3879, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5418105 = idf(docFreq=60, maxDocs=42306)
              0.4375 = fieldNorm(doc=3879)
        3.4903047 = weight(author_txt:ravichandra in 3879) [ClassicSimilarity], result of:
          3.4903047 = score(doc=3879,freq=1.0), product of:
            0.84372836 = queryWeight, product of:
              1.2537386 = boost
              9.45546 = idf(docFreq=8, maxDocs=42306)
              0.071172625 = queryNorm
            4.1367636 = fieldWeight in 3879, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.45546 = idf(docFreq=8, maxDocs=42306)
              0.4375 = fieldNorm(doc=3879)
    
  3. Rao, I.K.Ravichandra -> Ravichandra Rao, I.K.: 1.75
    1.7451524 = sum of:
      1.7451524 = product of:
        3.4903047 = sum of:
          3.4903047 = weight(author_txt:ravichandra in 241) [ClassicSimilarity], result of:
            3.4903047 = score(doc=241,freq=1.0), product of:
              0.84372836 = queryWeight, product of:
                1.2537386 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.071172625 = queryNorm
              4.1367636 = fieldWeight in 241, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.4375 = fieldNorm(doc=241)
        0.5 = coord(1/2)
    
  4. Rao, I.K.R. -> Ravichandra Rao, I.K.: 1.75
    1.7451524 = sum of:
      1.7451524 = product of:
        3.4903047 = sum of:
          3.4903047 = weight(author_txt:ravichandra in 2795) [ClassicSimilarity], result of:
            3.4903047 = score(doc=2795,freq=1.0), product of:
              0.84372836 = queryWeight, product of:
                1.2537386 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.071172625 = queryNorm
              4.1367636 = fieldWeight in 2795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.4375 = fieldNorm(doc=2795)
        0.5 = coord(1/2)
    
  5. Ravichandra Rao, I.K.; Neelameghan, A.: From librametry to informetrcis : an overview and Ranganathan's contributions (1992) 1.75
    1.7451524 = sum of:
      1.7451524 = product of:
        3.4903047 = sum of:
          3.4903047 = weight(author_txt:ravichandra in 2964) [ClassicSimilarity], result of:
            3.4903047 = score(doc=2964,freq=1.0), product of:
              0.84372836 = queryWeight, product of:
                1.2537386 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.071172625 = queryNorm
              4.1367636 = fieldWeight in 2964, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.4375 = fieldNorm(doc=2964)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Egghe, L.: ¬A new short proof of Naranan's theorem, explaining Lotka's law and Zipf's law (2010) 0.13
    0.1290665 = sum of:
      0.1290665 = product of:
        0.64533246 = sum of:
          0.039893206 = weight(abstract_txt:number in 433) [ClassicSimilarity], result of:
            0.039893206 = score(doc=433,freq=2.0), product of:
              0.07293628 = queryWeight, product of:
                1.0603193 = boost
                4.125428 = idf(docFreq=1857, maxDocs=42306)
                0.016673926 = queryNorm
              0.5469597 = fieldWeight in 433, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.125428 = idf(docFreq=1857, maxDocs=42306)
                0.09375 = fieldNorm(doc=433)
          0.21836275 = weight(abstract_txt:lotka's in 433) [ClassicSimilarity], result of:
            0.21836275 = score(doc=433,freq=3.0), product of:
              0.15706868 = queryWeight, product of:
                1.1002584 = boost
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.016673926 = queryNorm
              1.3902373 = fieldWeight in 433, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.09375 = fieldNorm(doc=433)
          0.1406694 = weight(abstract_txt:exponent in 433) [ClassicSimilarity], result of:
            0.1406694 = score(doc=433,freq=1.0), product of:
              0.16897045 = queryWeight, product of:
                1.1411829 = boost
                8.8800955 = idf(docFreq=15, maxDocs=42306)
                0.016673926 = queryNorm
              0.8325089 = fieldWeight in 433, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.8800955 = idf(docFreq=15, maxDocs=42306)
                0.09375 = fieldNorm(doc=433)
          0.0705796 = weight(abstract_txt:function in 433) [ClassicSimilarity], result of:
            0.0705796 = score(doc=433,freq=1.0), product of:
              0.13442306 = queryWeight, product of:
                1.4394672 = boost
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.016673926 = queryNorm
              0.52505577 = fieldWeight in 433, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.09375 = fieldNorm(doc=433)
          0.17582747 = weight(abstract_txt:formula in 433) [ClassicSimilarity], result of:
            0.17582747 = score(doc=433,freq=1.0), product of:
              0.24702759 = queryWeight, product of:
                1.951361 = boost
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.016673926 = queryNorm
              0.7117726 = fieldWeight in 433, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.09375 = fieldNorm(doc=433)
        0.2 = coord(5/25)
    
  2. Burrell, Q.L.: Formulae for the h-index : a lack of robustness in Lotkaian informetrics? (2013) 0.11
    0.11394723 = sum of:
      0.11394723 = product of:
        0.5697361 = sum of:
          0.122875564 = weight(abstract_txt:egghe in 2978) [ClassicSimilarity], result of:
            0.122875564 = score(doc=2978,freq=2.0), product of:
              0.1605852 = queryWeight, product of:
                1.1125066 = boost
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.016673926 = queryNorm
              0.7651737 = fieldWeight in 2978, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.0625 = fieldNorm(doc=2978)
          0.13553712 = weight(abstract_txt:rousseau in 2978) [ClassicSimilarity], result of:
            0.13553712 = score(doc=2978,freq=2.0), product of:
              0.17143546 = queryWeight, product of:
                1.1494768 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.016673926 = queryNorm
              0.79060143 = fieldWeight in 2978, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.0625 = fieldNorm(doc=2978)
          0.04705307 = weight(abstract_txt:function in 2978) [ClassicSimilarity], result of:
            0.04705307 = score(doc=2978,freq=1.0), product of:
              0.13442306 = queryWeight, product of:
                1.4394672 = boost
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.016673926 = queryNorm
              0.3500372 = fieldWeight in 2978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.0625 = fieldNorm(doc=2978)
          0.16577172 = weight(abstract_txt:formula in 2978) [ClassicSimilarity], result of:
            0.16577172 = score(doc=2978,freq=2.0), product of:
              0.24702759 = queryWeight, product of:
                1.951361 = boost
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.016673926 = queryNorm
              0.6710656 = fieldWeight in 2978, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.0625 = fieldNorm(doc=2978)
          0.09849863 = weight(abstract_txt:index in 2978) [ClassicSimilarity], result of:
            0.09849863 = score(doc=2978,freq=3.0), product of:
              0.19216378 = queryWeight, product of:
                2.4339724 = boost
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.016673926 = queryNorm
              0.51257646 = fieldWeight in 2978, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.0625 = fieldNorm(doc=2978)
        0.2 = coord(5/25)
    
  3. Carterette, B.; Can, F.: Comparing inverted files and signature files for searching a large lexicon (2005) 0.11
    0.108382165 = sum of:
      0.108382165 = product of:
        0.5419108 = sum of:
          0.085361645 = weight(abstract_txt:gram in 3030) [ClassicSimilarity], result of:
            0.085361645 = score(doc=3030,freq=1.0), product of:
              0.1367646 = queryWeight, product of:
                1.0266838 = boost
                7.9891224 = idf(docFreq=38, maxDocs=42306)
                0.016673926 = queryNorm
              0.62415016 = fieldWeight in 3030, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9891224 = idf(docFreq=38, maxDocs=42306)
                0.078125 = fieldNorm(doc=3030)
          0.023507295 = weight(abstract_txt:number in 3030) [ClassicSimilarity], result of:
            0.023507295 = score(doc=3030,freq=1.0), product of:
              0.07293628 = queryWeight, product of:
                1.0603193 = boost
                4.125428 = idf(docFreq=1857, maxDocs=42306)
                0.016673926 = queryNorm
              0.32229906 = fieldWeight in 3030, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.125428 = idf(docFreq=1857, maxDocs=42306)
                0.078125 = fieldNorm(doc=3030)
          0.05301484 = weight(abstract_txt:where in 3030) [ClassicSimilarity], result of:
            0.05301484 = score(doc=3030,freq=1.0), product of:
              0.14358328 = queryWeight, product of:
                1.822059 = boost
                4.726107 = idf(docFreq=1018, maxDocs=42306)
                0.016673926 = queryNorm
              0.3692271 = fieldWeight in 3030, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.726107 = idf(docFreq=1018, maxDocs=42306)
                0.078125 = fieldNorm(doc=3030)
          0.25690374 = weight(abstract_txt:grams in 3030) [ClassicSimilarity], result of:
            0.25690374 = score(doc=3030,freq=2.0), product of:
              0.28508788 = queryWeight, product of:
                2.096304 = boost
                8.156177 = idf(docFreq=32, maxDocs=42306)
                0.016673926 = queryNorm
              0.9011387 = fieldWeight in 3030, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.156177 = idf(docFreq=32, maxDocs=42306)
                0.078125 = fieldNorm(doc=3030)
          0.123123296 = weight(abstract_txt:index in 3030) [ClassicSimilarity], result of:
            0.123123296 = score(doc=3030,freq=3.0), product of:
              0.19216378 = queryWeight, product of:
                2.4339724 = boost
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.016673926 = queryNorm
              0.6407206 = fieldWeight in 3030, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.078125 = fieldNorm(doc=3030)
        0.2 = coord(5/25)
    
  4. Cohen, J.D.: Highlights: language- and domain-independent automatic indexing terms for abstracting (1995) 0.11
    0.10635399 = sum of:
      0.10635399 = product of:
        0.53176993 = sum of:
          0.0534352 = weight(abstract_txt:topic in 1862) [ClassicSimilarity], result of:
            0.0534352 = score(doc=1862,freq=1.0), product of:
              0.111662135 = queryWeight, product of:
                1.3119516 = boost
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.016673926 = queryNorm
              0.47854358 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.09375 = fieldNorm(doc=1862)
          0.0705796 = weight(abstract_txt:function in 1862) [ClassicSimilarity], result of:
            0.0705796 = score(doc=1862,freq=1.0), product of:
              0.13442306 = queryWeight, product of:
                1.4394672 = boost
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.016673926 = queryNorm
              0.52505577 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.09375 = fieldNorm(doc=1862)
          0.042017158 = weight(abstract_txt:documents in 1862) [ClassicSimilarity], result of:
            0.042017158 = score(doc=1862,freq=1.0), product of:
              0.10889364 = queryWeight, product of:
                1.5867618 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.016673926 = queryNorm
              0.38585502 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.09375 = fieldNorm(doc=1862)
          0.21799004 = weight(abstract_txt:grams in 1862) [ClassicSimilarity], result of:
            0.21799004 = score(doc=1862,freq=1.0), product of:
              0.28508788 = queryWeight, product of:
                2.096304 = boost
                8.156177 = idf(docFreq=32, maxDocs=42306)
                0.016673926 = queryNorm
              0.7646415 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.156177 = idf(docFreq=32, maxDocs=42306)
                0.09375 = fieldNorm(doc=1862)
          0.14774795 = weight(abstract_txt:index in 1862) [ClassicSimilarity], result of:
            0.14774795 = score(doc=1862,freq=3.0), product of:
              0.19216378 = queryWeight, product of:
                2.4339724 = boost
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.016673926 = queryNorm
              0.7688647 = fieldWeight in 1862, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.09375 = fieldNorm(doc=1862)
        0.2 = coord(5/25)
    
  5. Bodoff, D.: Test theory for evaluating reliability of IR test collections (2008) 0.10
    0.098081395 = sum of:
      0.098081395 = product of:
        0.49040696 = sum of:
          0.06524248 = weight(abstract_txt:estimation in 4086) [ClassicSimilarity], result of:
            0.06524248 = score(doc=4086,freq=1.0), product of:
              0.13266574 = queryWeight, product of:
                1.0111818 = boost
                7.8684945 = idf(docFreq=43, maxDocs=42306)
                0.016673926 = queryNorm
              0.4917809 = fieldWeight in 4086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8684945 = idf(docFreq=43, maxDocs=42306)
                0.0625 = fieldNorm(doc=4086)
          0.018805837 = weight(abstract_txt:number in 4086) [ClassicSimilarity], result of:
            0.018805837 = score(doc=4086,freq=1.0), product of:
              0.07293628 = queryWeight, product of:
                1.0603193 = boost
                4.125428 = idf(docFreq=1857, maxDocs=42306)
                0.016673926 = queryNorm
              0.25783926 = fieldWeight in 4086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.125428 = idf(docFreq=1857, maxDocs=42306)
                0.0625 = fieldNorm(doc=4086)
          0.04705307 = weight(abstract_txt:function in 4086) [ClassicSimilarity], result of:
            0.04705307 = score(doc=4086,freq=1.0), product of:
              0.13442306 = queryWeight, product of:
                1.4394672 = boost
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.016673926 = queryNorm
              0.3500372 = fieldWeight in 4086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.0625 = fieldNorm(doc=4086)
          0.042411875 = weight(abstract_txt:where in 4086) [ClassicSimilarity], result of:
            0.042411875 = score(doc=4086,freq=1.0), product of:
              0.14358328 = queryWeight, product of:
                1.822059 = boost
                4.726107 = idf(docFreq=1018, maxDocs=42306)
                0.016673926 = queryNorm
              0.2953817 = fieldWeight in 4086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.726107 = idf(docFreq=1018, maxDocs=42306)
                0.0625 = fieldNorm(doc=4086)
          0.3168937 = weight(abstract_txt:alpha in 4086) [ClassicSimilarity], result of:
            0.3168937 = score(doc=4086,freq=1.0), product of:
              0.6039962 = queryWeight, product of:
                4.3151608 = boost
                8.3945875 = idf(docFreq=25, maxDocs=42306)
                0.016673926 = queryNorm
              0.5246617 = fieldWeight in 4086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.3945875 = idf(docFreq=25, maxDocs=42306)
                0.0625 = fieldNorm(doc=4086)
        0.2 = coord(5/25)