Document (#34010)

Author
Egghe, L.
Ravichandra Rao, I.K.
Title
¬The influence of the broadness of a query of a topic on its h-index : models and examples of the h-index of n-grams
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.10, S.1688-1693
Year
2008
Series
Brief communication
Abstract
The article studies the influence of the query formulation of a topic on its h-index. In order to generate pure random sets of documents, we used N-grams (N variable) to measure this influence: strings of zeros, truncated at the end. The used databases are WoS and Scopus. The formula h=T**1/alpha, proved in Egghe and Rousseau (2006) where T is the number of retrieved documents and is Lotka's exponent, is confirmed being a concavely increasing function of T. We also give a formula for the relation between h and N the length of the N-gram: h=D10**(-N/alpha) where D is a constant, a convexly decreasing function, which is found in our experiments. Nonlinear regression on h=T**1/alpha gives an estimation of , which can then be used to estimate the h-index of the entire database (Web of Science [WoS] and Scopus): h=S**1/alpha, , where S is the total number of documents in the database.
Theme
Informetrie
Object
h-index

Similar documents (author)

  1. Egghe, L.; Ravichandra Rao, I.K.: Duality revisited : construction of fractional frequency distributions based on two dual Lotka laws (2002) 5.29
    5.288704 = sum of:
      5.288704 = sum of:
        1.7844702 = weight(author_txt:egghe in 1006) [ClassicSimilarity], result of:
          1.7844702 = score(doc=1006,freq=1.0), product of:
            0.5376723 = queryWeight, product of:
              7.5860133 = idf(docFreq=60, maxDocs=44218)
              0.07087679 = queryNorm
            3.3188808 = fieldWeight in 1006, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5860133 = idf(docFreq=60, maxDocs=44218)
              0.4375 = fieldNorm(doc=1006)
        3.5042336 = weight(author_txt:ravichandra in 1006) [ClassicSimilarity], result of:
          3.5042336 = score(doc=1006,freq=1.0), product of:
            0.84315383 = queryWeight, product of:
              1.2522602 = boost
              9.499662 = idf(docFreq=8, maxDocs=44218)
              0.07087679 = queryNorm
            4.156102 = fieldWeight in 1006, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.499662 = idf(docFreq=8, maxDocs=44218)
              0.4375 = fieldNorm(doc=1006)
    
  2. Egghe, L.; Ravichandra Rao, I.K.: Study of different h-indices for groups of authors (2008) 5.29
    5.288704 = sum of:
      5.288704 = sum of:
        1.7844702 = weight(author_txt:egghe in 1878) [ClassicSimilarity], result of:
          1.7844702 = score(doc=1878,freq=1.0), product of:
            0.5376723 = queryWeight, product of:
              7.5860133 = idf(docFreq=60, maxDocs=44218)
              0.07087679 = queryNorm
            3.3188808 = fieldWeight in 1878, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5860133 = idf(docFreq=60, maxDocs=44218)
              0.4375 = fieldNorm(doc=1878)
        3.5042336 = weight(author_txt:ravichandra in 1878) [ClassicSimilarity], result of:
          3.5042336 = score(doc=1878,freq=1.0), product of:
            0.84315383 = queryWeight, product of:
              1.2522602 = boost
              9.499662 = idf(docFreq=8, maxDocs=44218)
              0.07087679 = queryNorm
            4.156102 = fieldWeight in 1878, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.499662 = idf(docFreq=8, maxDocs=44218)
              0.4375 = fieldNorm(doc=1878)
    
  3. Rao, I.K.Ravichandra -> Ravichandra Rao, I.K.: 1.75
    1.7521168 = sum of:
      1.7521168 = product of:
        3.5042336 = sum of:
          3.5042336 = weight(author_txt:ravichandra in 241) [ClassicSimilarity], result of:
            3.5042336 = score(doc=241,freq=1.0), product of:
              0.84315383 = queryWeight, product of:
                1.2522602 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.07087679 = queryNorm
              4.156102 = fieldWeight in 241, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.4375 = fieldNorm(doc=241)
        0.5 = coord(1/2)
    
  4. Rao, I.K.R. -> Ravichandra Rao, I.K.: 1.75
    1.7521168 = sum of:
      1.7521168 = product of:
        3.5042336 = sum of:
          3.5042336 = weight(author_txt:ravichandra in 2795) [ClassicSimilarity], result of:
            3.5042336 = score(doc=2795,freq=1.0), product of:
              0.84315383 = queryWeight, product of:
                1.2522602 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.07087679 = queryNorm
              4.156102 = fieldWeight in 2795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.4375 = fieldNorm(doc=2795)
        0.5 = coord(1/2)
    
  5. Ravichandra Rao, I.K.; Neelameghan, A.: From librametry to informetrcis : an overview and Ranganathan's contributions (1992) 1.75
    1.7521168 = sum of:
      1.7521168 = product of:
        3.5042336 = sum of:
          3.5042336 = weight(author_txt:ravichandra in 2964) [ClassicSimilarity], result of:
            3.5042336 = score(doc=2964,freq=1.0), product of:
              0.84315383 = queryWeight, product of:
                1.2522602 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.07087679 = queryNorm
              4.156102 = fieldWeight in 2964, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.4375 = fieldNorm(doc=2964)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Egghe, L.: ¬A new short proof of Naranan's theorem, explaining Lotka's law and Zipf's law (2010) 0.13
    0.12965468 = sum of:
      0.12965468 = product of:
        0.64827335 = sum of:
          0.040048916 = weight(abstract_txt:number in 3432) [ClassicSimilarity], result of:
            0.040048916 = score(doc=3432,freq=2.0), product of:
              0.073093034 = queryWeight, product of:
                1.0617138 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.016658658 = queryNorm
              0.547917 = fieldWeight in 3432, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.09375 = fieldNorm(doc=3432)
          0.22146307 = weight(abstract_txt:lotka's in 3432) [ClassicSimilarity], result of:
            0.22146307 = score(doc=3432,freq=3.0), product of:
              0.15848054 = queryWeight, product of:
                1.1054585 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.016658658 = queryNorm
              1.3974149 = fieldWeight in 3432, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.09375 = fieldNorm(doc=3432)
          0.13970166 = weight(abstract_txt:exponent in 3432) [ClassicSimilarity], result of:
            0.13970166 = score(doc=3432,freq=1.0), product of:
              0.1681189 = queryWeight, product of:
                1.1385778 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.016658658 = queryNorm
              0.83096945 = fieldWeight in 3432, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.09375 = fieldNorm(doc=3432)
          0.069581896 = weight(abstract_txt:function in 3432) [ClassicSimilarity], result of:
            0.069581896 = score(doc=3432,freq=1.0), product of:
              0.13309333 = queryWeight, product of:
                1.4326748 = boost
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.016658658 = queryNorm
              0.5228053 = fieldWeight in 3432, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.09375 = fieldNorm(doc=3432)
          0.17747778 = weight(abstract_txt:formula in 3432) [ClassicSimilarity], result of:
            0.17747778 = score(doc=3432,freq=1.0), product of:
              0.24845904 = queryWeight, product of:
                1.9574779 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.016658658 = queryNorm
              0.71431404 = fieldWeight in 3432, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.09375 = fieldNorm(doc=3432)
        0.2 = coord(5/25)
    
  2. Burrell, Q.L.: Formulae for the h-index : a lack of robustness in Lotkaian informetrics? (2013) 0.11
    0.11498489 = sum of:
      0.11498489 = product of:
        0.57492447 = sum of:
          0.12459904 = weight(abstract_txt:egghe in 977) [ClassicSimilarity], result of:
            0.12459904 = score(doc=977,freq=2.0), product of:
              0.16201036 = queryWeight, product of:
                1.1177015 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.016658658 = queryNorm
              0.7690807 = fieldWeight in 977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.13737082 = weight(abstract_txt:rousseau in 977) [ClassicSimilarity], result of:
            0.13737082 = score(doc=977,freq=2.0), product of:
              0.1729004 = queryWeight, product of:
                1.1546556 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.016658658 = queryNorm
              0.79450846 = fieldWeight in 977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.04638793 = weight(abstract_txt:function in 977) [ClassicSimilarity], result of:
            0.04638793 = score(doc=977,freq=1.0), product of:
              0.13309333 = queryWeight, product of:
                1.4326748 = boost
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.016658658 = queryNorm
              0.34853685 = fieldWeight in 977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.16732766 = weight(abstract_txt:formula in 977) [ClassicSimilarity], result of:
            0.16732766 = score(doc=977,freq=2.0), product of:
              0.24845904 = queryWeight, product of:
                1.9574779 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.016658658 = queryNorm
              0.67346174 = fieldWeight in 977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.09923901 = weight(abstract_txt:index in 977) [ClassicSimilarity], result of:
            0.09923901 = score(doc=977,freq=3.0), product of:
              0.1930386 = queryWeight, product of:
                2.440094 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.016658658 = queryNorm
              0.5140889 = fieldWeight in 977, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
        0.2 = coord(5/25)
    
  3. Carterette, B.; Can, F.: Comparing inverted files and signature files for searching a large lexicon (2005) 0.11
    0.10870247 = sum of:
      0.10870247 = product of:
        0.54351234 = sum of:
          0.08354733 = weight(abstract_txt:gram in 1029) [ClassicSimilarity], result of:
            0.08354733 = score(doc=1029,freq=1.0), product of:
              0.13475907 = queryWeight, product of:
                1.0193738 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.016658658 = queryNorm
              0.61997557 = fieldWeight in 1029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.078125 = fieldNorm(doc=1029)
          0.023599051 = weight(abstract_txt:number in 1029) [ClassicSimilarity], result of:
            0.023599051 = score(doc=1029,freq=1.0), product of:
              0.073093034 = queryWeight, product of:
                1.0617138 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.016658658 = queryNorm
              0.3228632 = fieldWeight in 1029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.078125 = fieldNorm(doc=1029)
          0.05156626 = weight(abstract_txt:where in 1029) [ClassicSimilarity], result of:
            0.05156626 = score(doc=1029,freq=1.0), product of:
              0.14089227 = queryWeight, product of:
                1.8053385 = boost
                4.684772 = idf(docFreq=1109, maxDocs=44218)
                0.016658658 = queryNorm
              0.36599782 = fieldWeight in 1029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.684772 = idf(docFreq=1109, maxDocs=44218)
                0.078125 = fieldNorm(doc=1029)
          0.26075095 = weight(abstract_txt:grams in 1029) [ClassicSimilarity], result of:
            0.26075095 = score(doc=1029,freq=2.0), product of:
              0.28779742 = queryWeight, product of:
                2.1067495 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.016658658 = queryNorm
              0.9060225 = fieldWeight in 1029, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.078125 = fieldNorm(doc=1029)
          0.124048755 = weight(abstract_txt:index in 1029) [ClassicSimilarity], result of:
            0.124048755 = score(doc=1029,freq=3.0), product of:
              0.1930386 = queryWeight, product of:
                2.440094 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.016658658 = queryNorm
              0.64261115 = fieldWeight in 1029, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.078125 = fieldNorm(doc=1029)
        0.2 = coord(5/25)
    
  4. Cohen, J.D.: Highlights: language- and domain-independent automatic indexing terms for abstracting (1995) 0.11
    0.106774904 = sum of:
      0.106774904 = product of:
        0.5338745 = sum of:
          0.052050155 = weight(abstract_txt:topic in 1793) [ClassicSimilarity], result of:
            0.052050155 = score(doc=1793,freq=1.0), product of:
              0.10967479 = queryWeight, product of:
                1.3005375 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.016658658 = queryNorm
              0.4745863 = fieldWeight in 1793, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.09375 = fieldNorm(doc=1793)
          0.069581896 = weight(abstract_txt:function in 1793) [ClassicSimilarity], result of:
            0.069581896 = score(doc=1793,freq=1.0), product of:
              0.13309333 = queryWeight, product of:
                1.4326748 = boost
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.016658658 = queryNorm
              0.5228053 = fieldWeight in 1793, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.09375 = fieldNorm(doc=1793)
          0.042129375 = weight(abstract_txt:documents in 1793) [ClassicSimilarity], result of:
            0.042129375 = score(doc=1793,freq=1.0), product of:
              0.10903834 = queryWeight, product of:
                1.5881982 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.016658658 = queryNorm
              0.38637212 = fieldWeight in 1793, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.09375 = fieldNorm(doc=1793)
          0.22125451 = weight(abstract_txt:grams in 1793) [ClassicSimilarity], result of:
            0.22125451 = score(doc=1793,freq=1.0), product of:
              0.28779742 = queryWeight, product of:
                2.1067495 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.016658658 = queryNorm
              0.7687856 = fieldWeight in 1793, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.09375 = fieldNorm(doc=1793)
          0.14885852 = weight(abstract_txt:index in 1793) [ClassicSimilarity], result of:
            0.14885852 = score(doc=1793,freq=3.0), product of:
              0.1930386 = queryWeight, product of:
                2.440094 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.016658658 = queryNorm
              0.7711334 = fieldWeight in 1793, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.09375 = fieldNorm(doc=1793)
        0.2 = coord(5/25)
    
  5. Bodoff, D.: Test theory for evaluating reliability of IR test collections (2008) 0.10
    0.098741494 = sum of:
      0.098741494 = product of:
        0.49370745 = sum of:
          0.06569573 = weight(abstract_txt:estimation in 2085) [ClassicSimilarity], result of:
            0.06569573 = score(doc=2085,freq=1.0), product of:
              0.13321948 = queryWeight, product of:
                1.0135341 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.016658658 = queryNorm
              0.49313906 = fieldWeight in 2085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=2085)
          0.01887924 = weight(abstract_txt:number in 2085) [ClassicSimilarity], result of:
            0.01887924 = score(doc=2085,freq=1.0), product of:
              0.073093034 = queryWeight, product of:
                1.0617138 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.016658658 = queryNorm
              0.25829056 = fieldWeight in 2085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.0625 = fieldNorm(doc=2085)
          0.04638793 = weight(abstract_txt:function in 2085) [ClassicSimilarity], result of:
            0.04638793 = score(doc=2085,freq=1.0), product of:
              0.13309333 = queryWeight, product of:
                1.4326748 = boost
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.016658658 = queryNorm
              0.34853685 = fieldWeight in 2085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.0625 = fieldNorm(doc=2085)
          0.041253008 = weight(abstract_txt:where in 2085) [ClassicSimilarity], result of:
            0.041253008 = score(doc=2085,freq=1.0), product of:
              0.14089227 = queryWeight, product of:
                1.8053385 = boost
                4.684772 = idf(docFreq=1109, maxDocs=44218)
                0.016658658 = queryNorm
              0.29279825 = fieldWeight in 2085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.684772 = idf(docFreq=1109, maxDocs=44218)
                0.0625 = fieldNorm(doc=2085)
          0.32149154 = weight(abstract_txt:alpha in 2085) [ClassicSimilarity], result of:
            0.32149154 = score(doc=2085,freq=1.0), product of:
              0.60955 = queryWeight, product of:
                4.3359985 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.016658658 = queryNorm
              0.5274244 = fieldWeight in 2085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.0625 = fieldNorm(doc=2085)
        0.2 = coord(5/25)