Document (#41547)

Author
Ferrer-i-Cancho, R.
Vitevitch, M.S.
Title
¬The origins of Zipf's meaning-frequency law
Source
Journal of the Association for Information Science and Technology. 69(2018) no.11, S.1369-1379
Year
2018
Abstract
In his pioneering research, G.K. Zipf observed that more frequent words tend to have more meanings, and showed that the number of meanings of a word grows as the square root of its frequency. He derived this relationship from two assumptions: that words follow Zipf's law for word frequencies (a power law dependency between frequency and rank) and Zipf's law of meaning distribution (a power law dependency between number of meanings and rank). Here we show that a single assumption on the joint probability of a word and a meaning suffices to infer Zipf's meaning-frequency law or relaxed versions. Interestingly, this assumption can be justified as the outcome of a biased random walk in the process of mental exploration.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24057.
Theme
Informetrie
Object
Zipf-Gesetz

Similar documents (author)

  1. Sapena, A. Ferrer- => Ferrer-Sapena, A.: 5.04
    5.0379567 = sum of:
      5.0379567 = weight(author_txt:ferrer in 5771) [ClassicSimilarity], result of:
        5.0379567 = fieldWeight in 5771, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=5771)
    
  2. Miro, A.B.; Sahun, X.B.; Ferrer, M.E.: ¬La Library of Congress Classification à la Biblioteca de la Universitat Pompeu Fabra (1993) 3.56
    3.5623734 = sum of:
      3.5623734 = weight(author_txt:ferrer in 7090) [ClassicSimilarity], result of:
        3.5623734 = fieldWeight in 7090, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=7090)
    
  3. Ferrer Morillo, L.M.; Portillo de Hernández, R.: Tesauros transdisciplinarios : del reduccionismo científico a la unidad del conocimiento (2007) 3.56
    3.5623734 = sum of:
      3.5623734 = weight(author_txt:ferrer in 1107) [ClassicSimilarity], result of:
        3.5623734 = fieldWeight in 1107, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=1107)
    
  4. Ferrer-i-Cancho, R.; Gavaldà, R.: ¬The frequency spectrum of finite samples from the intermittent silence process (2009) 3.56
    3.5623734 = sum of:
      3.5623734 = weight(author_txt:ferrer in 2762) [ClassicSimilarity], result of:
        3.5623734 = fieldWeight in 2762, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=2762)
    
  5. Cole, T.W.; Mischo, W.H.; Habing, T.G.; Ferrer, R.H.: Using XML and XSLT to process and render online journals (2001) 2.97
    2.9686446 = sum of:
      2.9686446 = weight(author_txt:ferrer in 4802) [ClassicSimilarity], result of:
        2.9686446 = fieldWeight in 4802, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.3125 = fieldNorm(doc=4802)
    

Similar documents (content)

  1. Sun, Q.; Shaw, D.; Davis, C.H.: ¬A model for estimating the occurence of same-frequency words and the boundary between high- and low-frequency words in texts (1999) 0.20
    0.19508778 = sum of:
      0.19508778 = product of:
        0.81286573 = sum of:
          0.08709279 = weight(abstract_txt:root in 3063) [ClassicSimilarity], result of:
            0.08709279 = score(doc=3063,freq=1.0), product of:
              0.116718724 = queryWeight, product of:
                1.1477356 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.012776982 = queryNorm
              0.74617666 = fieldWeight in 3063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.09375 = fieldNorm(doc=3063)
          0.02438297 = weight(abstract_txt:number in 3063) [ClassicSimilarity], result of:
            0.02438297 = score(doc=3063,freq=1.0), product of:
              0.06293421 = queryWeight, product of:
                1.1918731 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.012776982 = queryNorm
              0.38743585 = fieldWeight in 3063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.09375 = fieldNorm(doc=3063)
          0.10380342 = weight(abstract_txt:square in 3063) [ClassicSimilarity], result of:
            0.10380342 = score(doc=3063,freq=1.0), product of:
              0.13120796 = queryWeight, product of:
                1.216891 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.012776982 = queryNorm
              0.7911366 = fieldWeight in 3063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.09375 = fieldNorm(doc=3063)
          0.12979862 = weight(abstract_txt:words in 3063) [ClassicSimilarity], result of:
            0.12979862 = score(doc=3063,freq=6.0), product of:
              0.10559063 = queryWeight, product of:
                1.5438293 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.012776982 = queryNorm
              1.2292627 = fieldWeight in 3063, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.09375 = fieldNorm(doc=3063)
          0.083211966 = weight(abstract_txt:word in 3063) [ClassicSimilarity], result of:
            0.083211966 = score(doc=3063,freq=1.0), product of:
              0.16329893 = queryWeight, product of:
                2.3513858 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.012776982 = queryNorm
              0.50956833 = fieldWeight in 3063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.09375 = fieldNorm(doc=3063)
          0.38457596 = weight(abstract_txt:frequency in 3063) [ClassicSimilarity], result of:
            0.38457596 = score(doc=3063,freq=7.0), product of:
              0.26069206 = queryWeight, product of:
                3.4305637 = boost
                5.947494 = idf(docFreq=313, maxDocs=44218)
                0.012776982 = queryNorm
              1.4752116 = fieldWeight in 3063, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.947494 = idf(docFreq=313, maxDocs=44218)
                0.09375 = fieldNorm(doc=3063)
        0.24 = coord(6/25)
    
  2. Arsenault, C.: Aggregation consistency and frequency of Chinese words and characters (2006) 0.19
    0.18749216 = sum of:
      0.18749216 = product of:
        0.78121734 = sum of:
          0.07339395 = weight(abstract_txt:zipf in 609) [ClassicSimilarity], result of:
            0.07339395 = score(doc=609,freq=1.0), product of:
              0.13645415 = queryWeight, product of:
                1.2409805 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.012776982 = queryNorm
              0.5378653 = fieldWeight in 609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.0625 = fieldNorm(doc=609)
          0.013701915 = weight(abstract_txt:that in 609) [ClassicSimilarity], result of:
            0.013701915 = score(doc=609,freq=5.0), product of:
              0.041377485 = queryWeight, product of:
                1.3667328 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.012776982 = queryNorm
              0.3311442 = fieldWeight in 609, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=609)
          0.04995951 = weight(abstract_txt:words in 609) [ClassicSimilarity], result of:
            0.04995951 = score(doc=609,freq=2.0), product of:
              0.10559063 = queryWeight, product of:
                1.5438293 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.012776982 = queryNorm
              0.47314343 = fieldWeight in 609, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=609)
          0.055474646 = weight(abstract_txt:word in 609) [ClassicSimilarity], result of:
            0.055474646 = score(doc=609,freq=1.0), product of:
              0.16329893 = queryWeight, product of:
                2.3513858 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.012776982 = queryNorm
              0.33971223 = fieldWeight in 609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=609)
          0.19380806 = weight(abstract_txt:frequency in 609) [ClassicSimilarity], result of:
            0.19380806 = score(doc=609,freq=4.0), product of:
              0.26069206 = queryWeight, product of:
                3.4305637 = boost
                5.947494 = idf(docFreq=313, maxDocs=44218)
                0.012776982 = queryNorm
              0.74343675 = fieldWeight in 609, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.947494 = idf(docFreq=313, maxDocs=44218)
                0.0625 = fieldNorm(doc=609)
          0.39487925 = weight(abstract_txt:zipf's in 609) [ClassicSimilarity], result of:
            0.39487925 = score(doc=609,freq=1.0), product of:
              0.6650834 = queryWeight, product of:
                5.4794836 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.012776982 = queryNorm
              0.5937289 = fieldWeight in 609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.0625 = fieldNorm(doc=609)
        0.24 = coord(6/25)
    
  3. Egghe, L.: Zipfian and Lotkaian continuous concentration theory (2005) 0.16
    0.1563972 = sum of:
      0.1563972 = product of:
        0.97748244 = sum of:
          0.15890257 = weight(abstract_txt:zipf in 3678) [ClassicSimilarity], result of:
            0.15890257 = score(doc=3678,freq=3.0), product of:
              0.13645415 = queryWeight, product of:
                1.2409805 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.012776982 = queryNorm
              1.1645125 = fieldWeight in 3678, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.078125 = fieldNorm(doc=3678)
          0.010832314 = weight(abstract_txt:that in 3678) [ClassicSimilarity], result of:
            0.010832314 = score(doc=3678,freq=2.0), product of:
              0.041377485 = queryWeight, product of:
                1.3667328 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.012776982 = queryNorm
              0.26179248 = fieldWeight in 3678, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=3678)
          0.10969309 = weight(abstract_txt:power in 3678) [ClassicSimilarity], result of:
            0.10969309 = score(doc=3678,freq=4.0), product of:
              0.12200644 = queryWeight, product of:
                1.6595027 = boost
                5.754088 = idf(docFreq=380, maxDocs=44218)
                0.012776982 = queryNorm
              0.8990762 = fieldWeight in 3678, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.754088 = idf(docFreq=380, maxDocs=44218)
                0.078125 = fieldNorm(doc=3678)
          0.6980545 = weight(abstract_txt:zipf's in 3678) [ClassicSimilarity], result of:
            0.6980545 = score(doc=3678,freq=2.0), product of:
              0.6650834 = queryWeight, product of:
                5.4794836 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.012776982 = queryNorm
              1.0495744 = fieldWeight in 3678, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.078125 = fieldNorm(doc=3678)
        0.16 = coord(4/25)
    
  4. Egghe, L.: ¬A new short proof of Naranan's theorem, explaining Lotka's law and Zipf's law (2010) 0.12
    0.12303147 = sum of:
      0.12303147 = product of:
        0.7689467 = sum of:
          0.12914637 = weight(abstract_txt:grows in 3432) [ClassicSimilarity], result of:
            0.12914637 = score(doc=3432,freq=2.0), product of:
              0.12046584 = queryWeight, product of:
                1.1660135 = boost
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.012776982 = queryNorm
              1.0720581 = fieldWeight in 3432, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.09375 = fieldNorm(doc=3432)
          0.034482725 = weight(abstract_txt:number in 3432) [ClassicSimilarity], result of:
            0.034482725 = score(doc=3432,freq=2.0), product of:
              0.06293421 = queryWeight, product of:
                1.1918731 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.012776982 = queryNorm
              0.547917 = fieldWeight in 3432, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.09375 = fieldNorm(doc=3432)
          0.012998777 = weight(abstract_txt:that in 3432) [ClassicSimilarity], result of:
            0.012998777 = score(doc=3432,freq=2.0), product of:
              0.041377485 = queryWeight, product of:
                1.3667328 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.012776982 = queryNorm
              0.314151 = fieldWeight in 3432, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=3432)
          0.59231883 = weight(abstract_txt:zipf's in 3432) [ClassicSimilarity], result of:
            0.59231883 = score(doc=3432,freq=1.0), product of:
              0.6650834 = queryWeight, product of:
                5.4794836 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.012776982 = queryNorm
              0.89059335 = fieldWeight in 3432, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.09375 = fieldNorm(doc=3432)
        0.16 = coord(4/25)
    
  5. Symonds, M.; Bruza, P.; Zuccon, G.; Koopman, B.; Sitbon, L.; Turner, I.: Automatic query expansion : a structural linguistic perspective (2014) 0.12
    0.12261982 = sum of:
      0.12261982 = product of:
        0.51091594 = sum of:
          0.056093633 = weight(abstract_txt:infer in 1338) [ClassicSimilarity], result of:
            0.056093633 = score(doc=1338,freq=1.0), product of:
              0.11406585 = queryWeight, product of:
                1.1346173 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.012776982 = queryNorm
              0.49176535 = fieldWeight in 1338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.0625 = fieldNorm(doc=1338)
          0.013701915 = weight(abstract_txt:that in 1338) [ClassicSimilarity], result of:
            0.013701915 = score(doc=1338,freq=5.0), product of:
              0.041377485 = queryWeight, product of:
                1.3667328 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.012776982 = queryNorm
              0.3311442 = fieldWeight in 1338, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1338)
          0.12428805 = weight(abstract_txt:dependency in 1338) [ClassicSimilarity], result of:
            0.12428805 = score(doc=1338,freq=1.0), product of:
              0.24425465 = queryWeight, product of:
                2.3480537 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.012776982 = queryNorm
              0.5088462 = fieldWeight in 1338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.0625 = fieldNorm(doc=1338)
          0.0960849 = weight(abstract_txt:word in 1338) [ClassicSimilarity], result of:
            0.0960849 = score(doc=1338,freq=3.0), product of:
              0.16329893 = queryWeight, product of:
                2.3513858 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.012776982 = queryNorm
              0.5883988 = fieldWeight in 1338, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=1338)
          0.10683319 = weight(abstract_txt:meanings in 1338) [ClassicSimilarity], result of:
            0.10683319 = score(doc=1338,freq=1.0), product of:
              0.25276938 = queryWeight, product of:
                2.925462 = boost
                6.7624135 = idf(docFreq=138, maxDocs=44218)
                0.012776982 = queryNorm
              0.42265084 = fieldWeight in 1338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7624135 = idf(docFreq=138, maxDocs=44218)
                0.0625 = fieldNorm(doc=1338)
          0.11391427 = weight(abstract_txt:meaning in 1338) [ClassicSimilarity], result of:
            0.11391427 = score(doc=1338,freq=2.0), product of:
              0.23046696 = queryWeight, product of:
                3.225566 = boost
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.012776982 = queryNorm
              0.49427593 = fieldWeight in 1338, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.0625 = fieldNorm(doc=1338)
        0.24 = coord(6/25)