Document (#38734)

Author
Wasserman, M.
Mukherjee, S.
Scott, K.
Zeng, X.H.T.
Radicchi, F.
Amaral, L.A.N.
Title
Correlations between user voting data, budget, and box office for films in the internet movie database
Source
Journal of the Association for Information Science and Technology. 66(2015) no.4, S.858-868
Year
2015
Abstract
The Internet Movie Database (IMDb) is one of the most-visited websites in the world and the premier source for information on films. Similar to Wikipedia, much of IMDb's information is user contributed. IMDb also allows users to voice their opinion on the quality of films through voting. We investigate whether there is a connection between user voting data and economic film characteristics. We perform distribution and correlation analysis on a set of films chosen to mitigate effects of bias due to the language and country of origin of films. Production budget, box office gross, and total number of user votes for films are consistent with double-log normal distributions for certain time periods. Both total gross and user votes are consistent with a double-log normal distribution from the late 1980s onward while for budget it extends from 1935 to 1979. In addition, we find a strong correlation between number of user votes and the economic statistics, particularly budget. Remarkably, we find no evidence for a correlation between number of votes and average user rating. Our results suggest that total user votes is an indicator of a film's prominence or notability, which can be quantified by its promotional costs.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23213/abstract.
Form
Filme
Object
Internet Movie Database

Similar documents (author)

  1. Amaral, L.A.N. -> Nunes Amaral, L.A.: 1.43
    1.4264998 = sum of:
      1.4264998 = product of:
        4.2794995 = sum of:
          4.2794995 = weight(author_txt:amaral in 2186) [ClassicSimilarity], result of:
            4.2794995 = score(doc=2186,freq=2.0), product of:
              0.72173554 = queryWeight, product of:
                1.2634151 = boost
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.05960877 = queryNorm
              5.9294567 = fieldWeight in 2186, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.4375 = fieldNorm(doc=2186)
        0.33333334 = coord(1/3)
    
  2. Morato Amaral, R. -> Amaral, R.M.: 1.43
    1.4264998 = sum of:
      1.4264998 = product of:
        4.2794995 = sum of:
          4.2794995 = weight(author_txt:amaral in 2892) [ClassicSimilarity], result of:
            4.2794995 = score(doc=2892,freq=2.0), product of:
              0.72173554 = queryWeight, product of:
                1.2634151 = boost
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.05960877 = queryNorm
              5.9294567 = fieldWeight in 2892, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.4375 = fieldNorm(doc=2892)
        0.33333334 = coord(1/3)
    
  3. Amaral, L.A. Nunes -> Nunes Amaral, L.A.: 1.22
    1.2227142 = sum of:
      1.2227142 = product of:
        3.6681423 = sum of:
          3.6681423 = weight(author_txt:amaral in 4143) [ClassicSimilarity], result of:
            3.6681423 = score(doc=4143,freq=2.0), product of:
              0.72173554 = queryWeight, product of:
                1.2634151 = boost
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.05960877 = queryNorm
              5.0823913 = fieldWeight in 4143, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.375 = fieldNorm(doc=4143)
        0.33333334 = coord(1/3)
    
  4. Scott, D.S.: Subject classification and natural-language processing for retrieval in large databases (1989) 0.89
    0.89162517 = sum of:
      0.89162517 = product of:
        2.6748755 = sum of:
          2.6748755 = weight(author_txt:scott in 967) [ClassicSimilarity], result of:
            2.6748755 = score(doc=967,freq=1.0), product of:
              0.52407545 = queryWeight, product of:
                1.0765989 = boost
                8.166383 = idf(docFreq=32, maxDocs=42740)
                0.05960877 = queryNorm
              5.103989 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.166383 = idf(docFreq=32, maxDocs=42740)
                0.625 = fieldNorm(doc=967)
        0.33333334 = coord(1/3)
    
  5. Scott, E.: ¬The evolution of bibliographic systems in the USA, 1876-1945 (1976/77) 0.89
    0.89162517 = sum of:
      0.89162517 = product of:
        2.6748755 = sum of:
          2.6748755 = weight(author_txt:scott in 4365) [ClassicSimilarity], result of:
            2.6748755 = score(doc=4365,freq=1.0), product of:
              0.52407545 = queryWeight, product of:
                1.0765989 = boost
                8.166383 = idf(docFreq=32, maxDocs=42740)
                0.05960877 = queryNorm
              5.103989 = fieldWeight in 4365, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.166383 = idf(docFreq=32, maxDocs=42740)
                0.625 = fieldNorm(doc=4365)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Collins, B.R.: Webwatch (1996) 0.12
    0.121832296 = sum of:
      0.121832296 = product of:
        1.0152692 = sum of:
          0.047716975 = weight(abstract_txt:number in 26) [ClassicSimilarity], result of:
            0.047716975 = score(doc=26,freq=1.0), product of:
              0.07401579 = queryWeight, product of:
                1.4197452 = boost
                4.1259933 = idf(docFreq=1875, maxDocs=42740)
                0.012635296 = queryNorm
              0.64468646 = fieldWeight in 26, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1259933 = idf(docFreq=1875, maxDocs=42740)
                0.15625 = fieldNorm(doc=26)
          0.26528567 = weight(abstract_txt:movie in 26) [ClassicSimilarity], result of:
            0.26528567 = score(doc=26,freq=1.0), product of:
              0.20291829 = queryWeight, product of:
                1.91939 = boost
                8.367054 = idf(docFreq=26, maxDocs=42740)
                0.012635296 = queryNorm
              1.3073522 = fieldWeight in 26, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.367054 = idf(docFreq=26, maxDocs=42740)
                0.15625 = fieldNorm(doc=26)
          0.7022665 = weight(abstract_txt:films in 26) [ClassicSimilarity], result of:
            0.7022665 = score(doc=26,freq=1.0), product of:
              0.5600417 = queryWeight, product of:
                5.5229793 = boost
                8.025305 = idf(docFreq=37, maxDocs=42740)
                0.012635296 = queryNorm
              1.2539539 = fieldWeight in 26, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.025305 = idf(docFreq=37, maxDocs=42740)
                0.15625 = fieldNorm(doc=26)
        0.12 = coord(3/25)
    
  2. Yee, M.M.: Manifestations and near-equivalents of moving image works : a research project (1994) 0.07
    0.066333555 = sum of:
      0.066333555 = product of:
        0.5527796 = sum of:
          0.015377494 = weight(abstract_txt:between in 931) [ClassicSimilarity], result of:
            0.015377494 = score(doc=931,freq=1.0), product of:
              0.07053517 = queryWeight, product of:
                1.6003702 = boost
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.012635296 = queryNorm
              0.21801172 = fieldWeight in 931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.0625 = fieldNorm(doc=931)
          0.050857645 = weight(abstract_txt:total in 931) [ClassicSimilarity], result of:
            0.050857645 = score(doc=931,freq=1.0), product of:
              0.14225687 = queryWeight, product of:
                1.9682709 = boost
                5.7200913 = idf(docFreq=380, maxDocs=42740)
                0.012635296 = queryNorm
              0.3575057 = fieldWeight in 931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7200913 = idf(docFreq=380, maxDocs=42740)
                0.0625 = fieldNorm(doc=931)
          0.4865445 = weight(abstract_txt:films in 931) [ClassicSimilarity], result of:
            0.4865445 = score(doc=931,freq=3.0), product of:
              0.5600417 = queryWeight, product of:
                5.5229793 = boost
                8.025305 = idf(docFreq=37, maxDocs=42740)
                0.012635296 = queryNorm
              0.8687647 = fieldWeight in 931, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.025305 = idf(docFreq=37, maxDocs=42740)
                0.0625 = fieldNorm(doc=931)
        0.12 = coord(3/25)
    
  3. Száva-Kováts, E.: Indirect-collective referencing (ICR) in the elite journal literature of physics : II: a literature science study on the level of communications (2002) 0.06
    0.06090807 = sum of:
      0.06090807 = product of:
        0.25378364 = sum of:
          0.016134983 = weight(abstract_txt:find in 1181) [ClassicSimilarity], result of:
            0.016134983 = score(doc=1181,freq=1.0), product of:
              0.07002883 = queryWeight, product of:
                1.1275636 = boost
                4.915304 = idf(docFreq=851, maxDocs=42740)
                0.012635296 = queryNorm
              0.23040488 = fieldWeight in 1181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.915304 = idf(docFreq=851, maxDocs=42740)
                0.046875 = fieldNorm(doc=1181)
          0.014315092 = weight(abstract_txt:number in 1181) [ClassicSimilarity], result of:
            0.014315092 = score(doc=1181,freq=1.0), product of:
              0.07401579 = queryWeight, product of:
                1.4197452 = boost
                4.1259933 = idf(docFreq=1875, maxDocs=42740)
                0.012635296 = queryNorm
              0.19340593 = fieldWeight in 1181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1259933 = idf(docFreq=1875, maxDocs=42740)
                0.046875 = fieldNorm(doc=1181)
          0.02306624 = weight(abstract_txt:between in 1181) [ClassicSimilarity], result of:
            0.02306624 = score(doc=1181,freq=4.0), product of:
              0.07053517 = queryWeight, product of:
                1.6003702 = boost
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.012635296 = queryNorm
              0.32701758 = fieldWeight in 1181, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.046875 = fieldNorm(doc=1181)
          0.057773482 = weight(abstract_txt:normal in 1181) [ClassicSimilarity], result of:
            0.057773482 = score(doc=1181,freq=1.0), product of:
              0.16390173 = queryWeight, product of:
                1.725021 = boost
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.012635296 = queryNorm
              0.35248855 = fieldWeight in 1181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.046875 = fieldNorm(doc=1181)
          0.038143232 = weight(abstract_txt:total in 1181) [ClassicSimilarity], result of:
            0.038143232 = score(doc=1181,freq=1.0), product of:
              0.14225687 = queryWeight, product of:
                1.9682709 = boost
                5.7200913 = idf(docFreq=380, maxDocs=42740)
                0.012635296 = queryNorm
              0.2681293 = fieldWeight in 1181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7200913 = idf(docFreq=380, maxDocs=42740)
                0.046875 = fieldNorm(doc=1181)
          0.10435063 = weight(abstract_txt:correlation in 1181) [ClassicSimilarity], result of:
            0.10435063 = score(doc=1181,freq=4.0), product of:
              0.17529586 = queryWeight, product of:
                2.1849124 = boost
                6.3496847 = idf(docFreq=202, maxDocs=42740)
                0.012635296 = queryNorm
              0.5952829 = fieldWeight in 1181, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.3496847 = idf(docFreq=202, maxDocs=42740)
                0.046875 = fieldNorm(doc=1181)
        0.24 = coord(6/25)
    
  4. Hidalgo, C.: Why information grows : the evolution of order, from atoms to economies (2015) 0.06
    0.057690006 = sum of:
      0.057690006 = product of:
        0.48075005 = sum of:
          0.09107806 = weight(abstract_txt:economic in 4155) [ClassicSimilarity], result of:
            0.09107806 = score(doc=4155,freq=9.0), product of:
              0.10673156 = queryWeight, product of:
                1.3920314 = boost
                6.068179 = idf(docFreq=268, maxDocs=42740)
                0.012635296 = queryNorm
              0.85333765 = fieldWeight in 4155, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.068179 = idf(docFreq=268, maxDocs=42740)
                0.046875 = fieldNorm(doc=4155)
          0.09172554 = weight(abstract_txt:gross in 4155) [ClassicSimilarity], result of:
            0.09172554 = score(doc=4155,freq=1.0), product of:
              0.22306153 = queryWeight, product of:
                2.012403 = boost
                8.772519 = idf(docFreq=17, maxDocs=42740)
                0.012635296 = queryNorm
              0.41121185 = fieldWeight in 4155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.772519 = idf(docFreq=17, maxDocs=42740)
                0.046875 = fieldNorm(doc=4155)
          0.29794645 = weight(abstract_txt:films in 4155) [ClassicSimilarity], result of:
            0.29794645 = score(doc=4155,freq=2.0), product of:
              0.5600417 = queryWeight, product of:
                5.5229793 = boost
                8.025305 = idf(docFreq=37, maxDocs=42740)
                0.012635296 = queryNorm
              0.5320076 = fieldWeight in 4155, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.025305 = idf(docFreq=37, maxDocs=42740)
                0.046875 = fieldNorm(doc=4155)
        0.12 = coord(3/25)
    
  5. Chowdhury, S.; Gibb, F.: Relationship among activities and problems causing uncertainty in information seeking and retrieval (2009) 0.05
    0.05353274 = sum of:
      0.05353274 = product of:
        0.2676637 = sum of:
          0.014315092 = weight(abstract_txt:number in 4845) [ClassicSimilarity], result of:
            0.014315092 = score(doc=4845,freq=1.0), product of:
              0.07401579 = queryWeight, product of:
                1.4197452 = boost
                4.1259933 = idf(docFreq=1875, maxDocs=42740)
                0.012635296 = queryNorm
              0.19340593 = fieldWeight in 4845, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1259933 = idf(docFreq=1875, maxDocs=42740)
                0.046875 = fieldNorm(doc=4845)
          0.02306624 = weight(abstract_txt:between in 4845) [ClassicSimilarity], result of:
            0.02306624 = score(doc=4845,freq=4.0), product of:
              0.07053517 = queryWeight, product of:
                1.6003702 = boost
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.012635296 = queryNorm
              0.32701758 = fieldWeight in 4845, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.046875 = fieldNorm(doc=4845)
          0.053942677 = weight(abstract_txt:total in 4845) [ClassicSimilarity], result of:
            0.053942677 = score(doc=4845,freq=2.0), product of:
              0.14225687 = queryWeight, product of:
                1.9682709 = boost
                5.7200913 = idf(docFreq=380, maxDocs=42740)
                0.012635296 = queryNorm
              0.37919205 = fieldWeight in 4845, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7200913 = idf(docFreq=380, maxDocs=42740)
                0.046875 = fieldNorm(doc=4845)
          0.13804291 = weight(abstract_txt:correlation in 4845) [ClassicSimilarity], result of:
            0.13804291 = score(doc=4845,freq=7.0), product of:
              0.17529586 = queryWeight, product of:
                2.1849124 = boost
                6.3496847 = idf(docFreq=202, maxDocs=42740)
                0.012635296 = queryNorm
              0.7874853 = fieldWeight in 4845, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.3496847 = idf(docFreq=202, maxDocs=42740)
                0.046875 = fieldNorm(doc=4845)
          0.03829676 = weight(abstract_txt:user in 4845) [ClassicSimilarity], result of:
            0.03829676 = score(doc=4845,freq=2.0), product of:
              0.1569938 = queryWeight, product of:
                3.3765552 = boost
                3.6797917 = idf(docFreq=2930, maxDocs=42740)
                0.012635296 = queryNorm
              0.24393803 = fieldWeight in 4845, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6797917 = idf(docFreq=2930, maxDocs=42740)
                0.046875 = fieldNorm(doc=4845)
        0.2 = coord(5/25)