Document (#38812)

Author
Frické, M.
Title
Big data and its epistemology
Source
Journal of the Association for Information Science and Technology. 66(2015) no.4, S.651-661
Year
2015
Abstract
The article considers whether Big Data, in the form of data-driven science, will enable the discovery, or appraisal, of universal scientific theories, instrumentalist tools, or inductive inferences. It points out, initially, that such aspirations are similar to the now-discredited inductivist approach to science. On the positive side, Big Data may permit larger sample sizes, cheaper and more extensive testing of theories, and the continuous assessment of theories. On the negative side, data-driven science encourages passive data collection, as opposed to experimentation and testing, and hornswoggling ("unsound statistical fiddling"). The roles of theory and data in inductive algorithms, statistical modeling, and scientific discoveries are analyzed, and it is argued that theory is needed at every turn. Data-driven science is a chimera.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23212/abstract.
Theme
Data Mining

Similar documents (author)

  1. Frické, M.: Faceted classification : orthogonal facets and graphs of foci? (2011) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:frické in 4850) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 4850, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=4850)
    
  2. Frické, M.: Reflections on classification : Thomas Reid and bibliographic description (2013) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:frické in 1766) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 1766, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=1766)
    
  3. Frické, M.: Logic and the organization of information (2012) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:frické in 1782) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 1782, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=1782)
    
  4. Frické, M.: Logical division (2016) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:frické in 3183) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 3183, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=3183)
    
  5. Frické, M.: Logic and librarianship (2017) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:frické in 3504) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 3504, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=3504)
    

Similar documents (content)

  1. Fonseca, F.; Marcinkowski, M.; Davis, C.: Cyber-human systems of thought and understanding (2019) 0.10
    0.09918119 = sum of:
      0.09918119 = product of:
        0.49590594 = sum of:
          0.08478656 = weight(abstract_txt:encourages in 5011) [ClassicSimilarity], result of:
            0.08478656 = score(doc=5011,freq=1.0), product of:
              0.17144404 = queryWeight, product of:
                1.1972083 = boost
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.018097896 = queryNorm
              0.4945436 = fieldWeight in 5011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.05928431 = weight(abstract_txt:scientific in 5011) [ClassicSimilarity], result of:
            0.05928431 = score(doc=5011,freq=3.0), product of:
              0.11798692 = queryWeight, product of:
                1.4045604 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.018097896 = queryNorm
              0.5024651 = fieldWeight in 5011, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.055718333 = weight(abstract_txt:science in 5011) [ClassicSimilarity], result of:
            0.055718333 = score(doc=5011,freq=2.0), product of:
              0.16327253 = queryWeight, product of:
                2.3366575 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.018097896 = queryNorm
              0.3412597 = fieldWeight in 5011, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.1274795 = weight(abstract_txt:driven in 5011) [ClassicSimilarity], result of:
            0.1274795 = score(doc=5011,freq=1.0), product of:
              0.3245177 = queryWeight, product of:
                2.8529117 = boost
                6.285241 = idf(docFreq=223, maxDocs=44218)
                0.018097896 = queryNorm
              0.39282757 = fieldWeight in 5011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.285241 = idf(docFreq=223, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.16863726 = weight(abstract_txt:data in 5011) [ClassicSimilarity], result of:
            0.16863726 = score(doc=5011,freq=11.0), product of:
              0.2438405 = queryWeight, product of:
                4.038373 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018097896 = queryNorm
              0.6915884 = fieldWeight in 5011, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
        0.2 = coord(5/25)
    
  2. Szostak, R.: Classifying science : phenomena, data, theory, method, practice (2004) 0.09
    0.08774907 = sum of:
      0.08774907 = product of:
        0.43874535 = sum of:
          0.04517823 = weight(abstract_txt:theory in 325) [ClassicSimilarity], result of:
            0.04517823 = score(doc=325,freq=2.0), product of:
              0.11268269 = queryWeight, product of:
                1.3726257 = boost
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.018097896 = queryNorm
              0.40093318 = fieldWeight in 325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.0625 = fieldNorm(doc=325)
          0.048405442 = weight(abstract_txt:scientific in 325) [ClassicSimilarity], result of:
            0.048405442 = score(doc=325,freq=2.0), product of:
              0.11798692 = queryWeight, product of:
                1.4045604 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.018097896 = queryNorm
              0.4102611 = fieldWeight in 325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=325)
          0.088098414 = weight(abstract_txt:science in 325) [ClassicSimilarity], result of:
            0.088098414 = score(doc=325,freq=5.0), product of:
              0.16327253 = queryWeight, product of:
                2.3366575 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.018097896 = queryNorm
              0.5395789 = fieldWeight in 325, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=325)
          0.1851561 = weight(abstract_txt:theories in 325) [ClassicSimilarity], result of:
            0.1851561 = score(doc=325,freq=4.0), product of:
              0.26219043 = queryWeight, product of:
                2.5643516 = boost
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.018097896 = queryNorm
              0.7061894 = fieldWeight in 325, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.0625 = fieldNorm(doc=325)
          0.07190717 = weight(abstract_txt:data in 325) [ClassicSimilarity], result of:
            0.07190717 = score(doc=325,freq=2.0), product of:
              0.2438405 = queryWeight, product of:
                4.038373 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018097896 = queryNorm
              0.29489428 = fieldWeight in 325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=325)
        0.2 = coord(5/25)
    
  3. Wu, D.; Xu, H.; Sun, Y.; Lv, S.: What should we teach? : A human-centered data science graduate curriculum model design for iField schools (2023) 0.09
    0.085347325 = sum of:
      0.085347325 = product of:
        0.5334208 = sum of:
          0.10423946 = weight(abstract_txt:science in 961) [ClassicSimilarity], result of:
            0.10423946 = score(doc=961,freq=7.0), product of:
              0.16327253 = queryWeight, product of:
                2.3366575 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.018097896 = queryNorm
              0.6384384 = fieldWeight in 961, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=961)
          0.15788749 = weight(abstract_txt:inductive in 961) [ClassicSimilarity], result of:
            0.15788749 = score(doc=961,freq=1.0), product of:
              0.32694864 = queryWeight, product of:
                2.3381011 = boost
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.018097896 = queryNorm
              0.4829122 = fieldWeight in 961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.0625 = fieldNorm(doc=961)
          0.1274795 = weight(abstract_txt:driven in 961) [ClassicSimilarity], result of:
            0.1274795 = score(doc=961,freq=1.0), product of:
              0.3245177 = queryWeight, product of:
                2.8529117 = boost
                6.285241 = idf(docFreq=223, maxDocs=44218)
                0.018097896 = queryNorm
              0.39282757 = fieldWeight in 961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.285241 = idf(docFreq=223, maxDocs=44218)
                0.0625 = fieldNorm(doc=961)
          0.14381434 = weight(abstract_txt:data in 961) [ClassicSimilarity], result of:
            0.14381434 = score(doc=961,freq=8.0), product of:
              0.2438405 = queryWeight, product of:
                4.038373 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018097896 = queryNorm
              0.58978856 = fieldWeight in 961, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=961)
        0.16 = coord(4/25)
    
  4. Vakkari, P.; Kuokkanen, M.: Theory growth in information science : applications of the theory of science to a theory of information seeking (1997) 0.08
    0.083754234 = sum of:
      0.083754234 = product of:
        0.52346396 = sum of:
          0.13693924 = weight(abstract_txt:theory in 4710) [ClassicSimilarity], result of:
            0.13693924 = score(doc=4710,freq=6.0), product of:
              0.11268269 = queryWeight, product of:
                1.3726257 = boost
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.018097896 = queryNorm
              1.2152642 = fieldWeight in 4710, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.109375 = fieldNorm(doc=4710)
          0.05989868 = weight(abstract_txt:scientific in 4710) [ClassicSimilarity], result of:
            0.05989868 = score(doc=4710,freq=1.0), product of:
              0.11798692 = queryWeight, product of:
                1.4045604 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.018097896 = queryNorm
              0.5076722 = fieldWeight in 4710, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.109375 = fieldNorm(doc=4710)
          0.09750708 = weight(abstract_txt:science in 4710) [ClassicSimilarity], result of:
            0.09750708 = score(doc=4710,freq=2.0), product of:
              0.16327253 = queryWeight, product of:
                2.3366575 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.018097896 = queryNorm
              0.59720445 = fieldWeight in 4710, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.109375 = fieldNorm(doc=4710)
          0.22911899 = weight(abstract_txt:theories in 4710) [ClassicSimilarity], result of:
            0.22911899 = score(doc=4710,freq=2.0), product of:
              0.26219043 = queryWeight, product of:
                2.5643516 = boost
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.018097896 = queryNorm
              0.87386477 = fieldWeight in 4710, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.109375 = fieldNorm(doc=4710)
        0.16 = coord(4/25)
    
  5. Fattahi, R.: Towards developing theories about data : a philosophical and scientific approach (2022) 0.08
    0.08292668 = sum of:
      0.08292668 = product of:
        0.5182918 = sum of:
          0.048405442 = weight(abstract_txt:scientific in 1101) [ClassicSimilarity], result of:
            0.048405442 = score(doc=1101,freq=2.0), product of:
              0.11798692 = queryWeight, product of:
                1.4045604 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.018097896 = queryNorm
              0.4102611 = fieldWeight in 1101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=1101)
          0.03939881 = weight(abstract_txt:science in 1101) [ClassicSimilarity], result of:
            0.03939881 = score(doc=1101,freq=1.0), product of:
              0.16327253 = queryWeight, product of:
                2.3366575 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.018097896 = queryNorm
              0.24130704 = fieldWeight in 1101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=1101)
          0.26185027 = weight(abstract_txt:theories in 1101) [ClassicSimilarity], result of:
            0.26185027 = score(doc=1101,freq=8.0), product of:
              0.26219043 = queryWeight, product of:
                2.5643516 = boost
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.018097896 = queryNorm
              0.9987026 = fieldWeight in 1101, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.0625 = fieldNorm(doc=1101)
          0.16863726 = weight(abstract_txt:data in 1101) [ClassicSimilarity], result of:
            0.16863726 = score(doc=1101,freq=11.0), product of:
              0.2438405 = queryWeight, product of:
                4.038373 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018097896 = queryNorm
              0.6915884 = fieldWeight in 1101, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=1101)
        0.16 = coord(4/25)