Document (#38813)

Author
Frické, M.
Title
Big data and its epistemology
Source
Journal of the Association for Information Science and Technology. 66(2015) no.4, S.651-661
Year
2015
Abstract
The article considers whether Big Data, in the form of data-driven science, will enable the discovery, or appraisal, of universal scientific theories, instrumentalist tools, or inductive inferences. It points out, initially, that such aspirations are similar to the now-discredited inductivist approach to science. On the positive side, Big Data may permit larger sample sizes, cheaper and more extensive testing of theories, and the continuous assessment of theories. On the negative side, data-driven science encourages passive data collection, as opposed to experimentation and testing, and hornswoggling ("unsound statistical fiddling"). The roles of theory and data in inductive algorithms, statistical modeling, and scientific discoveries are analyzed, and it is argued that theory is needed at every turn. Data-driven science is a chimera.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23212/abstract.
Theme
Data Mining

Similar documents (author)

  1. Frické, M.: Faceted classification : orthogonal facets and graphs of foci? (2011) 5.78
    5.7842436 = sum of:
      5.7842436 = weight(author_txt:frické in 1851) [ClassicSimilarity], result of:
        5.7842436 = fieldWeight in 1851, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.254789 = idf(docFreq=10, maxDocs=42306)
          0.625 = fieldNorm(doc=1851)
    
  2. Frické, M.: Reflections on classification : Thomas Reid and bibliographic description (2013) 5.78
    5.7842436 = sum of:
      5.7842436 = weight(author_txt:frické in 3767) [ClassicSimilarity], result of:
        5.7842436 = fieldWeight in 3767, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.254789 = idf(docFreq=10, maxDocs=42306)
          0.625 = fieldNorm(doc=3767)
    
  3. Frické, M.: Logic and the organization of information (2012) 5.78
    5.7842436 = sum of:
      5.7842436 = weight(author_txt:frické in 3783) [ClassicSimilarity], result of:
        5.7842436 = fieldWeight in 3783, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.254789 = idf(docFreq=10, maxDocs=42306)
          0.625 = fieldNorm(doc=3783)
    
  4. Frické, M.: Logical division (2016) 5.78
    5.7842436 = sum of:
      5.7842436 = weight(author_txt:frické in 102) [ClassicSimilarity], result of:
        5.7842436 = fieldWeight in 102, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.254789 = idf(docFreq=10, maxDocs=42306)
          0.625 = fieldNorm(doc=102)
    
  5. Frické, M.: Logic and librarianship (2017) 5.78
    5.7842436 = sum of:
      5.7842436 = weight(author_txt:frické in 423) [ClassicSimilarity], result of:
        5.7842436 = fieldWeight in 423, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.254789 = idf(docFreq=10, maxDocs=42306)
          0.625 = fieldNorm(doc=423)
    

Similar documents (content)

  1. Fonseca, F.; Marcinkowski, M.; Davis, C.: Cyber-human systems of thought and understanding (2019) 0.10
    0.101536505 = sum of:
      0.101536505 = product of:
        0.5076825 = sum of:
          0.08509245 = weight(abstract_txt:encourages in 1930) [ClassicSimilarity], result of:
            0.08509245 = score(doc=1930,freq=1.0), product of:
              0.17095838 = queryWeight, product of:
                1.2025836 = boost
                7.9638047 = idf(docFreq=39, maxDocs=42306)
                0.017850671 = queryNorm
              0.4977378 = fieldWeight in 1930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9638047 = idf(docFreq=39, maxDocs=42306)
                0.0625 = fieldNorm(doc=1930)
          0.060664073 = weight(abstract_txt:scientific in 1930) [ClassicSimilarity], result of:
            0.060664073 = score(doc=1930,freq=3.0), product of:
              0.11918465 = queryWeight, product of:
                1.4200225 = boost
                4.7018695 = idf(docFreq=1043, maxDocs=42306)
                0.017850671 = queryNorm
              0.5089923 = fieldWeight in 1930, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7018695 = idf(docFreq=1043, maxDocs=42306)
                0.0625 = fieldNorm(doc=1930)
          0.05767203 = weight(abstract_txt:science in 1930) [ClassicSimilarity], result of:
            0.05767203 = score(doc=1930,freq=2.0), product of:
              0.16619447 = queryWeight, product of:
                2.3714192 = boost
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.017850671 = queryNorm
              0.34701535 = fieldWeight in 1930, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.0625 = fieldNorm(doc=1930)
          0.13123453 = weight(abstract_txt:driven in 1930) [ClassicSimilarity], result of:
            0.13123453 = score(doc=1930,freq=1.0), product of:
              0.32913107 = queryWeight, product of:
                2.890115 = boost
                6.3796844 = idf(docFreq=194, maxDocs=42306)
                0.017850671 = queryNorm
              0.39873028 = fieldWeight in 1930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3796844 = idf(docFreq=194, maxDocs=42306)
                0.0625 = fieldNorm(doc=1930)
          0.17301944 = weight(abstract_txt:data in 1930) [ClassicSimilarity], result of:
            0.17301944 = score(doc=1930,freq=11.0), product of:
              0.24675089 = queryWeight, product of:
                4.0864334 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.017850671 = queryNorm
              0.7011907 = fieldWeight in 1930, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.0625 = fieldNorm(doc=1930)
        0.2 = coord(5/25)
    
  2. Szostak, R.: Classifying science : phenomena, data, theory, method, practice (2004) 0.09
    0.09007815 = sum of:
      0.09007815 = product of:
        0.45039076 = sum of:
          0.046146512 = weight(abstract_txt:theory in 2326) [ClassicSimilarity], result of:
            0.046146512 = score(doc=2326,freq=2.0), product of:
              0.11369001 = queryWeight, product of:
                1.3869034 = boost
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.017850671 = queryNorm
              0.40589768 = fieldWeight in 2326, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.0625 = fieldNorm(doc=2326)
          0.049532004 = weight(abstract_txt:scientific in 2326) [ClassicSimilarity], result of:
            0.049532004 = score(doc=2326,freq=2.0), product of:
              0.11918465 = queryWeight, product of:
                1.4200225 = boost
                4.7018695 = idf(docFreq=1043, maxDocs=42306)
                0.017850671 = queryNorm
              0.41559047 = fieldWeight in 2326, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7018695 = idf(docFreq=1043, maxDocs=42306)
                0.0625 = fieldNorm(doc=2326)
          0.09118749 = weight(abstract_txt:science in 2326) [ClassicSimilarity], result of:
            0.09118749 = score(doc=2326,freq=5.0), product of:
              0.16619447 = queryWeight, product of:
                2.3714192 = boost
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.017850671 = queryNorm
              0.5486795 = fieldWeight in 2326, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.0625 = fieldNorm(doc=2326)
          0.189749 = weight(abstract_txt:theories in 2326) [ClassicSimilarity], result of:
            0.189749 = score(doc=2326,freq=4.0), product of:
              0.26511633 = queryWeight, product of:
                2.5938745 = boost
                5.725758 = idf(docFreq=374, maxDocs=42306)
                0.017850671 = queryNorm
              0.71571976 = fieldWeight in 2326, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.725758 = idf(docFreq=374, maxDocs=42306)
                0.0625 = fieldNorm(doc=2326)
          0.07377573 = weight(abstract_txt:data in 2326) [ClassicSimilarity], result of:
            0.07377573 = score(doc=2326,freq=2.0), product of:
              0.24675089 = queryWeight, product of:
                4.0864334 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.017850671 = queryNorm
              0.2989887 = fieldWeight in 2326, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.0625 = fieldNorm(doc=2326)
        0.2 = coord(5/25)
    
  3. Vakkari, P.; Kuokkanen, M.: Theory growth in information science : applications of the theory of science to a theory of information seeking (1997) 0.09
    0.085903265 = sum of:
      0.085903265 = product of:
        0.5368954 = sum of:
          0.13987419 = weight(abstract_txt:theory in 711) [ClassicSimilarity], result of:
            0.13987419 = score(doc=711,freq=6.0), product of:
              0.11369001 = queryWeight, product of:
                1.3869034 = boost
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.017850671 = queryNorm
              1.230312 = fieldWeight in 711, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.109375 = fieldNorm(doc=711)
          0.06129273 = weight(abstract_txt:scientific in 711) [ClassicSimilarity], result of:
            0.06129273 = score(doc=711,freq=1.0), product of:
              0.11918465 = queryWeight, product of:
                1.4200225 = boost
                4.7018695 = idf(docFreq=1043, maxDocs=42306)
                0.017850671 = queryNorm
              0.51426697 = fieldWeight in 711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7018695 = idf(docFreq=1043, maxDocs=42306)
                0.109375 = fieldNorm(doc=711)
          0.10092606 = weight(abstract_txt:science in 711) [ClassicSimilarity], result of:
            0.10092606 = score(doc=711,freq=2.0), product of:
              0.16619447 = queryWeight, product of:
                2.3714192 = boost
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.017850671 = queryNorm
              0.60727686 = fieldWeight in 711, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.109375 = fieldNorm(doc=711)
          0.23480241 = weight(abstract_txt:theories in 711) [ClassicSimilarity], result of:
            0.23480241 = score(doc=711,freq=2.0), product of:
              0.26511633 = queryWeight, product of:
                2.5938745 = boost
                5.725758 = idf(docFreq=374, maxDocs=42306)
                0.017850671 = queryNorm
              0.885658 = fieldWeight in 711, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.725758 = idf(docFreq=374, maxDocs=42306)
                0.109375 = fieldNorm(doc=711)
        0.16 = coord(4/25)
    
  4. Teich, E.; Degaetano-Ortlieb, S.; Fankhauser, P.; Kermes, H.; Lapshinova-Koltunski, E.: ¬The linguistic construal of disciplinarity : a data-mining approach using register features (2016) 0.08
    0.076586165 = sum of:
      0.076586165 = product of:
        0.38293082 = sum of:
          0.04078814 = weight(abstract_txt:theory in 16) [ClassicSimilarity], result of:
            0.04078814 = score(doc=16,freq=1.0), product of:
              0.11369001 = queryWeight, product of:
                1.3869034 = boost
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.017850671 = queryNorm
              0.35876626 = fieldWeight in 16, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.078125 = fieldNorm(doc=16)
          0.061915006 = weight(abstract_txt:scientific in 16) [ClassicSimilarity], result of:
            0.061915006 = score(doc=16,freq=2.0), product of:
              0.11918465 = queryWeight, product of:
                1.4200225 = boost
                4.7018695 = idf(docFreq=1043, maxDocs=42306)
                0.017850671 = queryNorm
              0.5194881 = fieldWeight in 16, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7018695 = idf(docFreq=1043, maxDocs=42306)
                0.078125 = fieldNorm(doc=16)
          0.05097536 = weight(abstract_txt:science in 16) [ClassicSimilarity], result of:
            0.05097536 = score(doc=16,freq=1.0), product of:
              0.16619447 = queryWeight, product of:
                2.3714192 = boost
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.017850671 = queryNorm
              0.30672115 = fieldWeight in 16, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.078125 = fieldNorm(doc=16)
          0.16404316 = weight(abstract_txt:driven in 16) [ClassicSimilarity], result of:
            0.16404316 = score(doc=16,freq=1.0), product of:
              0.32913107 = queryWeight, product of:
                2.890115 = boost
                6.3796844 = idf(docFreq=194, maxDocs=42306)
                0.017850671 = queryNorm
              0.49841285 = fieldWeight in 16, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3796844 = idf(docFreq=194, maxDocs=42306)
                0.078125 = fieldNorm(doc=16)
          0.06520915 = weight(abstract_txt:data in 16) [ClassicSimilarity], result of:
            0.06520915 = score(doc=16,freq=1.0), product of:
              0.24675089 = queryWeight, product of:
                4.0864334 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.017850671 = queryNorm
              0.26427117 = fieldWeight in 16, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.078125 = fieldNorm(doc=16)
        0.2 = coord(5/25)
    
  5. White, H.D.: Combining bibliometrics, information retrieval, and relevance theory : part 1: first examples of a synthesis (2007) 0.08
    0.07645551 = sum of:
      0.07645551 = product of:
        0.38227755 = sum of:
          0.056517698 = weight(abstract_txt:theory in 2437) [ClassicSimilarity], result of:
            0.056517698 = score(doc=2437,freq=3.0), product of:
              0.11369001 = queryWeight, product of:
                1.3869034 = boost
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.017850671 = queryNorm
              0.49712107 = fieldWeight in 2437, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.0625 = fieldNorm(doc=2437)
          0.057605993 = weight(abstract_txt:statistical in 2437) [ClassicSimilarity], result of:
            0.057605993 = score(doc=2437,freq=1.0), product of:
              0.16606757 = queryWeight, product of:
                1.6762064 = boost
                5.5501256 = idf(docFreq=446, maxDocs=42306)
                0.017850671 = queryNorm
              0.34688285 = fieldWeight in 2437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5501256 = idf(docFreq=446, maxDocs=42306)
                0.0625 = fieldNorm(doc=2437)
          0.17520626 = weight(abstract_txt:side in 2437) [ClassicSimilarity], result of:
            0.17520626 = score(doc=2437,freq=2.0), product of:
              0.2766917 = queryWeight, product of:
                2.1636307 = boost
                7.1640477 = idf(docFreq=88, maxDocs=42306)
                0.017850671 = queryNorm
              0.63321835 = fieldWeight in 2437, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1640477 = idf(docFreq=88, maxDocs=42306)
                0.0625 = fieldNorm(doc=2437)
          0.040780287 = weight(abstract_txt:science in 2437) [ClassicSimilarity], result of:
            0.040780287 = score(doc=2437,freq=1.0), product of:
              0.16619447 = queryWeight, product of:
                2.3714192 = boost
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.017850671 = queryNorm
              0.24537691 = fieldWeight in 2437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.0625 = fieldNorm(doc=2437)
          0.05216732 = weight(abstract_txt:data in 2437) [ClassicSimilarity], result of:
            0.05216732 = score(doc=2437,freq=1.0), product of:
              0.24675089 = queryWeight, product of:
                4.0864334 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.017850671 = queryNorm
              0.21141694 = fieldWeight in 2437, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.0625 = fieldNorm(doc=2437)
        0.2 = coord(5/25)