Document (#38297)

Author
Badia, A.
Title
Data, information, knowledge : an information science analysis
Source
Journal of the Association for Information Science and Technology. 65(2014) no.6, S.1279-1287
Year
2014
Abstract
I analyze the text of an article that appeared in this journal in 2007 that published the results of a questionnaire in which a number of experts were asked to define the concepts of data, information, and knowledge. I apply standard information retrieval techniques to build a list of the most frequent terms in each set of definitions. I then apply information extraction techniques to analyze how the top terms are used in the definitions. As a result, I draw data-driven conclusions about the aggregate opinion of the experts. I contrast this with the original analysis of the data to provide readers with an alternative viewpoint on what the data tell us.
Theme
Information

Similar documents (content)

  1. Kashyap, M.M.: Likeness between Ranganathan's postulations based approach to knowledge classification and entity relationship data modelling approach (2003) 0.16
    0.16324155 = sum of:
      0.16324155 = product of:
        0.58300555 = sum of:
          0.0577403 = weight(abstract_txt:knowledge in 2045) [ClassicSimilarity], result of:
            0.0577403 = score(doc=2045,freq=4.0), product of:
              0.10401349 = queryWeight, product of:
                1.1651418 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.02512705 = queryNorm
              0.5551232 = fieldWeight in 2045, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.078125 = fieldNorm(doc=2045)
          0.044401675 = weight(abstract_txt:analysis in 2045) [ClassicSimilarity], result of:
            0.044401675 = score(doc=2045,freq=2.0), product of:
              0.10999675 = queryWeight, product of:
                1.1981851 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.02512705 = queryNorm
              0.40366352 = fieldWeight in 2045, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=2045)
          0.0598401 = weight(abstract_txt:techniques in 2045) [ClassicSimilarity], result of:
            0.0598401 = score(doc=2045,freq=1.0), product of:
              0.16909023 = queryWeight, product of:
                1.4855703 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.02512705 = queryNorm
              0.3538945 = fieldWeight in 2045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.078125 = fieldNorm(doc=2045)
          0.13330828 = weight(abstract_txt:apply in 2045) [ClassicSimilarity], result of:
            0.13330828 = score(doc=2045,freq=1.0), product of:
              0.288422 = queryWeight, product of:
                1.9402074 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.02512705 = queryNorm
              0.46219873 = fieldWeight in 2045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.078125 = fieldNorm(doc=2045)
          0.13587579 = weight(abstract_txt:experts in 2045) [ClassicSimilarity], result of:
            0.13587579 = score(doc=2045,freq=1.0), product of:
              0.29211354 = queryWeight, product of:
                1.9525844 = boost
                5.953884 = idf(docFreq=311, maxDocs=44218)
                0.02512705 = queryNorm
              0.4651472 = fieldWeight in 2045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.953884 = idf(docFreq=311, maxDocs=44218)
                0.078125 = fieldNorm(doc=2045)
          0.0322962 = weight(abstract_txt:information in 2045) [ClassicSimilarity], result of:
            0.0322962 = score(doc=2045,freq=2.0), product of:
              0.1207428 = queryWeight, product of:
                1.9848815 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02512705 = queryNorm
              0.2674793 = fieldWeight in 2045, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=2045)
          0.119543225 = weight(abstract_txt:data in 2045) [ClassicSimilarity], result of:
            0.119543225 = score(doc=2045,freq=4.0), product of:
              0.2293156 = queryWeight, product of:
                2.7353995 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.02512705 = queryNorm
              0.52130437 = fieldWeight in 2045, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=2045)
        0.28 = coord(7/25)
    
  2. Börner, K.; Chen, C.; Boyack, K.W.: Visualizing knowledge domains (2002) 0.15
    0.15238476 = sum of:
      0.15238476 = product of:
        0.3809619 = sum of:
          0.039961956 = weight(abstract_txt:driven in 4286) [ClassicSimilarity], result of:
            0.039961956 = score(doc=4286,freq=1.0), product of:
              0.16276641 = queryWeight, product of:
                1.0306267 = boost
                6.285241 = idf(docFreq=223, maxDocs=44218)
                0.02512705 = queryNorm
              0.24551722 = fieldWeight in 4286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.285241 = idf(docFreq=223, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4286)
          0.04177257 = weight(abstract_txt:conclusions in 4286) [ClassicSimilarity], result of:
            0.04177257 = score(doc=4286,freq=1.0), product of:
              0.16764647 = queryWeight, product of:
                1.0459627 = boost
                6.378767 = idf(docFreq=203, maxDocs=44218)
                0.02512705 = queryNorm
              0.24917059 = fieldWeight in 4286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.378767 = idf(docFreq=203, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4286)
          0.04329104 = weight(abstract_txt:contrast in 4286) [ClassicSimilarity], result of:
            0.04329104 = score(doc=4286,freq=1.0), product of:
              0.17168498 = queryWeight, product of:
                1.058486 = boost
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.02512705 = queryNorm
              0.2521539 = fieldWeight in 4286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4286)
          0.02887015 = weight(abstract_txt:knowledge in 4286) [ClassicSimilarity], result of:
            0.02887015 = score(doc=4286,freq=4.0), product of:
              0.10401349 = queryWeight, product of:
                1.1651418 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.02512705 = queryNorm
              0.2775616 = fieldWeight in 4286, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4286)
          0.022200838 = weight(abstract_txt:analysis in 4286) [ClassicSimilarity], result of:
            0.022200838 = score(doc=4286,freq=2.0), product of:
              0.10999675 = queryWeight, product of:
                1.1981851 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.02512705 = queryNorm
              0.20183176 = fieldWeight in 4286, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4286)
          0.02128636 = weight(abstract_txt:terms in 4286) [ClassicSimilarity], result of:
            0.02128636 = score(doc=4286,freq=1.0), product of:
              0.1347549 = queryWeight, product of:
                1.3261915 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.02512705 = queryNorm
              0.15796354 = fieldWeight in 4286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4286)
          0.05182305 = weight(abstract_txt:techniques in 4286) [ClassicSimilarity], result of:
            0.05182305 = score(doc=4286,freq=3.0), product of:
              0.16909023 = queryWeight, product of:
                1.4855703 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.02512705 = queryNorm
              0.30648163 = fieldWeight in 4286, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4286)
          0.06665414 = weight(abstract_txt:apply in 4286) [ClassicSimilarity], result of:
            0.06665414 = score(doc=4286,freq=1.0), product of:
              0.288422 = queryWeight, product of:
                1.9402074 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.02512705 = queryNorm
              0.23109937 = fieldWeight in 4286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4286)
          0.022836862 = weight(abstract_txt:information in 4286) [ClassicSimilarity], result of:
            0.022836862 = score(doc=4286,freq=4.0), product of:
              0.1207428 = queryWeight, product of:
                1.9848815 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02512705 = queryNorm
              0.18913643 = fieldWeight in 4286, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4286)
          0.04226491 = weight(abstract_txt:data in 4286) [ClassicSimilarity], result of:
            0.04226491 = score(doc=4286,freq=2.0), product of:
              0.2293156 = queryWeight, product of:
                2.7353995 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.02512705 = queryNorm
              0.18430892 = fieldWeight in 4286, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4286)
        0.4 = coord(10/25)
    
  3. Chen, Y.-N.; Ke, H.-R.: ¬A study on mental models of taggers and experts for article indexing based on analysis of keyword usage (2014) 0.14
    0.14062822 = sum of:
      0.14062822 = product of:
        0.5859509 = sum of:
          0.1717578 = weight(abstract_txt:frequent in 1334) [ClassicSimilarity], result of:
            0.1717578 = score(doc=1334,freq=4.0), product of:
              0.19814265 = queryWeight, product of:
                1.1371243 = boost
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.02512705 = queryNorm
              0.8668392 = fieldWeight in 1334, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.0625 = fieldNorm(doc=1334)
          0.043504577 = weight(abstract_txt:analysis in 1334) [ClassicSimilarity], result of:
            0.043504577 = score(doc=1334,freq=3.0), product of:
              0.10999675 = queryWeight, product of:
                1.1981851 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.02512705 = queryNorm
              0.39550784 = fieldWeight in 1334, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=1334)
          0.048165537 = weight(abstract_txt:terms in 1334) [ClassicSimilarity], result of:
            0.048165537 = score(doc=1334,freq=2.0), product of:
              0.1347549 = queryWeight, product of:
                1.3261915 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.02512705 = queryNorm
              0.3574307 = fieldWeight in 1334, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1334)
          0.24306202 = weight(abstract_txt:experts in 1334) [ClassicSimilarity], result of:
            0.24306202 = score(doc=1334,freq=5.0), product of:
              0.29211354 = queryWeight, product of:
                1.9525844 = boost
                5.953884 = idf(docFreq=311, maxDocs=44218)
                0.02512705 = queryNorm
              0.8320806 = fieldWeight in 1334, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.953884 = idf(docFreq=311, maxDocs=44218)
                0.0625 = fieldNorm(doc=1334)
          0.031643685 = weight(abstract_txt:information in 1334) [ClassicSimilarity], result of:
            0.031643685 = score(doc=1334,freq=3.0), product of:
              0.1207428 = queryWeight, product of:
                1.9848815 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02512705 = queryNorm
              0.26207513 = fieldWeight in 1334, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=1334)
          0.047817286 = weight(abstract_txt:data in 1334) [ClassicSimilarity], result of:
            0.047817286 = score(doc=1334,freq=1.0), product of:
              0.2293156 = queryWeight, product of:
                2.7353995 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.02512705 = queryNorm
              0.20852174 = fieldWeight in 1334, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=1334)
        0.24 = coord(6/25)
    
  4. Varathan, K.D.; Giachanou, A.; Crestani, F.: Comparative opinion mining : a review (2017) 0.13
    0.12686114 = sum of:
      0.12686114 = product of:
        0.5285881 = sum of:
          0.277728 = weight(abstract_txt:opinion in 3540) [ClassicSimilarity], result of:
            0.277728 = score(doc=3540,freq=11.0), product of:
              0.19483598 = queryWeight, product of:
                1.127596 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.02512705 = queryNorm
              1.4254451 = fieldWeight in 3540, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0625 = fieldNorm(doc=3540)
          0.03552134 = weight(abstract_txt:analysis in 3540) [ClassicSimilarity], result of:
            0.03552134 = score(doc=3540,freq=2.0), product of:
              0.10999675 = queryWeight, product of:
                1.1981851 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.02512705 = queryNorm
              0.3229308 = fieldWeight in 3540, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=3540)
          0.04787208 = weight(abstract_txt:techniques in 3540) [ClassicSimilarity], result of:
            0.04787208 = score(doc=3540,freq=1.0), product of:
              0.16909023 = queryWeight, product of:
                1.4855703 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.02512705 = queryNorm
              0.2831156 = fieldWeight in 3540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=3540)
          0.09381243 = weight(abstract_txt:analyze in 3540) [ClassicSimilarity], result of:
            0.09381243 = score(doc=3540,freq=1.0), product of:
              0.2647914 = queryWeight, product of:
                1.8590279 = boost
                5.6686087 = idf(docFreq=414, maxDocs=44218)
                0.02512705 = queryNorm
              0.35428804 = fieldWeight in 3540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6686087 = idf(docFreq=414, maxDocs=44218)
                0.0625 = fieldNorm(doc=3540)
          0.02583696 = weight(abstract_txt:information in 3540) [ClassicSimilarity], result of:
            0.02583696 = score(doc=3540,freq=2.0), product of:
              0.1207428 = queryWeight, product of:
                1.9848815 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02512705 = queryNorm
              0.21398345 = fieldWeight in 3540, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=3540)
          0.047817286 = weight(abstract_txt:data in 3540) [ClassicSimilarity], result of:
            0.047817286 = score(doc=3540,freq=1.0), product of:
              0.2293156 = queryWeight, product of:
                2.7353995 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.02512705 = queryNorm
              0.20852174 = fieldWeight in 3540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=3540)
        0.24 = coord(6/25)
    
  5. Hjoerland, B.: Information (2023) 0.12
    0.11531687 = sum of:
      0.11531687 = product of:
        0.48048696 = sum of:
          0.060005486 = weight(abstract_txt:knowledge in 1118) [ClassicSimilarity], result of:
            0.060005486 = score(doc=1118,freq=3.0), product of:
              0.10401349 = queryWeight, product of:
                1.1651418 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.02512705 = queryNorm
              0.576901 = fieldWeight in 1118, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.09375 = fieldNorm(doc=1118)
          0.03767607 = weight(abstract_txt:analysis in 1118) [ClassicSimilarity], result of:
            0.03767607 = score(doc=1118,freq=1.0), product of:
              0.10999675 = queryWeight, product of:
                1.1981851 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.02512705 = queryNorm
              0.34251985 = fieldWeight in 1118, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.09375 = fieldNorm(doc=1118)
          0.051087264 = weight(abstract_txt:terms in 1118) [ClassicSimilarity], result of:
            0.051087264 = score(doc=1118,freq=1.0), product of:
              0.1347549 = queryWeight, product of:
                1.3261915 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.02512705 = queryNorm
              0.37911248 = fieldWeight in 1118, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=1118)
          0.0866598 = weight(abstract_txt:information in 1118) [ClassicSimilarity], result of:
            0.0866598 = score(doc=1118,freq=10.0), product of:
              0.1207428 = queryWeight, product of:
                1.9848815 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02512705 = queryNorm
              0.7177223 = fieldWeight in 1118, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=1118)
          0.17333244 = weight(abstract_txt:definitions in 1118) [ClassicSimilarity], result of:
            0.17333244 = score(doc=1118,freq=1.0), product of:
              0.30426782 = queryWeight, product of:
                1.992792 = boost
                6.0764866 = idf(docFreq=275, maxDocs=44218)
                0.02512705 = queryNorm
              0.5696706 = fieldWeight in 1118, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0764866 = idf(docFreq=275, maxDocs=44218)
                0.09375 = fieldNorm(doc=1118)
          0.071725935 = weight(abstract_txt:data in 1118) [ClassicSimilarity], result of:
            0.071725935 = score(doc=1118,freq=1.0), product of:
              0.2293156 = queryWeight, product of:
                2.7353995 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.02512705 = queryNorm
              0.31278262 = fieldWeight in 1118, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=1118)
        0.24 = coord(6/25)