Document (#29926)

Author
Daizadeh, I.
Title
¬An example of information management in biology : qualitative data economizing theory applied to the Human Genome Project databases
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.2, S.244-250
Year
2006
Abstract
Ironically, although much work has been done an elucidating algorithms for enabling scientists to efficiently retrieve relevant information from the glut of data derived from the efforts of the Human Genome Project and other similar projects, little has been performed an optimizing the levels of data economy across databases. One technique to qualify the degree of data economization is that constructed by Boisot. Boisot's Information Space (I-Space) takes into account the degree to which data are written (codification), the degree to which the data can be understood (abstraction), and the degree to which the data are effectively communicated to an audience (diffusion). A data system is said to be more data economical if it is relatively high in these dimensions. Application of the approach to entries in two popular, publicly available biological data repositories, the Protein DataBank (PDB) and GenBank, leads to the recommendation that PDB increases its level of abstraction through establishing a larger set of detailed keywords, diffusion through constructing hyperlinks to other databases, and codification through constructing additional subsections. With these recommendations in place, PDB would achieve the greater data economies currently enjoyed by GenBank. A discussion of the limitations of the approach is presented.
Field
Molekularbiologie

Similar documents (content)

  1. Rapp, B.A.; Wheeler, D.L.: Bioinformatics resources from the National Center for Biotechnology Information : an integrated foundation for discovery (2005) 0.24
    0.24238136 = sum of:
      0.24238136 = product of:
        0.86564773 = sum of:
          0.010394922 = weight(abstract_txt:which in 266) [ClassicSimilarity], result of:
            0.010394922 = score(doc=266,freq=1.0), product of:
              0.056812696 = queryWeight, product of:
                1.1691724 = boost
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.016598582 = queryNorm
              0.1829683 = fieldWeight in 266, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.0625 = fieldNorm(doc=266)
          0.21437804 = weight(abstract_txt:protein in 266) [ClassicSimilarity], result of:
            0.21437804 = score(doc=266,freq=4.0), product of:
              0.18661979 = queryWeight, product of:
                1.2234159 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.016598582 = queryNorm
              1.1487423 = fieldWeight in 266, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0625 = fieldNorm(doc=266)
          0.043359455 = weight(abstract_txt:space in 266) [ClassicSimilarity], result of:
            0.043359455 = score(doc=266,freq=1.0), product of:
              0.12860465 = queryWeight, product of:
                1.4362782 = boost
                5.394449 = idf(docFreq=533, maxDocs=43254)
                0.016598582 = queryNorm
              0.33715308 = fieldWeight in 266, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.394449 = idf(docFreq=533, maxDocs=43254)
                0.0625 = fieldNorm(doc=266)
          0.03849858 = weight(abstract_txt:through in 266) [ClassicSimilarity], result of:
            0.03849858 = score(doc=266,freq=2.0), product of:
              0.107940495 = queryWeight, product of:
                1.611566 = boost
                4.0352025 = idf(docFreq=2078, maxDocs=43254)
                0.016598582 = queryNorm
              0.35666487 = fieldWeight in 266, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0352025 = idf(docFreq=2078, maxDocs=43254)
                0.0625 = fieldNorm(doc=266)
          0.03536891 = weight(abstract_txt:databases in 266) [ClassicSimilarity], result of:
            0.03536891 = score(doc=266,freq=1.0), product of:
              0.12852246 = queryWeight, product of:
                1.7585123 = boost
                4.4031415 = idf(docFreq=1438, maxDocs=43254)
                0.016598582 = queryNorm
              0.27519634 = fieldWeight in 266, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4031415 = idf(docFreq=1438, maxDocs=43254)
                0.0625 = fieldNorm(doc=266)
          0.37131366 = weight(abstract_txt:genome in 266) [ClassicSimilarity], result of:
            0.37131366 = score(doc=266,freq=3.0), product of:
              0.37323958 = queryWeight, product of:
                2.4468317 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.016598582 = queryNorm
              0.99484 = fieldWeight in 266, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0625 = fieldNorm(doc=266)
          0.15233417 = weight(abstract_txt:data in 266) [ClassicSimilarity], result of:
            0.15233417 = score(doc=266,freq=7.0), product of:
              0.2742546 = queryWeight, product of:
                4.918906 = boost
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.016598582 = queryNorm
              0.555448 = fieldWeight in 266, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.0625 = fieldNorm(doc=266)
        0.28 = coord(7/25)
    
  2. Shachak, A.: Diffusion pattern of the use of genomic databases and analysis of biological sequences from 1970-2003 : bibliographic record analysis of 12 journals (2006) 0.10
    0.10295769 = sum of:
      0.10295769 = product of:
        0.51478845 = sum of:
          0.01463417 = weight(abstract_txt:approach in 907) [ClassicSimilarity], result of:
            0.01463417 = score(doc=907,freq=1.0), product of:
              0.062341828 = queryWeight, product of:
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.016598582 = queryNorm
              0.23474078 = fieldWeight in 907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0625 = fieldNorm(doc=907)
          0.10718902 = weight(abstract_txt:protein in 907) [ClassicSimilarity], result of:
            0.10718902 = score(doc=907,freq=1.0), product of:
              0.18661979 = queryWeight, product of:
                1.2234159 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.016598582 = queryNorm
              0.57437116 = fieldWeight in 907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0625 = fieldNorm(doc=907)
          0.07073782 = weight(abstract_txt:databases in 907) [ClassicSimilarity], result of:
            0.07073782 = score(doc=907,freq=4.0), product of:
              0.12852246 = queryWeight, product of:
                1.7585123 = boost
                4.4031415 = idf(docFreq=1438, maxDocs=43254)
                0.016598582 = queryNorm
              0.5503927 = fieldWeight in 907, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4031415 = idf(docFreq=1438, maxDocs=43254)
                0.0625 = fieldNorm(doc=907)
          0.2408014 = weight(abstract_txt:diffusion in 907) [ClassicSimilarity], result of:
            0.2408014 = score(doc=907,freq=5.0), product of:
              0.23585774 = queryWeight, product of:
                1.9450703 = boost
                7.305397 = idf(docFreq=78, maxDocs=43254)
                0.016598582 = queryNorm
              1.0209603 = fieldWeight in 907, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.305397 = idf(docFreq=78, maxDocs=43254)
                0.0625 = fieldNorm(doc=907)
          0.08142603 = weight(abstract_txt:data in 907) [ClassicSimilarity], result of:
            0.08142603 = score(doc=907,freq=2.0), product of:
              0.2742546 = queryWeight, product of:
                4.918906 = boost
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.016598582 = queryNorm
              0.29689944 = fieldWeight in 907, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.0625 = fieldNorm(doc=907)
        0.2 = coord(5/25)
    
  3. Ingwersen, P.: Cognitive perspectives of information retrieval interaction : elements of a cognitive IR theory (1996) 0.10
    0.10209279 = sum of:
      0.10209279 = product of:
        0.42538664 = sum of:
          0.01463417 = weight(abstract_txt:approach in 4685) [ClassicSimilarity], result of:
            0.01463417 = score(doc=4685,freq=1.0), product of:
              0.062341828 = queryWeight, product of:
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.016598582 = queryNorm
              0.23474078 = fieldWeight in 4685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0625 = fieldNorm(doc=4685)
          0.07510078 = weight(abstract_txt:space in 4685) [ClassicSimilarity], result of:
            0.07510078 = score(doc=4685,freq=3.0), product of:
              0.12860465 = queryWeight, product of:
                1.4362782 = boost
                5.394449 = idf(docFreq=533, maxDocs=43254)
                0.016598582 = queryNorm
              0.58396626 = fieldWeight in 4685, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.394449 = idf(docFreq=533, maxDocs=43254)
                0.0625 = fieldNorm(doc=4685)
          0.02722261 = weight(abstract_txt:through in 4685) [ClassicSimilarity], result of:
            0.02722261 = score(doc=4685,freq=1.0), product of:
              0.107940495 = queryWeight, product of:
                1.611566 = boost
                4.0352025 = idf(docFreq=2078, maxDocs=43254)
                0.016598582 = queryNorm
              0.25220016 = fieldWeight in 4685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0352025 = idf(docFreq=2078, maxDocs=43254)
                0.0625 = fieldNorm(doc=4685)
          0.10768965 = weight(abstract_txt:diffusion in 4685) [ClassicSimilarity], result of:
            0.10768965 = score(doc=4685,freq=1.0), product of:
              0.23585774 = queryWeight, product of:
                1.9450703 = boost
                7.305397 = idf(docFreq=78, maxDocs=43254)
                0.016598582 = queryNorm
              0.4565873 = fieldWeight in 4685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.305397 = idf(docFreq=78, maxDocs=43254)
                0.0625 = fieldNorm(doc=4685)
          0.10101331 = weight(abstract_txt:degree in 4685) [ClassicSimilarity], result of:
            0.10101331 = score(doc=4685,freq=1.0), product of:
              0.28474966 = queryWeight, product of:
                3.0224342 = boost
                5.6759086 = idf(docFreq=402, maxDocs=43254)
                0.016598582 = queryNorm
              0.3547443 = fieldWeight in 4685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6759086 = idf(docFreq=402, maxDocs=43254)
                0.0625 = fieldNorm(doc=4685)
          0.09972612 = weight(abstract_txt:data in 4685) [ClassicSimilarity], result of:
            0.09972612 = score(doc=4685,freq=3.0), product of:
              0.2742546 = queryWeight, product of:
                4.918906 = boost
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.016598582 = queryNorm
              0.36362606 = fieldWeight in 4685, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.0625 = fieldNorm(doc=4685)
        0.24 = coord(6/25)
    
  4. Vries, A.P. de: Content independence in multimedia databases (2001) 0.10
    0.09566524 = sum of:
      0.09566524 = product of:
        0.4783262 = sum of:
          0.02205096 = weight(abstract_txt:which in 1535) [ClassicSimilarity], result of:
            0.02205096 = score(doc=1535,freq=2.0), product of:
              0.056812696 = queryWeight, product of:
                1.1691724 = boost
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.016598582 = queryNorm
              0.38813436 = fieldWeight in 1535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.09375 = fieldNorm(doc=1535)
          0.053053368 = weight(abstract_txt:databases in 1535) [ClassicSimilarity], result of:
            0.053053368 = score(doc=1535,freq=1.0), product of:
              0.12852246 = queryWeight, product of:
                1.7585123 = boost
                4.4031415 = idf(docFreq=1438, maxDocs=43254)
                0.016598582 = queryNorm
              0.41279453 = fieldWeight in 1535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4031415 = idf(docFreq=1438, maxDocs=43254)
                0.09375 = fieldNorm(doc=1535)
          0.14517021 = weight(abstract_txt:constructing in 1535) [ClassicSimilarity], result of:
            0.14517021 = score(doc=1535,freq=1.0), product of:
              0.21964686 = queryWeight, product of:
                1.8770366 = boost
                7.0498724 = idf(docFreq=101, maxDocs=43254)
                0.016598582 = queryNorm
              0.6609255 = fieldWeight in 1535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0498724 = idf(docFreq=101, maxDocs=43254)
                0.09375 = fieldNorm(doc=1535)
          0.1716863 = weight(abstract_txt:abstraction in 1535) [ClassicSimilarity], result of:
            0.1716863 = score(doc=1535,freq=1.0), product of:
              0.24563886 = queryWeight, product of:
                1.9849921 = boost
                7.4553375 = idf(docFreq=67, maxDocs=43254)
                0.016598582 = queryNorm
              0.6989379 = fieldWeight in 1535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4553375 = idf(docFreq=67, maxDocs=43254)
                0.09375 = fieldNorm(doc=1535)
          0.08636536 = weight(abstract_txt:data in 1535) [ClassicSimilarity], result of:
            0.08636536 = score(doc=1535,freq=1.0), product of:
              0.2742546 = queryWeight, product of:
                4.918906 = boost
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.016598582 = queryNorm
              0.31490943 = fieldWeight in 1535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.09375 = fieldNorm(doc=1535)
        0.2 = coord(5/25)
    
  5. Acker, W. van; Uyttenhove, P.: Analogous spaces : an introduction to spatial metaphors for the organization of knowledge (2012) 0.08
    0.0781058 = sum of:
      0.0781058 = product of:
        0.32544085 = sum of:
          0.010394922 = weight(abstract_txt:which in 564) [ClassicSimilarity], result of:
            0.010394922 = score(doc=564,freq=1.0), product of:
              0.056812696 = queryWeight, product of:
                1.1691724 = boost
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.016598582 = queryNorm
              0.1829683 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.0625 = fieldNorm(doc=564)
          0.043359455 = weight(abstract_txt:space in 564) [ClassicSimilarity], result of:
            0.043359455 = score(doc=564,freq=1.0), product of:
              0.12860465 = queryWeight, product of:
                1.4362782 = boost
                5.394449 = idf(docFreq=533, maxDocs=43254)
                0.016598582 = queryNorm
              0.33715308 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.394449 = idf(docFreq=533, maxDocs=43254)
                0.0625 = fieldNorm(doc=564)
          0.02722261 = weight(abstract_txt:through in 564) [ClassicSimilarity], result of:
            0.02722261 = score(doc=564,freq=1.0), product of:
              0.107940495 = queryWeight, product of:
                1.611566 = boost
                4.0352025 = idf(docFreq=2078, maxDocs=43254)
                0.016598582 = queryNorm
              0.25220016 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0352025 = idf(docFreq=2078, maxDocs=43254)
                0.0625 = fieldNorm(doc=564)
          0.05001919 = weight(abstract_txt:databases in 564) [ClassicSimilarity], result of:
            0.05001919 = score(doc=564,freq=2.0), product of:
              0.12852246 = queryWeight, product of:
                1.7585123 = boost
                4.4031415 = idf(docFreq=1438, maxDocs=43254)
                0.016598582 = queryNorm
              0.38918638 = fieldWeight in 564, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4031415 = idf(docFreq=1438, maxDocs=43254)
                0.0625 = fieldNorm(doc=564)
          0.13686779 = weight(abstract_txt:constructing in 564) [ClassicSimilarity], result of:
            0.13686779 = score(doc=564,freq=2.0), product of:
              0.21964686 = queryWeight, product of:
                1.8770366 = boost
                7.0498724 = idf(docFreq=101, maxDocs=43254)
                0.016598582 = queryNorm
              0.62312657 = fieldWeight in 564, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0498724 = idf(docFreq=101, maxDocs=43254)
                0.0625 = fieldNorm(doc=564)
          0.057576902 = weight(abstract_txt:data in 564) [ClassicSimilarity], result of:
            0.057576902 = score(doc=564,freq=1.0), product of:
              0.2742546 = queryWeight, product of:
                4.918906 = boost
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.016598582 = queryNorm
              0.20993961 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.0625 = fieldNorm(doc=564)
        0.24 = coord(6/25)