Document (#29926)

Author
Daizadeh, I.
Title
¬An example of information management in biology : qualitative data economizing theory applied to the Human Genome Project databases
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.2, S.244-250
Year
2006
Abstract
Ironically, although much work has been done an elucidating algorithms for enabling scientists to efficiently retrieve relevant information from the glut of data derived from the efforts of the Human Genome Project and other similar projects, little has been performed an optimizing the levels of data economy across databases. One technique to qualify the degree of data economization is that constructed by Boisot. Boisot's Information Space (I-Space) takes into account the degree to which data are written (codification), the degree to which the data can be understood (abstraction), and the degree to which the data are effectively communicated to an audience (diffusion). A data system is said to be more data economical if it is relatively high in these dimensions. Application of the approach to entries in two popular, publicly available biological data repositories, the Protein DataBank (PDB) and GenBank, leads to the recommendation that PDB increases its level of abstraction through establishing a larger set of detailed keywords, diffusion through constructing hyperlinks to other databases, and codification through constructing additional subsections. With these recommendations in place, PDB would achieve the greater data economies currently enjoyed by GenBank. A discussion of the limitations of the approach is presented.
Field
Molekularbiologie

Similar documents (content)

  1. Rapp, B.A.; Wheeler, D.L.: Bioinformatics resources from the National Center for Biotechnology Information : an integrated foundation for discovery (2005) 0.24
    0.24322292 = sum of:
      0.24322292 = product of:
        0.8686533 = sum of:
          0.010434915 = weight(abstract_txt:which in 266) [ClassicSimilarity], result of:
            0.010434915 = score(doc=266,freq=1.0), product of:
              0.056901474 = queryWeight, product of:
                1.1711105 = boost
                2.9341707 = idf(docFreq=6156, maxDocs=42596)
                0.016559234 = queryNorm
              0.18338567 = fieldWeight in 266, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9341707 = idf(docFreq=6156, maxDocs=42596)
                0.0625 = fieldNorm(doc=266)
          0.21877737 = weight(abstract_txt:protein in 266) [ClassicSimilarity], result of:
            0.21877737 = score(doc=266,freq=4.0), product of:
              0.18897545 = queryWeight, product of:
                1.2321914 = boost
                9.2616205 = idf(docFreq=10, maxDocs=42596)
                0.016559234 = queryNorm
              1.1577026 = fieldWeight in 266, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.2616205 = idf(docFreq=10, maxDocs=42596)
                0.0625 = fieldNorm(doc=266)
          0.04331576 = weight(abstract_txt:space in 266) [ClassicSimilarity], result of:
            0.04331576 = score(doc=266,freq=1.0), product of:
              0.12838997 = queryWeight, product of:
                1.4363359 = boost
                5.398024 = idf(docFreq=523, maxDocs=42596)
                0.016559234 = queryNorm
              0.3373765 = fieldWeight in 266, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.398024 = idf(docFreq=523, maxDocs=42596)
                0.0625 = fieldNorm(doc=266)
          0.03884006 = weight(abstract_txt:through in 266) [ClassicSimilarity], result of:
            0.03884006 = score(doc=266,freq=2.0), product of:
              0.10846946 = queryWeight, product of:
                1.6169249 = boost
                4.0511413 = idf(docFreq=2014, maxDocs=42596)
                0.016559234 = queryNorm
              0.35807368 = fieldWeight in 266, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0511413 = idf(docFreq=2014, maxDocs=42596)
                0.0625 = fieldNorm(doc=266)
          0.034979198 = weight(abstract_txt:databases in 266) [ClassicSimilarity], result of:
            0.034979198 = score(doc=266,freq=1.0), product of:
              0.1274493 = queryWeight, product of:
                1.752689 = boost
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.016559234 = queryNorm
              0.2744558 = fieldWeight in 266, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.0625 = fieldNorm(doc=266)
          0.36835355 = weight(abstract_txt:genome in 266) [ClassicSimilarity], result of:
            0.36835355 = score(doc=266,freq=3.0), product of:
              0.37088275 = queryWeight, product of:
                2.4412305 = boost
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.016559234 = queryNorm
              0.9931806 = fieldWeight in 266, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.0625 = fieldNorm(doc=266)
          0.15395245 = weight(abstract_txt:data in 266) [ClassicSimilarity], result of:
            0.15395245 = score(doc=266,freq=7.0), product of:
              0.27591783 = queryWeight, product of:
                4.938121 = boost
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.016559234 = queryNorm
              0.55796486 = fieldWeight in 266, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.0625 = fieldNorm(doc=266)
        0.28 = coord(7/25)
    
  2. Shachak, A.: Diffusion pattern of the use of genomic databases and analysis of biological sequences from 1970-2003 : bibliographic record analysis of 12 journals (2006) 0.10
    0.103770114 = sum of:
      0.103770114 = product of:
        0.51885056 = sum of:
          0.0148184 = weight(abstract_txt:approach in 5907) [ClassicSimilarity], result of:
            0.0148184 = score(doc=5907,freq=1.0), product of:
              0.06280121 = queryWeight, product of:
                1.0045568 = boost
                3.7753158 = idf(docFreq=2654, maxDocs=42596)
                0.016559234 = queryNorm
              0.23595724 = fieldWeight in 5907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7753158 = idf(docFreq=2654, maxDocs=42596)
                0.0625 = fieldNorm(doc=5907)
          0.10938869 = weight(abstract_txt:protein in 5907) [ClassicSimilarity], result of:
            0.10938869 = score(doc=5907,freq=1.0), product of:
              0.18897545 = queryWeight, product of:
                1.2321914 = boost
                9.2616205 = idf(docFreq=10, maxDocs=42596)
                0.016559234 = queryNorm
              0.5788513 = fieldWeight in 5907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.2616205 = idf(docFreq=10, maxDocs=42596)
                0.0625 = fieldNorm(doc=5907)
          0.069958396 = weight(abstract_txt:databases in 5907) [ClassicSimilarity], result of:
            0.069958396 = score(doc=5907,freq=4.0), product of:
              0.1274493 = queryWeight, product of:
                1.752689 = boost
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.016559234 = queryNorm
              0.5489116 = fieldWeight in 5907, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.0625 = fieldNorm(doc=5907)
          0.24239402 = weight(abstract_txt:diffusion in 5907) [ClassicSimilarity], result of:
            0.24239402 = score(doc=5907,freq=5.0), product of:
              0.23666011 = queryWeight, product of:
                1.9500827 = boost
                7.328782 = idf(docFreq=75, maxDocs=42596)
                0.016559234 = queryNorm
              1.0242285 = fieldWeight in 5907, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.328782 = idf(docFreq=75, maxDocs=42596)
                0.0625 = fieldNorm(doc=5907)
          0.08229105 = weight(abstract_txt:data in 5907) [ClassicSimilarity], result of:
            0.08229105 = score(doc=5907,freq=2.0), product of:
              0.27591783 = queryWeight, product of:
                4.938121 = boost
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.016559234 = queryNorm
              0.29824477 = fieldWeight in 5907, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.0625 = fieldNorm(doc=5907)
        0.2 = coord(5/25)
    
  3. Ingwersen, P.: Cognitive perspectives of information retrieval interaction : elements of a cognitive IR theory (1996) 0.10
    0.10272032 = sum of:
      0.10272032 = product of:
        0.42800134 = sum of:
          0.0148184 = weight(abstract_txt:approach in 3685) [ClassicSimilarity], result of:
            0.0148184 = score(doc=3685,freq=1.0), product of:
              0.06280121 = queryWeight, product of:
                1.0045568 = boost
                3.7753158 = idf(docFreq=2654, maxDocs=42596)
                0.016559234 = queryNorm
              0.23595724 = fieldWeight in 3685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7753158 = idf(docFreq=2654, maxDocs=42596)
                0.0625 = fieldNorm(doc=3685)
          0.07502509 = weight(abstract_txt:space in 3685) [ClassicSimilarity], result of:
            0.07502509 = score(doc=3685,freq=3.0), product of:
              0.12838997 = queryWeight, product of:
                1.4363359 = boost
                5.398024 = idf(docFreq=523, maxDocs=42596)
                0.016559234 = queryNorm
              0.5843532 = fieldWeight in 3685, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.398024 = idf(docFreq=523, maxDocs=42596)
                0.0625 = fieldNorm(doc=3685)
          0.027464068 = weight(abstract_txt:through in 3685) [ClassicSimilarity], result of:
            0.027464068 = score(doc=3685,freq=1.0), product of:
              0.10846946 = queryWeight, product of:
                1.6169249 = boost
                4.0511413 = idf(docFreq=2014, maxDocs=42596)
                0.016559234 = queryNorm
              0.25319633 = fieldWeight in 3685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0511413 = idf(docFreq=2014, maxDocs=42596)
                0.0625 = fieldNorm(doc=3685)
          0.108401895 = weight(abstract_txt:diffusion in 3685) [ClassicSimilarity], result of:
            0.108401895 = score(doc=3685,freq=1.0), product of:
              0.23666011 = queryWeight, product of:
                1.9500827 = boost
                7.328782 = idf(docFreq=75, maxDocs=42596)
                0.016559234 = queryNorm
              0.45804888 = fieldWeight in 3685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.328782 = idf(docFreq=75, maxDocs=42596)
                0.0625 = fieldNorm(doc=3685)
          0.10150637 = weight(abstract_txt:degree in 3685) [ClassicSimilarity], result of:
            0.10150637 = score(doc=3685,freq=1.0), product of:
              0.28539038 = queryWeight, product of:
                3.0284832 = boost
                5.6908083 = idf(docFreq=390, maxDocs=42596)
                0.016559234 = queryNorm
              0.35567552 = fieldWeight in 3685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6908083 = idf(docFreq=390, maxDocs=42596)
                0.0625 = fieldNorm(doc=3685)
          0.10078554 = weight(abstract_txt:data in 3685) [ClassicSimilarity], result of:
            0.10078554 = score(doc=3685,freq=3.0), product of:
              0.27591783 = queryWeight, product of:
                4.938121 = boost
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.016559234 = queryNorm
              0.36527374 = fieldWeight in 3685, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.0625 = fieldNorm(doc=3685)
        0.24 = coord(6/25)
    
  4. Vries, A.P. de: Content independence in multimedia databases (2001) 0.10
    0.09536373 = sum of:
      0.09536373 = product of:
        0.47681865 = sum of:
          0.022135796 = weight(abstract_txt:which in 535) [ClassicSimilarity], result of:
            0.022135796 = score(doc=535,freq=2.0), product of:
              0.056901474 = queryWeight, product of:
                1.1711105 = boost
                2.9341707 = idf(docFreq=6156, maxDocs=42596)
                0.016559234 = queryNorm
              0.38901973 = fieldWeight in 535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9341707 = idf(docFreq=6156, maxDocs=42596)
                0.09375 = fieldNorm(doc=535)
          0.0524688 = weight(abstract_txt:databases in 535) [ClassicSimilarity], result of:
            0.0524688 = score(doc=535,freq=1.0), product of:
              0.1274493 = queryWeight, product of:
                1.752689 = boost
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.016559234 = queryNorm
              0.41168368 = fieldWeight in 535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.09375 = fieldNorm(doc=535)
          0.1437939 = weight(abstract_txt:constructing in 535) [ClassicSimilarity], result of:
            0.1437939 = score(doc=535,freq=1.0), product of:
              0.21803853 = queryWeight, product of:
                1.8717899 = boost
                7.034543 = idf(docFreq=101, maxDocs=42596)
                0.016559234 = queryNorm
              0.65948844 = fieldWeight in 535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.034543 = idf(docFreq=101, maxDocs=42596)
                0.09375 = fieldNorm(doc=535)
          0.17113733 = weight(abstract_txt:abstraction in 535) [ClassicSimilarity], result of:
            0.17113733 = score(doc=535,freq=1.0), product of:
              0.24487032 = queryWeight, product of:
                1.9836204 = boost
                7.454823 = idf(docFreq=66, maxDocs=42596)
                0.016559234 = queryNorm
              0.6988897 = fieldWeight in 535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.454823 = idf(docFreq=66, maxDocs=42596)
                0.09375 = fieldNorm(doc=535)
          0.087282844 = weight(abstract_txt:data in 535) [ClassicSimilarity], result of:
            0.087282844 = score(doc=535,freq=1.0), product of:
              0.27591783 = queryWeight, product of:
                4.938121 = boost
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.016559234 = queryNorm
              0.31633636 = fieldWeight in 535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.09375 = fieldNorm(doc=535)
        0.2 = coord(5/25)
    
  5. Acker, W. van; Uyttenhove, P.: Analogous spaces : an introduction to spatial metaphors for the organization of knowledge (2012) 0.08
    0.077865966 = sum of:
      0.077865966 = product of:
        0.32444152 = sum of:
          0.010434915 = weight(abstract_txt:which in 1161) [ClassicSimilarity], result of:
            0.010434915 = score(doc=1161,freq=1.0), product of:
              0.056901474 = queryWeight, product of:
                1.1711105 = boost
                2.9341707 = idf(docFreq=6156, maxDocs=42596)
                0.016559234 = queryNorm
              0.18338567 = fieldWeight in 1161, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9341707 = idf(docFreq=6156, maxDocs=42596)
                0.0625 = fieldNorm(doc=1161)
          0.04331576 = weight(abstract_txt:space in 1161) [ClassicSimilarity], result of:
            0.04331576 = score(doc=1161,freq=1.0), product of:
              0.12838997 = queryWeight, product of:
                1.4363359 = boost
                5.398024 = idf(docFreq=523, maxDocs=42596)
                0.016559234 = queryNorm
              0.3373765 = fieldWeight in 1161, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.398024 = idf(docFreq=523, maxDocs=42596)
                0.0625 = fieldNorm(doc=1161)
          0.027464068 = weight(abstract_txt:through in 1161) [ClassicSimilarity], result of:
            0.027464068 = score(doc=1161,freq=1.0), product of:
              0.10846946 = queryWeight, product of:
                1.6169249 = boost
                4.0511413 = idf(docFreq=2014, maxDocs=42596)
                0.016559234 = queryNorm
              0.25319633 = fieldWeight in 1161, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0511413 = idf(docFreq=2014, maxDocs=42596)
                0.0625 = fieldNorm(doc=1161)
          0.04946806 = weight(abstract_txt:databases in 1161) [ClassicSimilarity], result of:
            0.04946806 = score(doc=1161,freq=2.0), product of:
              0.1274493 = queryWeight, product of:
                1.752689 = boost
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.016559234 = queryNorm
              0.3881391 = fieldWeight in 1161, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.0625 = fieldNorm(doc=1161)
          0.13557017 = weight(abstract_txt:constructing in 1161) [ClassicSimilarity], result of:
            0.13557017 = score(doc=1161,freq=2.0), product of:
              0.21803853 = queryWeight, product of:
                1.8717899 = boost
                7.034543 = idf(docFreq=101, maxDocs=42596)
                0.016559234 = queryNorm
              0.62177163 = fieldWeight in 1161, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.034543 = idf(docFreq=101, maxDocs=42596)
                0.0625 = fieldNorm(doc=1161)
          0.05818856 = weight(abstract_txt:data in 1161) [ClassicSimilarity], result of:
            0.05818856 = score(doc=1161,freq=1.0), product of:
              0.27591783 = queryWeight, product of:
                4.938121 = boost
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.016559234 = queryNorm
              0.2108909 = fieldWeight in 1161, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.0625 = fieldNorm(doc=1161)
        0.24 = coord(6/25)