Document (#40196)

Author
Lamb, I.
Larson, C.
Title
Shining a light on scientific data : building a data catalog to foster data sharing and reuse
Source
Code4Lib journal. Issue 32(2016), [http://journal.code4lib.org]
Year
2016
Abstract
The scientific community's growing eagerness to make research data available to the public provides libraries - with our expertise in metadata and discovery - an interesting new opportunity. This paper details the in-house creation of a "data catalog" which describes datasets ranging from population-level studies like the US Census to small, specialized datasets created by researchers at our own institution. Based on Symfony2 and Solr, the data catalog provides a powerful search interface to help researchers locate the data that can help them, and an administrative interface so librarians can add, edit, and manage metadata elements at will. This paper will outline the successes, failures, and total redos that culminated in the current manifestation of our data catalog.
Content
Vgl.: http://journal.code4lib.org/articles/11421.
Theme
Informetrie
Visualisierung

Similar documents (author)

  1. Larson, R.R.: Between Scylla and Charybdis : searching in the online catalog (1991) 5.23
    5.2279267 = sum of:
      5.2279267 = weight(author_txt:larson in 462) [ClassicSimilarity], result of:
        5.2279267 = score(doc=462,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.364683 = idf(docFreq=27, maxDocs=44218)
            0.11955025 = queryNorm
          5.227927 = fieldWeight in 462, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.364683 = idf(docFreq=27, maxDocs=44218)
            0.625 = fieldNorm(doc=462)
    
  2. Larson, R.R.: Evaluation of advanced retrieval techniques in an experimental online catalog (1992) 5.23
    5.2279267 = sum of:
      5.2279267 = weight(author_txt:larson in 481) [ClassicSimilarity], result of:
        5.2279267 = score(doc=481,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.364683 = idf(docFreq=27, maxDocs=44218)
            0.11955025 = queryNorm
          5.227927 = fieldWeight in 481, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.364683 = idf(docFreq=27, maxDocs=44218)
            0.625 = fieldNorm(doc=481)
    
  3. Larson, R.R.: Experiments in automatic Library of Congress Classification (1992) 5.23
    5.2279267 = sum of:
      5.2279267 = weight(author_txt:larson in 1054) [ClassicSimilarity], result of:
        5.2279267 = score(doc=1054,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.364683 = idf(docFreq=27, maxDocs=44218)
            0.11955025 = queryNorm
          5.227927 = fieldWeight in 1054, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.364683 = idf(docFreq=27, maxDocs=44218)
            0.625 = fieldNorm(doc=1054)
    
  4. Larson, R.R.: Classification clustering, probabilistic information retrieval, and the online catalog (1991) 5.23
    5.2279267 = sum of:
      5.2279267 = weight(author_txt:larson in 1070) [ClassicSimilarity], result of:
        5.2279267 = score(doc=1070,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.364683 = idf(docFreq=27, maxDocs=44218)
            0.11955025 = queryNorm
          5.227927 = fieldWeight in 1070, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.364683 = idf(docFreq=27, maxDocs=44218)
            0.625 = fieldNorm(doc=1070)
    
  5. Larson, R.R.: ¬The decline of subject searching : long-term trends and patterns of index use in an online catalog (1991) 5.23
    5.2279267 = sum of:
      5.2279267 = weight(author_txt:larson in 1104) [ClassicSimilarity], result of:
        5.2279267 = score(doc=1104,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.364683 = idf(docFreq=27, maxDocs=44218)
            0.11955025 = queryNorm
          5.227927 = fieldWeight in 1104, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.364683 = idf(docFreq=27, maxDocs=44218)
            0.625 = fieldNorm(doc=1104)
    

Similar documents (content)

  1. Senzig, D.: Library catalogs for library users (1984) 0.18
    0.17542516 = sum of:
      0.17542516 = product of:
        0.87712574 = sum of:
          0.055823725 = weight(abstract_txt:will in 831) [ClassicSimilarity], result of:
            0.055823725 = score(doc=831,freq=2.0), product of:
              0.09346549 = queryWeight, product of:
                1.140773 = boost
                3.8613079 = idf(docFreq=2528, maxDocs=44218)
                0.021218644 = queryNorm
              0.5972656 = fieldWeight in 831, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8613079 = idf(docFreq=2528, maxDocs=44218)
                0.109375 = fieldNorm(doc=831)
          0.15931001 = weight(abstract_txt:failures in 831) [ClassicSimilarity], result of:
            0.15931001 = score(doc=831,freq=1.0), product of:
              0.18804747 = queryWeight, product of:
                1.1441747 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.021218644 = queryNorm
              0.8471798 = fieldWeight in 831, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.109375 = fieldNorm(doc=831)
          0.20326908 = weight(abstract_txt:successes in 831) [ClassicSimilarity], result of:
            0.20326908 = score(doc=831,freq=1.0), product of:
              0.22121759 = queryWeight, product of:
                1.2409904 = boost
                8.401051 = idf(docFreq=26, maxDocs=44218)
                0.021218644 = queryNorm
              0.9188649 = fieldWeight in 831, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.401051 = idf(docFreq=26, maxDocs=44218)
                0.109375 = fieldNorm(doc=831)
          0.07651393 = weight(abstract_txt:help in 831) [ClassicSimilarity], result of:
            0.07651393 = score(doc=831,freq=1.0), product of:
              0.1453034 = queryWeight, product of:
                1.4223664 = boost
                4.81445 = idf(docFreq=974, maxDocs=44218)
                0.021218644 = queryNorm
              0.52658045 = fieldWeight in 831, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.81445 = idf(docFreq=974, maxDocs=44218)
                0.109375 = fieldNorm(doc=831)
          0.38220897 = weight(abstract_txt:catalog in 831) [ClassicSimilarity], result of:
            0.38220897 = score(doc=831,freq=3.0), product of:
              0.3709246 = queryWeight, product of:
                3.213893 = boost
                5.4392195 = idf(docFreq=521, maxDocs=44218)
                0.021218644 = queryNorm
              1.0304223 = fieldWeight in 831, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4392195 = idf(docFreq=521, maxDocs=44218)
                0.109375 = fieldNorm(doc=831)
        0.2 = coord(5/25)
    
  2. Daniel Jr., R.; Lagoze, C.: Extending the Warwick framework : from metadata containers to active digital objects (1997) 0.14
    0.1411934 = sum of:
      0.1411934 = product of:
        0.50426215 = sum of:
          0.017680965 = weight(abstract_txt:paper in 1264) [ClassicSimilarity], result of:
            0.017680965 = score(doc=1264,freq=3.0), product of:
              0.07536754 = queryWeight, product of:
                1.024391 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.021218644 = queryNorm
              0.23459655 = fieldWeight in 1264, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1264)
          0.014097619 = weight(abstract_txt:will in 1264) [ClassicSimilarity], result of:
            0.014097619 = score(doc=1264,freq=1.0), product of:
              0.09346549 = queryWeight, product of:
                1.140773 = boost
                3.8613079 = idf(docFreq=2528, maxDocs=44218)
                0.021218644 = queryNorm
              0.15083234 = fieldWeight in 1264, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8613079 = idf(docFreq=2528, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1264)
          0.025949372 = weight(abstract_txt:provides in 1264) [ClassicSimilarity], result of:
            0.025949372 = score(doc=1264,freq=2.0), product of:
              0.111419715 = queryWeight, product of:
                1.2455312 = boost
                4.215895 = idf(docFreq=1773, maxDocs=44218)
                0.021218644 = queryNorm
              0.23289749 = fieldWeight in 1264, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.215895 = idf(docFreq=1773, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1264)
          0.098656446 = weight(abstract_txt:metadata in 1264) [ClassicSimilarity], result of:
            0.098656446 = score(doc=1264,freq=12.0), product of:
              0.14936334 = queryWeight, product of:
                1.4421008 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.021218644 = queryNorm
              0.6605131 = fieldWeight in 1264, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1264)
          0.069727 = weight(abstract_txt:datasets in 1264) [ClassicSimilarity], result of:
            0.069727 = score(doc=1264,freq=1.0), product of:
              0.27132395 = queryWeight, product of:
                1.9436482 = boost
                6.578893 = idf(docFreq=166, maxDocs=44218)
                0.021218644 = queryNorm
              0.25698802 = fieldWeight in 1264, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.578893 = idf(docFreq=166, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1264)
          0.111454405 = weight(abstract_txt:catalog in 1264) [ClassicSimilarity], result of:
            0.111454405 = score(doc=1264,freq=2.0), product of:
              0.3709246 = queryWeight, product of:
                3.213893 = boost
                5.4392195 = idf(docFreq=521, maxDocs=44218)
                0.021218644 = queryNorm
              0.30047727 = fieldWeight in 1264, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4392195 = idf(docFreq=521, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1264)
          0.16669632 = weight(abstract_txt:data in 1264) [ClassicSimilarity], result of:
            0.16669632 = score(doc=1264,freq=21.0), product of:
              0.2791162 = queryWeight, product of:
                3.9427218 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.021218644 = queryNorm
              0.5972291 = fieldWeight in 1264, product of:
                4.582576 = tf(freq=21.0), with freq of:
                  21.0 = termFreq=21.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1264)
        0.28 = coord(7/25)
    
  3. Baker, T.; Bermès, E.; Coyle, K.; Dunsire, G.; Isaac, A.; Murray, P.; Panzer, M.; Schneider, J.; Singer, R.; Summers, E.; Waites, W.; Young, J.; Zeng, M.: Library Linked Data Incubator Group Final Report (2011) 0.13
    0.13255824 = sum of:
      0.13255824 = product of:
        0.552326 = sum of:
          0.047280166 = weight(abstract_txt:ranging in 4796) [ClassicSimilarity], result of:
            0.047280166 = score(doc=4796,freq=1.0), product of:
              0.1471892 = queryWeight, product of:
                1.0122705 = boost
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.021218644 = queryNorm
              0.32122034 = fieldWeight in 4796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.046875 = fieldNorm(doc=4796)
          0.06633468 = weight(abstract_txt:foster in 4796) [ClassicSimilarity], result of:
            0.06633468 = score(doc=4796,freq=1.0), product of:
              0.18446632 = queryWeight, product of:
                1.1332276 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.021218644 = queryNorm
              0.35960323 = fieldWeight in 4796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.046875 = fieldNorm(doc=4796)
          0.032791685 = weight(abstract_txt:help in 4796) [ClassicSimilarity], result of:
            0.032791685 = score(doc=4796,freq=1.0), product of:
              0.1453034 = queryWeight, product of:
                1.4223664 = boost
                4.81445 = idf(docFreq=974, maxDocs=44218)
                0.021218644 = queryNorm
              0.22567734 = fieldWeight in 4796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.81445 = idf(docFreq=974, maxDocs=44218)
                0.046875 = fieldNorm(doc=4796)
          0.034175597 = weight(abstract_txt:metadata in 4796) [ClassicSimilarity], result of:
            0.034175597 = score(doc=4796,freq=1.0), product of:
              0.14936334 = queryWeight, product of:
                1.4421008 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.021218644 = queryNorm
              0.22880846 = fieldWeight in 4796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.046875 = fieldNorm(doc=4796)
          0.14492485 = weight(abstract_txt:datasets in 4796) [ClassicSimilarity], result of:
            0.14492485 = score(doc=4796,freq=3.0), product of:
              0.27132395 = queryWeight, product of:
                1.9436482 = boost
                6.578893 = idf(docFreq=166, maxDocs=44218)
                0.021218644 = queryNorm
              0.5341395 = fieldWeight in 4796, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.578893 = idf(docFreq=166, maxDocs=44218)
                0.046875 = fieldNorm(doc=4796)
          0.22681904 = weight(abstract_txt:data in 4796) [ClassicSimilarity], result of:
            0.22681904 = score(doc=4796,freq=27.0), product of:
              0.2791162 = queryWeight, product of:
                3.9427218 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.021218644 = queryNorm
              0.812633 = fieldWeight in 4796, product of:
                5.196152 = tf(freq=27.0), with freq of:
                  27.0 = termFreq=27.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.046875 = fieldNorm(doc=4796)
        0.24 = coord(6/25)
    
  4. McCutcheon, S.; Kreyche, M.; Maurer, M.B.; Nickerson, J.: Morphing metadata : maximizing access to electronic theses and dissertations (2008) 0.13
    0.1314156 = sum of:
      0.1314156 = product of:
        0.54756504 = sum of:
          0.016332975 = weight(abstract_txt:paper in 2394) [ClassicSimilarity], result of:
            0.016332975 = score(doc=2394,freq=1.0), product of:
              0.07536754 = queryWeight, product of:
                1.024391 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.021218644 = queryNorm
              0.216711 = fieldWeight in 2394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.0625 = fieldNorm(doc=2394)
          0.088446245 = weight(abstract_txt:foster in 2394) [ClassicSimilarity], result of:
            0.088446245 = score(doc=2394,freq=1.0), product of:
              0.18446632 = queryWeight, product of:
                1.1332276 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.021218644 = queryNorm
              0.47947097 = fieldWeight in 2394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0625 = fieldNorm(doc=2394)
          0.029358365 = weight(abstract_txt:provides in 2394) [ClassicSimilarity], result of:
            0.029358365 = score(doc=2394,freq=1.0), product of:
              0.111419715 = queryWeight, product of:
                1.2455312 = boost
                4.215895 = idf(docFreq=1773, maxDocs=44218)
                0.021218644 = queryNorm
              0.26349345 = fieldWeight in 2394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.215895 = idf(docFreq=1773, maxDocs=44218)
                0.0625 = fieldNorm(doc=2394)
          0.078925155 = weight(abstract_txt:metadata in 2394) [ClassicSimilarity], result of:
            0.078925155 = score(doc=2394,freq=3.0), product of:
              0.14936334 = queryWeight, product of:
                1.4421008 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.021218644 = queryNorm
              0.5284105 = fieldWeight in 2394, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.0625 = fieldNorm(doc=2394)
          0.25219253 = weight(abstract_txt:catalog in 2394) [ClassicSimilarity], result of:
            0.25219253 = score(doc=2394,freq=4.0), product of:
              0.3709246 = queryWeight, product of:
                3.213893 = boost
                5.4392195 = idf(docFreq=521, maxDocs=44218)
                0.021218644 = queryNorm
              0.67990243 = fieldWeight in 2394, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4392195 = idf(docFreq=521, maxDocs=44218)
                0.0625 = fieldNorm(doc=2394)
          0.082309775 = weight(abstract_txt:data in 2394) [ClassicSimilarity], result of:
            0.082309775 = score(doc=2394,freq=2.0), product of:
              0.2791162 = queryWeight, product of:
                3.9427218 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.021218644 = queryNorm
              0.29489428 = fieldWeight in 2394, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=2394)
        0.24 = coord(6/25)
    
  5. Jiao, H.; Qiu, Y.; Ma, X.; Yang, B.: Dissmination effect of data papers on scientific datasets (2024) 0.13
    0.12827228 = sum of:
      0.12827228 = product of:
        0.6413614 = sum of:
          0.060775425 = weight(abstract_txt:reuse in 1204) [ClassicSimilarity], result of:
            0.060775425 = score(doc=1204,freq=1.0), product of:
              0.14364246 = queryWeight, product of:
                6.769634 = idf(docFreq=137, maxDocs=44218)
                0.021218644 = queryNorm
              0.4231021 = fieldWeight in 1204, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.769634 = idf(docFreq=137, maxDocs=44218)
                0.0625 = fieldNorm(doc=1204)
          0.016332975 = weight(abstract_txt:paper in 1204) [ClassicSimilarity], result of:
            0.016332975 = score(doc=1204,freq=1.0), product of:
              0.07536754 = queryWeight, product of:
                1.024391 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.021218644 = queryNorm
              0.216711 = fieldWeight in 1204, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.0625 = fieldNorm(doc=1204)
          0.067860804 = weight(abstract_txt:scientific in 1204) [ClassicSimilarity], result of:
            0.067860804 = score(doc=1204,freq=3.0), product of:
              0.13505575 = queryWeight, product of:
                1.3712926 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.021218644 = queryNorm
              0.5024651 = fieldWeight in 1204, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=1204)
          0.24946292 = weight(abstract_txt:datasets in 1204) [ClassicSimilarity], result of:
            0.24946292 = score(doc=1204,freq=5.0), product of:
              0.27132395 = queryWeight, product of:
                1.9436482 = boost
                6.578893 = idf(docFreq=166, maxDocs=44218)
                0.021218644 = queryNorm
              0.9194283 = fieldWeight in 1204, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.578893 = idf(docFreq=166, maxDocs=44218)
                0.0625 = fieldNorm(doc=1204)
          0.2469293 = weight(abstract_txt:data in 1204) [ClassicSimilarity], result of:
            0.2469293 = score(doc=1204,freq=18.0), product of:
              0.2791162 = queryWeight, product of:
                3.9427218 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.021218644 = queryNorm
              0.8846828 = fieldWeight in 1204, product of:
                4.2426405 = tf(freq=18.0), with freq of:
                  18.0 = termFreq=18.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=1204)
        0.2 = coord(5/25)