Document (#37470)

Author
Rauber, A.
Title
Digital preservation in data-driven science : on the importance of process capture, preservation and validation
Source
Proceedings of the 2nd International Workshop on Semantic Digital Archives held in conjunction with the 16th Int. Conference on Theory and Practice of Digital Libraries (TPDL) on September 27, 2012 in Paphos, Cyprus [http://ceur-ws.org/Vol-912/proceedings.pdf]. Eds.: A. Mitschik et al
Year
2012
Pages
S.7-17
Abstract
Current digital preservation is strongly biased towards data objects: digital files of document-style objects, or encapsulated and largely self-contained objects. To provide authenticity and provenance information, comprehensive metadata models are deployed to document information on an object's context. Yet, we claim that simply documenting an objects context may not be sufficient to ensure proper provenance and to fulfill the stated preservation goals. Specifically in e-Science and business settings, capturing, documenting and preserving entire processes may be necessary to meet the preservation goals. We thus present an approach for capturing, documenting and preserving processes, and means to assess their authenticity upon re-execution. We will discuss options as well as limitations and open challenges to achieve sound preservation, speci?cally within scientific processes.
Content
Vgl. auch: http://sda2012.dke-research.de.

Similar documents (author)

  1. Rauch, C.; Rauber, A.: Anwendung der Nutzwertanalyse zur Bewertung von Strategien zur langfristigen Erhaltung digitaler Objekte (2005) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:rauber in 3859) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 3859, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=3859)
    
  2. Becker, C.; Rauber, A.: Decision criteria in digital preservation : what to measure and how (2011) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:rauber in 4456) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 4456, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=4456)
    
  3. Rauber, K.; Nilges, A.: Was hieß noch mal schnell "Unterbegriff" auf Englisch? : Finden Sie die Antwort im Glossary to Terms of Information Literacy (2011) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:rauber in 4518) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 4518, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=4518)
    
  4. Bashir, S.; Rauber, A.: On the relationship between query characteristics and IR functions retrieval bias (2011) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:rauber in 4628) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 4628, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=4628)
    
  5. Klein, A.; Mitschang, J.; Nilges, A.; Oberhausen, B.; Rauber, K.; Weiß, A.: "Aus der Praxis für die Praxis" : ein Glossar zu Begriffen der Informationskompetenz (2008) 2.44
    2.4377444 = sum of:
      2.4377444 = weight(author_txt:rauber in 1282) [ClassicSimilarity], result of:
        2.4377444 = fieldWeight in 1282, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.25 = fieldNorm(doc=1282)
    

Similar documents (content)

  1. Tognoli, N.; Chaves-Guimarães, J.A.: Provenance as a knowledge organization principle (2019) 0.28
    0.2808732 = sum of:
      0.2808732 = product of:
        0.87772876 = sum of:
          0.029980129 = weight(abstract_txt:science in 5489) [ClassicSimilarity], result of:
            0.029980129 = score(doc=5489,freq=4.0), product of:
              0.062120296 = queryWeight, product of:
                1.1560298 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.013917925 = queryNorm
              0.48261407 = fieldWeight in 5489, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=5489)
          0.020601429 = weight(abstract_txt:document in 5489) [ClassicSimilarity], result of:
            0.020601429 = score(doc=5489,freq=1.0), product of:
              0.07678848 = queryWeight, product of:
                1.2852876 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.013917925 = queryNorm
              0.26828802 = fieldWeight in 5489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=5489)
          0.021290872 = weight(abstract_txt:context in 5489) [ClassicSimilarity], result of:
            0.021290872 = score(doc=5489,freq=1.0), product of:
              0.078492254 = queryWeight, product of:
                1.2994683 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.013917925 = queryNorm
              0.27124807 = fieldWeight in 5489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.0625 = fieldNorm(doc=5489)
          0.06356427 = weight(abstract_txt:digital in 5489) [ClassicSimilarity], result of:
            0.06356427 = score(doc=5489,freq=4.0), product of:
              0.117359154 = queryWeight, product of:
                1.9460609 = boost
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.013917925 = queryNorm
              0.54162174 = fieldWeight in 5489, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.0625 = fieldNorm(doc=5489)
          0.29870594 = weight(abstract_txt:provenance in 5489) [ClassicSimilarity], result of:
            0.29870594 = score(doc=5489,freq=6.0), product of:
              0.25127155 = queryWeight, product of:
                2.3250053 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.013917925 = queryNorm
              1.1887774 = fieldWeight in 5489, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=5489)
          0.14869317 = weight(abstract_txt:authenticity in 5489) [ClassicSimilarity], result of:
            0.14869317 = score(doc=5489,freq=1.0), product of:
              0.28678638 = queryWeight, product of:
                2.4838853 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.013917925 = queryNorm
              0.5184806 = fieldWeight in 5489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.0625 = fieldNorm(doc=5489)
          0.08472243 = weight(abstract_txt:objects in 5489) [ClassicSimilarity], result of:
            0.08472243 = score(doc=5489,freq=1.0), product of:
              0.24833624 = queryWeight, product of:
                3.2687924 = boost
                5.4585624 = idf(docFreq=511, maxDocs=44218)
                0.013917925 = queryNorm
              0.34116015 = fieldWeight in 5489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4585624 = idf(docFreq=511, maxDocs=44218)
                0.0625 = fieldNorm(doc=5489)
          0.21017055 = weight(abstract_txt:preservation in 5489) [ClassicSimilarity], result of:
            0.21017055 = score(doc=5489,freq=1.0), product of:
              0.52093816 = queryWeight, product of:
                5.7983713 = boost
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.013917925 = queryNorm
              0.40344626 = fieldWeight in 5489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.0625 = fieldNorm(doc=5489)
        0.32 = coord(8/25)
    
  2. Dobratz, S.; Neuroth, H.: nestor: Network of Expertise in long-term STOrage of digital Resources : a digital preservation initiative for Germany (2004) 0.19
    0.19032855 = sum of:
      0.19032855 = product of:
        0.67974484 = sum of:
          0.007495032 = weight(abstract_txt:science in 1195) [ClassicSimilarity], result of:
            0.007495032 = score(doc=1195,freq=1.0), product of:
              0.062120296 = queryWeight, product of:
                1.1560298 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.013917925 = queryNorm
              0.12065352 = fieldWeight in 1195, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.03125 = fieldNorm(doc=1195)
          0.01456741 = weight(abstract_txt:document in 1195) [ClassicSimilarity], result of:
            0.01456741 = score(doc=1195,freq=2.0), product of:
              0.07678848 = queryWeight, product of:
                1.2852876 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.013917925 = queryNorm
              0.18970828 = fieldWeight in 1195, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.03125 = fieldNorm(doc=1195)
          0.010645436 = weight(abstract_txt:context in 1195) [ClassicSimilarity], result of:
            0.010645436 = score(doc=1195,freq=1.0), product of:
              0.078492254 = queryWeight, product of:
                1.2994683 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.013917925 = queryNorm
              0.13562404 = fieldWeight in 1195, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.03125 = fieldNorm(doc=1195)
          0.06742009 = weight(abstract_txt:digital in 1195) [ClassicSimilarity], result of:
            0.06742009 = score(doc=1195,freq=18.0), product of:
              0.117359154 = queryWeight, product of:
                1.9460609 = boost
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.013917925 = queryNorm
              0.5744766 = fieldWeight in 1195, product of:
                4.2426405 = tf(freq=18.0), with freq of:
                  18.0 = termFreq=18.0
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.03125 = fieldNorm(doc=1195)
          0.07434659 = weight(abstract_txt:authenticity in 1195) [ClassicSimilarity], result of:
            0.07434659 = score(doc=1195,freq=1.0), product of:
              0.28678638 = queryWeight, product of:
                2.4838853 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.013917925 = queryNorm
              0.2592403 = fieldWeight in 1195, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.03125 = fieldNorm(doc=1195)
          0.112077236 = weight(abstract_txt:objects in 1195) [ClassicSimilarity], result of:
            0.112077236 = score(doc=1195,freq=7.0), product of:
              0.24833624 = queryWeight, product of:
                3.2687924 = boost
                5.4585624 = idf(docFreq=511, maxDocs=44218)
                0.013917925 = queryNorm
              0.45131245 = fieldWeight in 1195, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.4585624 = idf(docFreq=511, maxDocs=44218)
                0.03125 = fieldNorm(doc=1195)
          0.3931931 = weight(abstract_txt:preservation in 1195) [ClassicSimilarity], result of:
            0.3931931 = score(doc=1195,freq=14.0), product of:
              0.52093816 = queryWeight, product of:
                5.7983713 = boost
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.013917925 = queryNorm
              0.75477886 = fieldWeight in 1195, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.03125 = fieldNorm(doc=1195)
        0.28 = coord(7/25)
    
  3. Hendley, T.: ¬The preservation of digital material (1996) 0.19
    0.18609269 = sum of:
      0.18609269 = product of:
        0.93046343 = sum of:
          0.08495916 = weight(abstract_txt:stated in 5085) [ClassicSimilarity], result of:
            0.08495916 = score(doc=5085,freq=1.0), product of:
              0.107928455 = queryWeight, product of:
                1.0774702 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.013917925 = queryNorm
              0.78718036 = fieldWeight in 5085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.109375 = fieldNorm(doc=5085)
          0.037259027 = weight(abstract_txt:context in 5085) [ClassicSimilarity], result of:
            0.037259027 = score(doc=5085,freq=1.0), product of:
              0.078492254 = queryWeight, product of:
                1.2994683 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.013917925 = queryNorm
              0.47468412 = fieldWeight in 5085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.109375 = fieldNorm(doc=5085)
          0.07865677 = weight(abstract_txt:digital in 5085) [ClassicSimilarity], result of:
            0.07865677 = score(doc=5085,freq=2.0), product of:
              0.117359154 = queryWeight, product of:
                1.9460609 = boost
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.013917925 = queryNorm
              0.6702227 = fieldWeight in 5085, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.109375 = fieldNorm(doc=5085)
          0.09254286 = weight(abstract_txt:processes in 5085) [ClassicSimilarity], result of:
            0.09254286 = score(doc=5085,freq=1.0), product of:
              0.16479024 = queryWeight, product of:
                2.3060231 = boost
                5.1344433 = idf(docFreq=707, maxDocs=44218)
                0.013917925 = queryNorm
              0.5615797 = fieldWeight in 5085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1344433 = idf(docFreq=707, maxDocs=44218)
                0.109375 = fieldNorm(doc=5085)
          0.6370456 = weight(abstract_txt:preservation in 5085) [ClassicSimilarity], result of:
            0.6370456 = score(doc=5085,freq=3.0), product of:
              0.52093816 = queryWeight, product of:
                5.7983713 = boost
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.013917925 = queryNorm
              1.2228814 = fieldWeight in 5085, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.109375 = fieldNorm(doc=5085)
        0.2 = coord(5/25)
    
  4. Dalkir, K.: Knowledge management (2009) 0.17
    0.16956092 = sum of:
      0.16956092 = product of:
        0.70650387 = sum of:
          0.025963552 = weight(abstract_txt:science in 3832) [ClassicSimilarity], result of:
            0.025963552 = score(doc=3832,freq=3.0), product of:
              0.062120296 = queryWeight, product of:
                1.1560298 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.013917925 = queryNorm
              0.41795602 = fieldWeight in 3832, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=3832)
          0.107968524 = weight(abstract_txt:encapsulated in 3832) [ClassicSimilarity], result of:
            0.107968524 = score(doc=3832,freq=1.0), product of:
              0.18388768 = queryWeight, product of:
                1.4064153 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.013917925 = queryNorm
              0.5871439 = fieldWeight in 3832, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=3832)
          0.108888455 = weight(abstract_txt:capturing in 3832) [ClassicSimilarity], result of:
            0.108888455 = score(doc=3832,freq=1.0), product of:
              0.23299812 = queryWeight, product of:
                2.238868 = boost
                7.4773793 = idf(docFreq=67, maxDocs=44218)
                0.013917925 = queryNorm
              0.4673362 = fieldWeight in 3832, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4773793 = idf(docFreq=67, maxDocs=44218)
                0.0625 = fieldNorm(doc=3832)
          0.052881636 = weight(abstract_txt:processes in 3832) [ClassicSimilarity], result of:
            0.052881636 = score(doc=3832,freq=1.0), product of:
              0.16479024 = queryWeight, product of:
                2.3060231 = boost
                5.1344433 = idf(docFreq=707, maxDocs=44218)
                0.013917925 = queryNorm
              0.3209027 = fieldWeight in 3832, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1344433 = idf(docFreq=707, maxDocs=44218)
                0.0625 = fieldNorm(doc=3832)
          0.20063113 = weight(abstract_txt:documenting in 3832) [ClassicSimilarity], result of:
            0.20063113 = score(doc=3832,freq=1.0), product of:
              0.400861 = queryWeight, product of:
                3.5966222 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.013917925 = queryNorm
              0.5005005 = fieldWeight in 3832, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=3832)
          0.21017055 = weight(abstract_txt:preservation in 3832) [ClassicSimilarity], result of:
            0.21017055 = score(doc=3832,freq=1.0), product of:
              0.52093816 = queryWeight, product of:
                5.7983713 = boost
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.013917925 = queryNorm
              0.40344626 = fieldWeight in 3832, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.0625 = fieldNorm(doc=3832)
        0.24 = coord(6/25)
    
  5. Maemura, E.; Moles, N.; Becker, C.: Organizational assessment frameworks for digital preservation : a literature review and mapping (2017) 0.13
    0.1330012 = sum of:
      0.1330012 = product of:
        0.6650059 = sum of:
          0.07100077 = weight(abstract_txt:validation in 3743) [ClassicSimilarity], result of:
            0.07100077 = score(doc=3743,freq=2.0), product of:
              0.110370554 = queryWeight, product of:
                1.089592 = boost
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.013917925 = queryNorm
              0.6432945 = fieldWeight in 3743, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.0625 = fieldNorm(doc=3743)
          0.057218134 = weight(abstract_txt:goals in 3743) [ClassicSimilarity], result of:
            0.057218134 = score(doc=3743,freq=1.0), product of:
              0.15172377 = queryWeight, product of:
                1.8066711 = boost
                6.033927 = idf(docFreq=287, maxDocs=44218)
                0.013917925 = queryNorm
              0.37712044 = fieldWeight in 3743, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.033927 = idf(docFreq=287, maxDocs=44218)
                0.0625 = fieldNorm(doc=3743)
          0.06356427 = weight(abstract_txt:digital in 3743) [ClassicSimilarity], result of:
            0.06356427 = score(doc=3743,freq=4.0), product of:
              0.117359154 = queryWeight, product of:
                1.9460609 = boost
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.013917925 = queryNorm
              0.54162174 = fieldWeight in 3743, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.0625 = fieldNorm(doc=3743)
          0.052881636 = weight(abstract_txt:processes in 3743) [ClassicSimilarity], result of:
            0.052881636 = score(doc=3743,freq=1.0), product of:
              0.16479024 = queryWeight, product of:
                2.3060231 = boost
                5.1344433 = idf(docFreq=707, maxDocs=44218)
                0.013917925 = queryNorm
              0.3209027 = fieldWeight in 3743, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1344433 = idf(docFreq=707, maxDocs=44218)
                0.0625 = fieldNorm(doc=3743)
          0.4203411 = weight(abstract_txt:preservation in 3743) [ClassicSimilarity], result of:
            0.4203411 = score(doc=3743,freq=4.0), product of:
              0.52093816 = queryWeight, product of:
                5.7983713 = boost
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.013917925 = queryNorm
              0.8068925 = fieldWeight in 3743, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.0625 = fieldNorm(doc=3743)
        0.2 = coord(5/25)