Document (#39494)

Author
Sánchez, D.
Batet, M.
Title
C-sanitized : a privacy model for document redaction and sanitization
Source
Journal of the Association for Information Science and Technology. 67(2016) no.1, S.148-163
Year
2016
Abstract
Vast amounts of information are daily exchanged and/or released. The sensitive nature of much of this information creates a serious privacy threat when documents are uncontrollably made available to untrusted third parties. In such cases, appropriate data protection measures should be undertaken by the responsible organization, especially under the umbrella of current legislation on data privacy. To do so, human experts are usually requested to redact or sanitize document contents. To relieve this burdensome task, this paper presents a privacy model for document redaction/sanitization, which offers several advantages over other models available in the literature. Based on the well-established foundations of data semantics and information theory, our model provides a framework to develop and implement automated and inherently semantic redaction/sanitization tools. Moreover, contrary to ad-hoc redaction methods, our proposal provides a priori privacy guarantees which can be intuitively defined according to current legislations on data privacy. Empirical tests performed within the context of several use cases illustrate the applicability of our model and its ability to mimic the reasoning of human sanitizers.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23363/abstract.

Similar documents (author)

  1. Sánchez, M.F.: Semantically enhanced Information Retrieval : an ontology-based approach (2006) 5.07
    5.070855 = sum of:
      5.070855 = weight(author_txt:sánchez in 4327) [ClassicSimilarity], result of:
        5.070855 = fieldWeight in 4327, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.113368 = idf(docFreq=35, maxDocs=44218)
          0.625 = fieldNorm(doc=4327)
    
  2. Sánchez, R. Rodriguez- -> Rodriguez-Sánchez, R.: 4.30
    4.302763 = sum of:
      4.302763 = weight(author_txt:sánchez in 3567) [ClassicSimilarity], result of:
        4.302763 = fieldWeight in 3567, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.113368 = idf(docFreq=35, maxDocs=44218)
          0.375 = fieldNorm(doc=3567)
    
  3. Sánchez, R. Rodríguez -> Rodríguez-Sánchez, R.: 4.30
    4.302763 = sum of:
      4.302763 = weight(author_txt:sánchez in 501) [ClassicSimilarity], result of:
        4.302763 = fieldWeight in 501, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.113368 = idf(docFreq=35, maxDocs=44218)
          0.375 = fieldNorm(doc=501)
    
  4. Casabón, A.I. Sánchez- => Sánchez-Casabón, A.I.: 4.30
    4.302763 = sum of:
      4.302763 = weight(author_txt:sánchez in 4787) [ClassicSimilarity], result of:
        4.302763 = fieldWeight in 4787, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.113368 = idf(docFreq=35, maxDocs=44218)
          0.375 = fieldNorm(doc=4787)
    
  5. Sánchez, J.A. Pastor => Pastor Sánchez, J.A.: 4.30
    4.302763 = sum of:
      4.302763 = weight(author_txt:sánchez in 4791) [ClassicSimilarity], result of:
        4.302763 = fieldWeight in 4791, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.113368 = idf(docFreq=35, maxDocs=44218)
          0.375 = fieldNorm(doc=4791)
    

Similar documents (content)

  1. Wu, Z.; Xie, J.; Pan, J.; Su, X.: ¬An effective approach for the protection of user privacy in a digital library (2019) 0.25
    0.24672888 = sum of:
      0.24672888 = product of:
        1.2336444 = sum of:
          0.08483062 = weight(abstract_txt:protection in 5782) [ClassicSimilarity], result of:
            0.08483062 = score(doc=5782,freq=2.0), product of:
              0.13120535 = queryWeight, product of:
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.01793682 = queryNorm
              0.64654845 = fieldWeight in 5782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.0625 = fieldNorm(doc=5782)
          0.022967767 = weight(abstract_txt:provides in 5782) [ClassicSimilarity], result of:
            0.022967767 = score(doc=5782,freq=1.0), product of:
              0.08716637 = queryWeight, product of:
                1.1526932 = boost
                4.215895 = idf(docFreq=1773, maxDocs=44218)
                0.01793682 = queryNorm
              0.26349345 = fieldWeight in 5782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.215895 = idf(docFreq=1773, maxDocs=44218)
                0.0625 = fieldNorm(doc=5782)
          0.25796467 = weight(abstract_txt:untrusted in 5782) [ClassicSimilarity], result of:
            0.25796467 = score(doc=5782,freq=3.0), product of:
              0.24058002 = queryWeight, product of:
                1.3541102 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.01793682 = queryNorm
              1.0722615 = fieldWeight in 5782, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=5782)
          0.055765934 = weight(abstract_txt:data in 5782) [ClassicSimilarity], result of:
            0.055765934 = score(doc=5782,freq=6.0), product of:
              0.10917973 = queryWeight, product of:
                1.8244218 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01793682 = queryNorm
              0.5107719 = fieldWeight in 5782, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=5782)
          0.8121154 = weight(abstract_txt:privacy in 5782) [ClassicSimilarity], result of:
            0.8121154 = score(doc=5782,freq=8.0), product of:
              0.67716116 = queryWeight, product of:
                5.5647526 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.01793682 = queryNorm
              1.1992941 = fieldWeight in 5782, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0625 = fieldNorm(doc=5782)
        0.2 = coord(5/25)
    
  2. Chen, H.; Beaudoin, C.E.; Hong, H.: Teen online information disclosure : empirical testing of a protection motivation and social capital model (2016) 0.20
    0.20156814 = sum of:
      0.20156814 = product of:
        1.2598009 = sum of:
          0.16766122 = weight(abstract_txt:protection in 3203) [ClassicSimilarity], result of:
            0.16766122 = score(doc=3203,freq=5.0), product of:
              0.13120535 = queryWeight, product of:
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.01793682 = queryNorm
              1.2778535 = fieldWeight in 3203, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.078125 = fieldNorm(doc=3203)
          0.028457934 = weight(abstract_txt:data in 3203) [ClassicSimilarity], result of:
            0.028457934 = score(doc=3203,freq=1.0), product of:
              0.10917973 = queryWeight, product of:
                1.8244218 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01793682 = queryNorm
              0.26065218 = fieldWeight in 3203, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=3203)
          0.048537537 = weight(abstract_txt:model in 3203) [ClassicSimilarity], result of:
            0.048537537 = score(doc=3203,freq=1.0), product of:
              0.1558565 = queryWeight, product of:
                2.1798003 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.01793682 = queryNorm
              0.31142452 = fieldWeight in 3203, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.078125 = fieldNorm(doc=3203)
          1.0151442 = weight(abstract_txt:privacy in 3203) [ClassicSimilarity], result of:
            1.0151442 = score(doc=3203,freq=8.0), product of:
              0.67716116 = queryWeight, product of:
                5.5647526 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.01793682 = queryNorm
              1.4991176 = fieldWeight in 3203, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.078125 = fieldNorm(doc=3203)
        0.16 = coord(4/25)
    
  3. Harvey, M.J.; Harvey, M.G.: Privacy and security issues for mobile health platforms (2014) 0.18
    0.1837436 = sum of:
      0.1837436 = product of:
        0.918718 = sum of:
          0.10603827 = weight(abstract_txt:protection in 1260) [ClassicSimilarity], result of:
            0.10603827 = score(doc=1260,freq=2.0), product of:
              0.13120535 = queryWeight, product of:
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.01793682 = queryNorm
              0.8081856 = fieldWeight in 1260, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.078125 = fieldNorm(doc=1260)
          0.09108763 = weight(abstract_txt:legislation in 1260) [ClassicSimilarity], result of:
            0.09108763 = score(doc=1260,freq=1.0), product of:
              0.14938009 = queryWeight, product of:
                1.0670152 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.01793682 = queryNorm
              0.6097709 = fieldWeight in 1260, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.078125 = fieldNorm(doc=1260)
          0.030422106 = weight(abstract_txt:current in 1260) [ClassicSimilarity], result of:
            0.030422106 = score(doc=1260,freq=1.0), product of:
              0.09059884 = queryWeight, product of:
                1.1751696 = boost
                4.298101 = idf(docFreq=1633, maxDocs=44218)
                0.01793682 = queryNorm
              0.33578914 = fieldWeight in 1260, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.298101 = idf(docFreq=1633, maxDocs=44218)
                0.078125 = fieldNorm(doc=1260)
          0.069523595 = weight(abstract_txt:cases in 1260) [ClassicSimilarity], result of:
            0.069523595 = score(doc=1260,freq=1.0), product of:
              0.15718748 = queryWeight, product of:
                1.5479189 = boost
                5.6614056 = idf(docFreq=417, maxDocs=44218)
                0.01793682 = queryNorm
              0.4422973 = fieldWeight in 1260, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6614056 = idf(docFreq=417, maxDocs=44218)
                0.078125 = fieldNorm(doc=1260)
          0.62164634 = weight(abstract_txt:privacy in 1260) [ClassicSimilarity], result of:
            0.62164634 = score(doc=1260,freq=3.0), product of:
              0.67716116 = queryWeight, product of:
                5.5647526 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.01793682 = queryNorm
              0.9180183 = fieldWeight in 1260, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.078125 = fieldNorm(doc=1260)
        0.2 = coord(5/25)
    
  4. Yao, M.Z.; Rice, R.E.; Wallis, K.: Predicting user concerns about online privacy (2007) 0.16
    0.16411293 = sum of:
      0.16411293 = product of:
        1.3676077 = sum of:
          0.040245596 = weight(abstract_txt:data in 205) [ClassicSimilarity], result of:
            0.040245596 = score(doc=205,freq=2.0), product of:
              0.10917973 = queryWeight, product of:
                1.8244218 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01793682 = queryNorm
              0.36861783 = fieldWeight in 205, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=205)
          0.084069476 = weight(abstract_txt:model in 205) [ClassicSimilarity], result of:
            0.084069476 = score(doc=205,freq=3.0), product of:
              0.1558565 = queryWeight, product of:
                2.1798003 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.01793682 = queryNorm
              0.5394031 = fieldWeight in 205, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.078125 = fieldNorm(doc=205)
          1.2432927 = weight(abstract_txt:privacy in 205) [ClassicSimilarity], result of:
            1.2432927 = score(doc=205,freq=12.0), product of:
              0.67716116 = queryWeight, product of:
                5.5647526 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.01793682 = queryNorm
              1.8360366 = fieldWeight in 205, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.078125 = fieldNorm(doc=205)
        0.12 = coord(3/25)
    
  5. Wu, Z.; Li, R.; Zhou, Z.; Guo, J.; Jiang, J.; Su, X.: ¬A user sensitive subject protection approach for book search service (2020) 0.15
    0.14891426 = sum of:
      0.14891426 = product of:
        0.9307142 = sum of:
          0.08483062 = weight(abstract_txt:protection in 5617) [ClassicSimilarity], result of:
            0.08483062 = score(doc=5617,freq=2.0), product of:
              0.13120535 = queryWeight, product of:
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.01793682 = queryNorm
              0.64654845 = fieldWeight in 5617, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.0625 = fieldNorm(doc=5617)
          0.14893599 = weight(abstract_txt:untrusted in 5617) [ClassicSimilarity], result of:
            0.14893599 = score(doc=5617,freq=1.0), product of:
              0.24058002 = queryWeight, product of:
                1.3541102 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.01793682 = queryNorm
              0.6190705 = fieldWeight in 5617, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=5617)
          0.054913953 = weight(abstract_txt:model in 5617) [ClassicSimilarity], result of:
            0.054913953 = score(doc=5617,freq=2.0), product of:
              0.1558565 = queryWeight, product of:
                2.1798003 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.01793682 = queryNorm
              0.35233662 = fieldWeight in 5617, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=5617)
          0.64203364 = weight(abstract_txt:privacy in 5617) [ClassicSimilarity], result of:
            0.64203364 = score(doc=5617,freq=5.0), product of:
              0.67716116 = queryWeight, product of:
                5.5647526 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.01793682 = queryNorm
              0.9481253 = fieldWeight in 5617, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0625 = fieldNorm(doc=5617)
        0.16 = coord(4/25)