Document (#42672)

Author
Menkov, V.
Ginsparg, P.
Kantor, P.B.
Title
Recommendations and privacy in the arXiv system : a simulation experiment using historical data
Source
Journal of the Association for Information Science and Technology. 71(2020) no.3, S.300-313
Year
2020
Abstract
Recommender systems may accelerate knowledge discovery in many fields. However, their users may be competitors guarding their ideas before publication or for other reasons. We describe a simulation experiment to assess user privacy against targeted attacks, modeling recommendations based on co-access data. The analysis uses an unusually long (14?years) set of anonymized historical data on user-item accesses. We introduce the notions of "visibility" and "discoverability." We find, based on historical data, that the majority of the actions of arXiv users would be potentially "visible" under targeted attack. However, "discoverability," which incorporates the difficulty of actually seeing a "visible" effect, is very much lower for nearly all users. We consider the effect of changes to the settings of the recommender algorithm on the visibility and discoverability of user actions and propose mitigation strategies that reduce both measures of risk.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24236.
Object
arXiv

Similar documents (author)

  1. Ginsparg, P.: Winners and losers in the global research village (1998) 2.56
    2.5591238 = sum of:
      2.5591238 = product of:
        5.1182475 = sum of:
          5.1182475 = weight(author_txt:ginsparg in 1146) [ClassicSimilarity], result of:
            5.1182475 = score(doc=1146,freq=1.0), product of:
              0.8267633 = queryWeight, product of:
                1.2122998 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.06885113 = queryNorm
              6.190705 = fieldWeight in 1146, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.625 = fieldNorm(doc=1146)
        0.5 = coord(1/2)
    
  2. Haque, A.-u.; Ginsparg, P.: Positional effects on citation and readership in arXiv (2009) 1.79
    1.7913865 = sum of:
      1.7913865 = product of:
        3.582773 = sum of:
          3.582773 = weight(author_txt:ginsparg in 3160) [ClassicSimilarity], result of:
            3.582773 = score(doc=3160,freq=1.0), product of:
              0.8267633 = queryWeight, product of:
                1.2122998 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.06885113 = queryNorm
              4.333493 = fieldWeight in 3160, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.4375 = fieldNorm(doc=3160)
        0.5 = coord(1/2)
    
  3. Haque, A.-ul; Ginsparg, P.: Last but not least : additional positional effects on citation and readership in arXiv (2010) 1.79
    1.7913865 = sum of:
      1.7913865 = product of:
        3.582773 = sum of:
          3.582773 = weight(author_txt:ginsparg in 4110) [ClassicSimilarity], result of:
            3.582773 = score(doc=4110,freq=1.0), product of:
              0.8267633 = queryWeight, product of:
                1.2122998 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.06885113 = queryNorm
              4.333493 = fieldWeight in 4110, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.4375 = fieldNorm(doc=4110)
        0.5 = coord(1/2)
    
  4. Collins, H.M.; Reyes-Galindo, L.; Ginsparg, P.: ¬A note concerning primary source knowledge (2017) 1.54
    1.5354742 = sum of:
      1.5354742 = product of:
        3.0709484 = sum of:
          3.0709484 = weight(author_txt:ginsparg in 3592) [ClassicSimilarity], result of:
            3.0709484 = score(doc=3592,freq=1.0), product of:
              0.8267633 = queryWeight, product of:
                1.2122998 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.06885113 = queryNorm
              3.7144227 = fieldWeight in 3592, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.375 = fieldNorm(doc=3592)
        0.5 = coord(1/2)
    
  5. Kantor, P.B.: ¬The Adaptive Network Library Interface : a historical overview and interim report (1993) 1.44
    1.4363528 = sum of:
      1.4363528 = product of:
        2.8727057 = sum of:
          2.8727057 = weight(author_txt:kantor in 6976) [ClassicSimilarity], result of:
            2.8727057 = score(doc=6976,freq=1.0), product of:
              0.56254995 = queryWeight, product of:
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.06885113 = queryNorm
              5.106579 = fieldWeight in 6976, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.625 = fieldNorm(doc=6976)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Can, O.; Yilmazer, D.: ¬A privacy-aware semantic model for provenance management (2014) 0.13
    0.1340849 = sum of:
      0.1340849 = product of:
        0.5586871 = sum of:
          0.12445798 = weight(abstract_txt:accesses in 1580) [ClassicSimilarity], result of:
            0.12445798 = score(doc=1580,freq=1.0), product of:
              0.17293371 = queryWeight, product of:
                1.0965272 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.017120136 = queryNorm
              0.71968603 = fieldWeight in 1580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.078125 = fieldNorm(doc=1580)
          0.023871658 = weight(abstract_txt:user in 1580) [ClassicSimilarity], result of:
            0.023871658 = score(doc=1580,freq=1.0), product of:
              0.08295196 = queryWeight, product of:
                1.315387 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.017120136 = queryNorm
              0.2877769 = fieldWeight in 1580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.078125 = fieldNorm(doc=1580)
          0.092449546 = weight(abstract_txt:actions in 1580) [ClassicSimilarity], result of:
            0.092449546 = score(doc=1580,freq=1.0), product of:
              0.17870817 = queryWeight, product of:
                1.5764014 = boost
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.017120136 = queryNorm
              0.51732135 = fieldWeight in 1580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.078125 = fieldNorm(doc=1580)
          0.06257295 = weight(abstract_txt:data in 1580) [ClassicSimilarity], result of:
            0.06257295 = score(doc=1580,freq=7.0), product of:
              0.090735294 = queryWeight, product of:
                1.5885384 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.017120136 = queryNorm
              0.68962085 = fieldWeight in 1580, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=1580)
          0.17220919 = weight(abstract_txt:privacy in 1580) [ClassicSimilarity], result of:
            0.17220919 = score(doc=1580,freq=3.0), product of:
              0.18758798 = queryWeight, product of:
                1.6150913 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.017120136 = queryNorm
              0.9180183 = fieldWeight in 1580, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.078125 = fieldNorm(doc=1580)
          0.08312577 = weight(abstract_txt:historical in 1580) [ClassicSimilarity], result of:
            0.08312577 = score(doc=1580,freq=1.0), product of:
              0.19057329 = queryWeight, product of:
                1.9937524 = boost
                5.583205 = idf(docFreq=451, maxDocs=44218)
                0.017120136 = queryNorm
              0.43618792 = fieldWeight in 1580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.583205 = idf(docFreq=451, maxDocs=44218)
                0.078125 = fieldNorm(doc=1580)
        0.24 = coord(6/25)
    
  2. Ghosh, I.; Singh, V.: "Not all my friends are friends" : audience-group-based nudges for managing location privacy (2022) 0.09
    0.0911102 = sum of:
      0.0911102 = product of:
        0.37962583 = sum of:
          0.03886722 = weight(abstract_txt:users in 561) [ClassicSimilarity], result of:
            0.03886722 = score(doc=561,freq=5.0), product of:
              0.07790714 = queryWeight, product of:
                1.2747612 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.017120136 = queryNorm
              0.49889165 = fieldWeight in 561, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0625 = fieldNorm(doc=561)
          0.04077631 = weight(abstract_txt:effect in 561) [ClassicSimilarity], result of:
            0.04077631 = score(doc=561,freq=1.0), product of:
              0.12015812 = queryWeight, product of:
                1.2926209 = boost
                5.4296865 = idf(docFreq=526, maxDocs=44218)
                0.017120136 = queryNorm
              0.3393554 = fieldWeight in 561, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4296865 = idf(docFreq=526, maxDocs=44218)
                0.0625 = fieldNorm(doc=561)
          0.04833214 = weight(abstract_txt:recommendations in 561) [ClassicSimilarity], result of:
            0.04833214 = score(doc=561,freq=1.0), product of:
              0.13457732 = queryWeight, product of:
                1.3679825 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.017120136 = queryNorm
              0.3591403 = fieldWeight in 561, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=561)
          0.018920282 = weight(abstract_txt:data in 561) [ClassicSimilarity], result of:
            0.018920282 = score(doc=561,freq=1.0), product of:
              0.090735294 = queryWeight, product of:
                1.5885384 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.017120136 = queryNorm
              0.20852174 = fieldWeight in 561, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=561)
          0.13776736 = weight(abstract_txt:privacy in 561) [ClassicSimilarity], result of:
            0.13776736 = score(doc=561,freq=3.0), product of:
              0.18758798 = queryWeight, product of:
                1.6150913 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.017120136 = queryNorm
              0.73441464 = fieldWeight in 561, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0625 = fieldNorm(doc=561)
          0.094962515 = weight(abstract_txt:visible in 561) [ClassicSimilarity], result of:
            0.094962515 = score(doc=561,freq=1.0), product of:
              0.2111135 = queryWeight, product of:
                1.7133757 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.017120136 = queryNorm
              0.44981736 = fieldWeight in 561, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.0625 = fieldNorm(doc=561)
        0.24 = coord(6/25)
    
  3. Smets, A.; Vannieuwenhuyze, J.; Ballon, P.: Serendipity in the city : user evaluations of urban recommender systems (2022) 0.08
    0.07810019 = sum of:
      0.07810019 = product of:
        0.39050093 = sum of:
          0.019095331 = weight(abstract_txt:however in 458) [ClassicSimilarity], result of:
            0.019095331 = score(doc=458,freq=1.0), product of:
              0.07246017 = queryWeight, product of:
                1.0037932 = boost
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.017120136 = queryNorm
              0.26352867 = fieldWeight in 458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.0625 = fieldNorm(doc=458)
          0.01738195 = weight(abstract_txt:users in 458) [ClassicSimilarity], result of:
            0.01738195 = score(doc=458,freq=1.0), product of:
              0.07790714 = queryWeight, product of:
                1.2747612 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.017120136 = queryNorm
              0.22311112 = fieldWeight in 458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0625 = fieldNorm(doc=458)
          0.027007697 = weight(abstract_txt:user in 458) [ClassicSimilarity], result of:
            0.027007697 = score(doc=458,freq=2.0), product of:
              0.08295196 = queryWeight, product of:
                1.315387 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.017120136 = queryNorm
              0.32558239 = fieldWeight in 458, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=458)
          0.083713725 = weight(abstract_txt:recommendations in 458) [ClassicSimilarity], result of:
            0.083713725 = score(doc=458,freq=3.0), product of:
              0.13457732 = queryWeight, product of:
                1.3679825 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.017120136 = queryNorm
              0.6220493 = fieldWeight in 458, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=458)
          0.24330223 = weight(abstract_txt:recommender in 458) [ClassicSimilarity], result of:
            0.24330223 = score(doc=458,freq=3.0), product of:
              0.2740763 = queryWeight, product of:
                1.9522271 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.017120136 = queryNorm
              0.88771707 = fieldWeight in 458, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.0625 = fieldNorm(doc=458)
        0.2 = coord(5/25)
    
  4. Huang, Z.; Chung, Z.W.; Chen, H.: ¬A graph model for e-commerce recommender systems (2004) 0.08
    0.0767924 = sum of:
      0.0767924 = product of:
        0.38396198 = sum of:
          0.023869166 = weight(abstract_txt:however in 501) [ClassicSimilarity], result of:
            0.023869166 = score(doc=501,freq=1.0), product of:
              0.07246017 = queryWeight, product of:
                1.0037932 = boost
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.017120136 = queryNorm
              0.32941085 = fieldWeight in 501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.078125 = fieldNorm(doc=501)
          0.060415175 = weight(abstract_txt:recommendations in 501) [ClassicSimilarity], result of:
            0.060415175 = score(doc=501,freq=1.0), product of:
              0.13457732 = queryWeight, product of:
                1.3679825 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.017120136 = queryNorm
              0.44892538 = fieldWeight in 501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.078125 = fieldNorm(doc=501)
          0.04096361 = weight(abstract_txt:data in 501) [ClassicSimilarity], result of:
            0.04096361 = score(doc=501,freq=3.0), product of:
              0.090735294 = queryWeight, product of:
                1.5885384 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.017120136 = queryNorm
              0.4514628 = fieldWeight in 501, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=501)
          0.17558825 = weight(abstract_txt:recommender in 501) [ClassicSimilarity], result of:
            0.17558825 = score(doc=501,freq=1.0), product of:
              0.2740763 = queryWeight, product of:
                1.9522271 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.017120136 = queryNorm
              0.6406546 = fieldWeight in 501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.078125 = fieldNorm(doc=501)
          0.08312577 = weight(abstract_txt:historical in 501) [ClassicSimilarity], result of:
            0.08312577 = score(doc=501,freq=1.0), product of:
              0.19057329 = queryWeight, product of:
                1.9937524 = boost
                5.583205 = idf(docFreq=451, maxDocs=44218)
                0.017120136 = queryNorm
              0.43618792 = fieldWeight in 501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.583205 = idf(docFreq=451, maxDocs=44218)
                0.078125 = fieldNorm(doc=501)
        0.2 = coord(5/25)
    
  5. Vishwanath, A.; Xu, W.; Ngoh, Z.: How people protect their privacy on facebook : a cost-benefit view (2018) 0.08
    0.07579693 = sum of:
      0.07579693 = product of:
        0.37898466 = sum of:
          0.019095331 = weight(abstract_txt:however in 4223) [ClassicSimilarity], result of:
            0.019095331 = score(doc=4223,freq=1.0), product of:
              0.07246017 = queryWeight, product of:
                1.0037932 = boost
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.017120136 = queryNorm
              0.26352867 = fieldWeight in 4223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.0625 = fieldNorm(doc=4223)
          0.09052653 = weight(abstract_txt:attacks in 4223) [ClassicSimilarity], result of:
            0.09052653 = score(doc=4223,freq=1.0), product of:
              0.16230121 = queryWeight, product of:
                1.0622836 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.017120136 = queryNorm
              0.55776864 = fieldWeight in 4223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0625 = fieldNorm(doc=4223)
          0.01738195 = weight(abstract_txt:users in 4223) [ClassicSimilarity], result of:
            0.01738195 = score(doc=4223,freq=1.0), product of:
              0.07790714 = queryWeight, product of:
                1.2747612 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.017120136 = queryNorm
              0.22311112 = fieldWeight in 4223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0625 = fieldNorm(doc=4223)
          0.027007697 = weight(abstract_txt:user in 4223) [ClassicSimilarity], result of:
            0.027007697 = score(doc=4223,freq=2.0), product of:
              0.08295196 = queryWeight, product of:
                1.315387 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.017120136 = queryNorm
              0.32558239 = fieldWeight in 4223, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=4223)
          0.22497316 = weight(abstract_txt:privacy in 4223) [ClassicSimilarity], result of:
            0.22497316 = score(doc=4223,freq=8.0), product of:
              0.18758798 = queryWeight, product of:
                1.6150913 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.017120136 = queryNorm
              1.1992941 = fieldWeight in 4223, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0625 = fieldNorm(doc=4223)
        0.2 = coord(5/25)