Document (#31100)

Author
Thelwall, M.
Stuart, D.
Title
Web crawling ethics revisited : cost, privacy, and denial of service
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.13, S.1771-1779
Year
2006
Abstract
Ethical aspects of the employment of Web crawlers for information science research and other contexts are reviewed. The difference between legal and ethical uses of communications technologies is emphasized as well as the changing boundary between ethical and unethical conduct. A review of the potential impacts on Web site owners is used to underpin a new framework for ethical crawling, and it is argued that delicate human judgment is required for each individual case, with verdicts likely to change over time. Decisions can be based upon an approximate cost-benefit analysis, but it is crucial that crawler owners find out about the technological issues affecting the owners of the sites being crawled in order to produce an informed assessment.
Theme
Suchmaschinen

Similar documents (author)

  1. Angus, E.; Thelwall, M.; Stuart, D.: General patterns of tag usage among university groups in Flickr (2008) 4.24
    4.241468 = sum of:
      4.241468 = sum of:
        1.3346748 = weight(author_txt:thelwall in 3734) [ClassicSimilarity], result of:
          1.3346748 = score(doc=3734,freq=1.0), product of:
            0.5114406 = queryWeight, product of:
              6.9590354 = idf(docFreq=109, maxDocs=42596)
              0.073493026 = queryNorm
            2.6096382 = fieldWeight in 3734, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9590354 = idf(docFreq=109, maxDocs=42596)
              0.375 = fieldNorm(doc=3734)
        2.906793 = weight(author_txt:stuart in 3734) [ClassicSimilarity], result of:
          2.906793 = score(doc=3734,freq=1.0), product of:
            0.8593187 = queryWeight, product of:
              1.2962224 = boost
              9.020458 = idf(docFreq=13, maxDocs=42596)
              0.073493026 = queryNorm
            3.3826718 = fieldWeight in 3734, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.020458 = idf(docFreq=13, maxDocs=42596)
              0.375 = fieldNorm(doc=3734)
    
  2. Thelwall, M.; Klitkou, A.; Verbeek, A.; Stuart, D.; Vincent, C.: Policy-relevant Webometrics for individual scientific fields (2010) 3.53
    3.5345566 = sum of:
      3.5345566 = sum of:
        1.1122291 = weight(author_txt:thelwall in 4754) [ClassicSimilarity], result of:
          1.1122291 = score(doc=4754,freq=1.0), product of:
            0.5114406 = queryWeight, product of:
              6.9590354 = idf(docFreq=109, maxDocs=42596)
              0.073493026 = queryNorm
            2.1746986 = fieldWeight in 4754, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9590354 = idf(docFreq=109, maxDocs=42596)
              0.3125 = fieldNorm(doc=4754)
        2.4223275 = weight(author_txt:stuart in 4754) [ClassicSimilarity], result of:
          2.4223275 = score(doc=4754,freq=1.0), product of:
            0.8593187 = queryWeight, product of:
              1.2962224 = boost
              9.020458 = idf(docFreq=13, maxDocs=42596)
              0.073493026 = queryNorm
            2.8188932 = fieldWeight in 4754, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.020458 = idf(docFreq=13, maxDocs=42596)
              0.3125 = fieldNorm(doc=4754)
    
  3. Thelwall, M.; Goriunova, O.; Vis, F.; Faulkner, S.; Burns, A.; Aulich, J.; Mas-Bleda, A.; Stuart, E.; D'Orazio, F.: Chatting through pictures : a classification of images tweeted in one week in the UK and USA (2016) 2.47
    2.4741898 = sum of:
      2.4741898 = sum of:
        0.7785604 = weight(author_txt:thelwall in 4216) [ClassicSimilarity], result of:
          0.7785604 = score(doc=4216,freq=1.0), product of:
            0.5114406 = queryWeight, product of:
              6.9590354 = idf(docFreq=109, maxDocs=42596)
              0.073493026 = queryNorm
            1.522289 = fieldWeight in 4216, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9590354 = idf(docFreq=109, maxDocs=42596)
              0.21875 = fieldNorm(doc=4216)
        1.6956292 = weight(author_txt:stuart in 4216) [ClassicSimilarity], result of:
          1.6956292 = score(doc=4216,freq=1.0), product of:
            0.8593187 = queryWeight, product of:
              1.2962224 = boost
              9.020458 = idf(docFreq=13, maxDocs=42596)
              0.073493026 = queryNorm
            1.9732252 = fieldWeight in 4216, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.020458 = idf(docFreq=13, maxDocs=42596)
              0.21875 = fieldNorm(doc=4216)
    
  4. Stuart, D.: Web metrics for library and information professionals (2014) 2.42
    2.4223275 = sum of:
      2.4223275 = product of:
        4.844655 = sum of:
          4.844655 = weight(author_txt:stuart in 3275) [ClassicSimilarity], result of:
            4.844655 = score(doc=3275,freq=1.0), product of:
              0.8593187 = queryWeight, product of:
                1.2962224 = boost
                9.020458 = idf(docFreq=13, maxDocs=42596)
                0.073493026 = queryNorm
              5.6377864 = fieldWeight in 3275, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.020458 = idf(docFreq=13, maxDocs=42596)
                0.625 = fieldNorm(doc=3275)
        0.5 = coord(1/2)
    
  5. Stuart, D.: Practical ontologies for information professionals (2016) 2.42
    2.4223275 = sum of:
      2.4223275 = product of:
        4.844655 = sum of:
          4.844655 = weight(author_txt:stuart in 750) [ClassicSimilarity], result of:
            4.844655 = score(doc=750,freq=1.0), product of:
              0.8593187 = queryWeight, product of:
                1.2962224 = boost
                9.020458 = idf(docFreq=13, maxDocs=42596)
                0.073493026 = queryNorm
              5.6377864 = fieldWeight in 750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.020458 = idf(docFreq=13, maxDocs=42596)
                0.625 = fieldNorm(doc=750)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Frohmann, B.: Subjectivity and information ethics (2008) 0.15
    0.14767602 = sum of:
      0.14767602 = product of:
        0.9229752 = sum of:
          0.015797958 = weight(abstract_txt:between in 2540) [ClassicSimilarity], result of:
            0.015797958 = score(doc=2540,freq=2.0), product of:
              0.058564153 = queryWeight, product of:
                1.1364892 = boost
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.014774082 = queryNorm
              0.26975474 = fieldWeight in 2540, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.0546875 = fieldNorm(doc=2540)
          0.06400302 = weight(abstract_txt:privacy in 2540) [ClassicSimilarity], result of:
            0.06400302 = score(doc=2540,freq=2.0), product of:
              0.118128546 = queryWeight, product of:
                1.1413314 = boost
                7.005555 = idf(docFreq=104, maxDocs=42596)
                0.014774082 = queryNorm
              0.54180825 = fieldWeight in 2540, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.005555 = idf(docFreq=104, maxDocs=42596)
                0.0546875 = fieldNorm(doc=2540)
          0.45685342 = weight(title_txt:ethics in 2540) [ClassicSimilarity], result of:
            0.45685342 = score(doc=2540,freq=1.0), product of:
              0.12619084 = queryWeight, product of:
                1.1796367 = boost
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.014774082 = queryNorm
              3.6203375 = fieldWeight in 2540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.5 = fieldNorm(doc=2540)
          0.38632074 = weight(abstract_txt:ethical in 2540) [ClassicSimilarity], result of:
            0.38632074 = score(doc=2540,freq=5.0), product of:
              0.45802927 = queryWeight, product of:
                4.4948063 = boost
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.014774082 = queryNorm
              0.8434412 = fieldWeight in 2540, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.0546875 = fieldNorm(doc=2540)
        0.16 = coord(4/25)
    
  2. Danielson, E.S.: Ethics and reference services (1997) 0.13
    0.13425218 = sum of:
      0.13425218 = product of:
        1.1187681 = sum of:
          0.1034445 = weight(abstract_txt:privacy in 483) [ClassicSimilarity], result of:
            0.1034445 = score(doc=483,freq=1.0), product of:
              0.118128546 = queryWeight, product of:
                1.1413314 = boost
                7.005555 = idf(docFreq=104, maxDocs=42596)
                0.014774082 = queryNorm
              0.8756944 = fieldWeight in 483, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.005555 = idf(docFreq=104, maxDocs=42596)
                0.125 = fieldNorm(doc=483)
          0.45685342 = weight(title_txt:ethics in 483) [ClassicSimilarity], result of:
            0.45685342 = score(doc=483,freq=1.0), product of:
              0.12619084 = queryWeight, product of:
                1.1796367 = boost
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.014774082 = queryNorm
              3.6203375 = fieldWeight in 483, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.5 = fieldNorm(doc=483)
          0.5584702 = weight(abstract_txt:ethical in 483) [ClassicSimilarity], result of:
            0.5584702 = score(doc=483,freq=2.0), product of:
              0.45802927 = queryWeight, product of:
                4.4948063 = boost
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.014774082 = queryNorm
              1.2192893 = fieldWeight in 483, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.125 = fieldNorm(doc=483)
        0.12 = coord(3/25)
    
  3. Polat, H.; Du, W.: Privacy-preserving top-N recommendation on distributed data (2008) 0.12
    0.12490883 = sum of:
      0.12490883 = product of:
        0.62454414 = sum of:
          0.046667818 = weight(abstract_txt:legal in 3044) [ClassicSimilarity], result of:
            0.046667818 = score(doc=3044,freq=1.0), product of:
              0.095054984 = queryWeight, product of:
                1.0238158 = boost
                6.2842374 = idf(docFreq=215, maxDocs=42596)
                0.014774082 = queryNorm
              0.49095604 = fieldWeight in 3044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2842374 = idf(docFreq=215, maxDocs=42596)
                0.078125 = fieldNorm(doc=3044)
          0.05665018 = weight(abstract_txt:conduct in 3044) [ClassicSimilarity], result of:
            0.05665018 = score(doc=3044,freq=1.0), product of:
              0.108167656 = queryWeight, product of:
                1.092152 = boost
                6.7036886 = idf(docFreq=141, maxDocs=42596)
                0.014774082 = queryNorm
              0.5237257 = fieldWeight in 3044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7036886 = idf(docFreq=141, maxDocs=42596)
                0.078125 = fieldNorm(doc=3044)
          0.015958348 = weight(abstract_txt:between in 3044) [ClassicSimilarity], result of:
            0.015958348 = score(doc=3044,freq=1.0), product of:
              0.058564153 = queryWeight, product of:
                1.1364892 = boost
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.014774082 = queryNorm
              0.27249345 = fieldWeight in 3044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.078125 = fieldNorm(doc=3044)
          0.1445681 = weight(abstract_txt:privacy in 3044) [ClassicSimilarity], result of:
            0.1445681 = score(doc=3044,freq=5.0), product of:
              0.118128546 = queryWeight, product of:
                1.1413314 = boost
                7.005555 = idf(docFreq=104, maxDocs=42596)
                0.014774082 = queryNorm
              1.2238202 = fieldWeight in 3044, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.005555 = idf(docFreq=104, maxDocs=42596)
                0.078125 = fieldNorm(doc=3044)
          0.36069968 = weight(abstract_txt:owners in 3044) [ClassicSimilarity], result of:
            0.36069968 = score(doc=3044,freq=1.0), product of:
              0.53592104 = queryWeight, product of:
                4.2106137 = boost
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.014774082 = queryNorm
              0.67304635 = fieldWeight in 3044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.078125 = fieldNorm(doc=3044)
        0.2 = coord(5/25)
    
  4. Himma, K.E.: Foundational issues in information ethics (2007) 0.12
    0.12288614 = sum of:
      0.12288614 = product of:
        0.7680384 = sum of:
          0.037334256 = weight(abstract_txt:legal in 3771) [ClassicSimilarity], result of:
            0.037334256 = score(doc=3771,freq=1.0), product of:
              0.095054984 = queryWeight, product of:
                1.0238158 = boost
                6.2842374 = idf(docFreq=215, maxDocs=42596)
                0.014774082 = queryNorm
              0.39276484 = fieldWeight in 3771, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2842374 = idf(docFreq=215, maxDocs=42596)
                0.0625 = fieldNorm(doc=3771)
          0.05172225 = weight(abstract_txt:privacy in 3771) [ClassicSimilarity], result of:
            0.05172225 = score(doc=3771,freq=1.0), product of:
              0.118128546 = queryWeight, product of:
                1.1413314 = boost
                7.005555 = idf(docFreq=104, maxDocs=42596)
                0.014774082 = queryNorm
              0.4378472 = fieldWeight in 3771, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.005555 = idf(docFreq=104, maxDocs=42596)
                0.0625 = fieldNorm(doc=3771)
          0.39974675 = weight(title_txt:ethics in 3771) [ClassicSimilarity], result of:
            0.39974675 = score(doc=3771,freq=1.0), product of:
              0.12619084 = queryWeight, product of:
                1.1796367 = boost
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.014774082 = queryNorm
              3.1677952 = fieldWeight in 3771, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.4375 = fieldNorm(doc=3771)
          0.2792351 = weight(abstract_txt:ethical in 3771) [ClassicSimilarity], result of:
            0.2792351 = score(doc=3771,freq=2.0), product of:
              0.45802927 = queryWeight, product of:
                4.4948063 = boost
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.014774082 = queryNorm
              0.60964465 = fieldWeight in 3771, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.0625 = fieldNorm(doc=3771)
        0.16 = coord(4/25)
    
  5. Fernández-Molina, J.C.; Chaves Guimaraes, J.A.: Ethical aspects of knowledge organization and representation in the digital environment : their articulation in professional codes of ethics (2003) 0.12
    0.11978657 = sum of:
      0.11978657 = product of:
        0.74866605 = sum of:
          0.05665018 = weight(abstract_txt:conduct in 3766) [ClassicSimilarity], result of:
            0.05665018 = score(doc=3766,freq=1.0), product of:
              0.108167656 = queryWeight, product of:
                1.092152 = boost
                6.7036886 = idf(docFreq=141, maxDocs=42596)
                0.014774082 = queryNorm
              0.5237257 = fieldWeight in 3766, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7036886 = idf(docFreq=141, maxDocs=42596)
                0.078125 = fieldNorm(doc=3766)
          0.064652815 = weight(abstract_txt:privacy in 3766) [ClassicSimilarity], result of:
            0.064652815 = score(doc=3766,freq=1.0), product of:
              0.118128546 = queryWeight, product of:
                1.1413314 = boost
                7.005555 = idf(docFreq=104, maxDocs=42596)
                0.014774082 = queryNorm
              0.547309 = fieldWeight in 3766, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.005555 = idf(docFreq=104, maxDocs=42596)
                0.078125 = fieldNorm(doc=3766)
          0.19987337 = weight(title_txt:ethics in 3766) [ClassicSimilarity], result of:
            0.19987337 = score(doc=3766,freq=1.0), product of:
              0.12619084 = queryWeight, product of:
                1.1796367 = boost
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.014774082 = queryNorm
              1.5838976 = fieldWeight in 3766, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.21875 = fieldNorm(doc=3766)
          0.4274897 = weight(abstract_txt:ethical in 3766) [ClassicSimilarity], result of:
            0.4274897 = score(doc=3766,freq=3.0), product of:
              0.45802927 = queryWeight, product of:
                4.4948063 = boost
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.014774082 = queryNorm
              0.933324 = fieldWeight in 3766, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.078125 = fieldNorm(doc=3766)
        0.16 = coord(4/25)