Search (1 results, page 1 of 1)

  • × author_ss:"Gokhale, S.S."
  • × theme_ss:"Data Mining"
  • × theme_ss:"Internet"
  1. Derek Doran, D.; Gokhale, S.S.: ¬A classification framework for web robots (2012) 0.03
    0.031546462 = product of:
      0.15773231 = sum of:
        0.053868517 = weight(_text_:web in 505) [ClassicSimilarity], result of:
          0.053868517 = score(doc=505,freq=8.0), product of:
            0.0933738 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.028611459 = queryNorm
            0.5769126 = fieldWeight in 505, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.0625 = fieldNorm(doc=505)
        0.10386378 = weight(_text_:log in 505) [ClassicSimilarity], result of:
          0.10386378 = score(doc=505,freq=2.0), product of:
            0.18335998 = queryWeight, product of:
              6.4086204 = idf(docFreq=197, maxDocs=44218)
              0.028611459 = queryNorm
            0.5664474 = fieldWeight in 505, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.4086204 = idf(docFreq=197, maxDocs=44218)
              0.0625 = fieldNorm(doc=505)
      0.2 = coord(2/10)
    
    Abstract
    The behavior of modern web robots varies widely when they crawl for different purposes. In this article, we present a framework to classify these web robots from two orthogonal perspectives, namely, their functionality and the types of resources they consume. Applying the classification framework to a year-long access log from the UConn SoE web server, we present trends that point to significant differences in their crawling behavior.