Li, L.; Shang, Y.; Zhang, W.: Improvement of HITS-based algorithms on Web documents
0.07
0.070275724 = product of:
0.105413586 = sum of:
0.082684144 = product of:
0.24805243 = sum of:
0.24805243 = weight(_text_:3a in 2514) [ClassicSimilarity], result of:
0.24805243 = score(doc=2514,freq=2.0), product of:
0.44136027 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.052059412 = queryNorm
0.56201804 = fieldWeight in 2514, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=2514)
0.33333334 = coord(1/3)
0.022729442 = weight(_text_:the in 2514) [ClassicSimilarity], result of:
0.022729442 = score(doc=2514,freq=14.0), product of:
0.08213748 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.052059412 = queryNorm
0.27672437 = fieldWeight in 2514, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.046875 = fieldNorm(doc=2514)
0.6666667 = coord(2/3)
- Abstract
- In this paper, we present two ways to improve the precision of HITS-based algorithms onWeb documents. First, by analyzing the limitations of current HITS-based algorithms, we propose a new weighted HITS-based method that assigns appropriate weights to in-links of root documents. Then, we combine content analysis with HITS-based algorithms and study the effects of four representative relevance scoring methods, VSM, Okapi, TLS, and CDR, using a set of broad topic queries. Our experimental results show that our weighted HITS-based method performs significantly better than Bharat's improved HITS algorithm. When we combine our weighted HITS-based method or Bharat's HITS algorithm with any of the four relevance scoring methods, the combined methods are only marginally better than our weighted HITS-based method. Between the four relevance scoring methods, there is no significant quality difference when they are combined with a HITS-based algorithm.
- Content
- Vgl.: http%3A%2F%2Fdelab.csd.auth.gr%2F~dimitris%2Fcourses%2Fir_spring06%2Fpage_rank_computing%2Fp527-li.pdf. Vgl. auch: http://www2002.org/CDROM/refereed/643/.
- Source
- WWW '02: Proceedings of the 11th International Conference on World Wide Web, May 7-11, 2002, Honolulu, Hawaii, USA