Document (#29922)

Author
Calado, P.
Cristo, M.
Gonçalves, M.A.
Moura, E.S. de
Ribeiro-Neto, B.
Ziviani, N.
Title
Link-based similarity measures for the classification of Web documents
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.2, S.208-221
Year
2006
Abstract
Traditional text-based document classifiers tend to perform poorly an the Web. Text in Web documents is usually noisy and often does not contain enough information to determine their topic. However, the Web provides a different source that can be useful to document classification: its hyperlink structure. In this work, the authors evaluate how the link structure of the Web can be used to determine a measure of similarity appropriate for document classification. They experiment with five different similarity measures and determine their adequacy for predicting the topic of a Web page. Tests performed an a Web directory Show that link information alone allows classifying documents with an average precision of 86%. Further, when combined with a traditional textbased classifier, precision increases to values of up to 90%, representing gains that range from 63 to 132% over the use of text-based classification alone. Because the measures proposed in this article are straightforward to compute, they provide a practical and effective solution for Web classification and related information retrieval tasks. Further, the authors provide an important set of guidelines an how link structure can be used effectively to classify Web documents.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Couto, T.; Cristo, M.; Gonçalves, M.A.; Calado, P.; Ziviani, N.; Moura, E.; Ribeiro-Neto, B.: ¬A comparative study of citations and links in document classification (2006) 5.62
    5.6152787 = sum of:
      5.6152787 = sum of:
        0.7548182 = weight(author_txt:gonçalves in 2531) [ClassicSimilarity], result of:
          0.7548182 = score(doc=2531,freq=1.0), product of:
            0.3526614 = queryWeight, product of:
              8.561393 = idf(docFreq=22, maxDocs=44218)
              0.04119206 = queryNorm
            2.1403482 = fieldWeight in 2531, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.561393 = idf(docFreq=22, maxDocs=44218)
              0.25 = fieldNorm(doc=2531)
        0.79239136 = weight(author_txt:moura in 2531) [ClassicSimilarity], result of:
          0.79239136 = score(doc=2531,freq=1.0), product of:
            0.36426952 = queryWeight, product of:
              1.0163246 = boost
              8.701155 = idf(docFreq=19, maxDocs=44218)
              0.04119206 = queryNorm
            2.1752887 = fieldWeight in 2531, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.701155 = idf(docFreq=19, maxDocs=44218)
              0.25 = fieldNorm(doc=2531)
        0.80648756 = weight(author_txt:ribeiro in 2531) [ClassicSimilarity], result of:
          0.80648756 = score(doc=2531,freq=1.0), product of:
            0.3685769 = queryWeight, product of:
              1.0223159 = boost
              8.752448 = idf(docFreq=18, maxDocs=44218)
              0.04119206 = queryNorm
            2.188112 = fieldWeight in 2531, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.752448 = idf(docFreq=18, maxDocs=44218)
              0.25 = fieldNorm(doc=2531)
        1.0311779 = weight(author_txt:neto in 2531) [ClassicSimilarity], result of:
          1.0311779 = score(doc=2531,freq=1.0), product of:
            0.4341956 = queryWeight, product of:
              1.1095932 = boost
              9.499662 = idf(docFreq=8, maxDocs=44218)
              0.04119206 = queryNorm
            2.3749156 = fieldWeight in 2531, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.499662 = idf(docFreq=8, maxDocs=44218)
              0.25 = fieldNorm(doc=2531)
        1.1152021 = weight(author_txt:cristo in 2531) [ClassicSimilarity], result of:
          1.1152021 = score(doc=2531,freq=1.0), product of:
            0.45747292 = queryWeight, product of:
              1.1389476 = boost
              9.7509775 = idf(docFreq=6, maxDocs=44218)
              0.04119206 = queryNorm
            2.4377444 = fieldWeight in 2531, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.7509775 = idf(docFreq=6, maxDocs=44218)
              0.25 = fieldNorm(doc=2531)
        1.1152021 = weight(author_txt:ziviani in 2531) [ClassicSimilarity], result of:
          1.1152021 = score(doc=2531,freq=1.0), product of:
            0.45747292 = queryWeight, product of:
              1.1389476 = boost
              9.7509775 = idf(docFreq=6, maxDocs=44218)
              0.04119206 = queryNorm
            2.4377444 = fieldWeight in 2531, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.7509775 = idf(docFreq=6, maxDocs=44218)
              0.25 = fieldNorm(doc=2531)
    
  2. Pereira, D.A.; Ribeiro-Neto, B.; Ziviani, N.; Laender, A.H.F.; Gonçalves, M.A.: ¬A generic Web-based entity resolution framework (2011) 2.47
    2.4717903 = sum of:
      2.4717903 = product of:
        3.7076855 = sum of:
          0.7548182 = weight(author_txt:gonçalves in 4450) [ClassicSimilarity], result of:
            0.7548182 = score(doc=4450,freq=1.0), product of:
              0.3526614 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04119206 = queryNorm
              2.1403482 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=4450)
          0.80648756 = weight(author_txt:ribeiro in 4450) [ClassicSimilarity], result of:
            0.80648756 = score(doc=4450,freq=1.0), product of:
              0.3685769 = queryWeight, product of:
                1.0223159 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.04119206 = queryNorm
              2.188112 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.25 = fieldNorm(doc=4450)
          1.0311779 = weight(author_txt:neto in 4450) [ClassicSimilarity], result of:
            1.0311779 = score(doc=4450,freq=1.0), product of:
              0.4341956 = queryWeight, product of:
                1.1095932 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.04119206 = queryNorm
              2.3749156 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.25 = fieldNorm(doc=4450)
          1.1152021 = weight(author_txt:ziviani in 4450) [ClassicSimilarity], result of:
            1.1152021 = score(doc=4450,freq=1.0), product of:
              0.45747292 = queryWeight, product of:
                1.1389476 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.04119206 = queryNorm
              2.4377444 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.25 = fieldNorm(doc=4450)
        0.6666667 = coord(4/6)
    
  3. Moura, E.S. de; Fernandes, D.; Ribeiro-Neto, B.; Silva, A.S. da; Gonçalves, M.A.: Using structural information to improve search in Web collections (2010) 2.26
    2.2565832 = sum of:
      2.2565832 = product of:
        3.3848748 = sum of:
          0.7548182 = weight(author_txt:gonçalves in 4119) [ClassicSimilarity], result of:
            0.7548182 = score(doc=4119,freq=1.0), product of:
              0.3526614 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04119206 = queryNorm
              2.1403482 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=4119)
          0.79239136 = weight(author_txt:moura in 4119) [ClassicSimilarity], result of:
            0.79239136 = score(doc=4119,freq=1.0), product of:
              0.36426952 = queryWeight, product of:
                1.0163246 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.04119206 = queryNorm
              2.1752887 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.25 = fieldNorm(doc=4119)
          0.80648756 = weight(author_txt:ribeiro in 4119) [ClassicSimilarity], result of:
            0.80648756 = score(doc=4119,freq=1.0), product of:
              0.3685769 = queryWeight, product of:
                1.0223159 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.04119206 = queryNorm
              2.188112 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.25 = fieldNorm(doc=4119)
          1.0311779 = weight(author_txt:neto in 4119) [ClassicSimilarity], result of:
            1.0311779 = score(doc=4119,freq=1.0), product of:
              0.4341956 = queryWeight, product of:
                1.1095932 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.04119206 = queryNorm
              2.3749156 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.25 = fieldNorm(doc=4119)
        0.6666667 = coord(4/6)
    
  4. Silva, A.J.C.; Gonçalves, M.A.; Laender, A.H.F.; Modesto, M.A.B.; Cristo, M.; Ziviani, N.: Finding what is missing from a digital library : a case study in the computer science field (2009) 1.49
    1.4926112 = sum of:
      1.4926112 = product of:
        2.9852223 = sum of:
          0.7548182 = weight(author_txt:gonçalves in 4219) [ClassicSimilarity], result of:
            0.7548182 = score(doc=4219,freq=1.0), product of:
              0.3526614 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04119206 = queryNorm
              2.1403482 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
          1.1152021 = weight(author_txt:cristo in 4219) [ClassicSimilarity], result of:
            1.1152021 = score(doc=4219,freq=1.0), product of:
              0.45747292 = queryWeight, product of:
                1.1389476 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.04119206 = queryNorm
              2.4377444 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
          1.1152021 = weight(author_txt:ziviani in 4219) [ClassicSimilarity], result of:
            1.1152021 = score(doc=4219,freq=1.0), product of:
              0.45747292 = queryWeight, product of:
                1.1389476 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.04119206 = queryNorm
              2.4377444 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
        0.5 = coord(3/6)
    
  5. Silveira, M.; Ribeiro-Neto, B.: Concept-based ranking : a case study in the juridical domain (2004) 1.07
    1.0719715 = sum of:
      1.0719715 = product of:
        3.2159145 = sum of:
          1.4113532 = weight(author_txt:ribeiro in 2339) [ClassicSimilarity], result of:
            1.4113532 = score(doc=2339,freq=1.0), product of:
              0.3685769 = queryWeight, product of:
                1.0223159 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.04119206 = queryNorm
              3.829196 = fieldWeight in 2339, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.4375 = fieldNorm(doc=2339)
          1.8045613 = weight(author_txt:neto in 2339) [ClassicSimilarity], result of:
            1.8045613 = score(doc=2339,freq=1.0), product of:
              0.4341956 = queryWeight, product of:
                1.1095932 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.04119206 = queryNorm
              4.156102 = fieldWeight in 2339, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.4375 = fieldNorm(doc=2339)
        0.33333334 = coord(2/6)
    

Similar documents (content)

  1. Couto, T.; Cristo, M.; Gonçalves, M.A.; Calado, P.; Ziviani, N.; Moura, E.; Ribeiro-Neto, B.: ¬A comparative study of citations and links in document classification (2006) 0.34
    0.34389755 = sum of:
      0.34389755 = product of:
        0.85974383 = sum of:
          0.11168065 = weight(abstract_txt:classifiers in 2531) [ClassicSimilarity], result of:
            0.11168065 = score(doc=2531,freq=2.0), product of:
              0.16796575 = queryWeight, product of:
                1.0020337 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.022283131 = queryNorm
              0.6649013 = fieldWeight in 2531, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=2531)
          0.12887228 = weight(abstract_txt:gains in 2531) [ClassicSimilarity], result of:
            0.12887228 = score(doc=2531,freq=2.0), product of:
              0.18478857 = queryWeight, product of:
                1.0510164 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.022283131 = queryNorm
              0.6974039 = fieldWeight in 2531, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=2531)
          0.037415687 = weight(abstract_txt:traditional in 2531) [ClassicSimilarity], result of:
            0.037415687 = score(doc=2531,freq=1.0), product of:
              0.12861489 = queryWeight, product of:
                1.2400311 = boost
                4.654601 = idf(docFreq=1143, maxDocs=44218)
                0.022283131 = queryNorm
              0.29091257 = fieldWeight in 2531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.654601 = idf(docFreq=1143, maxDocs=44218)
                0.0625 = fieldNorm(doc=2531)
          0.037821025 = weight(abstract_txt:further in 2531) [ClassicSimilarity], result of:
            0.037821025 = score(doc=2531,freq=1.0), product of:
              0.12954211 = queryWeight, product of:
                1.2444929 = boost
                4.671349 = idf(docFreq=1124, maxDocs=44218)
                0.022283131 = queryNorm
              0.29195932 = fieldWeight in 2531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.671349 = idf(docFreq=1124, maxDocs=44218)
                0.0625 = fieldNorm(doc=2531)
          0.047705892 = weight(abstract_txt:based in 2531) [ClassicSimilarity], result of:
            0.047705892 = score(doc=2531,freq=7.0), product of:
              0.09049708 = queryWeight, product of:
                1.2739426 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.022283131 = queryNorm
              0.52715397 = fieldWeight in 2531, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=2531)
          0.06374538 = weight(abstract_txt:text in 2531) [ClassicSimilarity], result of:
            0.06374538 = score(doc=2531,freq=3.0), product of:
              0.14561673 = queryWeight, product of:
                1.6159883 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022283131 = queryNorm
              0.4377614 = fieldWeight in 2531, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2531)
          0.12638807 = weight(abstract_txt:measures in 2531) [ClassicSimilarity], result of:
            0.12638807 = score(doc=2531,freq=2.0), product of:
              0.2630752 = queryWeight, product of:
                2.1720636 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.022283131 = queryNorm
              0.48042563 = fieldWeight in 2531, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=2531)
          0.05194454 = weight(abstract_txt:documents in 2531) [ClassicSimilarity], result of:
            0.05194454 = score(doc=2531,freq=1.0), product of:
              0.2016626 = queryWeight, product of:
                2.1959105 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.022283131 = queryNorm
              0.2575814 = fieldWeight in 2531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2531)
          0.059012298 = weight(abstract_txt:classification in 2531) [ClassicSimilarity], result of:
            0.059012298 = score(doc=2531,freq=1.0), product of:
              0.23651777 = queryWeight, product of:
                2.6588194 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022283131 = queryNorm
              0.2495047 = fieldWeight in 2531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=2531)
          0.19515797 = weight(abstract_txt:link in 2531) [ClassicSimilarity], result of:
            0.19515797 = score(doc=2531,freq=2.0), product of:
              0.38682362 = queryWeight, product of:
                3.041294 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.022283131 = queryNorm
              0.5045141 = fieldWeight in 2531, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=2531)
        0.4 = coord(10/25)
    
  2. Yang, P.; Gao, W.; Tan, Q.; Wong, K.-F.: ¬A link-bridged topic model for cross-domain document classification (2013) 0.31
    0.30683562 = sum of:
      0.30683562 = product of:
        0.85232115 = sum of:
          0.091126464 = weight(abstract_txt:hyperlink in 2706) [ClassicSimilarity], result of:
            0.091126464 = score(doc=2706,freq=1.0), product of:
              0.18478857 = queryWeight, product of:
                1.0510164 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.022283131 = queryNorm
              0.49313906 = fieldWeight in 2706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.018031133 = weight(abstract_txt:based in 2706) [ClassicSimilarity], result of:
            0.018031133 = score(doc=2706,freq=1.0), product of:
              0.09049708 = queryWeight, product of:
                1.2739426 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.022283131 = queryNorm
              0.19924548 = fieldWeight in 2706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.08336787 = weight(abstract_txt:topic in 2706) [ClassicSimilarity], result of:
            0.08336787 = score(doc=2706,freq=3.0), product of:
              0.15212975 = queryWeight, product of:
                1.3486338 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.022283131 = queryNorm
              0.54800504 = fieldWeight in 2706, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.036803413 = weight(abstract_txt:text in 2706) [ClassicSimilarity], result of:
            0.036803413 = score(doc=2706,freq=1.0), product of:
              0.14561673 = queryWeight, product of:
                1.6159883 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022283131 = queryNorm
              0.25274166 = fieldWeight in 2706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.062255304 = weight(abstract_txt:document in 2706) [ClassicSimilarity], result of:
            0.062255304 = score(doc=2706,freq=2.0), product of:
              0.16408168 = queryWeight, product of:
                1.715389 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.022283131 = queryNorm
              0.37941656 = fieldWeight in 2706, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.04606373 = weight(abstract_txt:structure in 2706) [ClassicSimilarity], result of:
            0.04606373 = score(doc=2706,freq=1.0), product of:
              0.1691188 = queryWeight, product of:
                1.7415202 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.022283131 = queryNorm
              0.27237496 = fieldWeight in 2706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.10388908 = weight(abstract_txt:documents in 2706) [ClassicSimilarity], result of:
            0.10388908 = score(doc=2706,freq=4.0), product of:
              0.2016626 = queryWeight, product of:
                2.1959105 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.022283131 = queryNorm
              0.5151628 = fieldWeight in 2706, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.102212295 = weight(abstract_txt:classification in 2706) [ClassicSimilarity], result of:
            0.102212295 = score(doc=2706,freq=3.0), product of:
              0.23651777 = queryWeight, product of:
                2.6588194 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022283131 = queryNorm
              0.4321548 = fieldWeight in 2706, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
          0.30857188 = weight(abstract_txt:link in 2706) [ClassicSimilarity], result of:
            0.30857188 = score(doc=2706,freq=5.0), product of:
              0.38682362 = queryWeight, product of:
                3.041294 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.022283131 = queryNorm
              0.7977069 = fieldWeight in 2706, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=2706)
        0.36 = coord(9/25)
    
  3. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.27
    0.27105132 = sum of:
      0.27105132 = product of:
        0.84703535 = sum of:
          0.17658262 = weight(abstract_txt:classifiers in 1808) [ClassicSimilarity], result of:
            0.17658262 = score(doc=1808,freq=5.0), product of:
              0.16796575 = queryWeight, product of:
                1.0020337 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.022283131 = queryNorm
              1.0513014 = fieldWeight in 1808, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.025499873 = weight(abstract_txt:based in 1808) [ClassicSimilarity], result of:
            0.025499873 = score(doc=1808,freq=2.0), product of:
              0.09049708 = queryWeight, product of:
                1.2739426 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.022283131 = queryNorm
              0.28177565 = fieldWeight in 1808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.036803413 = weight(abstract_txt:text in 1808) [ClassicSimilarity], result of:
            0.036803413 = score(doc=1808,freq=1.0), product of:
              0.14561673 = queryWeight, product of:
                1.6159883 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022283131 = queryNorm
              0.25274166 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.04402115 = weight(abstract_txt:document in 1808) [ClassicSimilarity], result of:
            0.04402115 = score(doc=1808,freq=1.0), product of:
              0.16408168 = queryWeight, product of:
                1.715389 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.022283131 = queryNorm
              0.26828802 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.23645043 = weight(abstract_txt:measures in 1808) [ClassicSimilarity], result of:
            0.23645043 = score(doc=1808,freq=7.0), product of:
              0.2630752 = queryWeight, product of:
                2.1720636 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.022283131 = queryNorm
              0.89879405 = fieldWeight in 1808, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.07346067 = weight(abstract_txt:documents in 1808) [ClassicSimilarity], result of:
            0.07346067 = score(doc=1808,freq=2.0), product of:
              0.2016626 = queryWeight, product of:
                2.1959105 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.022283131 = queryNorm
              0.36427513 = fieldWeight in 1808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.10966716 = weight(abstract_txt:similarity in 1808) [ClassicSimilarity], result of:
            0.10966716 = score(doc=1808,freq=1.0), product of:
              0.30153444 = queryWeight, product of:
                2.325418 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.022283131 = queryNorm
              0.36369696 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.14455001 = weight(abstract_txt:classification in 1808) [ClassicSimilarity], result of:
            0.14455001 = score(doc=1808,freq=6.0), product of:
              0.23651777 = queryWeight, product of:
                2.6588194 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022283131 = queryNorm
              0.6111592 = fieldWeight in 1808, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
        0.32 = coord(8/25)
    
  4. Haveliwala, T.: Context-Sensitive Web search (2005) 0.22
    0.21890867 = sum of:
      0.21890867 = product of:
        0.6080796 = sum of:
          0.03647848 = weight(abstract_txt:provide in 2567) [ClassicSimilarity], result of:
            0.03647848 = score(doc=2567,freq=3.0), product of:
              0.09584455 = queryWeight, product of:
                1.0704606 = boost
                4.0180984 = idf(docFreq=2161, maxDocs=44218)
                0.022283131 = queryNorm
              0.38060042 = fieldWeight in 2567, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0180984 = idf(docFreq=2161, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2567)
          0.032610174 = weight(abstract_txt:authors in 2567) [ClassicSimilarity], result of:
            0.032610174 = score(doc=2567,freq=1.0), product of:
              0.12827799 = queryWeight, product of:
                1.238406 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.022283131 = queryNorm
              0.25421488 = fieldWeight in 2567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2567)
          0.056705136 = weight(abstract_txt:traditional in 2567) [ClassicSimilarity], result of:
            0.056705136 = score(doc=2567,freq=3.0), product of:
              0.12861489 = queryWeight, product of:
                1.2400311 = boost
                4.654601 = idf(docFreq=1143, maxDocs=44218)
                0.022283131 = queryNorm
              0.4408909 = fieldWeight in 2567, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.654601 = idf(docFreq=1143, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2567)
          0.015777241 = weight(abstract_txt:based in 2567) [ClassicSimilarity], result of:
            0.015777241 = score(doc=2567,freq=1.0), product of:
              0.09049708 = queryWeight, product of:
                1.2739426 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.022283131 = queryNorm
              0.1743398 = fieldWeight in 2567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2567)
          0.057000954 = weight(abstract_txt:structure in 2567) [ClassicSimilarity], result of:
            0.057000954 = score(doc=2567,freq=2.0), product of:
              0.1691188 = queryWeight, product of:
                1.7415202 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.022283131 = queryNorm
              0.3370468 = fieldWeight in 2567, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2567)
          0.097334184 = weight(abstract_txt:alone in 2567) [ClassicSimilarity], result of:
            0.097334184 = score(doc=2567,freq=1.0), product of:
              0.2659257 = queryWeight, product of:
                1.7830647 = boost
                6.6929407 = idf(docFreq=148, maxDocs=44218)
                0.022283131 = queryNorm
              0.3660202 = fieldWeight in 2567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6929407 = idf(docFreq=148, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2567)
          0.04545147 = weight(abstract_txt:documents in 2567) [ClassicSimilarity], result of:
            0.04545147 = score(doc=2567,freq=1.0), product of:
              0.2016626 = queryWeight, product of:
                2.1959105 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.022283131 = queryNorm
              0.22538373 = fieldWeight in 2567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2567)
          0.09595876 = weight(abstract_txt:similarity in 2567) [ClassicSimilarity], result of:
            0.09595876 = score(doc=2567,freq=1.0), product of:
              0.30153444 = queryWeight, product of:
                2.325418 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.022283131 = queryNorm
              0.31823483 = fieldWeight in 2567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2567)
          0.17076322 = weight(abstract_txt:link in 2567) [ClassicSimilarity], result of:
            0.17076322 = score(doc=2567,freq=2.0), product of:
              0.38682362 = queryWeight, product of:
                3.041294 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.022283131 = queryNorm
              0.44144982 = fieldWeight in 2567, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2567)
        0.36 = coord(9/25)
    
  5. Addison, E.R.; Nelson, P.E.: Intelligent hypertext (1992) 0.22
    0.21566221 = sum of:
      0.21566221 = product of:
        0.6739444 = sum of:
          0.05590316 = weight(abstract_txt:authors in 2026) [ClassicSimilarity], result of:
            0.05590316 = score(doc=2026,freq=1.0), product of:
              0.12827799 = queryWeight, product of:
                1.238406 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.022283131 = queryNorm
              0.43579698 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.09375 = fieldNorm(doc=2026)
          0.056123532 = weight(abstract_txt:traditional in 2026) [ClassicSimilarity], result of:
            0.056123532 = score(doc=2026,freq=1.0), product of:
              0.12861489 = queryWeight, product of:
                1.2400311 = boost
                4.654601 = idf(docFreq=1143, maxDocs=44218)
                0.022283131 = queryNorm
              0.43636885 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.654601 = idf(docFreq=1143, maxDocs=44218)
                0.09375 = fieldNorm(doc=2026)
          0.027046703 = weight(abstract_txt:based in 2026) [ClassicSimilarity], result of:
            0.027046703 = score(doc=2026,freq=1.0), product of:
              0.09049708 = queryWeight, product of:
                1.2739426 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.022283131 = queryNorm
              0.29886824 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.09375 = fieldNorm(doc=2026)
          0.055205118 = weight(abstract_txt:text in 2026) [ClassicSimilarity], result of:
            0.055205118 = score(doc=2026,freq=1.0), product of:
              0.14561673 = queryWeight, product of:
                1.6159883 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022283131 = queryNorm
              0.37911248 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=2026)
          0.093382955 = weight(abstract_txt:document in 2026) [ClassicSimilarity], result of:
            0.093382955 = score(doc=2026,freq=2.0), product of:
              0.16408168 = queryWeight, product of:
                1.715389 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.022283131 = queryNorm
              0.5691248 = fieldWeight in 2026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.09375 = fieldNorm(doc=2026)
          0.06909559 = weight(abstract_txt:structure in 2026) [ClassicSimilarity], result of:
            0.06909559 = score(doc=2026,freq=1.0), product of:
              0.1691188 = queryWeight, product of:
                1.7415202 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.022283131 = queryNorm
              0.40856242 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.09375 = fieldNorm(doc=2026)
          0.11019101 = weight(abstract_txt:documents in 2026) [ClassicSimilarity], result of:
            0.11019101 = score(doc=2026,freq=2.0), product of:
              0.2016626 = queryWeight, product of:
                2.1959105 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.022283131 = queryNorm
              0.5464127 = fieldWeight in 2026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.09375 = fieldNorm(doc=2026)
          0.2069963 = weight(abstract_txt:link in 2026) [ClassicSimilarity], result of:
            0.2069963 = score(doc=2026,freq=1.0), product of:
              0.38682362 = queryWeight, product of:
                3.041294 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.022283131 = queryNorm
              0.53511804 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.09375 = fieldNorm(doc=2026)
        0.32 = coord(8/25)