Document (#32609)

Author
Liu, Y.
Zhang, M.
Cen, R.
Ru, L.
Ma, S.
Title
Data cleansing for Web information retrieval using query independent features
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.12, S.1884-1898
Year
2007
Abstract
Understanding what kinds of Web pages are the most useful for Web search engine users is a critical task in Web information retrieval (IR). Most previous works used hyperlink analysis algorithms to solve this problem. However, little research has been focused on query-independent Web data cleansing for Web IR. In this paper, we first provide analysis of the differences between retrieval target pages and ordinary ones based on more than 30 million Web pages obtained from both the Text Retrieval Conference (TREC) and a widely used Chinese search engine, SOGOU (www.sogou.com). We further propose a learning-based data cleansing algorithm for reducing Web pages that are unlikely to be useful for user requests. We found that there exists a large proportion of low-quality Web pages in both the English and the Chinese Web page corpus, and retrieval target pages can be identified using query-independent features and cleansing algorithms. The experimental results showed that our algorithm is effective in reducing a large portion of Web pages with a small loss in retrieval target pages. It makes it possible for Web IR tools to meet a large fraction of users' needs with only a small part of pages on the Web. These results may help Web search engines make better use of their limited storage and computation resources to improve search performance.
Footnote
Beitrag eines Themenschwerpunktes "Mining Web resources for enhancing information retrieval"
Theme
Data Mining
Suchmaschinen
Object
WWW

Similar documents (author)

  1. Zhang, M.; Zhang, Y.: Professional organizations in Twittersphere : an empirical study of U.S. library and information science professional organizations-related Tweets (2020) 4.60
    4.603307 = sum of:
      4.603307 = weight(author_txt:zhang in 776) [ClassicSimilarity], result of:
        4.603307 = fieldWeight in 776, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.510059 = idf(docFreq=174, maxDocs=43254)
          0.5 = fieldNorm(doc=776)
    
  2. Zhang, Y.; Zhang, C.: Enhancing keyphrase extraction from microblogs using human reading time (2021) 4.60
    4.603307 = sum of:
      4.603307 = weight(author_txt:zhang in 1239) [ClassicSimilarity], result of:
        4.603307 = fieldWeight in 1239, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.510059 = idf(docFreq=174, maxDocs=43254)
          0.5 = fieldNorm(doc=1239)
    
  3. Zhang, J.: TOFIR: A tool of facilitating information retrieval : introduce a visual retrieval model (2001) 4.07
    4.0687866 = sum of:
      4.0687866 = weight(author_txt:zhang in 711) [ClassicSimilarity], result of:
        4.0687866 = fieldWeight in 711, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.510059 = idf(docFreq=174, maxDocs=43254)
          0.625 = fieldNorm(doc=711)
    
  4. Zhang, A.: Multimedia file formats on the Internet : a beginner's guide for PC users (1995) 4.07
    4.0687866 = sum of:
      4.0687866 = weight(author_txt:zhang in 4281) [ClassicSimilarity], result of:
        4.0687866 = fieldWeight in 4281, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.510059 = idf(docFreq=174, maxDocs=43254)
          0.625 = fieldNorm(doc=4281)
    
  5. Zhang, J.: ¬A representational analysis of relational information displays (1996) 4.07
    4.0687866 = sum of:
      4.0687866 = weight(author_txt:zhang in 472) [ClassicSimilarity], result of:
        4.0687866 = fieldWeight in 472, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.510059 = idf(docFreq=174, maxDocs=43254)
          0.625 = fieldNorm(doc=472)
    

Similar documents (content)

  1. Souza, J.; Carvalho, A.; Cristo, M.; Moura, E.; Calado, P.; Chirita, P.-A.; Nejdl, W.: Using site-level connections to estimate link confidence (2012) 0.26
    0.25973952 = sum of:
      0.25973952 = product of:
        0.6493488 = sum of:
          0.013485203 = weight(abstract_txt:most in 1963) [ClassicSimilarity], result of:
            0.013485203 = score(doc=1963,freq=1.0), product of:
              0.054563057 = queryWeight, product of:
                1.0729891 = boost
                3.9543834 = idf(docFreq=2253, maxDocs=43254)
                0.012859516 = queryNorm
              0.24714896 = fieldWeight in 1963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9543834 = idf(docFreq=2253, maxDocs=43254)
                0.0625 = fieldNorm(doc=1963)
          0.029058648 = weight(abstract_txt:features in 1963) [ClassicSimilarity], result of:
            0.029058648 = score(doc=1963,freq=2.0), product of:
              0.07224935 = queryWeight, product of:
                1.2347043 = boost
                4.550367 = idf(docFreq=1241, maxDocs=43254)
                0.012859516 = queryNorm
              0.40219942 = fieldWeight in 1963, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.550367 = idf(docFreq=1241, maxDocs=43254)
                0.0625 = fieldNorm(doc=1963)
          0.036596097 = weight(abstract_txt:engine in 1963) [ClassicSimilarity], result of:
            0.036596097 = score(doc=1963,freq=1.0), product of:
              0.10615739 = queryWeight, product of:
                1.4966528 = boost
                5.5157495 = idf(docFreq=472, maxDocs=43254)
                0.012859516 = queryNorm
              0.34473434 = fieldWeight in 1963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5157495 = idf(docFreq=472, maxDocs=43254)
                0.0625 = fieldNorm(doc=1963)
          0.040792767 = weight(abstract_txt:algorithm in 1963) [ClassicSimilarity], result of:
            0.040792767 = score(doc=1963,freq=1.0), product of:
              0.114125445 = queryWeight, product of:
                1.5518053 = boost
                5.7190075 = idf(docFreq=385, maxDocs=43254)
                0.012859516 = queryNorm
              0.35743797 = fieldWeight in 1963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7190075 = idf(docFreq=385, maxDocs=43254)
                0.0625 = fieldNorm(doc=1963)
          0.041414566 = weight(abstract_txt:algorithms in 1963) [ClassicSimilarity], result of:
            0.041414566 = score(doc=1963,freq=1.0), product of:
              0.11528225 = queryWeight, product of:
                1.5596502 = boost
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.012859516 = queryNorm
              0.35924494 = fieldWeight in 1963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.0625 = fieldNorm(doc=1963)
          0.02915788 = weight(abstract_txt:large in 1963) [ClassicSimilarity], result of:
            0.02915788 = score(doc=1963,freq=1.0), product of:
              0.10443869 = queryWeight, product of:
                1.818119 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.012859516 = queryNorm
              0.27918658 = fieldWeight in 1963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.0625 = fieldNorm(doc=1963)
          0.049221024 = weight(abstract_txt:query in 1963) [ClassicSimilarity], result of:
            0.049221024 = score(doc=1963,freq=2.0), product of:
              0.11752075 = queryWeight, product of:
                1.9286298 = boost
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.012859516 = queryNorm
              0.41882837 = fieldWeight in 1963, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.0625 = fieldNorm(doc=1963)
          0.0475402 = weight(abstract_txt:search in 1963) [ClassicSimilarity], result of:
            0.0475402 = score(doc=1963,freq=5.0), product of:
              0.093122445 = queryWeight, product of:
                1.9823856 = boost
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.012859516 = queryNorm
              0.51051277 = fieldWeight in 1963, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.0625 = fieldNorm(doc=1963)
          0.064385764 = weight(abstract_txt:independent in 1963) [ClassicSimilarity], result of:
            0.064385764 = score(doc=1963,freq=1.0), product of:
              0.17709951 = queryWeight, product of:
                2.367556 = boost
                5.8169117 = idf(docFreq=349, maxDocs=43254)
                0.012859516 = queryNorm
              0.36355698 = fieldWeight in 1963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8169117 = idf(docFreq=349, maxDocs=43254)
                0.0625 = fieldNorm(doc=1963)
          0.29769665 = weight(abstract_txt:pages in 1963) [ClassicSimilarity], result of:
            0.29769665 = score(doc=1963,freq=3.0), product of:
              0.49151877 = queryWeight, product of:
                6.8315973 = boost
                5.5949116 = idf(docFreq=436, maxDocs=43254)
                0.012859516 = queryNorm
              0.60566694 = fieldWeight in 1963, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5949116 = idf(docFreq=436, maxDocs=43254)
                0.0625 = fieldNorm(doc=1963)
        0.4 = coord(10/25)
    
  2. Wang, F.L.; Yang, C.C.: Mining Web data for Chinese segmentation (2007) 0.19
    0.18659225 = sum of:
      0.18659225 = product of:
        0.5183118 = sum of:
          0.013485203 = weight(abstract_txt:most in 2605) [ClassicSimilarity], result of:
            0.013485203 = score(doc=2605,freq=1.0), product of:
              0.054563057 = queryWeight, product of:
                1.0729891 = boost
                3.9543834 = idf(docFreq=2253, maxDocs=43254)
                0.012859516 = queryNorm
              0.24714896 = fieldWeight in 2605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9543834 = idf(docFreq=2253, maxDocs=43254)
                0.0625 = fieldNorm(doc=2605)
          0.01753358 = weight(abstract_txt:data in 2605) [ClassicSimilarity], result of:
            0.01753358 = score(doc=2605,freq=2.0), product of:
              0.05905562 = queryWeight, product of:
                1.367169 = boost
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.012859516 = queryNorm
              0.29689944 = fieldWeight in 2605, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.0625 = fieldNorm(doc=2605)
          0.09992147 = weight(abstract_txt:algorithm in 2605) [ClassicSimilarity], result of:
            0.09992147 = score(doc=2605,freq=6.0), product of:
              0.114125445 = queryWeight, product of:
                1.5518053 = boost
                5.7190075 = idf(docFreq=385, maxDocs=43254)
                0.012859516 = queryNorm
              0.8755407 = fieldWeight in 2605, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.7190075 = idf(docFreq=385, maxDocs=43254)
                0.0625 = fieldNorm(doc=2605)
          0.07173213 = weight(abstract_txt:algorithms in 2605) [ClassicSimilarity], result of:
            0.07173213 = score(doc=2605,freq=3.0), product of:
              0.11528225 = queryWeight, product of:
                1.5596502 = boost
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.012859516 = queryNorm
              0.62223047 = fieldWeight in 2605, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.0625 = fieldNorm(doc=2605)
          0.1458603 = weight(abstract_txt:chinese in 2605) [ClassicSimilarity], result of:
            0.1458603 = score(doc=2605,freq=7.0), product of:
              0.13950372 = queryWeight, product of:
                1.7156901 = boost
                6.322987 = idf(docFreq=210, maxDocs=43254)
                0.012859516 = queryNorm
              1.0455657 = fieldWeight in 2605, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.322987 = idf(docFreq=210, maxDocs=43254)
                0.0625 = fieldNorm(doc=2605)
          0.04123547 = weight(abstract_txt:large in 2605) [ClassicSimilarity], result of:
            0.04123547 = score(doc=2605,freq=2.0), product of:
              0.10443869 = queryWeight, product of:
                1.818119 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.012859516 = queryNorm
              0.39482942 = fieldWeight in 2605, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.0625 = fieldNorm(doc=2605)
          0.03682448 = weight(abstract_txt:search in 2605) [ClassicSimilarity], result of:
            0.03682448 = score(doc=2605,freq=3.0), product of:
              0.093122445 = queryWeight, product of:
                1.9823856 = boost
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.012859516 = queryNorm
              0.3954415 = fieldWeight in 2605, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.0625 = fieldNorm(doc=2605)
          0.064385764 = weight(abstract_txt:independent in 2605) [ClassicSimilarity], result of:
            0.064385764 = score(doc=2605,freq=1.0), product of:
              0.17709951 = queryWeight, product of:
                2.367556 = boost
                5.8169117 = idf(docFreq=349, maxDocs=43254)
                0.012859516 = queryNorm
              0.36355698 = fieldWeight in 2605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8169117 = idf(docFreq=349, maxDocs=43254)
                0.0625 = fieldNorm(doc=2605)
          0.027333377 = weight(abstract_txt:retrieval in 2605) [ClassicSimilarity], result of:
            0.027333377 = score(doc=2605,freq=1.0), product of:
              0.1260365 = queryWeight, product of:
                2.8245857 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.012859516 = queryNorm
              0.21686874 = fieldWeight in 2605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=2605)
        0.36 = coord(9/25)
    
  3. Thelwall, M.; Vaughan, L.: New versions of PageRank employing alternative Web document models (2004) 0.17
    0.16674699 = sum of:
      0.16674699 = product of:
        0.59552497 = sum of:
          0.013485203 = weight(abstract_txt:most in 1800) [ClassicSimilarity], result of:
            0.013485203 = score(doc=1800,freq=1.0), product of:
              0.054563057 = queryWeight, product of:
                1.0729891 = boost
                3.9543834 = idf(docFreq=2253, maxDocs=43254)
                0.012859516 = queryNorm
              0.24714896 = fieldWeight in 1800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9543834 = idf(docFreq=2253, maxDocs=43254)
                0.0625 = fieldNorm(doc=1800)
          0.036596097 = weight(abstract_txt:engine in 1800) [ClassicSimilarity], result of:
            0.036596097 = score(doc=1800,freq=1.0), product of:
              0.10615739 = queryWeight, product of:
                1.4966528 = boost
                5.5157495 = idf(docFreq=472, maxDocs=43254)
                0.012859516 = queryNorm
              0.34473434 = fieldWeight in 1800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5157495 = idf(docFreq=472, maxDocs=43254)
                0.0625 = fieldNorm(doc=1800)
          0.040792767 = weight(abstract_txt:algorithm in 1800) [ClassicSimilarity], result of:
            0.040792767 = score(doc=1800,freq=1.0), product of:
              0.114125445 = queryWeight, product of:
                1.5518053 = boost
                5.7190075 = idf(docFreq=385, maxDocs=43254)
                0.012859516 = queryNorm
              0.35743797 = fieldWeight in 1800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7190075 = idf(docFreq=385, maxDocs=43254)
                0.0625 = fieldNorm(doc=1800)
          0.07173213 = weight(abstract_txt:algorithms in 1800) [ClassicSimilarity], result of:
            0.07173213 = score(doc=1800,freq=3.0), product of:
              0.11528225 = queryWeight, product of:
                1.5596502 = boost
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.012859516 = queryNorm
              0.62223047 = fieldWeight in 1800, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.0625 = fieldNorm(doc=1800)
          0.021260623 = weight(abstract_txt:search in 1800) [ClassicSimilarity], result of:
            0.021260623 = score(doc=1800,freq=1.0), product of:
              0.093122445 = queryWeight, product of:
                1.9823856 = boost
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.012859516 = queryNorm
              0.22830826 = fieldWeight in 1800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.0625 = fieldNorm(doc=1800)
          0.027333377 = weight(abstract_txt:retrieval in 1800) [ClassicSimilarity], result of:
            0.027333377 = score(doc=1800,freq=1.0), product of:
              0.1260365 = queryWeight, product of:
                2.8245857 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.012859516 = queryNorm
              0.21686874 = fieldWeight in 1800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=1800)
          0.38432476 = weight(abstract_txt:pages in 1800) [ClassicSimilarity], result of:
            0.38432476 = score(doc=1800,freq=5.0), product of:
              0.49151877 = queryWeight, product of:
                6.8315973 = boost
                5.5949116 = idf(docFreq=436, maxDocs=43254)
                0.012859516 = queryNorm
              0.7819127 = fieldWeight in 1800, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.5949116 = idf(docFreq=436, maxDocs=43254)
                0.0625 = fieldNorm(doc=1800)
        0.28 = coord(7/25)
    
  4. Austin, D.: How Google finds your needle in the Web's haystack : as we'll see, the trick is to ask the web itself to rank the importance of pages... (2006) 0.16
    0.16470052 = sum of:
      0.16470052 = product of:
        0.5882161 = sum of:
          0.016856505 = weight(abstract_txt:most in 1219) [ClassicSimilarity], result of:
            0.016856505 = score(doc=1219,freq=4.0), product of:
              0.054563057 = queryWeight, product of:
                1.0729891 = boost
                3.9543834 = idf(docFreq=2253, maxDocs=43254)
                0.012859516 = queryNorm
              0.3089362 = fieldWeight in 1219, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9543834 = idf(docFreq=2253, maxDocs=43254)
                0.0390625 = fieldNorm(doc=1219)
          0.015389107 = weight(abstract_txt:useful in 1219) [ClassicSimilarity], result of:
            0.015389107 = score(doc=1219,freq=1.0), product of:
              0.08151095 = queryWeight, product of:
                1.3114567 = boost
                4.8332295 = idf(docFreq=935, maxDocs=43254)
                0.012859516 = queryNorm
              0.18879803 = fieldWeight in 1219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8332295 = idf(docFreq=935, maxDocs=43254)
                0.0390625 = fieldNorm(doc=1219)
          0.03961644 = weight(abstract_txt:engine in 1219) [ClassicSimilarity], result of:
            0.03961644 = score(doc=1219,freq=3.0), product of:
              0.10615739 = queryWeight, product of:
                1.4966528 = boost
                5.5157495 = idf(docFreq=472, maxDocs=43254)
                0.012859516 = queryNorm
              0.37318587 = fieldWeight in 1219, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5157495 = idf(docFreq=472, maxDocs=43254)
                0.0390625 = fieldNorm(doc=1219)
          0.036056053 = weight(abstract_txt:algorithm in 1219) [ClassicSimilarity], result of:
            0.036056053 = score(doc=1219,freq=2.0), product of:
              0.114125445 = queryWeight, product of:
                1.5518053 = boost
                5.7190075 = idf(docFreq=385, maxDocs=43254)
                0.012859516 = queryNorm
              0.31593353 = fieldWeight in 1219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7190075 = idf(docFreq=385, maxDocs=43254)
                0.0390625 = fieldNorm(doc=1219)
          0.018223677 = weight(abstract_txt:large in 1219) [ClassicSimilarity], result of:
            0.018223677 = score(doc=1219,freq=1.0), product of:
              0.10443869 = queryWeight, product of:
                1.818119 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.012859516 = queryNorm
              0.17449161 = fieldWeight in 1219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.0390625 = fieldNorm(doc=1219)
          0.0460306 = weight(abstract_txt:search in 1219) [ClassicSimilarity], result of:
            0.0460306 = score(doc=1219,freq=12.0), product of:
              0.093122445 = queryWeight, product of:
                1.9823856 = boost
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.012859516 = queryNorm
              0.4943019 = fieldWeight in 1219, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.0390625 = fieldNorm(doc=1219)
          0.41604376 = weight(abstract_txt:pages in 1219) [ClassicSimilarity], result of:
            0.41604376 = score(doc=1219,freq=15.0), product of:
              0.49151877 = queryWeight, product of:
                6.8315973 = boost
                5.5949116 = idf(docFreq=436, maxDocs=43254)
                0.012859516 = queryNorm
              0.8464453 = fieldWeight in 1219, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                5.5949116 = idf(docFreq=436, maxDocs=43254)
                0.0390625 = fieldNorm(doc=1219)
        0.28 = coord(7/25)
    
  5. Lawrence, S.; Giles, C.L.: Inquirus, the NECI meta search engine (1998) 0.16
    0.1621232 = sum of:
      0.1621232 = product of:
        0.6755134 = sum of:
          0.018208215 = weight(abstract_txt:both in 5605) [ClassicSimilarity], result of:
            0.018208215 = score(doc=5605,freq=1.0), product of:
              0.050867975 = queryWeight, product of:
                1.0360202 = boost
                3.8181381 = idf(docFreq=2582, maxDocs=43254)
                0.012859516 = queryNorm
              0.35795045 = fieldWeight in 5605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8181381 = idf(docFreq=2582, maxDocs=43254)
                0.09375 = fieldNorm(doc=5605)
          0.036933858 = weight(abstract_txt:useful in 5605) [ClassicSimilarity], result of:
            0.036933858 = score(doc=5605,freq=1.0), product of:
              0.08151095 = queryWeight, product of:
                1.3114567 = boost
                4.8332295 = idf(docFreq=935, maxDocs=43254)
                0.012859516 = queryNorm
              0.45311528 = fieldWeight in 5605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8332295 = idf(docFreq=935, maxDocs=43254)
                0.09375 = fieldNorm(doc=5605)
          0.05489415 = weight(abstract_txt:engine in 5605) [ClassicSimilarity], result of:
            0.05489415 = score(doc=5605,freq=1.0), product of:
              0.10615739 = queryWeight, product of:
                1.4966528 = boost
                5.5157495 = idf(docFreq=472, maxDocs=43254)
                0.012859516 = queryNorm
              0.5171015 = fieldWeight in 5605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5157495 = idf(docFreq=472, maxDocs=43254)
                0.09375 = fieldNorm(doc=5605)
          0.073831536 = weight(abstract_txt:query in 5605) [ClassicSimilarity], result of:
            0.073831536 = score(doc=5605,freq=2.0), product of:
              0.11752075 = queryWeight, product of:
                1.9286298 = boost
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.012859516 = queryNorm
              0.62824255 = fieldWeight in 5605, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.738502 = idf(docFreq=1028, maxDocs=43254)
                0.09375 = fieldNorm(doc=5605)
          0.045100592 = weight(abstract_txt:search in 5605) [ClassicSimilarity], result of:
            0.045100592 = score(doc=5605,freq=2.0), product of:
              0.093122445 = queryWeight, product of:
                1.9823856 = boost
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.012859516 = queryNorm
              0.48431495 = fieldWeight in 5605, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.09375 = fieldNorm(doc=5605)
          0.446545 = weight(abstract_txt:pages in 5605) [ClassicSimilarity], result of:
            0.446545 = score(doc=5605,freq=3.0), product of:
              0.49151877 = queryWeight, product of:
                6.8315973 = boost
                5.5949116 = idf(docFreq=436, maxDocs=43254)
                0.012859516 = queryNorm
              0.90850043 = fieldWeight in 5605, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5949116 = idf(docFreq=436, maxDocs=43254)
                0.09375 = fieldNorm(doc=5605)
        0.24 = coord(6/25)