Document (#32608)

Author
Liu, Y.
Zhang, M.
Cen, R.
Ru, L.
Ma, S.
Title
Data cleansing for Web information retrieval using query independent features
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.12, S.1884-1898
Year
2007
Abstract
Understanding what kinds of Web pages are the most useful for Web search engine users is a critical task in Web information retrieval (IR). Most previous works used hyperlink analysis algorithms to solve this problem. However, little research has been focused on query-independent Web data cleansing for Web IR. In this paper, we first provide analysis of the differences between retrieval target pages and ordinary ones based on more than 30 million Web pages obtained from both the Text Retrieval Conference (TREC) and a widely used Chinese search engine, SOGOU (www.sogou.com). We further propose a learning-based data cleansing algorithm for reducing Web pages that are unlikely to be useful for user requests. We found that there exists a large proportion of low-quality Web pages in both the English and the Chinese Web page corpus, and retrieval target pages can be identified using query-independent features and cleansing algorithms. The experimental results showed that our algorithm is effective in reducing a large portion of Web pages with a small loss in retrieval target pages. It makes it possible for Web IR tools to meet a large fraction of users' needs with only a small part of pages on the Web. These results may help Web search engines make better use of their limited storage and computation resources to improve search performance.
Footnote
Beitrag eines Themenschwerpunktes "Mining Web resources for enhancing information retrieval"
Theme
Data Mining
Suchmaschinen
Object
WWW

Similar documents (author)

  1. Zhang, M.; Zhang, Y.: Professional organizations in Twittersphere : an empirical study of U.S. library and information science professional organizations-related Tweets (2020) 4.54
    4.5423746 = sum of:
      4.5423746 = weight(author_txt:zhang in 5775) [ClassicSimilarity], result of:
        4.5423746 = fieldWeight in 5775, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.4238877 = idf(docFreq=194, maxDocs=44218)
          0.5 = fieldNorm(doc=5775)
    
  2. Zhang, Y.; Zhang, C.: Enhancing keyphrase extraction from microblogs using human reading time (2021) 4.54
    4.5423746 = sum of:
      4.5423746 = weight(author_txt:zhang in 237) [ClassicSimilarity], result of:
        4.5423746 = fieldWeight in 237, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.4238877 = idf(docFreq=194, maxDocs=44218)
          0.5 = fieldNorm(doc=237)
    
  3. Zhang, J.: TOFIR: A tool of facilitating information retrieval : introduce a visual retrieval model (2001) 4.01
    4.01493 = sum of:
      4.01493 = weight(author_txt:zhang in 7711) [ClassicSimilarity], result of:
        4.01493 = fieldWeight in 7711, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.4238877 = idf(docFreq=194, maxDocs=44218)
          0.625 = fieldNorm(doc=7711)
    
  4. Zhang, A.: Multimedia file formats on the Internet : a beginner's guide for PC users (1995) 4.01
    4.01493 = sum of:
      4.01493 = weight(author_txt:zhang in 3212) [ClassicSimilarity], result of:
        4.01493 = fieldWeight in 3212, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.4238877 = idf(docFreq=194, maxDocs=44218)
          0.625 = fieldNorm(doc=3212)
    
  5. Zhang, J.: ¬A representational analysis of relational information displays (1996) 4.01
    4.01493 = sum of:
      4.01493 = weight(author_txt:zhang in 6403) [ClassicSimilarity], result of:
        4.01493 = fieldWeight in 6403, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.4238877 = idf(docFreq=194, maxDocs=44218)
          0.625 = fieldNorm(doc=6403)
    

Similar documents (content)

  1. Souza, J.; Carvalho, A.; Cristo, M.; Moura, E.; Calado, P.; Chirita, P.-A.; Nejdl, W.: Using site-level connections to estimate link confidence (2012) 0.26
    0.25950113 = sum of:
      0.25950113 = product of:
        0.6487528 = sum of:
          0.013347855 = weight(abstract_txt:most in 498) [ClassicSimilarity], result of:
            0.013347855 = score(doc=498,freq=1.0), product of:
              0.05415373 = queryWeight, product of:
                1.0688385 = boost
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.012847339 = queryNorm
              0.24648081 = fieldWeight in 498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.0625 = fieldNorm(doc=498)
          0.028783346 = weight(abstract_txt:features in 498) [ClassicSimilarity], result of:
            0.028783346 = score(doc=498,freq=2.0), product of:
              0.071741685 = queryWeight, product of:
                1.2302226 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.012847339 = queryNorm
              0.4012081 = fieldWeight in 498, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=498)
          0.03674817 = weight(abstract_txt:engine in 498) [ClassicSimilarity], result of:
            0.03674817 = score(doc=498,freq=1.0), product of:
              0.106376216 = queryWeight, product of:
                1.4980289 = boost
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.012847339 = queryNorm
              0.34545478 = fieldWeight in 498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.0625 = fieldNorm(doc=498)
          0.04041715 = weight(abstract_txt:algorithm in 498) [ClassicSimilarity], result of:
            0.04041715 = score(doc=498,freq=1.0), product of:
              0.11334381 = queryWeight, product of:
                1.5463109 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.012847339 = queryNorm
              0.35658893 = fieldWeight in 498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0625 = fieldNorm(doc=498)
          0.040470365 = weight(abstract_txt:algorithms in 498) [ClassicSimilarity], result of:
            0.040470365 = score(doc=498,freq=1.0), product of:
              0.113443285 = queryWeight, product of:
                1.5469893 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.012847339 = queryNorm
              0.35674536 = fieldWeight in 498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=498)
          0.028844973 = weight(abstract_txt:large in 498) [ClassicSimilarity], result of:
            0.028844973 = score(doc=498,freq=1.0), product of:
              0.10361705 = queryWeight, product of:
                1.8107527 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.012847339 = queryNorm
              0.27838057 = fieldWeight in 498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=498)
          0.04959312 = weight(abstract_txt:query in 498) [ClassicSimilarity], result of:
            0.04959312 = score(doc=498,freq=2.0), product of:
              0.118029006 = queryWeight, product of:
                1.9325819 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.012847339 = queryNorm
              0.4201774 = fieldWeight in 498, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=498)
          0.047639474 = weight(abstract_txt:search in 498) [ClassicSimilarity], result of:
            0.047639474 = score(doc=498,freq=5.0), product of:
              0.09318629 = queryWeight, product of:
                1.9828457 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.012847339 = queryNorm
              0.5112284 = fieldWeight in 498, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=498)
          0.06413883 = weight(abstract_txt:independent in 498) [ClassicSimilarity], result of:
            0.06413883 = score(doc=498,freq=1.0), product of:
              0.17652185 = queryWeight, product of:
                2.3634303 = boost
                5.813565 = idf(docFreq=358, maxDocs=44218)
                0.012847339 = queryNorm
              0.3633478 = fieldWeight in 498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.813565 = idf(docFreq=358, maxDocs=44218)
                0.0625 = fieldNorm(doc=498)
          0.29876956 = weight(abstract_txt:pages in 498) [ClassicSimilarity], result of:
            0.29876956 = score(doc=498,freq=3.0), product of:
              0.49235162 = queryWeight, product of:
                6.8366265 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.012847339 = queryNorm
              0.60682154 = fieldWeight in 498, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.0625 = fieldNorm(doc=498)
        0.4 = coord(10/25)
    
  2. Wang, F.L.; Yang, C.C.: Mining Web data for Chinese segmentation (2007) 0.18
    0.18468603 = sum of:
      0.18468603 = product of:
        0.51301676 = sum of:
          0.013347855 = weight(abstract_txt:most in 604) [ClassicSimilarity], result of:
            0.013347855 = score(doc=604,freq=1.0), product of:
              0.05415373 = queryWeight, product of:
                1.0688385 = boost
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.012847339 = queryNorm
              0.24648081 = fieldWeight in 604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.017144404 = weight(abstract_txt:data in 604) [ClassicSimilarity], result of:
            0.017144404 = score(doc=604,freq=2.0), product of:
              0.05813746 = queryWeight, product of:
                1.3563493 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.012847339 = queryNorm
              0.29489428 = fieldWeight in 604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.09900139 = weight(abstract_txt:algorithm in 604) [ClassicSimilarity], result of:
            0.09900139 = score(doc=604,freq=6.0), product of:
              0.11334381 = queryWeight, product of:
                1.5463109 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.012847339 = queryNorm
              0.87346095 = fieldWeight in 604, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.07009673 = weight(abstract_txt:algorithms in 604) [ClassicSimilarity], result of:
            0.07009673 = score(doc=604,freq=3.0), product of:
              0.113443285 = queryWeight, product of:
                1.5469893 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.012847339 = queryNorm
              0.6179011 = fieldWeight in 604, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.14419387 = weight(abstract_txt:chinese in 604) [ClassicSimilarity], result of:
            0.14419387 = score(doc=604,freq=7.0), product of:
              0.13834153 = queryWeight, product of:
                1.7083396 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.012847339 = queryNorm
              1.0423036 = fieldWeight in 604, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.040792953 = weight(abstract_txt:large in 604) [ClassicSimilarity], result of:
            0.040792953 = score(doc=604,freq=2.0), product of:
              0.10361705 = queryWeight, product of:
                1.8107527 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.012847339 = queryNorm
              0.39368957 = fieldWeight in 604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.036901377 = weight(abstract_txt:search in 604) [ClassicSimilarity], result of:
            0.036901377 = score(doc=604,freq=3.0), product of:
              0.09318629 = queryWeight, product of:
                1.9828457 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.012847339 = queryNorm
              0.3959958 = fieldWeight in 604, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.06413883 = weight(abstract_txt:independent in 604) [ClassicSimilarity], result of:
            0.06413883 = score(doc=604,freq=1.0), product of:
              0.17652185 = queryWeight, product of:
                2.3634303 = boost
                5.813565 = idf(docFreq=358, maxDocs=44218)
                0.012847339 = queryNorm
              0.3633478 = fieldWeight in 604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.813565 = idf(docFreq=358, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.02739934 = weight(abstract_txt:retrieval in 604) [ClassicSimilarity], result of:
            0.02739934 = score(doc=604,freq=1.0), product of:
              0.12615018 = queryWeight, product of:
                2.8255465 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.012847339 = queryNorm
              0.21719621 = fieldWeight in 604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
        0.36 = coord(9/25)
    
  3. Thelwall, M.; Vaughan, L.: New versions of PageRank employing alternative Web document models (2004) 0.17
    0.16660677 = sum of:
      0.16660677 = product of:
        0.59502417 = sum of:
          0.013347855 = weight(abstract_txt:most in 674) [ClassicSimilarity], result of:
            0.013347855 = score(doc=674,freq=1.0), product of:
              0.05415373 = queryWeight, product of:
                1.0688385 = boost
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.012847339 = queryNorm
              0.24648081 = fieldWeight in 674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.0625 = fieldNorm(doc=674)
          0.03674817 = weight(abstract_txt:engine in 674) [ClassicSimilarity], result of:
            0.03674817 = score(doc=674,freq=1.0), product of:
              0.106376216 = queryWeight, product of:
                1.4980289 = boost
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.012847339 = queryNorm
              0.34545478 = fieldWeight in 674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.0625 = fieldNorm(doc=674)
          0.04041715 = weight(abstract_txt:algorithm in 674) [ClassicSimilarity], result of:
            0.04041715 = score(doc=674,freq=1.0), product of:
              0.11334381 = queryWeight, product of:
                1.5463109 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.012847339 = queryNorm
              0.35658893 = fieldWeight in 674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0625 = fieldNorm(doc=674)
          0.07009673 = weight(abstract_txt:algorithms in 674) [ClassicSimilarity], result of:
            0.07009673 = score(doc=674,freq=3.0), product of:
              0.113443285 = queryWeight, product of:
                1.5469893 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.012847339 = queryNorm
              0.6179011 = fieldWeight in 674, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=674)
          0.02130502 = weight(abstract_txt:search in 674) [ClassicSimilarity], result of:
            0.02130502 = score(doc=674,freq=1.0), product of:
              0.09318629 = queryWeight, product of:
                1.9828457 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.012847339 = queryNorm
              0.22862828 = fieldWeight in 674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=674)
          0.02739934 = weight(abstract_txt:retrieval in 674) [ClassicSimilarity], result of:
            0.02739934 = score(doc=674,freq=1.0), product of:
              0.12615018 = queryWeight, product of:
                2.8255465 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.012847339 = queryNorm
              0.21719621 = fieldWeight in 674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=674)
          0.38570988 = weight(abstract_txt:pages in 674) [ClassicSimilarity], result of:
            0.38570988 = score(doc=674,freq=5.0), product of:
              0.49235162 = queryWeight, product of:
                6.8366265 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.012847339 = queryNorm
              0.7834033 = fieldWeight in 674, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.0625 = fieldNorm(doc=674)
        0.28 = coord(7/25)
    
  4. Austin, D.: How Google finds your needle in the Web's haystack : as we'll see, the trick is to ask the web itself to rank the importance of pages... (2006) 0.17
    0.1650049 = sum of:
      0.1650049 = product of:
        0.5893032 = sum of:
          0.016684817 = weight(abstract_txt:most in 93) [ClassicSimilarity], result of:
            0.016684817 = score(doc=93,freq=4.0), product of:
              0.05415373 = queryWeight, product of:
                1.0688385 = boost
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.012847339 = queryNorm
              0.308101 = fieldWeight in 93, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.0390625 = fieldNorm(doc=93)
          0.015415212 = weight(abstract_txt:useful in 93) [ClassicSimilarity], result of:
            0.015415212 = score(doc=93,freq=1.0), product of:
              0.08154557 = queryWeight, product of:
                1.31159 = boost
                4.839373 = idf(docFreq=950, maxDocs=44218)
                0.012847339 = queryNorm
              0.18903801 = fieldWeight in 93, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.839373 = idf(docFreq=950, maxDocs=44218)
                0.0390625 = fieldNorm(doc=93)
          0.039781064 = weight(abstract_txt:engine in 93) [ClassicSimilarity], result of:
            0.039781064 = score(doc=93,freq=3.0), product of:
              0.106376216 = queryWeight, product of:
                1.4980289 = boost
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.012847339 = queryNorm
              0.37396577 = fieldWeight in 93, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.0390625 = fieldNorm(doc=93)
          0.03572405 = weight(abstract_txt:algorithm in 93) [ClassicSimilarity], result of:
            0.03572405 = score(doc=93,freq=2.0), product of:
              0.11334381 = queryWeight, product of:
                1.5463109 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.012847339 = queryNorm
              0.31518307 = fieldWeight in 93, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0390625 = fieldNorm(doc=93)
          0.018028108 = weight(abstract_txt:large in 93) [ClassicSimilarity], result of:
            0.018028108 = score(doc=93,freq=1.0), product of:
              0.10361705 = queryWeight, product of:
                1.8107527 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.012847339 = queryNorm
              0.17398787 = fieldWeight in 93, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0390625 = fieldNorm(doc=93)
          0.046126723 = weight(abstract_txt:search in 93) [ClassicSimilarity], result of:
            0.046126723 = score(doc=93,freq=12.0), product of:
              0.09318629 = queryWeight, product of:
                1.9828457 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.012847339 = queryNorm
              0.49499476 = fieldWeight in 93, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0390625 = fieldNorm(doc=93)
          0.4175432 = weight(abstract_txt:pages in 93) [ClassicSimilarity], result of:
            0.4175432 = score(doc=93,freq=15.0), product of:
              0.49235162 = queryWeight, product of:
                6.8366265 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.012847339 = queryNorm
              0.84805894 = fieldWeight in 93, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.0390625 = fieldNorm(doc=93)
        0.28 = coord(7/25)
    
  5. Lawrence, S.; Giles, C.L.: Inquirus, the NECI meta search engine (1998) 0.16
    0.16270404 = sum of:
      0.16270404 = product of:
        0.6779335 = sum of:
          0.018075945 = weight(abstract_txt:both in 3604) [ClassicSimilarity], result of:
            0.018075945 = score(doc=3604,freq=1.0), product of:
              0.050585635 = queryWeight, product of:
                1.0330266 = boost
                3.811558 = idf(docFreq=2657, maxDocs=44218)
                0.012847339 = queryNorm
              0.35733357 = fieldWeight in 3604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.811558 = idf(docFreq=2657, maxDocs=44218)
                0.09375 = fieldNorm(doc=3604)
          0.03699651 = weight(abstract_txt:useful in 3604) [ClassicSimilarity], result of:
            0.03699651 = score(doc=3604,freq=1.0), product of:
              0.08154557 = queryWeight, product of:
                1.31159 = boost
                4.839373 = idf(docFreq=950, maxDocs=44218)
                0.012847339 = queryNorm
              0.45369124 = fieldWeight in 3604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.839373 = idf(docFreq=950, maxDocs=44218)
                0.09375 = fieldNorm(doc=3604)
          0.055122256 = weight(abstract_txt:engine in 3604) [ClassicSimilarity], result of:
            0.055122256 = score(doc=3604,freq=1.0), product of:
              0.106376216 = queryWeight, product of:
                1.4980289 = boost
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.012847339 = queryNorm
              0.51818216 = fieldWeight in 3604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.09375 = fieldNorm(doc=3604)
          0.07438968 = weight(abstract_txt:query in 3604) [ClassicSimilarity], result of:
            0.07438968 = score(doc=3604,freq=2.0), product of:
              0.118029006 = queryWeight, product of:
                1.9325819 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.012847339 = queryNorm
              0.6302661 = fieldWeight in 3604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.09375 = fieldNorm(doc=3604)
          0.045194775 = weight(abstract_txt:search in 3604) [ClassicSimilarity], result of:
            0.045194775 = score(doc=3604,freq=2.0), product of:
              0.09318629 = queryWeight, product of:
                1.9828457 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.012847339 = queryNorm
              0.48499382 = fieldWeight in 3604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.09375 = fieldNorm(doc=3604)
          0.44815436 = weight(abstract_txt:pages in 3604) [ClassicSimilarity], result of:
            0.44815436 = score(doc=3604,freq=3.0), product of:
              0.49235162 = queryWeight, product of:
                6.8366265 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.012847339 = queryNorm
              0.9102323 = fieldWeight in 3604, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.09375 = fieldNorm(doc=3604)
        0.24 = coord(6/25)