Document (#32337)

Author
Chau, M.
Fang, X.
Rittman, C.C.
Title
Web searching in Chinese : a study of a search engine in Hong Kong
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.7, S.1044-1054
Year
2007
Abstract
The number of non-English resources has been increasing rapidly on the Web. Although many studies have been conducted on the query logs in search engines that are primarily English-based (e.g., Excite and AltaVista), only a few of them have studied the information-seeking behavior on the Web in non-English languages. In this article, we report the analysis of the search-query logs of a search engine that focused on Chinese. Three months of search-query logs of Timway, a search engine based in Hong Kong, were collected and analyzed. Metrics on sessions, queries, search topics, and character usage are reported. N-gram analysis also has been applied to perform character-based analysis. Our analysis suggests that some characteristics identified in the search log, such as search topics and the mean number of queries per sessions, are similar to those in English search engines; however, other characteristics, such as the use of operators in query formulation, are significantly different. The analysis also shows that only a very small number of unique Chinese characters are used in search queries. We believe the findings from this study have provided some insights into further research in non-English Web searching.
Theme
Internet
Location
Hong Kong

Similar documents (author)

  1. Chau, M.; Fang, X.; Sheng, O.R.U.: Analysis of the query logs of a Web site search engine (2005) 4.68
    4.680435 = sum of:
      4.680435 = sum of:
        2.034243 = weight(author_txt:fang in 4573) [ClassicSimilarity], result of:
          2.034243 = score(doc=4573,freq=1.0), product of:
            0.642823 = queryWeight, product of:
              8.43879 = idf(docFreq=25, maxDocs=44218)
              0.07617478 = queryNorm
            3.1645465 = fieldWeight in 4573, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.43879 = idf(docFreq=25, maxDocs=44218)
              0.375 = fieldNorm(doc=4573)
        2.6461923 = weight(author_txt:chau in 4573) [ClassicSimilarity], result of:
          2.6461923 = score(doc=4573,freq=1.0), product of:
            0.7660147 = queryWeight, product of:
              1.0916234 = boost
              9.211981 = idf(docFreq=11, maxDocs=44218)
              0.07617478 = queryNorm
            3.4544928 = fieldWeight in 4573, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.211981 = idf(docFreq=11, maxDocs=44218)
              0.375 = fieldNorm(doc=4573)
    
  2. Chau, M.; Lu, Y.; Fang, X.; Yang, C.C.: Characteristics of character usage in Chinese Web searching (2009) 3.90
    3.900363 = sum of:
      3.900363 = sum of:
        1.6952026 = weight(author_txt:fang in 2456) [ClassicSimilarity], result of:
          1.6952026 = score(doc=2456,freq=1.0), product of:
            0.642823 = queryWeight, product of:
              8.43879 = idf(docFreq=25, maxDocs=44218)
              0.07617478 = queryNorm
            2.637122 = fieldWeight in 2456, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.43879 = idf(docFreq=25, maxDocs=44218)
              0.3125 = fieldNorm(doc=2456)
        2.2051604 = weight(author_txt:chau in 2456) [ClassicSimilarity], result of:
          2.2051604 = score(doc=2456,freq=1.0), product of:
            0.7660147 = queryWeight, product of:
              1.0916234 = boost
              9.211981 = idf(docFreq=11, maxDocs=44218)
              0.07617478 = queryNorm
            2.8787441 = fieldWeight in 2456, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.211981 = idf(docFreq=11, maxDocs=44218)
              0.3125 = fieldNorm(doc=2456)
    
  3. Chau, M.Y.: Finding order in a chaotic world : a model for organized research using the World Wide Web (1997) 2.21
    2.2051604 = sum of:
      2.2051604 = product of:
        4.4103208 = sum of:
          4.4103208 = weight(author_txt:chau in 529) [ClassicSimilarity], result of:
            4.4103208 = score(doc=529,freq=1.0), product of:
              0.7660147 = queryWeight, product of:
                1.0916234 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.07617478 = queryNorm
              5.7574883 = fieldWeight in 529, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=529)
        0.5 = coord(1/2)
    
  4. Chen, H.; Chau, M.: Web mining : machine learning for Web applications (2003) 1.76
    1.7641282 = sum of:
      1.7641282 = product of:
        3.5282564 = sum of:
          3.5282564 = weight(author_txt:chau in 4242) [ClassicSimilarity], result of:
            3.5282564 = score(doc=4242,freq=1.0), product of:
              0.7660147 = queryWeight, product of:
                1.0916234 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.07617478 = queryNorm
              4.6059904 = fieldWeight in 4242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.5 = fieldNorm(doc=4242)
        0.5 = coord(1/2)
    
  5. Fang, L.: ¬A developing search service : heterogeneous resources integration and retrieval system (2004) 1.70
    1.6952026 = sum of:
      1.6952026 = product of:
        3.3904052 = sum of:
          3.3904052 = weight(author_txt:fang in 1193) [ClassicSimilarity], result of:
            3.3904052 = score(doc=1193,freq=1.0), product of:
              0.642823 = queryWeight, product of:
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.07617478 = queryNorm
              5.274244 = fieldWeight in 1193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.625 = fieldNorm(doc=1193)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Chau, M.; Lu, Y.; Fang, X.; Yang, C.C.: Characteristics of character usage in Chinese Web searching (2009) 1.24
    1.23909 = sum of:
      1.23909 = product of:
        1.7209582 = sum of:
          0.057520095 = weight(abstract_txt:characters in 2456) [ClassicSimilarity], result of:
            0.057520095 = score(doc=2456,freq=1.0), product of:
              0.12471496 = queryWeight, product of:
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.016900422 = queryNorm
              0.46121246 = fieldWeight in 2456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.031847186 = weight(abstract_txt:searching in 2456) [ClassicSimilarity], result of:
            0.031847186 = score(doc=2456,freq=2.0), product of:
              0.08409165 = queryWeight, product of:
                1.1612672 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.016900422 = queryNorm
              0.37871996 = fieldWeight in 2456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.015233616 = weight(abstract_txt:that in 2456) [ClassicSimilarity], result of:
            0.015233616 = score(doc=2456,freq=4.0), product of:
              0.05143288 = queryWeight, product of:
                1.2843729 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016900422 = queryNorm
              0.2961844 = fieldWeight in 2456, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.014131883 = weight(abstract_txt:have in 2456) [ClassicSimilarity], result of:
            0.014131883 = score(doc=2456,freq=1.0), product of:
              0.07055795 = queryWeight, product of:
                1.3027898 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.016900422 = queryNorm
              0.20028761 = fieldWeight in 2456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.033272576 = weight(abstract_txt:characteristics in 2456) [ClassicSimilarity], result of:
            0.033272576 = score(doc=2456,freq=1.0), product of:
              0.10908703 = queryWeight, product of:
                1.3226418 = boost
                4.8801513 = idf(docFreq=912, maxDocs=44218)
                0.016900422 = queryNorm
              0.30500945 = fieldWeight in 2456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8801513 = idf(docFreq=912, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.060329363 = weight(abstract_txt:engines in 2456) [ClassicSimilarity], result of:
            0.060329363 = score(doc=2456,freq=2.0), product of:
              0.1287433 = queryWeight, product of:
                1.4368719 = boost
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.016900422 = queryNorm
              0.46860194 = fieldWeight in 2456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.020329744 = weight(abstract_txt:been in 2456) [ClassicSimilarity], result of:
            0.020329744 = score(doc=2456,freq=1.0), product of:
              0.08991536 = queryWeight, product of:
                1.4706804 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.016900422 = queryNorm
              0.22609869 = fieldWeight in 2456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.11196069 = weight(abstract_txt:character in 2456) [ClassicSimilarity], result of:
            0.11196069 = score(doc=2456,freq=2.0), product of:
              0.1944237 = queryWeight, product of:
                1.7657545 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.016900422 = queryNorm
              0.57585925 = fieldWeight in 2456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.080869496 = weight(abstract_txt:queries in 2456) [ClassicSimilarity], result of:
            0.080869496 = score(doc=2456,freq=2.0), product of:
              0.17916743 = queryWeight, product of:
                2.076017 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.016900422 = queryNorm
              0.4513627 = fieldWeight in 2456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.07251225 = weight(abstract_txt:engine in 2456) [ClassicSimilarity], result of:
            0.07251225 = score(doc=2456,freq=1.0), product of:
              0.20990373 = queryWeight, product of:
                2.2470431 = boost
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.016900422 = queryNorm
              0.34545478 = fieldWeight in 2456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.17698109 = weight(abstract_txt:kong in 2456) [ClassicSimilarity], result of:
            0.17698109 = score(doc=2456,freq=1.0), product of:
              0.33240438 = queryWeight, product of:
                2.3088148 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.016900422 = queryNorm
              0.5324271 = fieldWeight in 2456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.17698109 = weight(abstract_txt:hong in 2456) [ClassicSimilarity], result of:
            0.17698109 = score(doc=2456,freq=1.0), product of:
              0.33240438 = queryWeight, product of:
                2.3088148 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.016900422 = queryNorm
              0.5324271 = fieldWeight in 2456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.03490358 = weight(abstract_txt:analysis in 2456) [ClassicSimilarity], result of:
            0.03490358 = score(doc=2456,freq=1.0), product of:
              0.15285353 = queryWeight, product of:
                2.475503 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.016900422 = queryNorm
              0.22834657 = fieldWeight in 2456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.26342022 = weight(abstract_txt:chinese in 2456) [ClassicSimilarity], result of:
            0.26342022 = score(doc=2456,freq=6.0), product of:
              0.27297837 = queryWeight, product of:
                2.5625093 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.016900422 = queryNorm
              0.96498567 = fieldWeight in 2456, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.061507713 = weight(abstract_txt:query in 2456) [ClassicSimilarity], result of:
            0.061507713 = score(doc=2456,freq=1.0), product of:
              0.2070198 = queryWeight, product of:
                2.576776 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.016900422 = queryNorm
              0.2971103 = fieldWeight in 2456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.12992187 = weight(abstract_txt:logs in 2456) [ClassicSimilarity], result of:
            0.12992187 = score(doc=2456,freq=1.0), product of:
              0.30964738 = queryWeight, product of:
                2.7291982 = boost
                6.7132807 = idf(docFreq=145, maxDocs=44218)
                0.016900422 = queryNorm
              0.41958004 = fieldWeight in 2456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7132807 = idf(docFreq=145, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.17532125 = weight(abstract_txt:english in 2456) [ClassicSimilarity], result of:
            0.17532125 = score(doc=2456,freq=2.0), product of:
              0.35582945 = queryWeight, product of:
                3.7769973 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.016900422 = queryNorm
              0.49271148 = fieldWeight in 2456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
          0.2039145 = weight(abstract_txt:search in 2456) [ClassicSimilarity], result of:
            0.2039145 = score(doc=2456,freq=7.0), product of:
              0.33710805 = queryWeight, product of:
                5.452826 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.016900422 = queryNorm
              0.60489357 = fieldWeight in 2456, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=2456)
        0.72 = coord(18/25)
    
  2. Chung, W.; Zhang, Y.; Huang, Z.; Wang, G.; Ong, T.-H.; Chen, H.: Internet searching and browsing in a multilingual world : an experiment an the Chinese Business Intelligence Portal (CBizPort) (2004) 0.60
    0.60428303 = sum of:
      0.60428303 = product of:
        1.258923 = sum of:
          0.05035482 = weight(abstract_txt:searching in 2393) [ClassicSimilarity], result of:
            0.05035482 = score(doc=2393,freq=5.0), product of:
              0.08409165 = queryWeight, product of:
                1.1612672 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.016900422 = queryNorm
              0.5988088 = fieldWeight in 2393, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.0625 = fieldNorm(doc=2393)
          0.015233616 = weight(abstract_txt:that in 2393) [ClassicSimilarity], result of:
            0.015233616 = score(doc=2393,freq=4.0), product of:
              0.05143288 = queryWeight, product of:
                1.2843729 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016900422 = queryNorm
              0.2961844 = fieldWeight in 2393, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2393)
          0.0139124375 = weight(abstract_txt:based in 2393) [ClassicSimilarity], result of:
            0.0139124375 = score(doc=2393,freq=1.0), product of:
              0.06982561 = queryWeight, product of:
                1.2960111 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.016900422 = queryNorm
              0.19924548 = fieldWeight in 2393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=2393)
          0.014131883 = weight(abstract_txt:have in 2393) [ClassicSimilarity], result of:
            0.014131883 = score(doc=2393,freq=1.0), product of:
              0.07055795 = queryWeight, product of:
                1.3027898 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.016900422 = queryNorm
              0.20028761 = fieldWeight in 2393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0625 = fieldNorm(doc=2393)
          0.0853186 = weight(abstract_txt:engines in 2393) [ClassicSimilarity], result of:
            0.0853186 = score(doc=2393,freq=4.0), product of:
              0.1287433 = queryWeight, product of:
                1.4368719 = boost
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.016900422 = queryNorm
              0.6627032 = fieldWeight in 2393, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.0625 = fieldNorm(doc=2393)
          0.07251225 = weight(abstract_txt:engine in 2393) [ClassicSimilarity], result of:
            0.07251225 = score(doc=2393,freq=1.0), product of:
              0.20990373 = queryWeight, product of:
                2.2470431 = boost
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.016900422 = queryNorm
              0.34545478 = fieldWeight in 2393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.0625 = fieldNorm(doc=2393)
          0.17698109 = weight(abstract_txt:kong in 2393) [ClassicSimilarity], result of:
            0.17698109 = score(doc=2393,freq=1.0), product of:
              0.33240438 = queryWeight, product of:
                2.3088148 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.016900422 = queryNorm
              0.5324271 = fieldWeight in 2393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=2393)
          0.17698109 = weight(abstract_txt:hong in 2393) [ClassicSimilarity], result of:
            0.17698109 = score(doc=2393,freq=1.0), product of:
              0.33240438 = queryWeight, product of:
                2.3088148 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.016900422 = queryNorm
              0.5324271 = fieldWeight in 2393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=2393)
          0.03490358 = weight(abstract_txt:analysis in 2393) [ClassicSimilarity], result of:
            0.03490358 = score(doc=2393,freq=1.0), product of:
              0.15285353 = queryWeight, product of:
                2.475503 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.016900422 = queryNorm
              0.22834657 = fieldWeight in 2393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=2393)
          0.21508169 = weight(abstract_txt:chinese in 2393) [ClassicSimilarity], result of:
            0.21508169 = score(doc=2393,freq=4.0), product of:
              0.27297837 = queryWeight, product of:
                2.5625093 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.016900422 = queryNorm
              0.7879075 = fieldWeight in 2393, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.0625 = fieldNorm(doc=2393)
          0.21472381 = weight(abstract_txt:english in 2393) [ClassicSimilarity], result of:
            0.21472381 = score(doc=2393,freq=3.0), product of:
              0.35582945 = queryWeight, product of:
                3.7769973 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.016900422 = queryNorm
              0.6034459 = fieldWeight in 2393, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0625 = fieldNorm(doc=2393)
          0.18878815 = weight(abstract_txt:search in 2393) [ClassicSimilarity], result of:
            0.18878815 = score(doc=2393,freq=6.0), product of:
              0.33710805 = queryWeight, product of:
                5.452826 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.016900422 = queryNorm
              0.56002265 = fieldWeight in 2393, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=2393)
        0.48 = coord(12/25)
    
  3. Ozmutlu, H.C.; Cavdur, F.; Ozmutlu, S.: Cross-validation of neural network applications for automatic new topic identification (2008) 0.47
    0.47403306 = sum of:
      0.47403306 = product of:
        1.0773479 = sum of:
          0.09337617 = weight(abstract_txt:excite in 1364) [ClassicSimilarity], result of:
            0.09337617 = score(doc=1364,freq=2.0), product of:
              0.13672656 = queryWeight, product of:
                1.0470494 = boost
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.016900422 = queryNorm
              0.68294096 = fieldWeight in 1364, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.0625 = fieldNorm(doc=1364)
          0.018657295 = weight(abstract_txt:that in 1364) [ClassicSimilarity], result of:
            0.018657295 = score(doc=1364,freq=6.0), product of:
              0.05143288 = queryWeight, product of:
                1.2843729 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016900422 = queryNorm
              0.36275032 = fieldWeight in 1364, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1364)
          0.047054525 = weight(abstract_txt:characteristics in 1364) [ClassicSimilarity], result of:
            0.047054525 = score(doc=1364,freq=2.0), product of:
              0.10908703 = queryWeight, product of:
                1.3226418 = boost
                4.8801513 = idf(docFreq=912, maxDocs=44218)
                0.016900422 = queryNorm
              0.4313485 = fieldWeight in 1364, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8801513 = idf(docFreq=912, maxDocs=44218)
                0.0625 = fieldNorm(doc=1364)
          0.037667308 = weight(abstract_txt:topics in 1364) [ClassicSimilarity], result of:
            0.037667308 = score(doc=1364,freq=1.0), product of:
              0.118492775 = queryWeight, product of:
                1.3784838 = boost
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.016900422 = queryNorm
              0.31788695 = fieldWeight in 1364, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.0625 = fieldNorm(doc=1364)
          0.060329363 = weight(abstract_txt:engines in 1364) [ClassicSimilarity], result of:
            0.060329363 = score(doc=1364,freq=2.0), product of:
              0.1287433 = queryWeight, product of:
                1.4368719 = boost
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.016900422 = queryNorm
              0.46860194 = fieldWeight in 1364, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.0625 = fieldNorm(doc=1364)
          0.042862587 = weight(abstract_txt:number in 1364) [ClassicSimilarity], result of:
            0.042862587 = score(doc=1364,freq=2.0), product of:
              0.11734237 = queryWeight, product of:
                1.6800754 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.016900422 = queryNorm
              0.365278 = fieldWeight in 1364, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.0625 = fieldNorm(doc=1364)
          0.08635064 = weight(abstract_txt:sessions in 1364) [ClassicSimilarity], result of:
            0.08635064 = score(doc=1364,freq=1.0), product of:
              0.20601201 = queryWeight, product of:
                1.8176154 = boost
                6.7064548 = idf(docFreq=146, maxDocs=44218)
                0.016900422 = queryNorm
              0.41915342 = fieldWeight in 1364, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7064548 = idf(docFreq=146, maxDocs=44218)
                0.0625 = fieldNorm(doc=1364)
          0.05718337 = weight(abstract_txt:queries in 1364) [ClassicSimilarity], result of:
            0.05718337 = score(doc=1364,freq=1.0), product of:
              0.17916743 = queryWeight, product of:
                2.076017 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.016900422 = queryNorm
              0.31916162 = fieldWeight in 1364, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0625 = fieldNorm(doc=1364)
          0.17761801 = weight(abstract_txt:engine in 1364) [ClassicSimilarity], result of:
            0.17761801 = score(doc=1364,freq=6.0), product of:
              0.20990373 = queryWeight, product of:
                2.2470431 = boost
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.016900422 = queryNorm
              0.84618795 = fieldWeight in 1364, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.0625 = fieldNorm(doc=1364)
          0.22503126 = weight(abstract_txt:logs in 1364) [ClassicSimilarity], result of:
            0.22503126 = score(doc=1364,freq=3.0), product of:
              0.30964738 = queryWeight, product of:
                2.7291982 = boost
                6.7132807 = idf(docFreq=145, maxDocs=44218)
                0.016900422 = queryNorm
              0.7267339 = fieldWeight in 1364, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7132807 = idf(docFreq=145, maxDocs=44218)
                0.0625 = fieldNorm(doc=1364)
          0.2312173 = weight(abstract_txt:search in 1364) [ClassicSimilarity], result of:
            0.2312173 = score(doc=1364,freq=9.0), product of:
              0.33710805 = queryWeight, product of:
                5.452826 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.016900422 = queryNorm
              0.68588483 = fieldWeight in 1364, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=1364)
        0.44 = coord(11/25)
    
  4. Koshman, S.; Spink, A.; Jansen, B.J.: Web searching on the Vivisimo search engine (2006) 0.47
    0.46987364 = sum of:
      0.46987364 = product of:
        0.9036032 = sum of:
          0.039004676 = weight(abstract_txt:searching in 216) [ClassicSimilarity], result of:
            0.039004676 = score(doc=216,freq=3.0), product of:
              0.08409165 = queryWeight, product of:
                1.1612672 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.016900422 = queryNorm
              0.4638353 = fieldWeight in 216, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
          0.015233616 = weight(abstract_txt:that in 216) [ClassicSimilarity], result of:
            0.015233616 = score(doc=216,freq=4.0), product of:
              0.05143288 = queryWeight, product of:
                1.2843729 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016900422 = queryNorm
              0.2961844 = fieldWeight in 216, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
          0.0139124375 = weight(abstract_txt:based in 216) [ClassicSimilarity], result of:
            0.0139124375 = score(doc=216,freq=1.0), product of:
              0.06982561 = queryWeight, product of:
                1.2960111 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.016900422 = queryNorm
              0.19924548 = fieldWeight in 216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
          0.014131883 = weight(abstract_txt:have in 216) [ClassicSimilarity], result of:
            0.014131883 = score(doc=216,freq=1.0), product of:
              0.07055795 = queryWeight, product of:
                1.3027898 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.016900422 = queryNorm
              0.20028761 = fieldWeight in 216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
          0.033272576 = weight(abstract_txt:characteristics in 216) [ClassicSimilarity], result of:
            0.033272576 = score(doc=216,freq=1.0), product of:
              0.10908703 = queryWeight, product of:
                1.3226418 = boost
                4.8801513 = idf(docFreq=912, maxDocs=44218)
                0.016900422 = queryNorm
              0.30500945 = fieldWeight in 216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8801513 = idf(docFreq=912, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
          0.037667308 = weight(abstract_txt:topics in 216) [ClassicSimilarity], result of:
            0.037667308 = score(doc=216,freq=1.0), product of:
              0.118492775 = queryWeight, product of:
                1.3784838 = boost
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.016900422 = queryNorm
              0.31788695 = fieldWeight in 216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
          0.020329744 = weight(abstract_txt:been in 216) [ClassicSimilarity], result of:
            0.020329744 = score(doc=216,freq=1.0), product of:
              0.08991536 = queryWeight, product of:
                1.4706804 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.016900422 = queryNorm
              0.22609869 = fieldWeight in 216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
          0.14956369 = weight(abstract_txt:sessions in 216) [ClassicSimilarity], result of:
            0.14956369 = score(doc=216,freq=3.0), product of:
              0.20601201 = queryWeight, product of:
                1.8176154 = boost
                6.7064548 = idf(docFreq=146, maxDocs=44218)
                0.016900422 = queryNorm
              0.725995 = fieldWeight in 216, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7064548 = idf(docFreq=146, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
          0.080869496 = weight(abstract_txt:queries in 216) [ClassicSimilarity], result of:
            0.080869496 = score(doc=216,freq=2.0), product of:
              0.17916743 = queryWeight, product of:
                2.076017 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.016900422 = queryNorm
              0.4513627 = fieldWeight in 216, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
          0.1450245 = weight(abstract_txt:engine in 216) [ClassicSimilarity], result of:
            0.1450245 = score(doc=216,freq=4.0), product of:
              0.20990373 = queryWeight, product of:
                2.2470431 = boost
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.016900422 = queryNorm
              0.69090956 = fieldWeight in 216, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
          0.049361117 = weight(abstract_txt:analysis in 216) [ClassicSimilarity], result of:
            0.049361117 = score(doc=216,freq=2.0), product of:
              0.15285353 = queryWeight, product of:
                2.475503 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.016900422 = queryNorm
              0.3229308 = fieldWeight in 216, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
          0.061507713 = weight(abstract_txt:query in 216) [ClassicSimilarity], result of:
            0.061507713 = score(doc=216,freq=1.0), product of:
              0.2070198 = queryWeight, product of:
                2.576776 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.016900422 = queryNorm
              0.2971103 = fieldWeight in 216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
          0.24372444 = weight(abstract_txt:search in 216) [ClassicSimilarity], result of:
            0.24372444 = score(doc=216,freq=10.0), product of:
              0.33710805 = queryWeight, product of:
                5.452826 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.016900422 = queryNorm
              0.7229861 = fieldWeight in 216, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=216)
        0.52 = coord(13/25)
    
  5. Pu, H.-T.; Chuang, S.-L.; Yang, C.: Subject categorization of query terms for exploring Web users' search interests (2002) 0.40
    0.40435007 = sum of:
      0.40435007 = product of:
        0.842396 = sum of:
          0.031847186 = weight(abstract_txt:searching in 587) [ClassicSimilarity], result of:
            0.031847186 = score(doc=587,freq=2.0), product of:
              0.08409165 = queryWeight, product of:
                1.1612672 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.016900422 = queryNorm
              0.37871996 = fieldWeight in 587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.0625 = fieldNorm(doc=587)
          0.010771793 = weight(abstract_txt:that in 587) [ClassicSimilarity], result of:
            0.010771793 = score(doc=587,freq=2.0), product of:
              0.05143288 = queryWeight, product of:
                1.2843729 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016900422 = queryNorm
              0.20943399 = fieldWeight in 587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=587)
          0.0139124375 = weight(abstract_txt:based in 587) [ClassicSimilarity], result of:
            0.0139124375 = score(doc=587,freq=1.0), product of:
              0.06982561 = queryWeight, product of:
                1.2960111 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.016900422 = queryNorm
              0.19924548 = fieldWeight in 587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=587)
          0.037667308 = weight(abstract_txt:topics in 587) [ClassicSimilarity], result of:
            0.037667308 = score(doc=587,freq=1.0), product of:
              0.118492775 = queryWeight, product of:
                1.3784838 = boost
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.016900422 = queryNorm
              0.31788695 = fieldWeight in 587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.0625 = fieldNorm(doc=587)
          0.0426593 = weight(abstract_txt:engines in 587) [ClassicSimilarity], result of:
            0.0426593 = score(doc=587,freq=1.0), product of:
              0.1287433 = queryWeight, product of:
                1.4368719 = boost
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.016900422 = queryNorm
              0.3313516 = fieldWeight in 587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.0625 = fieldNorm(doc=587)
          0.020329744 = weight(abstract_txt:been in 587) [ClassicSimilarity], result of:
            0.020329744 = score(doc=587,freq=1.0), product of:
              0.08991536 = queryWeight, product of:
                1.4706804 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.016900422 = queryNorm
              0.22609869 = fieldWeight in 587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=587)
          0.080869496 = weight(abstract_txt:queries in 587) [ClassicSimilarity], result of:
            0.080869496 = score(doc=587,freq=2.0), product of:
              0.17916743 = queryWeight, product of:
                2.076017 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.016900422 = queryNorm
              0.4513627 = fieldWeight in 587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0625 = fieldNorm(doc=587)
          0.07251225 = weight(abstract_txt:engine in 587) [ClassicSimilarity], result of:
            0.07251225 = score(doc=587,freq=1.0), product of:
              0.20990373 = queryWeight, product of:
                2.2470431 = boost
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.016900422 = queryNorm
              0.34545478 = fieldWeight in 587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.0625 = fieldNorm(doc=587)
          0.06045477 = weight(abstract_txt:analysis in 587) [ClassicSimilarity], result of:
            0.06045477 = score(doc=587,freq=3.0), product of:
              0.15285353 = queryWeight, product of:
                2.475503 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.016900422 = queryNorm
              0.39550784 = fieldWeight in 587, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=587)
          0.13753542 = weight(abstract_txt:query in 587) [ClassicSimilarity], result of:
            0.13753542 = score(doc=587,freq=5.0), product of:
              0.2070198 = queryWeight, product of:
                2.576776 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.016900422 = queryNorm
              0.6643588 = fieldWeight in 587, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=587)
          0.12992187 = weight(abstract_txt:logs in 587) [ClassicSimilarity], result of:
            0.12992187 = score(doc=587,freq=1.0), product of:
              0.30964738 = queryWeight, product of:
                2.7291982 = boost
                6.7132807 = idf(docFreq=145, maxDocs=44218)
                0.016900422 = queryNorm
              0.41958004 = fieldWeight in 587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7132807 = idf(docFreq=145, maxDocs=44218)
                0.0625 = fieldNorm(doc=587)
          0.2039145 = weight(abstract_txt:search in 587) [ClassicSimilarity], result of:
            0.2039145 = score(doc=587,freq=7.0), product of:
              0.33710805 = queryWeight, product of:
                5.452826 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.016900422 = queryNorm
              0.60489357 = fieldWeight in 587, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=587)
        0.48 = coord(12/25)