Search (10 results, page 1 of 1)

  • × author_ss:"Chau, M."
  1. Chen, H.; Chau, M.: Web mining : machine learning for Web applications (2003) 0.02
    0.021880183 = sum of:
      0.01825141 = product of:
        0.07300564 = sum of:
          0.07300564 = weight(_text_:authors in 4242) [ClassicSimilarity], result of:
            0.07300564 = score(doc=4242,freq=2.0), product of:
              0.24157293 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.052990302 = queryNorm
              0.30220953 = fieldWeight in 4242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.046875 = fieldNorm(doc=4242)
        0.25 = coord(1/4)
      0.0036287727 = product of:
        0.0072575454 = sum of:
          0.0072575454 = weight(_text_:e in 4242) [ClassicSimilarity], result of:
            0.0072575454 = score(doc=4242,freq=2.0), product of:
              0.07616667 = queryWeight, product of:
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.052990302 = queryNorm
              0.09528506 = fieldWeight in 4242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.046875 = fieldNorm(doc=4242)
        0.5 = coord(1/2)
    
    Abstract
    With more than two billion pages created by millions of Web page authors and organizations, the World Wide Web is a tremendously rich knowledge base. The knowledge comes not only from the content of the pages themselves, but also from the unique characteristics of the Web, such as its hyperlink structure and its diversity of content and languages. Analysis of these characteristics often reveals interesting patterns and new knowledge. Such knowledge can be used to improve users' efficiency and effectiveness in searching for information an the Web, and also for applications unrelated to the Web, such as support for decision making or business management. The Web's size and its unstructured and dynamic content, as well as its multilingual nature, make the extraction of useful knowledge a challenging research problem. Furthermore, the Web generates a large amount of data in other formats that contain valuable information. For example, Web server logs' information about user access patterns can be used for information personalization or improving Web page design.
    Language
    e
  2. Chau, M.; Lu, Y.; Fang, X.; Yang, C.C.: Characteristics of character usage in Chinese Web searching (2009) 0.02
    0.020972613 = product of:
      0.041945226 = sum of:
        0.041945226 = sum of:
          0.006047955 = weight(_text_:e in 2456) [ClassicSimilarity], result of:
            0.006047955 = score(doc=2456,freq=2.0), product of:
              0.07616667 = queryWeight, product of:
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.052990302 = queryNorm
              0.07940422 = fieldWeight in 2456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2456)
          0.035897274 = weight(_text_:22 in 2456) [ClassicSimilarity], result of:
            0.035897274 = score(doc=2456,freq=2.0), product of:
              0.18556301 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052990302 = queryNorm
              0.19345059 = fieldWeight in 2456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2456)
      0.5 = coord(1/2)
    
    Date
    22.11.2008 17:57:22
    Language
    e
  3. Qin, J.; Zhou, Y.; Chau, M.; Chen, H.: Multilingual Web retrieval : an experiment in English-Chinese business intelligence (2006) 0.02
    0.018233486 = sum of:
      0.015209509 = product of:
        0.060838036 = sum of:
          0.060838036 = weight(_text_:authors in 5054) [ClassicSimilarity], result of:
            0.060838036 = score(doc=5054,freq=2.0), product of:
              0.24157293 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.052990302 = queryNorm
              0.25184128 = fieldWeight in 5054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5054)
        0.25 = coord(1/4)
      0.0030239774 = product of:
        0.006047955 = sum of:
          0.006047955 = weight(_text_:e in 5054) [ClassicSimilarity], result of:
            0.006047955 = score(doc=5054,freq=2.0), product of:
              0.07616667 = queryWeight, product of:
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.052990302 = queryNorm
              0.07940422 = fieldWeight in 5054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5054)
        0.5 = coord(1/2)
    
    Abstract
    As increasing numbers of non-English resources have become available on the Web, the interesting and important issue of how Web users can retrieve documents in different languages has arisen. Cross-language information retrieval (CLIP), the study of retrieving information in one language by queries expressed in another language, is a promising approach to the problem. Cross-language information retrieval has attracted much attention in recent years. Most research systems have achieved satisfactory performance on standard Text REtrieval Conference (TREC) collections such as news articles, but CLIR techniques have not been widely studied and evaluated for applications such as Web portals. In this article, the authors present their research in developing and evaluating a multilingual English-Chinese Web portal that incorporates various CLIP techniques for use in the business domain. A dictionary-based approach was adopted and combines phrasal translation, co-occurrence analysis, and pre- and posttranslation query expansion. The portal was evaluated by domain experts, using a set of queries in both English and Chinese. The experimental results showed that co-occurrence-based phrasal translation achieved a 74.6% improvement in precision over simple word-byword translation. When used together, pre- and posttranslation query expansion improved the performance slightly, achieving a 78.0% improvement over the baseline word-by-word translation approach. In general, applying CLIR techniques in Web applications shows promise.
    Language
    e
  4. Chau, M.; Fang, X.; Sheng, O.R.U.: Analysis of the query logs of a Web site search engine (2005) 0.00
    0.002138275 = product of:
      0.00427655 = sum of:
        0.00427655 = product of:
          0.0085531 = sum of:
            0.0085531 = weight(_text_:e in 4573) [ClassicSimilarity], result of:
              0.0085531 = score(doc=4573,freq=4.0), product of:
                0.07616667 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.052990302 = queryNorm
                0.112294525 = fieldWeight in 4573, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4573)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    A large number of studies have investigated the transaction log of general-purpose search engines such as Excite and AItaVista, but few studies have reported an the analysis of search logs for search engines that are limited to particular Web sites, namely, Web site search engines. In this article, we report our research an analyzing the search logs of the search engine of the Utah state government Web site. Our results show that some statistics, such as the number of search terms per query, of Web users are the same for general-purpose search engines and Web site search engines, but others, such as the search topics and the terms used, are considerably different. Possible reasons for the differences include the focused domain of Web site search engines and users' different information needs. The findings are useful for Web site developers to improve the performance of their services provided an the Web and for researchers to conduct further research in this area. The analysis also can be applied in e-government research by investigating how information should be delivered to users in government Web sites.
    Language
    e
  5. Chau, M.; Wong, C.H.; Zhou, Y.; Qin, J.; Chen, H.: Evaluating the use of search engine development tools in IT education (2010) 0.00
    0.002138275 = product of:
      0.00427655 = sum of:
        0.00427655 = product of:
          0.0085531 = sum of:
            0.0085531 = weight(_text_:e in 3325) [ClassicSimilarity], result of:
              0.0085531 = score(doc=3325,freq=4.0), product of:
                0.07616667 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.052990302 = queryNorm
                0.112294525 = fieldWeight in 3325, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3325)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    It is important for education in computer science and information systems to keep up to date with the latest development in technology. With the rapid development of the Internet and the Web, many schools have included Internet-related technologies, such as Web search engines and e-commerce, as part of their curricula. Previous research has shown that it is effective to use search engine development tools to facilitate students' learning. However, the effectiveness of these tools in the classroom has not been evaluated. In this article, we review the design of three search engine development tools, SpidersRUs, Greenstone, and Alkaline, followed by an evaluation study that compared the three tools in the classroom. In the study, 33 students were divided into 13 groups and each group used the three tools to develop three independent search engines in a class project. Our evaluation results showed that SpidersRUs performed better than the two other tools in overall satisfaction and the level of knowledge gained in their learning experience when using the tools for a class project on Internet applications development.
    Language
    e
  6. Schroeder, J.; Xu, J.; Chen, H.; Chau, M.: Automated criminal link analysis based on domain knowledge (2007) 0.00
    0.0018143863 = product of:
      0.0036287727 = sum of:
        0.0036287727 = product of:
          0.0072575454 = sum of:
            0.0072575454 = weight(_text_:e in 275) [ClassicSimilarity], result of:
              0.0072575454 = score(doc=275,freq=2.0), product of:
                0.07616667 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.052990302 = queryNorm
                0.09528506 = fieldWeight in 275, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=275)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  7. Chen, H.; Fan, H.; Chau, M.; Zeng, D.: MetaSpider : meta-searching and categorization on the Web (2001) 0.00
    0.0015119887 = product of:
      0.0030239774 = sum of:
        0.0030239774 = product of:
          0.006047955 = sum of:
            0.006047955 = weight(_text_:e in 6849) [ClassicSimilarity], result of:
              0.006047955 = score(doc=6849,freq=2.0), product of:
                0.07616667 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.052990302 = queryNorm
                0.07940422 = fieldWeight in 6849, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=6849)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  8. Chen, H.; Lally, A.M.; Zhu, B.; Chau, M.: HelpfulMed : Intelligent searching for medical information over the Internet (2003) 0.00
    0.0015119887 = product of:
      0.0030239774 = sum of:
        0.0030239774 = product of:
          0.006047955 = sum of:
            0.006047955 = weight(_text_:e in 1615) [ClassicSimilarity], result of:
              0.006047955 = score(doc=1615,freq=2.0), product of:
                0.07616667 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.052990302 = queryNorm
                0.07940422 = fieldWeight in 1615, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1615)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  9. Chau, M.; Shiu, B.; Chan, M.; Chen, H.: Redips: backlink search and analysis on the Web for business intelligence analysis (2007) 0.00
    0.0015119887 = product of:
      0.0030239774 = sum of:
        0.0030239774 = product of:
          0.006047955 = sum of:
            0.006047955 = weight(_text_:e in 142) [ClassicSimilarity], result of:
              0.006047955 = score(doc=142,freq=2.0), product of:
                0.07616667 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.052990302 = queryNorm
                0.07940422 = fieldWeight in 142, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=142)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  10. Chau, M.; Fang, X.; Rittman, C.C.: Web searching in Chinese : a study of a search engine in Hong Kong (2007) 0.00
    0.0015119887 = product of:
      0.0030239774 = sum of:
        0.0030239774 = product of:
          0.006047955 = sum of:
            0.006047955 = weight(_text_:e in 336) [ClassicSimilarity], result of:
              0.006047955 = score(doc=336,freq=2.0), product of:
                0.07616667 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.052990302 = queryNorm
                0.07940422 = fieldWeight in 336, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=336)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e