Search (11 results, page 1 of 1)

  • × author_ss:"Chen, H."
  • × year_i:[2000 TO 2010}
  1. Chung, W.; Chen, H.: Browsing the underdeveloped Web : an experiment on the Arabic Medical Web Directory (2009) 0.03
    0.025681175 = sum of:
      0.008640769 = product of:
        0.060485378 = sum of:
          0.060485378 = weight(_text_:better in 2733) [ClassicSimilarity], result of:
            0.060485378 = score(doc=2733,freq=2.0), product of:
              0.195582 = queryWeight, product of:
                4.665146 = idf(docFreq=1131, maxDocs=44218)
                0.041924093 = queryNorm
              0.3092584 = fieldWeight in 2733, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.665146 = idf(docFreq=1131, maxDocs=44218)
                0.046875 = fieldNorm(doc=2733)
        0.14285715 = coord(1/7)
      0.017040405 = product of:
        0.03408081 = sum of:
          0.03408081 = weight(_text_:22 in 2733) [ClassicSimilarity], result of:
            0.03408081 = score(doc=2733,freq=2.0), product of:
              0.14681102 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.041924093 = queryNorm
              0.23214069 = fieldWeight in 2733, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=2733)
        0.5 = coord(1/2)
    
    Abstract
    While the Web has grown significantly in recent years, some portions of the Web remain largely underdeveloped, as shown in a lack of high-quality content and functionality. An example is the Arabic Web, in which a lack of well-structured Web directories limits users' ability to browse for Arabic resources. In this research, we proposed an approach to building Web directories for the underdeveloped Web and developed a proof-of-concept prototype called the Arabic Medical Web Directory (AMedDir) that supports browsing of over 5,000 Arabic medical Web sites and pages organized in a hierarchical structure. We conducted an experiment involving Arab participants and found that the AMedDir significantly outperformed two benchmark Arabic Web directories in terms of browsing effectiveness, efficiency, information quality, and user satisfaction. Participants expressed strong preference for the AMedDir and provided many positive comments. This research thus contributes to developing a useful Web directory for organizing the information in the Arabic medical domain and to a better understanding of how to support browsing on the underdeveloped Web.
    Date
    22. 3.2009 17:57:50
  2. Leroy, G.; Chen, H.: Genescene: an ontology-enhanced integration of linguistic and co-occurrence based relations in biomedical texts (2005) 0.01
    0.0071001695 = product of:
      0.014200339 = sum of:
        0.014200339 = product of:
          0.028400678 = sum of:
            0.028400678 = weight(_text_:22 in 5259) [ClassicSimilarity], result of:
              0.028400678 = score(doc=5259,freq=2.0), product of:
                0.14681102 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041924093 = queryNorm
                0.19345059 = fieldWeight in 5259, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5259)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 7.2006 14:26:01
  3. Zheng, R.; Li, J.; Chen, H.; Huang, Z.: ¬A framework for authorship identification of online messages : writing-style features and classification techniques (2006) 0.01
    0.0071001695 = product of:
      0.014200339 = sum of:
        0.014200339 = product of:
          0.028400678 = sum of:
            0.028400678 = weight(_text_:22 in 5276) [ClassicSimilarity], result of:
              0.028400678 = score(doc=5276,freq=2.0), product of:
                0.14681102 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041924093 = queryNorm
                0.19345059 = fieldWeight in 5276, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5276)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 7.2006 16:14:37
  4. Hu, D.; Kaza, S.; Chen, H.: Identifying significant facilitators of dark network evolution (2009) 0.01
    0.0071001695 = product of:
      0.014200339 = sum of:
        0.014200339 = product of:
          0.028400678 = sum of:
            0.028400678 = weight(_text_:22 in 2753) [ClassicSimilarity], result of:
              0.028400678 = score(doc=2753,freq=2.0), product of:
                0.14681102 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041924093 = queryNorm
                0.19345059 = fieldWeight in 2753, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2753)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2009 18:50:30
  5. Chung, W.; Chen, H.; Reid, E.: Business stakeholder analyzer : an experiment of classifying stakeholders on the Web (2009) 0.01
    0.0050916215 = product of:
      0.010183243 = sum of:
        0.010183243 = product of:
          0.0712827 = sum of:
            0.0712827 = weight(_text_:better in 2699) [ClassicSimilarity], result of:
              0.0712827 = score(doc=2699,freq=4.0), product of:
                0.195582 = queryWeight, product of:
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.041924093 = queryNorm
                0.36446452 = fieldWeight in 2699, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2699)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    As the Web is used increasingly to share and disseminate information, business analysts and managers are challenged to understand stakeholder relationships. Traditional stakeholder theories and frameworks employ a manual approach to analysis and do not scale up to accommodate the rapid growth of the Web. Unfortunately, existing business intelligence (BI) tools lack analysis capability, and research on BI systems is sparse. This research proposes a framework for designing BI systems to identify and to classify stakeholders on the Web, incorporating human knowledge and machine-learned information from Web pages. Based on the framework, we have developed a prototype called Business Stakeholder Analyzer (BSA) that helps managers and analysts to identify and to classify their stakeholders on the Web. Results from our experiment involving algorithm comparison, feature comparison, and a user study showed that the system achieved better within-class accuracies in widespread stakeholder types such as partner/sponsor/supplier and media/reviewer, and was more efficient than human classification. The student and practitioner subjects in our user study strongly agreed that such a system would save analysts' time and help to identify and classify stakeholders. This research contributes to a better understanding of how to integrate information technology with stakeholder theory, and enriches the knowledge base of BI system design.
  6. Huang, Z.; Chung, Z.W.; Chen, H.: ¬A graph model for e-commerce recommender systems (2004) 0.00
    0.0043203845 = product of:
      0.008640769 = sum of:
        0.008640769 = product of:
          0.060485378 = sum of:
            0.060485378 = weight(_text_:better in 501) [ClassicSimilarity], result of:
              0.060485378 = score(doc=501,freq=2.0), product of:
                0.195582 = queryWeight, product of:
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.041924093 = queryNorm
                0.3092584 = fieldWeight in 501, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.046875 = fieldNorm(doc=501)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    Information overload on the Web has created enormous challenges to customers selecting products for online purchases and to online businesses attempting to identify customers' preferences efficiently. Various recommender systems employing different data representations and recommendation methods are currently used to address these challenges. In this research, we developed a graph model that provides a generic data representation and can support different recommendation methods. To demonstrate its usefulness and flexibility, we developed three recommendation methods: direct retrieval, association mining, and high-degree association retrieval. We used a data set from an online bookstore as our research test-bed. Evaluation results showed that combining product content information and historical customer transaction information achieved more accurate predictions and relevant recommendations than using only collaborative information. However, comparisons among different methods showed that high-degree association retrieval did not perform significantly better than the association mining method or the direct retrieval method in our test-bed.
  7. Chen, H.; Fan, H.; Chau, M.; Zeng, D.: MetaSpider : meta-searching and categorization on the Web (2001) 0.00
    0.00360032 = product of:
      0.00720064 = sum of:
        0.00720064 = product of:
          0.050404478 = sum of:
            0.050404478 = weight(_text_:better in 6849) [ClassicSimilarity], result of:
              0.050404478 = score(doc=6849,freq=2.0), product of:
                0.195582 = queryWeight, product of:
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.041924093 = queryNorm
                0.2577153 = fieldWeight in 6849, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=6849)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    It has become increasingly difficult to locate relevant information on the Web, even with the help of Web search engines. Two approaches to addressing the low precision and poor presentation of search results of current search tools are studied: meta-search and document categorization. Meta-search engines improve precision by selecting and integrating search results from generic or domain-specific Web search engines or other resources. Document categorization promises better organization and presentation of retrieved results. This article introduces MetaSpider, a meta-search engine that has real-time indexing and categorizing functions. We report in this paper the major components of MetaSpider and discuss related technical approaches. Initial results of a user evaluation study comparing Meta-Spider, NorthernLight, and MetaCrawler in terms of clustering performance and of time and effort expended show that MetaSpider performed best in precision rate, but disclose no statistically significant differences in recall rate and time requirements. Our experimental study also reveals that MetaSpider exhibited a higher level of automation than the other two systems and facilitated efficient searching by providing the user with an organized, comprehensive view of the retrieved documents.
  8. Chen, H.; Lally, A.M.; Zhu, B.; Chau, M.: HelpfulMed : Intelligent searching for medical information over the Internet (2003) 0.00
    0.00360032 = product of:
      0.00720064 = sum of:
        0.00720064 = product of:
          0.050404478 = sum of:
            0.050404478 = weight(_text_:better in 1615) [ClassicSimilarity], result of:
              0.050404478 = score(doc=1615,freq=2.0), product of:
                0.195582 = queryWeight, product of:
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.041924093 = queryNorm
                0.2577153 = fieldWeight in 1615, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1615)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    The Medical professionals and researchers need information from reputable sources to accomplish their work. Unfortunately, the Web has a large number of documents that are irrelevant to their work, even those documents that purport to be "medically-related." This paper describes an architecture designed to integrate advanced searching and indexing algorithms, an automatic thesaurus, or "concept space," and Kohonen-based Self-Organizing Map (SOM) technologies to provide searchers with finegrained results. Initial results indicate that these systems provide complementary retrieval functionalities. HelpfulMed not only allows users to search Web pages and other online databases, but also allows them to build searches through the use of an automatic thesaurus and browse a graphical display of medical-related topics. Evaluation results for each of the different components are included. Our spidering algorithm outperformed both breadth-first search and PageRank spiders an a test collection of 100,000 Web pages. The automatically generated thesaurus performed as well as both MeSH and UMLS-systems which require human mediation for currency. Lastly, a variant of the Kohonen SOM was comparable to MeSH terms in perceived cluster precision and significantly better at perceived cluster recall.
  9. Chung, W.; Zhang, Y.; Huang, Z.; Wang, G.; Ong, T.-H.; Chen, H.: Internet searching and browsing in a multilingual world : an experiment an the Chinese Business Intelligence Portal (CBizPort) (2004) 0.00
    0.00360032 = product of:
      0.00720064 = sum of:
        0.00720064 = product of:
          0.050404478 = sum of:
            0.050404478 = weight(_text_:better in 2393) [ClassicSimilarity], result of:
              0.050404478 = score(doc=2393,freq=2.0), product of:
                0.195582 = queryWeight, product of:
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.041924093 = queryNorm
                0.2577153 = fieldWeight in 2393, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2393)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    The rapid growth of the non-English-speaking Internet population has created a need for better searching and browsing capabilities in languages other than English. However, existing search engines may not serve the needs of many non-English-speaking Internet users. In this paper, we propose a generic and integrated approach to searching and browsing the Internet in a multilingual world. Based an this approach, we have developed the Chinese Business Intelligence Portal (CBizPort), a meta-search engine that searches for business information of mainland China, Taiwan, and Hong Kong. Additional functions provided by CBizPort include encoding conversion (between Simplified Chinese and Traditional Chinese), summarization, and categorization. Experimental results of our user evaluation study show that the searching and browsing performance of CBizPort was comparable to that of regional Chinese search engines, and CBizPort could significantly augment these search engines. Subjects' verbal comments indicate that CBizPort performed best in terms of analysis functions, cross-regional searching, and user-friendliness, whereas regional search engines were more efficient and more popular. Subjects especially liked CBizPort's summarizer and categorizer, which helped in understanding search results. These encouraging results suggest a promising future of our approach to Internet searching and browsing in a multilingual world.
  10. Vishwanath, A.; Chen, H.: Technology clusters : using multidimensional scaling to evaluate and structure technology clusters (2006) 0.00
    0.00360032 = product of:
      0.00720064 = sum of:
        0.00720064 = product of:
          0.050404478 = sum of:
            0.050404478 = weight(_text_:better in 6006) [ClassicSimilarity], result of:
              0.050404478 = score(doc=6006,freq=2.0), product of:
                0.195582 = queryWeight, product of:
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.041924093 = queryNorm
                0.2577153 = fieldWeight in 6006, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=6006)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    Empirical evidence suggests that the ownership of related products that form a technology cluster is signifIcantly better than the attributes of an innovation at predicting adoption. The treatment of technology clusters, however, has been ad hoc and study specific: Researchers often make a priori assumptions about the relationships between technologies and measure ownership using lists of functionally related technology, without any systematic reasoning. Hence, the authors set out to examine empirically the composition of technology clusters and the differences, if any, in clusters of technologies formed by adopters and nonadopters. Using the Galileo system of multidimensional scaling and the associational diffusion framework, the dissimilarities between 30 technology concepts were scored by adopters and nonadopters. Results indicate clear differences in conceptualization of clusters: Adopters tend to relate technologies based an their functional similarity; here, innovations are perceived to be complementary, and hence, adoption of one technology spurs the adoption of related technologies. On the other hand, nonadopters tend to relate technologies using a stricter ascendancy of association where the adoption of an innovation makes subsequent innovations redundant. The results question the measurement approaches and present an alternative methodology.
  11. Zhu, B.; Chen, H.: Information visualization (2004) 0.00
    0.002520224 = product of:
      0.005040448 = sum of:
        0.005040448 = product of:
          0.035283137 = sum of:
            0.035283137 = weight(_text_:better in 4276) [ClassicSimilarity], result of:
              0.035283137 = score(doc=4276,freq=2.0), product of:
                0.195582 = queryWeight, product of:
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.041924093 = queryNorm
                0.18040073 = fieldWeight in 4276, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.665146 = idf(docFreq=1131, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=4276)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    Advanced technology has resulted in the generation of about one million terabytes of information every year. Ninety-reine percent of this is available in digital format (Keim, 2001). More information will be generated in the next three years than was created during all of previous human history (Keim, 2001). Collecting information is no longer a problem, but extracting value from information collections has become progressively more difficult. Various search engines have been developed to make it easier to locate information of interest, but these work well only for a person who has a specific goal and who understands what and how information is stored. This usually is not the Gase. Visualization was commonly thought of in terms of representing human mental processes (MacEachren, 1991; Miller, 1984). The concept is now associated with the amplification of these mental processes (Card, Mackinlay, & Shneiderman, 1999). Human eyes can process visual cues rapidly, whereas advanced information analysis techniques transform the computer into a powerful means of managing digitized information. Visualization offers a link between these two potent systems, the human eye and the computer (Gershon, Eick, & Card, 1998), helping to identify patterns and to extract insights from large amounts of information. The identification of patterns is important because it may lead to a scientific discovery, an interpretation of clues to solve a crime, the prediction of catastrophic weather, a successful financial investment, or a better understanding of human behavior in a computermediated environment. Visualization technology shows considerable promise for increasing the value of large-scale collections of information, as evidenced by several commercial applications of TreeMap (e.g., http://www.smartmoney.com) and Hyperbolic tree (e.g., http://www.inxight.com) to visualize large-scale hierarchical structures. Although the proliferation of visualization technologies dates from the 1990s where sophisticated hardware and software made increasingly faster generation of graphical objects possible, the role of visual aids in facilitating the construction of mental images has a long history. Visualization has been used to communicate ideas, to monitor trends implicit in data, and to explore large volumes of data for hypothesis generation. Imagine traveling to a strange place without a map, having to memorize physical and chemical properties of an element without Mendeleyev's periodic table, trying to understand the stock market without statistical diagrams, or browsing a collection of documents without interactive visual aids. A collection of information can lose its value simply because of the effort required for exhaustive exploration. Such frustrations can be overcome by visualization.