Search (8 results, page 1 of 1)

  • × author_ss:"Chen, H."
  1. Chung, W.; Zhang, Y.; Huang, Z.; Wang, G.; Ong, T.-H.; Chen, H.: Internet searching and browsing in a multilingual world : an experiment an the Chinese Business Intelligence Portal (CBizPort) (2004) 0.09
    0.08928036 = product of:
      0.26784107 = sum of:
        0.1121507 = weight(_text_:english in 2393) [ClassicSimilarity], result of:
          0.1121507 = score(doc=2393,freq=6.0), product of:
            0.21787451 = queryWeight, product of:
              5.3797226 = idf(docFreq=553, maxDocs=44218)
              0.04049921 = queryNorm
            0.51474905 = fieldWeight in 2393, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              5.3797226 = idf(docFreq=553, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2393)
        0.15569037 = weight(_text_:speaking in 2393) [ClassicSimilarity], result of:
          0.15569037 = score(doc=2393,freq=4.0), product of:
            0.2840921 = queryWeight, product of:
              7.014756 = idf(docFreq=107, maxDocs=44218)
              0.04049921 = queryNorm
            0.5480278 = fieldWeight in 2393, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              7.014756 = idf(docFreq=107, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2393)
      0.33333334 = coord(2/6)
    
    Abstract
    The rapid growth of the non-English-speaking Internet population has created a need for better searching and browsing capabilities in languages other than English. However, existing search engines may not serve the needs of many non-English-speaking Internet users. In this paper, we propose a generic and integrated approach to searching and browsing the Internet in a multilingual world. Based an this approach, we have developed the Chinese Business Intelligence Portal (CBizPort), a meta-search engine that searches for business information of mainland China, Taiwan, and Hong Kong. Additional functions provided by CBizPort include encoding conversion (between Simplified Chinese and Traditional Chinese), summarization, and categorization. Experimental results of our user evaluation study show that the searching and browsing performance of CBizPort was comparable to that of regional Chinese search engines, and CBizPort could significantly augment these search engines. Subjects' verbal comments indicate that CBizPort performed best in terms of analysis functions, cross-regional searching, and user-friendliness, whereas regional search engines were more efficient and more popular. Subjects especially liked CBizPort's summarizer and categorizer, which helped in understanding search results. These encouraging results suggest a promising future of our approach to Internet searching and browsing in a multilingual world.
  2. Zheng, R.; Li, J.; Chen, H.; Huang, Z.: ¬A framework for authorship identification of online messages : writing-style features and classification techniques (2006) 0.04
    0.035096124 = product of:
      0.10528837 = sum of:
        0.09157066 = weight(_text_:english in 5276) [ClassicSimilarity], result of:
          0.09157066 = score(doc=5276,freq=4.0), product of:
            0.21787451 = queryWeight, product of:
              5.3797226 = idf(docFreq=553, maxDocs=44218)
              0.04049921 = queryNorm
            0.42029083 = fieldWeight in 5276, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.3797226 = idf(docFreq=553, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5276)
        0.013717711 = product of:
          0.027435422 = sum of:
            0.027435422 = weight(_text_:22 in 5276) [ClassicSimilarity], result of:
              0.027435422 = score(doc=5276,freq=2.0), product of:
                0.14182134 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04049921 = queryNorm
                0.19345059 = fieldWeight in 5276, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5276)
          0.5 = coord(1/2)
      0.33333334 = coord(2/6)
    
    Abstract
    With the rapid proliferation of Internet technologies and applications, misuse of online messages for inappropriate or illegal purposes has become a major concern for society. The anonymous nature of online-message distribution makes identity tracing a critical problem. We developed a framework for authorship identification of online messages to address the identity-tracing problem. In this framework, four types of writing-style features (lexical, syntactic, structural, and content-specific features) are extracted and inductive learning algorithms are used to build feature-based classification models to identify authorship of online messages. To examine this framework, we conducted experiments on English and Chinese online-newsgroup messages. We compared the discriminating power of the four types of features and of three classification techniques: decision trees, backpropagation neural networks, and support vector machines. The experimental results showed that the proposed approach was able to identify authors of online messages with satisfactory accuracy of 70 to 95%. All four types of message features contributed to discriminating authors of online messages. Support vector machines outperformed the other two classification techniques in our experiments. The high performance we achieved for both the English and Chinese datasets showed the potential of this approach in a multiple-language context.
    Date
    22. 7.2006 16:14:37
  3. Qin, J.; Zhou, Y.; Chau, M.; Chen, H.: Multilingual Web retrieval : an experiment in English-Chinese business intelligence (2006) 0.02
    0.021583412 = product of:
      0.12950046 = sum of:
        0.12950046 = weight(_text_:english in 5054) [ClassicSimilarity], result of:
          0.12950046 = score(doc=5054,freq=8.0), product of:
            0.21787451 = queryWeight, product of:
              5.3797226 = idf(docFreq=553, maxDocs=44218)
              0.04049921 = queryNorm
            0.594381 = fieldWeight in 5054, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              5.3797226 = idf(docFreq=553, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5054)
      0.16666667 = coord(1/6)
    
    Abstract
    As increasing numbers of non-English resources have become available on the Web, the interesting and important issue of how Web users can retrieve documents in different languages has arisen. Cross-language information retrieval (CLIP), the study of retrieving information in one language by queries expressed in another language, is a promising approach to the problem. Cross-language information retrieval has attracted much attention in recent years. Most research systems have achieved satisfactory performance on standard Text REtrieval Conference (TREC) collections such as news articles, but CLIR techniques have not been widely studied and evaluated for applications such as Web portals. In this article, the authors present their research in developing and evaluating a multilingual English-Chinese Web portal that incorporates various CLIP techniques for use in the business domain. A dictionary-based approach was adopted and combines phrasal translation, co-occurrence analysis, and pre- and posttranslation query expansion. The portal was evaluated by domain experts, using a set of queries in both English and Chinese. The experimental results showed that co-occurrence-based phrasal translation achieved a 74.6% improvement in precision over simple word-byword translation. When used together, pre- and posttranslation query expansion improved the performance slightly, achieving a 78.0% improvement over the baseline word-by-word translation approach. In general, applying CLIR techniques in Web applications shows promise.
  4. Liu, X.; Kaza, S.; Zhang, P.; Chen, H.: Determining inventor status and its effect on knowledge diffusion : a study on nanotechnology literature from China, Russia, and India (2011) 0.01
    0.005595384 = product of:
      0.033572305 = sum of:
        0.033572305 = product of:
          0.06714461 = sum of:
            0.06714461 = weight(_text_:countries in 4468) [ClassicSimilarity], result of:
              0.06714461 = score(doc=4468,freq=2.0), product of:
                0.22186631 = queryWeight, product of:
                  5.478287 = idf(docFreq=501, maxDocs=44218)
                  0.04049921 = queryNorm
                0.30263546 = fieldWeight in 4468, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.478287 = idf(docFreq=501, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4468)
          0.5 = coord(1/2)
      0.16666667 = coord(1/6)
    
    Abstract
    In an increasingly global research landscape, it is important to identify the most prolific researchers in various institutions and their influence on the diffusion of knowledge. Knowledge diffusion within institutions is influenced by not just the status of individual researchers but also the collaborative culture that determines status. There are various methods to measure individual status, but few studies have compared them or explored the possible effects of different cultures on the status measures. In this article, we examine knowledge diffusion within science and technology-oriented research organizations. Using social network analysis metrics to measure individual status in large-scale coauthorship networks, we studied an individual's impact on the recombination of knowledge to produce innovation in nanotechnology. Data from the most productive and high-impact institutions in China (Chinese Academy of Sciences), Russia (Russian Academy of Sciences), and India (Indian Institutes of Technology) were used. We found that boundary-spanning individuals influenced knowledge diffusion in all countries. However, our results also indicate that cultural and institutional differences may influence knowledge diffusion.
  5. Chung, W.; Chen, H.: Browsing the underdeveloped Web : an experiment on the Arabic Medical Web Directory (2009) 0.00
    0.002743542 = product of:
      0.016461251 = sum of:
        0.016461251 = product of:
          0.032922503 = sum of:
            0.032922503 = weight(_text_:22 in 2733) [ClassicSimilarity], result of:
              0.032922503 = score(doc=2733,freq=2.0), product of:
                0.14182134 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04049921 = queryNorm
                0.23214069 = fieldWeight in 2733, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2733)
          0.5 = coord(1/2)
      0.16666667 = coord(1/6)
    
    Date
    22. 3.2009 17:57:50
  6. Carmel, E.; Crawford, S.; Chen, H.: Browsing in hypertext : a cognitive study (1992) 0.00
    0.0022862852 = product of:
      0.013717711 = sum of:
        0.013717711 = product of:
          0.027435422 = sum of:
            0.027435422 = weight(_text_:22 in 7469) [ClassicSimilarity], result of:
              0.027435422 = score(doc=7469,freq=2.0), product of:
                0.14182134 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04049921 = queryNorm
                0.19345059 = fieldWeight in 7469, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=7469)
          0.5 = coord(1/2)
      0.16666667 = coord(1/6)
    
    Source
    IEEE transactions on systems, man and cybernetics. 22(1992) no.5, S.865-884
  7. Leroy, G.; Chen, H.: Genescene: an ontology-enhanced integration of linguistic and co-occurrence based relations in biomedical texts (2005) 0.00
    0.0022862852 = product of:
      0.013717711 = sum of:
        0.013717711 = product of:
          0.027435422 = sum of:
            0.027435422 = weight(_text_:22 in 5259) [ClassicSimilarity], result of:
              0.027435422 = score(doc=5259,freq=2.0), product of:
                0.14182134 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04049921 = queryNorm
                0.19345059 = fieldWeight in 5259, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5259)
          0.5 = coord(1/2)
      0.16666667 = coord(1/6)
    
    Date
    22. 7.2006 14:26:01
  8. Hu, D.; Kaza, S.; Chen, H.: Identifying significant facilitators of dark network evolution (2009) 0.00
    0.0022862852 = product of:
      0.013717711 = sum of:
        0.013717711 = product of:
          0.027435422 = sum of:
            0.027435422 = weight(_text_:22 in 2753) [ClassicSimilarity], result of:
              0.027435422 = score(doc=2753,freq=2.0), product of:
                0.14182134 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04049921 = queryNorm
                0.19345059 = fieldWeight in 2753, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2753)
          0.5 = coord(1/2)
      0.16666667 = coord(1/6)
    
    Date
    22. 3.2009 18:50:30