Search (44 results, page 1 of 3)

  • × author_ss:"Zhang, Y."
  1. Zhang, Y.: ¬The impact of Internet-based electronic resources on formal scholarly communication in the area of library and information science : a citation analysis (1998) 0.03
    0.025962407 = product of:
      0.064906016 = sum of:
        0.0068111527 = weight(_text_:a in 2808) [ClassicSimilarity], result of:
          0.0068111527 = score(doc=2808,freq=8.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.12739488 = fieldWeight in 2808, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2808)
        0.058094867 = sum of:
          0.013672504 = weight(_text_:information in 2808) [ClassicSimilarity], result of:
            0.013672504 = score(doc=2808,freq=6.0), product of:
              0.08139861 = queryWeight, product of:
                1.7554779 = idf(docFreq=20772, maxDocs=44218)
                0.046368346 = queryNorm
              0.16796975 = fieldWeight in 2808, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                1.7554779 = idf(docFreq=20772, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2808)
          0.044422362 = weight(_text_:22 in 2808) [ClassicSimilarity], result of:
            0.044422362 = score(doc=2808,freq=4.0), product of:
              0.16237405 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046368346 = queryNorm
              0.27358043 = fieldWeight in 2808, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2808)
      0.4 = coord(2/5)
    
    Abstract
    Internet based electronic resources are growing dramatically but there have been no empirical studies evaluating the impact of e-sources, as a whole, on formal scholarly communication. reports results of an investigation into how much e-sources have been used in formal scholarly communication, using a case study in the area of Library and Information Science (LIS) during the period 1994 to 1996. 4 citation based indicators were used in the study of the impact measurement. Concludes that, compared with the impact of print sources, the impact of e-sources on formal scholarly communication in LIS is small, as measured by e-sources cited, and does not increase significantly by year even though there is observable growth of these impact across the years. It is found that periodical format is related to the rate of citing e-sources, articles are more likely to cite e-sources than are print priodical articles. However, once authors cite electronic resource, there is no significant difference in the number of references per article by periodical format or by year. Suggests that, at this stage, citing e-sources may depend on authors rather than the periodical format in which authors choose to publish
    Date
    30. 1.1999 17:22:22
    Source
    Journal of information science. 24(1998) no.4, S.241-254
    Type
    a
  2. Zhang, Y.: Developing a holistic model for digital library evaluation (2010) 0.02
    0.024035787 = product of:
      0.060089465 = sum of:
        0.012923255 = weight(_text_:a in 2360) [ClassicSimilarity], result of:
          0.012923255 = score(doc=2360,freq=20.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.24171482 = fieldWeight in 2360, product of:
              4.472136 = tf(freq=20.0), with freq of:
                20.0 = termFreq=20.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2360)
        0.04716621 = sum of:
          0.009472587 = weight(_text_:information in 2360) [ClassicSimilarity], result of:
            0.009472587 = score(doc=2360,freq=2.0), product of:
              0.08139861 = queryWeight, product of:
                1.7554779 = idf(docFreq=20772, maxDocs=44218)
                0.046368346 = queryNorm
              0.116372846 = fieldWeight in 2360, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.7554779 = idf(docFreq=20772, maxDocs=44218)
                0.046875 = fieldNorm(doc=2360)
          0.037693623 = weight(_text_:22 in 2360) [ClassicSimilarity], result of:
            0.037693623 = score(doc=2360,freq=2.0), product of:
              0.16237405 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046368346 = queryNorm
              0.23214069 = fieldWeight in 2360, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=2360)
      0.4 = coord(2/5)
    
    Abstract
    This article reports the author's recent research in developing a holistic model for various levels of digital library (DL) evaluation in which perceived important criteria from heterogeneous stakeholder groups are organized and presented. To develop such a model, the author applied a three-stage research approach: exploration, confirmation, and verification. During the exploration stage, a literature review was conducted followed by an interview, along with a card sorting technique, to collect important criteria perceived by DL experts. Then the criteria identified were used for developing an online survey during the confirmation stage. Survey respondents (431 in total) from 22 countries rated the importance of the criteria. A holistic DL evaluation model was constructed using statistical techniques. Eventually, the verification stage was devised to test the reliability of the model in the context of searching and evaluating an operational DL. The proposed model fills two lacunae in the DL domain: (a) the lack of a comprehensive and flexible framework to guide and benchmark evaluations, and (b) the uncertainty about what divergence exists among heterogeneous DL stakeholders, including general users.
    Source
    Journal of the American Society for Information Science and Technology. 61(2010) no.1, S.88-110
    Type
    a
  3. Zhang, Y.; Liu, J.; Song, S.: ¬The design and evaluation of a nudge-based interface to facilitate consumers' evaluation of online health information credibility (2023) 0.02
    0.02347796 = product of:
      0.0586949 = sum of:
        0.009632425 = weight(_text_:a in 993) [ClassicSimilarity], result of:
          0.009632425 = score(doc=993,freq=16.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.18016359 = fieldWeight in 993, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=993)
        0.049062476 = sum of:
          0.017651124 = weight(_text_:information in 993) [ClassicSimilarity], result of:
            0.017651124 = score(doc=993,freq=10.0), product of:
              0.08139861 = queryWeight, product of:
                1.7554779 = idf(docFreq=20772, maxDocs=44218)
                0.046368346 = queryNorm
              0.21684799 = fieldWeight in 993, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                1.7554779 = idf(docFreq=20772, maxDocs=44218)
                0.0390625 = fieldNorm(doc=993)
          0.031411353 = weight(_text_:22 in 993) [ClassicSimilarity], result of:
            0.031411353 = score(doc=993,freq=2.0), product of:
              0.16237405 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046368346 = queryNorm
              0.19345059 = fieldWeight in 993, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=993)
      0.4 = coord(2/5)
    
    Abstract
    Evaluating the quality of online health information (OHI) is a major challenge facing consumers. We designed PageGraph, an interface that displays quality indicators and associated values for a webpage, based on credibility evaluation models, the nudge theory, and existing empirical research concerning professionals' and consumers' evaluation of OHI quality. A qualitative evaluation of the interface with 16 participants revealed that PageGraph rendered the information and presentation nudges as intended. It provided the participants with easier access to quality indicators, encouraged fresh angles to assess information credibility, provided an evaluation framework, and encouraged validation of initial judgments. We then conducted a quantitative evaluation of the interface involving 60 participants using a between-subject experimental design. The control group used a regular web browser and evaluated the credibility of 12 preselected webpages, whereas the experimental group evaluated the same webpages with the assistance of PageGraph. PageGraph did not significantly influence participants' evaluation results. The results may be attributed to the insufficiency of the saliency and structure of the nudges implemented and the webpage stimuli's lack of sensitivity to the intervention. Future directions for applying nudges to support OHI evaluation were discussed.
    Date
    22. 6.2023 18:18:34
    Source
    Journal of the Association for Information Science and Technology. 74(2023) no.7, S.828-845
    Type
    a
  4. Zhang, Y.; Jansen, B.J.; Spink, A.: Identification of factors predicting clickthrough in Web searching using neural network analysis (2009) 0.02
    0.022521732 = product of:
      0.056304332 = sum of:
        0.009138121 = weight(_text_:a in 2742) [ClassicSimilarity], result of:
          0.009138121 = score(doc=2742,freq=10.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.1709182 = fieldWeight in 2742, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2742)
        0.04716621 = sum of:
          0.009472587 = weight(_text_:information in 2742) [ClassicSimilarity], result of:
            0.009472587 = score(doc=2742,freq=2.0), product of:
              0.08139861 = queryWeight, product of:
                1.7554779 = idf(docFreq=20772, maxDocs=44218)
                0.046368346 = queryNorm
              0.116372846 = fieldWeight in 2742, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.7554779 = idf(docFreq=20772, maxDocs=44218)
                0.046875 = fieldNorm(doc=2742)
          0.037693623 = weight(_text_:22 in 2742) [ClassicSimilarity], result of:
            0.037693623 = score(doc=2742,freq=2.0), product of:
              0.16237405 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046368346 = queryNorm
              0.23214069 = fieldWeight in 2742, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=2742)
      0.4 = coord(2/5)
    
    Abstract
    In this research, we aim to identify factors that significantly affect the clickthrough of Web searchers. Our underlying goal is determine more efficient methods to optimize the clickthrough rate. We devise a clickthrough metric for measuring customer satisfaction of search engine results using the number of links visited, number of queries a user submits, and rank of clicked links. We use a neural network to detect the significant influence of searching characteristics on future user clickthrough. Our results show that high occurrences of query reformulation, lengthy searching duration, longer query length, and the higher ranking of prior clicked links correlate positively with future clickthrough. We provide recommendations for leveraging these findings for improving the performance of search engine retrieval and result ranking, along with implications for search engine marketing.
    Date
    22. 3.2009 17:49:11
    Source
    Journal of the American Society for Information Science and Technology. 60(2009) no.3, S.557-570
    Type
    a
  5. Zhang, Y.; Wu, M.; Zhang, G.; Lu, J.: Stepping beyond your comfort zone : diffusion-based network analytics for knowledge trajectory recommendation (2023) 0.02
    0.021547971 = product of:
      0.053869925 = sum of:
        0.01129502 = weight(_text_:a in 994) [ClassicSimilarity], result of:
          0.01129502 = score(doc=994,freq=22.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.21126054 = fieldWeight in 994, product of:
              4.690416 = tf(freq=22.0), with freq of:
                22.0 = termFreq=22.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=994)
        0.042574905 = sum of:
          0.011163551 = weight(_text_:information in 994) [ClassicSimilarity], result of:
            0.011163551 = score(doc=994,freq=4.0), product of:
              0.08139861 = queryWeight, product of:
                1.7554779 = idf(docFreq=20772, maxDocs=44218)
                0.046368346 = queryNorm
              0.13714671 = fieldWeight in 994, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                1.7554779 = idf(docFreq=20772, maxDocs=44218)
                0.0390625 = fieldNorm(doc=994)
          0.031411353 = weight(_text_:22 in 994) [ClassicSimilarity], result of:
            0.031411353 = score(doc=994,freq=2.0), product of:
              0.16237405 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046368346 = queryNorm
              0.19345059 = fieldWeight in 994, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=994)
      0.4 = coord(2/5)
    
    Abstract
    Predicting a researcher's knowledge trajectories beyond their current foci can leverage potential inter-/cross-/multi-disciplinary interactions to achieve exploratory innovation. In this study, we present a method of diffusion-based network analytics for knowledge trajectory recommendation. The method begins by constructing a heterogeneous bibliometric network consisting of a co-topic layer and a co-authorship layer. A novel link prediction approach with a diffusion strategy is then used to capture the interactions between social elements (e.g., collaboration) and knowledge elements (e.g., technological similarity) in the process of exploratory innovation. This diffusion strategy differentiates the interactions occurring among homogeneous and heterogeneous nodes in the heterogeneous bibliometric network and weights the strengths of these interactions. Two sets of experiments-one with a local dataset and the other with a global dataset-demonstrate that the proposed method is prior to 10 selected baselines in link prediction, recommender systems, and upstream graph representation learning. A case study recommending knowledge trajectories of information scientists with topical hierarchy and explainable mediators reveals the proposed method's reliability and potential practical uses in broad scenarios.
    Date
    22. 6.2023 18:07:12
    Source
    Journal of the Association for Information Science and Technology. 74(2023) no.7, S.775-790
    Type
    a
  6. Xie, B.; He, D.; Mercer, T.; Wang, Y.; Wu, D.; Fleischmann, K.R.; Zhang, Y.; Yoder, L.H.; Stephens, K.K.; Mackert, M.; Lee, M.K.: Global health crises are also information crises : a call to action (2020) 0.01
    0.009228281 = product of:
      0.0230707 = sum of:
        0.009535614 = weight(_text_:a in 32) [ClassicSimilarity], result of:
          0.009535614 = score(doc=32,freq=8.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.17835285 = fieldWeight in 32, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=32)
        0.013535086 = product of:
          0.027070172 = sum of:
            0.027070172 = weight(_text_:information in 32) [ClassicSimilarity], result of:
              0.027070172 = score(doc=32,freq=12.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.3325631 = fieldWeight in 32, product of:
                  3.4641016 = tf(freq=12.0), with freq of:
                    12.0 = termFreq=12.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=32)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    In this opinion paper, we argue that global health crises are also information crises. Using as an example the coronavirus disease 2019 (COVID-19) epidemic, we (a) examine challenges associated with what we term "global information crises"; (b) recommend changes needed for the field of information science to play a leading role in such crises; and (c) propose actionable items for short- and long-term research, education, and practice in information science.
    Source
    Journal of the Association for Information Science and Technology. 71(2020) no.12, S.1419-1423
    Type
    a
  7. Zhang, Y.: Beyond quality and accessibility : source selection in consumer health information searching (2014) 0.01
    0.008840293 = product of:
      0.022100732 = sum of:
        0.009010308 = weight(_text_:a in 1252) [ClassicSimilarity], result of:
          0.009010308 = score(doc=1252,freq=14.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.1685276 = fieldWeight in 1252, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1252)
        0.013090424 = product of:
          0.026180848 = sum of:
            0.026180848 = weight(_text_:information in 1252) [ClassicSimilarity], result of:
              0.026180848 = score(doc=1252,freq=22.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.32163754 = fieldWeight in 1252, product of:
                  4.690416 = tf(freq=22.0), with freq of:
                    22.0 = termFreq=22.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1252)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    A systematic understanding of factors and criteria that affect consumers' selection of sources for health information is necessary for the design of effective health information services and information systems. However, current studies have overly focused on source attributes as indicators for 2 criteria, source quality and accessibility, and overlooked the role of other factors and criteria that help determine source selection. To fill this gap, guided by decision-making theories and the cognitive perspective to information search, we interviewed 30 participants about their reasons for using a wide range of sources for health information. Additionally, we asked each of them to report a critical incident in which sources were selected to fulfill a specific information need. Based on the analysis of the transcripts, 5 categories of factors were identified as influential to source selection: source-related factors, user-related factors, user-source relationships, characteristics of the problematic situation, and social influences. In addition, about a dozen criteria that mediate the influence of the factors on source-selection decisions were identified, including accessibility, quality, usability, interactivity, relevance, usefulness, familiarity, affection, anonymity, and appropriateness. These results significantly expanded the current understanding of the nature of costs and benefits involved in source-selection decisions, and strongly indicated that a personalized approach is needed for information services and information systems to provide effective access to health information sources for consumers.
    Source
    Journal of the Association for Information Science and Technology. 65(2014) no.5, S.911-927
    Type
    a
  8. Trace, C.B.; Zhang, Y.; Yi, S.; Williams-Brown, M.Y.: Information practices around genetic testing for ovarian cancer patients (2023) 0.01
    0.008243905 = product of:
      0.020609763 = sum of:
        0.0034055763 = weight(_text_:a in 1071) [ClassicSimilarity], result of:
          0.0034055763 = score(doc=1071,freq=2.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.06369744 = fieldWeight in 1071, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1071)
        0.017204186 = product of:
          0.034408372 = sum of:
            0.034408372 = weight(_text_:information in 1071) [ClassicSimilarity], result of:
              0.034408372 = score(doc=1071,freq=38.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.42271453 = fieldWeight in 1071, product of:
                  6.164414 = tf(freq=38.0), with freq of:
                    38.0 = termFreq=38.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1071)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Knowledge of ovarian cancer patients' information practices around cancer genetic testing (GT) is needed to inform interventions that promote patient access to GT-related information. We interviewed 21 ovarian cancer patients and survivors who had GT as part of the treatment process and analyzed the transcripts using the qualitative content analysis method. We found that patients' information practices, manifested in their information-seeking mode, information sources utilized, information assessment, and information use, showed three distinct styles: passive, semi-active, and active. Patients with the passive style primarily received information from clinical sources, encountered information, or delegated information-seeking to family members; they were not inclined to assess information themselves and seldom used it to learn or influence others. Women with semi-active and active styles adopted more active information-seeking modes to approach information, utilized information sources beyond clinical settings, attempted to assess the information found, and actively used it to learn, educate others, or advocate GT to family and friends. Guided by the social ecological model, we found multiple levels of influences, including personal, interpersonal, organizational, community, and societal, acting as motivators or barriers to patients' information practice. Based on these findings, we discussed strategies to promote patient access to GT-related information.
    Source
    Journal of the Association for Information Science and Technology. 74(2023) no.11, S.1265-1281
    Type
    a
  9. Zhang, Y.; Broussard, R.; Ke, W.; Gong, X.: Evaluation of a scatter/gather interface for supporting distinct health information search tasks (2014) 0.01
    0.008174919 = product of:
      0.020437296 = sum of:
        0.010769378 = weight(_text_:a in 1261) [ClassicSimilarity], result of:
          0.010769378 = score(doc=1261,freq=20.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.20142901 = fieldWeight in 1261, product of:
              4.472136 = tf(freq=20.0), with freq of:
                20.0 = termFreq=20.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1261)
        0.009667919 = product of:
          0.019335838 = sum of:
            0.019335838 = weight(_text_:information in 1261) [ClassicSimilarity], result of:
              0.019335838 = score(doc=1261,freq=12.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.23754507 = fieldWeight in 1261, product of:
                  3.4641016 = tf(freq=12.0), with freq of:
                    12.0 = termFreq=12.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1261)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Web search engines are important gateways for users to access health information. This study explored whether a search interface based on the Bing API and enabled by Scatter/Gather, a well-known document-clustering technique, can improve health information searches. Forty participants without medical backgrounds were randomly assigned to two interfaces: a baseline interface that resembles typical web search engines and a Scatter/Gather interface. Both groups performed two lookup and two exploratory health-related tasks. It was found that the baseline group was more likely to rephrase queries and less likely to access general-purpose sites than the Scatter/Gather group when completing exploratory tasks. Otherwise, the two groups did not differ in behavior and task performance, with participants in the Scatter/Gather group largely overlooking the features (key words, clusters, and the recluster function) designed to facilitate the exploration of semantic relationships between information objects, a potentially useful means for users in the rather unfamiliar domain of health. The results suggest a strong effect of users' mental models of search on their use of search interfaces and a high cognitive cost associated with using the Scatter/Gather features. It follows that novel features of a search interface should not only be compatible with users' mental models but also provide sufficient affordance to inform users of how they can be used. Compared with the interface, tasks showed more significant impacts on search behavior. In future studies, more effort should be devoted to identify salient features of health-related information needs.
    Source
    Journal of the Association for Information Science and Technology. 65(2014) no.5, S.1028-1041
    Type
    a
  10. Zhang, Y.: Searching for specific health-related information in MedlinePlus : behavioral patterns and user experience (2014) 0.01
    0.007781142 = product of:
      0.019452855 = sum of:
        0.009010308 = weight(_text_:a in 1180) [ClassicSimilarity], result of:
          0.009010308 = score(doc=1180,freq=14.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.1685276 = fieldWeight in 1180, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1180)
        0.010442546 = product of:
          0.020885091 = sum of:
            0.020885091 = weight(_text_:information in 1180) [ClassicSimilarity], result of:
              0.020885091 = score(doc=1180,freq=14.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.256578 = fieldWeight in 1180, product of:
                  3.7416575 = tf(freq=14.0), with freq of:
                    14.0 = termFreq=14.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1180)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Searches for specific factual health information constitute a significant part of consumer health information requests, but little is known about how users search for such information. This study attempts to fill this gap by observing users' behavior while using MedlinePlus to search for specific health information. Nineteen students participated in the study, and each performed 12 specific tasks. During the search process, they submitted short queries or complete questions, and they examined less than 1 result per search. Participants rarely reformulated queries; when they did, they tended to make a query more specific or more general, or iterate in different ways. Participants also browsed, primarily relying on the alphabetical list and the anatomical classification, to navigate to specific health topics. Participants overall had a positive experience with MedlinePlus, and the experience was significantly correlated with task difficulty and participants' spatial abilities. The results suggest that, to better support specific item search in the health domain, systems could provide a more "natural" interface to encourage users to ask questions; effective conceptual hierarchies could be implemented to help users reformulate queries; and the search results page should be reconceptualized as a place for accessing answers rather than documents. Moreover, multiple schemas should be provided to help users navigate to a health topic. The results also suggest that users' experience with information systems in general and health-related systems in particular should be evaluated in relation to contextual factors, such as task features and individual differences.
    Source
    Journal of the Association for Information Science and Technology. 65(2014) no.1, S.53-68
    Type
    a
  11. Zhang, Y.; Sun, Y.; Xie, B.: Quality of health information for consumers on the web : a systematic review of indicators, criteria, tools, and evaluation results (2015) 0.01
    0.007781142 = product of:
      0.019452855 = sum of:
        0.009010308 = weight(_text_:a in 2218) [ClassicSimilarity], result of:
          0.009010308 = score(doc=2218,freq=14.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.1685276 = fieldWeight in 2218, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2218)
        0.010442546 = product of:
          0.020885091 = sum of:
            0.020885091 = weight(_text_:information in 2218) [ClassicSimilarity], result of:
              0.020885091 = score(doc=2218,freq=14.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.256578 = fieldWeight in 2218, product of:
                  3.7416575 = tf(freq=14.0), with freq of:
                    14.0 = termFreq=14.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2218)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    The quality of online health information for consumers has been a critical issue that concerns all stakeholders in healthcare. To gain an understanding of how quality is evaluated, this systematic review examined 165 articles in which researchers evaluated the quality of consumer-oriented health information on the web against predefined criteria. It was found that studies typically evaluated quality in relation to the substance and formality of content, as well as to the design of technological platforms. Attention to design, particularly interactivity, privacy, and social and cultural appropriateness is on the rise, which suggests the permeation of a user-centered perspective into the evaluation of health information systems, and a growing recognition of the need to study these systems from a social-technical perspective. Researchers used many preexisting instruments to facilitate evaluation of the formality of content; however, only a few were used in multiple studies, and their validity was questioned. The quality of content (i.e., accuracy and completeness) was always evaluated using proprietary instruments constructed based on medical guidelines or textbooks. The evaluation results revealed that the quality of health information varied across medical domains and across websites, and that the overall quality remained problematic. Future research is needed to examine the quality of user-generated content and to explore opportunities offered by emerging new media that can facilitate the consumer evaluation of health information.
    Source
    Journal of the Association for Information Science and Technology. 66(2015) no.10, S.2071-2084
    Type
    a
  12. Zhang, Y.: Complex adaptive filtering user profile using graphical models (2008) 0.01
    0.0074719447 = product of:
      0.018679861 = sum of:
        0.007078358 = weight(_text_:a in 2445) [ClassicSimilarity], result of:
          0.007078358 = score(doc=2445,freq=6.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.13239266 = fieldWeight in 2445, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2445)
        0.011601503 = product of:
          0.023203006 = sum of:
            0.023203006 = weight(_text_:information in 2445) [ClassicSimilarity], result of:
              0.023203006 = score(doc=2445,freq=12.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.2850541 = fieldWeight in 2445, product of:
                  3.4641016 = tf(freq=12.0), with freq of:
                    12.0 = termFreq=12.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2445)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    This article explores how to develop complex data driven user models that go beyond the bag of words model and topical relevance. We propose to learn from rich user specific information and to satisfy complex user criteria under the graphical modelling framework. We carried out a user study with a web based personal news filtering system, and collected extensive user information, including explicit user feedback, implicit user feedback and some contextual information. Experimental results on the data set collected demonstrate that the graphical modelling approach helps us to better understand the complex domain. The results also show that the complex data driven user modelling approach can improve the adaptive information filtering performance. We also discuss some practical issues while learning complex user models, including how to handle data noise and the missing data problem.
    Footnote
    Beitrag in einem Themenheft "Adaptive information retrieval"
    Source
    Information processing and management. 44(2008) no.6, S.1886-1900
    Type
    a
  13. Zhang, Y.: Toward a layered model of context for health information searching : an analysis of consumer-generated questions (2013) 0.01
    0.0072230585 = product of:
      0.018057646 = sum of:
        0.0076151006 = weight(_text_:a in 953) [ClassicSimilarity], result of:
          0.0076151006 = score(doc=953,freq=10.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.14243183 = fieldWeight in 953, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=953)
        0.010442546 = product of:
          0.020885091 = sum of:
            0.020885091 = weight(_text_:information in 953) [ClassicSimilarity], result of:
              0.020885091 = score(doc=953,freq=14.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.256578 = fieldWeight in 953, product of:
                  3.7416575 = tf(freq=14.0), with freq of:
                    14.0 = termFreq=14.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=953)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Designing effective consumer health information systems requires deep understanding of the context in which the systems are being used. However, due to the elusive nature of the concept of context, few studies have made it a focus of examination. To fill this gap, we studied the context of consumer health information searching by analyzing questions posted on a social question and answer site: Yahoo! Answers. Based on the analysis, a model of context was developed. The model consists of 5 layers: demographic, cognitive, affective, situational, and social and environmental. The demographic layer contains demographic factors of the person of concern; the cognitive layer contains factors related to the current search task (specifically, topics of interest and information goals) and users' cognitive ability to articulate their needs. The affective layer contains different affective motivations and intentions behind the search. The situational layer contains users' perceptions of the current health condition and where the person is in the illness trajectory. The social and environmental layer contains users' social roles, social norms, and various information channels. Several novel system functions, including faceted search and layered presentation of results, are proposed based on the model to help contextualize and improve users' interactions with health information systems.
    Source
    Journal of the American Society for Information Science and Technology. 64(2013) no.6, S.1158-1172
    Type
    a
  14. Zhang, Y.: Understanding the sustained use of online health communities from a self-determination perspective (2016) 0.01
    0.0072039375 = product of:
      0.018009843 = sum of:
        0.008341924 = weight(_text_:a in 3216) [ClassicSimilarity], result of:
          0.008341924 = score(doc=3216,freq=12.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.15602624 = fieldWeight in 3216, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3216)
        0.009667919 = product of:
          0.019335838 = sum of:
            0.019335838 = weight(_text_:information in 3216) [ClassicSimilarity], result of:
              0.019335838 = score(doc=3216,freq=12.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.23754507 = fieldWeight in 3216, product of:
                  3.4641016 = tf(freq=12.0), with freq of:
                    12.0 = termFreq=12.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3216)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Sustained use of an information source is sometimes important for achieving an individual's long-term goals, such as learning and self-development. It is even more important for users of online health communities because health benefits usually come with sustained use. However, little is known about what retains a user. We interviewed 21 participants who had been using online diabetes communities in a sustained manner. Guided by self-determination theory, which posits that behaviors are sustained when they can satisfy basic human needs for autonomy, competence, and relatedness, we identified mechanisms that help satisfy these needs, and thus sustain users in online health communities. Autonomy-supportive mechanisms include being respected and supported as a unique individual, feeling free in making choices, and receiving meaningful rationales about others' decisions. Competence-cultivating mechanisms include seeking information, providing information, and exchanging information with others to construct knowledge. Mechanisms that cultivate relatedness include seeing similarities between oneself and peers, receiving responses from others, providing emotional support, and forming small underground groups for closer interactions. The results suggest that, like emotions, information and small group interactions also play a key role in retaining users. System design and community management strategies are discussed based on these mechanisms.
    Source
    Journal of the Association for Information Science and Technology. 67(2016) no.12, S.22842-2857
    Type
    a
  15. Zhang, M.; Zhang, Y.: Professional organizations in Twittersphere : an empirical study of U.S. library and information science professional organizations-related Tweets (2020) 0.01
    0.007131535 = product of:
      0.017828837 = sum of:
        0.008258085 = weight(_text_:a in 5775) [ClassicSimilarity], result of:
          0.008258085 = score(doc=5775,freq=6.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.1544581 = fieldWeight in 5775, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5775)
        0.009570752 = product of:
          0.019141505 = sum of:
            0.019141505 = weight(_text_:information in 5775) [ClassicSimilarity], result of:
              0.019141505 = score(doc=5775,freq=6.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.23515764 = fieldWeight in 5775, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5775)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Twitter is utilized by many, including professional businesses and organizations; however, there are very few studies on how other entities interact with these organizations in the Twittersphere. This article presents a study that investigates tweets related to 5 major library and information science (LIS) professional organizations in the United States. This study applies a systematic tweets analysis framework, including descriptive analytics, network analytics, and co-word analysis of hashtags. The findings shed light on user engagement with LIS professional organizations and the trending discussion topics on Twitter, which is valuable for enabling more successful social media use and greater influence.
    Source
    Journal of the Association for Information Science and Technology. 71(2020) no.4, S.491-496
    Type
    a
  16. Zhang, Y.; Ren, P.; Rijke, M. de: ¬A taxonomy, data set, and benchmark for detecting and classifying malevolent dialogue responses (2021) 0.01
    0.007004201 = product of:
      0.017510502 = sum of:
        0.010812371 = weight(_text_:a in 356) [ClassicSimilarity], result of:
          0.010812371 = score(doc=356,freq=14.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.20223314 = fieldWeight in 356, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=356)
        0.0066981306 = product of:
          0.013396261 = sum of:
            0.013396261 = weight(_text_:information in 356) [ClassicSimilarity], result of:
              0.013396261 = score(doc=356,freq=4.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.16457605 = fieldWeight in 356, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046875 = fieldNorm(doc=356)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Conversational interfaces are increasingly popular as a way of connecting people to information. With the increased generative capacity of corpus-based conversational agents comes the need to classify and filter out malevolent responses that are inappropriate in terms of content and dialogue acts. Previous studies on the topic of detecting and classifying inappropriate content are mostly focused on a specific category of malevolence or on single sentences instead of an entire dialogue. We make three contributions to advance research on the malevolent dialogue response detection and classification (MDRDC) task. First, we define the task and present a hierarchical malevolent dialogue taxonomy. Second, we create a labeled multiturn dialogue data set and formulate the MDRDC task as a hierarchical classification task. Last, we apply state-of-the-art text classification methods to the MDRDC task, and report on experiments aimed at assessing the performance of these approaches.
    Source
    Journal of the Association for Information Science and Technology. 72(2021) no.12, S.1477-1497
    Type
    a
  17. Zhang, Y.; Zhang, G.; Zhu, D.; Lu, J.: Scientific evolutionary pathways : identifying and visualizing relationships for scientific topics (2017) 0.01
    0.006951616 = product of:
      0.01737904 = sum of:
        0.011797264 = weight(_text_:a in 3758) [ClassicSimilarity], result of:
          0.011797264 = score(doc=3758,freq=24.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.22065444 = fieldWeight in 3758, product of:
              4.8989797 = tf(freq=24.0), with freq of:
                24.0 = termFreq=24.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3758)
        0.0055817757 = product of:
          0.011163551 = sum of:
            0.011163551 = weight(_text_:information in 3758) [ClassicSimilarity], result of:
              0.011163551 = score(doc=3758,freq=4.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.13714671 = fieldWeight in 3758, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3758)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Whereas traditional science maps emphasize citation statistics and static relationships, this paper presents a term-based method to identify and visualize the evolutionary pathways of scientific topics in a series of time slices. First, we create a data preprocessing model for accurate term cleaning, consolidating, and clustering. Then we construct a simulated data streaming function and introduce a learning process to train a relationship identification function to adapt to changing environments in real time, where relationships of topic evolution, fusion, death, and novelty are identified. The main result of the method is a map of scientific evolutionary pathways. The visual routines provide a way to indicate the interactions among scientific subjects and a version in a series of time slices helps further illustrate such evolutionary pathways in detail. The detailed outline offers sufficient statistical information to delve into scientific topics and routines and then helps address meaningful insights with the assistance of expert knowledge. This empirical study focuses on scientific proposals granted by the United States National Science Foundation, and demonstrates the feasibility and reliability. Our method could be widely applied to a range of science, technology, and innovation policy research, and offer insight into the evolutionary pathways of scientific activities.
    Source
    Journal of the Association for Information Science and Technology. 68(2017) no.8, S.1925-1939
    Type
    a
  18. Zhang, Y.: Dimensions and elements of people's mental models of an information-rich Web space (2010) 0.01
    0.006866995 = product of:
      0.017167486 = sum of:
        0.008341924 = weight(_text_:a in 4098) [ClassicSimilarity], result of:
          0.008341924 = score(doc=4098,freq=12.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.15602624 = fieldWeight in 4098, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4098)
        0.008825562 = product of:
          0.017651124 = sum of:
            0.017651124 = weight(_text_:information in 4098) [ClassicSimilarity], result of:
              0.017651124 = score(doc=4098,freq=10.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.21684799 = fieldWeight in 4098, product of:
                  3.1622777 = tf(freq=10.0), with freq of:
                    10.0 = termFreq=10.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4098)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Although considered proxies for people to interact with a system, mental models have produced limited practical implications for system design. This might be due to the lack of exploration of the elements of mental models resulting from the methodological challenge of measuring mental models. This study employed a new method, concept listing, to elicit people's mental models of an information-rich space, MedlinePlus, after they interacted with the system for 5 minutes. Thirty-eight undergraduate students participated in the study. The results showed that, in this short period of time, participants perceived MedlinePlus from many different aspects in relation to four components: the system as a whole, its content, information organization, and interface. Meanwhile, participants expressed evaluations of or emotions about the four components. In terms of the procedural knowledge, an integral part of people's mental models, only one participant identified a strategy more aligned to the capabilities of MedlinePlus to solve a hypothetical task; the rest planned to use general search and browse strategies. The composition of participants' mental models of MedlinePlus was consistent with that of their models of information-rich Web spaces in general.
    Source
    Journal of the American Society for Information Science and Technology. 61(2010) no.11, S.2206-2218
    Type
    a
  19. Zhang, Y.; Xu, W.: Fast exact maximum likelihood estimation for mixture of language model (2008) 0.01
    0.0066833766 = product of:
      0.016708441 = sum of:
        0.0100103095 = weight(_text_:a in 2082) [ClassicSimilarity], result of:
          0.0100103095 = score(doc=2082,freq=12.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.18723148 = fieldWeight in 2082, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2082)
        0.0066981306 = product of:
          0.013396261 = sum of:
            0.013396261 = weight(_text_:information in 2082) [ClassicSimilarity], result of:
              0.013396261 = score(doc=2082,freq=4.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.16457605 = fieldWeight in 2082, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2082)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Language modeling is an effective and theoretically attractive probabilistic framework for text information retrieval. The basic idea of this approach is to estimate a language model of a given document (or document set), and then do retrieval or classification based on this model. A common language modeling approach assumes the data D is generated from a mixture of several language models. The core problem is to find the maximum likelihood estimation of one language model mixture, given the fixed mixture weights and the other language model mixture. The EM algorithm is usually used to find the solution. In this paper, we proof that an exact maximum likelihood estimation of the unknown mixture component exists and can be calculated using the new algorithm we proposed. We further improve the algorithm and provide an efficient algorithm of O(k) complexity to find the exact solution, where k is the number of words occurring at least once in data D. Furthermore, we proof the probabilities of many words are exactly zeros, and the MLE estimation is implemented as a feature selection technique explicitly.
    Source
    Information processing and management. 44(2008) no.3, S.1076-1085
    Type
    a
  20. Chung, W.; Zhang, Y.; Huang, Z.; Wang, G.; Ong, T.-H.; Chen, H.: Internet searching and browsing in a multilingual world : an experiment an the Chinese Business Intelligence Portal (CBizPort) (2004) 0.01
    0.0065874713 = product of:
      0.016468678 = sum of:
        0.009632425 = weight(_text_:a in 2393) [ClassicSimilarity], result of:
          0.009632425 = score(doc=2393,freq=16.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.18016359 = fieldWeight in 2393, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2393)
        0.006836252 = product of:
          0.013672504 = sum of:
            0.013672504 = weight(_text_:information in 2393) [ClassicSimilarity], result of:
              0.013672504 = score(doc=2393,freq=6.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.16796975 = fieldWeight in 2393, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2393)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    The rapid growth of the non-English-speaking Internet population has created a need for better searching and browsing capabilities in languages other than English. However, existing search engines may not serve the needs of many non-English-speaking Internet users. In this paper, we propose a generic and integrated approach to searching and browsing the Internet in a multilingual world. Based an this approach, we have developed the Chinese Business Intelligence Portal (CBizPort), a meta-search engine that searches for business information of mainland China, Taiwan, and Hong Kong. Additional functions provided by CBizPort include encoding conversion (between Simplified Chinese and Traditional Chinese), summarization, and categorization. Experimental results of our user evaluation study show that the searching and browsing performance of CBizPort was comparable to that of regional Chinese search engines, and CBizPort could significantly augment these search engines. Subjects' verbal comments indicate that CBizPort performed best in terms of analysis functions, cross-regional searching, and user-friendliness, whereas regional search engines were more efficient and more popular. Subjects especially liked CBizPort's summarizer and categorizer, which helped in understanding search results. These encouraging results suggest a promising future of our approach to Internet searching and browsing in a multilingual world.
    Footnote
    Teil eines Themenheftes zu: Information seeking research
    Source
    Journal of the American Society for Information Science and Technology. 55(2004) no.9, S.818-831
    Type
    a

Years

Types

  • a 43
  • m 1
  • More… Less…

Classifications