Search (42 results, page 1 of 3)

Zhang, Y.: Undergraduate students' mental models of the Web as an information retrieval system (2008) 0.02
```
0.023858808 = product of:
  0.09543523 = sum of:
    0.053399518 = weight(_text_:web in 2385) [ClassicSimilarity], result of:
      0.053399518 = score(doc=2385,freq=14.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.47698978 = fieldWeight in 2385, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2385)
    0.015451051 = weight(_text_:information in 2385) [ClassicSimilarity], result of:
      0.015451051 = score(doc=2385,freq=14.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.256578 = fieldWeight in 2385, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2385)
    0.026584659 = weight(_text_:system in 2385) [ClassicSimilarity], result of:
      0.026584659 = score(doc=2385,freq=4.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.24605882 = fieldWeight in 2385, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2385)
  0.25 = coord(3/12)
```
Abstract

This study explored undergraduate students' mental models of the Web as an information retrieval system. Mental models play an important role in people's interaction with information systems. Better understanding of people's mental models could inspire better interface design and user instruction. Multiple data-collection methods, including questionnaire, semistructured interview, drawing, and participant observation, were used to elicit students' mental models of the Web from different perspectives, though only data from interviews and drawing descriptions are reported in this article. Content analysis of the transcripts showed that students had utilitarian rather than structural mental models of the Web. The majority of participants saw the Web as a huge information resource where everything can be found rather than an infrastructure consisting of hardware and computer applications. Students had different mental models of how information is organized on the Web, and the models varied in correctness and complexity. Students' mental models of search on the Web were illustrated from three points of view: avenues of getting information, understanding of search engines' working mechanisms, and search tactics. The research results suggest that there are mainly three sources contributing to the construction of mental models: personal observation, communication with others, and class instruction. In addition to structural and functional aspects, mental models have an emotional dimension.

Source

Journal of the American Society for Information Science and Technology. 59(2008) no.13, S.2087-2098
Zhang, Y.; Zhang, C.; Li, J.: Joint modeling of characters, words, and conversation contexts for microblog keyphrase extraction (2020) 0.02
```
0.023269854 = product of:
  0.13961913 = sum of:
    0.0058399485 = weight(_text_:information in 5816) [ClassicSimilarity], result of:
      0.0058399485 = score(doc=5816,freq=2.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.09697737 = fieldWeight in 5816, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5816)
    0.13377918 = weight(_text_:extraction in 5816) [ClassicSimilarity], result of:
      0.13377918 = score(doc=5816,freq=8.0), product of:
        0.20380433 = queryWeight, product of:
          5.941145 = idf(docFreq=315, maxDocs=44218)
          0.03430388 = queryNorm
        0.6564099 = fieldWeight in 5816, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          5.941145 = idf(docFreq=315, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5816)
  0.16666667 = coord(2/12)
```
Abstract

Millions of messages are produced on microblog platforms every day, leading to the pressing need for automatic identification of key points from the massive texts. To absorb salient content from the vast bulk of microblog posts, this article focuses on the task of microblog keyphrase extraction. In previous work, most efforts treat messages as independent documents and might suffer from the data sparsity problem exhibited in short and informal microblog posts. On the contrary, we propose to enrich contexts via exploiting conversations initialized by target posts and formed by their replies, which are generally centered around relevant topics to the target posts and therefore helpful for keyphrase identification. Concretely, we present a neural keyphrase extraction framework, which has 2 modules: a conversation context encoder and a keyphrase tagger. The conversation context encoder captures indicative representation from their conversation contexts and feeds the representation into the keyphrase tagger, and the keyphrase tagger extracts salient words from target posts. The 2 modules were trained jointly to optimize the conversation context encoding and keyphrase extraction processes. In the conversation context encoder, we leverage hierarchical structures to capture the word-level indicative representation and message-level indicative representation hierarchically. In both of the modules, we apply character-level representations, which enables the model to explore morphological features and deal with the out-of-vocabulary problem caused by the informal language style of microblog messages. Extensive comparison results on real-life data sets indicate that our model outperforms state-of-the-art models from previous studies.

Source

Journal of the Association for Information Science and Technology. 71(2020) no.5, S.553-567
Zhang, Y.; Zhang, C.: Enhancing keyphrase extraction from microblogs using human reading time (2021) 0.02
```
0.023269854 = product of:
  0.13961913 = sum of:
    0.0058399485 = weight(_text_:information in 237) [ClassicSimilarity], result of:
      0.0058399485 = score(doc=237,freq=2.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.09697737 = fieldWeight in 237, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=237)
    0.13377918 = weight(_text_:extraction in 237) [ClassicSimilarity], result of:
      0.13377918 = score(doc=237,freq=8.0), product of:
        0.20380433 = queryWeight, product of:
          5.941145 = idf(docFreq=315, maxDocs=44218)
          0.03430388 = queryNorm
        0.6564099 = fieldWeight in 237, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          5.941145 = idf(docFreq=315, maxDocs=44218)
          0.0390625 = fieldNorm(doc=237)
  0.16666667 = coord(2/12)
```
Abstract

The premise of manual keyphrase annotation is to read the corresponding content of an annotated object. Intuitively, when we read, more important words will occupy a longer reading time. Hence, by leveraging human reading time, we can find the salient words in the corresponding content. However, previous studies on keyphrase extraction ignore human reading features. In this article, we aim to leverage human reading time to extract keyphrases from microblog posts. There are two main tasks in this study. One is to determine how to measure the time spent by a human on reading a word. We use eye fixation durations (FDs) extracted from an open source eye-tracking corpus. Moreover, we propose strategies to make eye FD more effective on keyphrase extraction. The other task is to determine how to integrate human reading time into keyphrase extraction models. We propose two novel neural network models. The first is a model in which the human reading time is used as the ground truth of the attention mechanism. In the second model, we use human reading time as the external feature. Quantitative and qualitative experiments show that our proposed models yield better performance than the baseline models on two microblog datasets.

Source

Journal of the Association for Information Science and Technology. 72(2021) no.5, S.611-626

Zhang, Y.: Dimensions and elements of people's mental models of an information-rich Web space (2010) 0.02

0.019799538 = product of:
  0.07919815 = sum of:
    0.028543243 = weight(_text_:web in 4098) [ClassicSimilarity], result of:
      0.028543243 = score(doc=4098,freq=4.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.25496176 = fieldWeight in 4098, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4098)
    0.013058522 = weight(_text_:information in 4098) [ClassicSimilarity], result of:
      0.013058522 = score(doc=4098,freq=10.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.21684799 = fieldWeight in 4098, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4098)
    0.037596382 = weight(_text_:system in 4098) [ClassicSimilarity], result of:
      0.037596382 = score(doc=4098,freq=8.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.3479797 = fieldWeight in 4098, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4098)
  0.25 = coord(3/12)

Abstract: Although considered proxies for people to interact with a system, mental models have produced limited practical implications for system design. This might be due to the lack of exploration of the elements of mental models resulting from the methodological challenge of measuring mental models. This study employed a new method, concept listing, to elicit people's mental models of an information-rich space, MedlinePlus, after they interacted with the system for 5 minutes. Thirty-eight undergraduate students participated in the study. The results showed that, in this short period of time, participants perceived MedlinePlus from many different aspects in relation to four components: the system as a whole, its content, information organization, and interface. Meanwhile, participants expressed evaluations of or emotions about the four components. In terms of the procedural knowledge, an integral part of people's mental models, only one participant identified a strategy more aligned to the capabilities of MedlinePlus to solve a hypothetical task; the rest planned to use general search and browse strategies. The composition of participants' mental models of MedlinePlus was consistent with that of their models of information-rich Web spaces in general.
Source: Journal of the American Society for Information Science and Technology. 61(2010) no.11, S.2206-2218

Zhang, Y.: Complex adaptive filtering user profile using graphical models (2008) 0.02

0.015985861 = product of:
  0.063943446 = sum of:
    0.024219744 = weight(_text_:web in 2445) [ClassicSimilarity], result of:
      0.024219744 = score(doc=2445,freq=2.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.21634221 = fieldWeight in 2445, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=2445)
    0.017165873 = weight(_text_:information in 2445) [ClassicSimilarity], result of:
      0.017165873 = score(doc=2445,freq=12.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.2850541 = fieldWeight in 2445, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2445)
    0.02255783 = weight(_text_:system in 2445) [ClassicSimilarity], result of:
      0.02255783 = score(doc=2445,freq=2.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.20878783 = fieldWeight in 2445, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.046875 = fieldNorm(doc=2445)
  0.25 = coord(3/12)

Abstract: This article explores how to develop complex data driven user models that go beyond the bag of words model and topical relevance. We propose to learn from rich user specific information and to satisfy complex user criteria under the graphical modelling framework. We carried out a user study with a web based personal news filtering system, and collected extensive user information, including explicit user feedback, implicit user feedback and some contextual information. Experimental results on the data set collected demonstrate that the graphical modelling approach helps us to better understand the complex domain. The results also show that the complex data driven user modelling approach can improve the adaptive information filtering performance. We also discuss some practical issues while learning complex user models, including how to handle data noise and the missing data problem.
Footnote: Beitrag in einem Themenheft "Adaptive information retrieval"
Source: Information processing and management. 44(2008) no.6, S.1886-1900

Zhang, Y.; Jansen, B.J.; Spink, A.: Identification of factors predicting clickthrough in Web searching using neural network analysis (2009) 0.01

0.013800734 = product of:
  0.055202935 = sum of:
    0.03425189 = weight(_text_:web in 2742) [ClassicSimilarity], result of:
      0.03425189 = score(doc=2742,freq=4.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.3059541 = fieldWeight in 2742, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=2742)
    0.0070079383 = weight(_text_:information in 2742) [ClassicSimilarity], result of:
      0.0070079383 = score(doc=2742,freq=2.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.116372846 = fieldWeight in 2742, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2742)
    0.013943106 = product of:
      0.027886212 = sum of:
        0.027886212 = weight(_text_:22 in 2742) [ClassicSimilarity], result of:
          0.027886212 = score(doc=2742,freq=2.0), product of:
            0.120126344 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03430388 = queryNorm
            0.23214069 = fieldWeight in 2742, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2742)
      0.5 = coord(1/2)
  0.25 = coord(3/12)

Abstract: In this research, we aim to identify factors that significantly affect the clickthrough of Web searchers. Our underlying goal is determine more efficient methods to optimize the clickthrough rate. We devise a clickthrough metric for measuring customer satisfaction of search engine results using the number of links visited, number of queries a user submits, and rank of clicked links. We use a neural network to detect the significant influence of searching characteristics on future user clickthrough. Our results show that high occurrences of query reformulation, lengthy searching duration, longer query length, and the higher ranking of prior clicked links correlate positively with future clickthrough. We provide recommendations for leveraging these findings for improving the performance of search engine retrieval and result ranking, along with implications for search engine marketing.
Date: 22. 3.2009 17:49:11
Source: Journal of the American Society for Information Science and Technology. 60(2009) no.3, S.557-570

Dang, Y.; Zhang, Y.; Chen, H.; Hu, P.J.-H.; Brown, S.A.; Larson, C.: Arizona Literature Mapper : an integrated approach to monitor and analyze global bioterrorism research literature (2009) 0.01
```
0.0118100615 = product of:
  0.047240246 = sum of:
    0.02018312 = weight(_text_:web in 2943) [ClassicSimilarity], result of:
      0.02018312 = score(doc=2943,freq=2.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.18028519 = fieldWeight in 2943, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2943)
    0.008258934 = weight(_text_:information in 2943) [ClassicSimilarity], result of:
      0.008258934 = score(doc=2943,freq=4.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.13714671 = fieldWeight in 2943, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2943)
    0.018798191 = weight(_text_:system in 2943) [ClassicSimilarity], result of:
      0.018798191 = score(doc=2943,freq=2.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.17398985 = fieldWeight in 2943, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2943)
  0.25 = coord(3/12)
```
Abstract

Biomedical research is critical to biodefense, which is drawing increasing attention from governments globally as well as from various research communities. The U.S. government has been closely monitoring and regulating biomedical research activities, particularly those studying or involving bioterrorism agents or diseases. Effective surveillance requires comprehensive understanding of extant biomedical research and timely detection of new developments or emerging trends. The rapid knowledge expansion, technical breakthroughs, and spiraling collaboration networks demand greater support for literature search and sharing, which cannot be effectively supported by conventional literature search mechanisms or systems. In this study, we propose an integrated approach that integrates advanced techniques for content analysis, network analysis, and information visualization. We design and implement Arizona Literature Mapper, a Web-based portal that allows users to gain timely, comprehensive understanding of bioterrorism research, including leading scientists, research groups, institutions as well as insights about current mainstream interests or emerging trends. We conduct two user studies to evaluate Arizona Literature Mapper and include a well-known system for benchmarking purposes. According to our results, Arizona Literature Mapper is significantly more effective for supporting users' search of bioterrorism publications than PubMed. Users consider Arizona Literature Mapper more useful and easier to use than PubMed. Users are also more satisfied with Arizona Literature Mapper and show stronger intentions to use it in the future. Assessments of Arizona Literature Mapper's analysis functions are also positive, as our subjects consider them useful, easy to use, and satisfactory. Our results have important implications that are also discussed in the article.

Source

Journal of the American Society for Information Science and Technology. 60(2009) no.7, S.1466-1485

Zhang, Y.; Liu, J.; Song, S.: ¬The design and evaluation of a nudge-based interface to facilitate consumers' evaluation of online health information credibility (2023) 0.01

0.011215224 = product of:
  0.044860896 = sum of:
    0.02018312 = weight(_text_:web in 993) [ClassicSimilarity], result of:
      0.02018312 = score(doc=993,freq=2.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.18028519 = fieldWeight in 993, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=993)
    0.013058522 = weight(_text_:information in 993) [ClassicSimilarity], result of:
      0.013058522 = score(doc=993,freq=10.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.21684799 = fieldWeight in 993, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=993)
    0.011619256 = product of:
      0.023238512 = sum of:
        0.023238512 = weight(_text_:22 in 993) [ClassicSimilarity], result of:
          0.023238512 = score(doc=993,freq=2.0), product of:
            0.120126344 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03430388 = queryNorm
            0.19345059 = fieldWeight in 993, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=993)
      0.5 = coord(1/2)
  0.25 = coord(3/12)

Abstract: Evaluating the quality of online health information (OHI) is a major challenge facing consumers. We designed PageGraph, an interface that displays quality indicators and associated values for a webpage, based on credibility evaluation models, the nudge theory, and existing empirical research concerning professionals' and consumers' evaluation of OHI quality. A qualitative evaluation of the interface with 16 participants revealed that PageGraph rendered the information and presentation nudges as intended. It provided the participants with easier access to quality indicators, encouraged fresh angles to assess information credibility, provided an evaluation framework, and encouraged validation of initial judgments. We then conducted a quantitative evaluation of the interface involving 60 participants using a between-subject experimental design. The control group used a regular web browser and evaluated the credibility of 12 preselected webpages, whereas the experimental group evaluated the same webpages with the assistance of PageGraph. PageGraph did not significantly influence participants' evaluation results. The results may be attributed to the insufficiency of the saliency and structure of the nudges implemented and the webpage stimuli's lack of sensitivity to the intervention. Future directions for applying nudges to support OHI evaluation were discussed.
Date: 22. 6.2023 18:18:34
Source: Journal of the Association for Information Science and Technology. 74(2023) no.7, S.828-845

Zhang, Y.: ¬The influence of mental models on undergraduate students' searching behavior on the Web (2008) 0.01
```
0.010194156 = product of:
  0.061164938 = sum of:
    0.054157 = weight(_text_:web in 2097) [ClassicSimilarity], result of:
      0.054157 = score(doc=2097,freq=10.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.48375595 = fieldWeight in 2097, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=2097)
    0.0070079383 = weight(_text_:information in 2097) [ClassicSimilarity], result of:
      0.0070079383 = score(doc=2097,freq=2.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.116372846 = fieldWeight in 2097, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2097)
  0.16666667 = coord(2/12)
```
Abstract

This article explores the effects of undergraduate students' mental models of the Web on their online searching behavior. Forty-four undergraduate students, mainly freshmen and sophomores, participated in the study. Subjects' mental models of the Web were treated as equally good styles and operationalized as drawings of their perceptions about the Web. Four types of mental models of the Web were identified based on the drawings and the associated descriptions: technical view, functional view, process view, and connection view. In the study, subjects were required to finish two search tasks. Searching behavior was measured from four aspects: navigation and performance, subjects' feelings about tasks and their own performances, query construction, and search patterns. The four mental model groups showed different navigation and querying behaviors, but the differences were not significant. Subjects' satisfaction with their own performances was found to be significantly correlated with the time to complete the task. The results also showed that the familiarity of the task to subjects had a major effect on their ways to start interaction, query construction, and search patterns.

Source

Information processing and management. 44(2008) no.3, S.1330-1345
Ku, Y.; Chiu, C.; Zhang, Y.; Chen, H.; Su, H.: Text mining self-disclosing health information for public health service (2014) 0.01
```
0.008320353 = product of:
  0.049922116 = sum of:
    0.03425189 = weight(_text_:web in 1262) [ClassicSimilarity], result of:
      0.03425189 = score(doc=1262,freq=4.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.3059541 = fieldWeight in 1262, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=1262)
    0.015670227 = weight(_text_:information in 1262) [ClassicSimilarity], result of:
      0.015670227 = score(doc=1262,freq=10.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.2602176 = fieldWeight in 1262, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=1262)
  0.16666667 = coord(2/12)
```
Abstract

Understanding specific patterns or knowledge of self-disclosing health information could support public health surveillance and healthcare. This study aimed to develop an analytical framework to identify self-disclosing health information with unusual messages on web forums by leveraging advanced text-mining techniques. To demonstrate the performance of the proposed analytical framework, we conducted an experimental study on 2 major human immunodeficiency virus (HIV)/acquired immune deficiency syndrome (AIDS) forums in Taiwan. The experimental results show that the classification accuracy increased significantly (up to 83.83%) when using features selected by the information gain technique. The results also show the importance of adopting domain-specific features in analyzing unusual messages on web forums. This study has practical implications for the prevention and support of HIV/AIDS healthcare. For example, public health agencies can re-allocate resources and deliver services to people who need help via social media sites. In addition, individuals can also join a social media site to get better suggestions and support from each other.

Source

Journal of the Association for Information Science and Technology. 65(2014) no.5, S.928-947
Tenopir, C.; Wang, P.; Zhang, Y.; Simmons, B.; Pollard, R.: Academic users' interactions with ScienceDirect in search tasks : affective and cognitive behaviors (2008) 0.01
```
0.0073732203 = product of:
  0.04423932 = sum of:
    0.011679897 = weight(_text_:information in 2027) [ClassicSimilarity], result of:
      0.011679897 = score(doc=2027,freq=8.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.19395474 = fieldWeight in 2027, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2027)
    0.03255942 = weight(_text_:system in 2027) [ClassicSimilarity], result of:
      0.03255942 = score(doc=2027,freq=6.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.30135927 = fieldWeight in 2027, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2027)
  0.16666667 = coord(2/12)
```
Abstract

This article presents part of phase 2 of a research project funded by the NSF-National Science Digital Library Project, which observed how academic users interact with the ScienceDirect information retrieval system for simulated class-related assignments. The ultimate goal of the project is twofold: (1) to find ways to improve science and engineering students' use of science e-journal systems; (2) to develop methods to measure user interaction behaviors. Process-tracing technique recorded participants' processes and interaction behaviors that are measurable; think-aloud protocol captured participants' affective and cognitive verbalizations; pre- and post-search questionnaires solicited demographic information, prior experience with the system, and comments. We explored possible relationships between affective feelings and cognitive behaviors. During search interactions both feelings and thoughts occurred frequently. Positive feelings were more common and were associated more often with thoughts about results. Negative feelings were associated more often with thoughts related to the system, search strategy, and task. Learning styles are also examined as a factor influencing behavior. Engineering graduate students with an assimilating learning style searched longer and paused less than those with a converging learning style. Further exploration of learning styles is suggested.

Footnote

Beitrag eines Themenbereichs: Evaluation of Interactive Information Retrieval Systems

Source

Information processing and management. 44(2008) no.1, S.105-121
Zhang, Y.; Sun, Y.; Xie, B.: Quality of health information for consumers on the web : a systematic review of indicators, criteria, tools, and evaluation results (2015) 0.01
```
0.0073323823 = product of:
  0.043994293 = sum of:
    0.028543243 = weight(_text_:web in 2218) [ClassicSimilarity], result of:
      0.028543243 = score(doc=2218,freq=4.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.25496176 = fieldWeight in 2218, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2218)
    0.015451051 = weight(_text_:information in 2218) [ClassicSimilarity], result of:
      0.015451051 = score(doc=2218,freq=14.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.256578 = fieldWeight in 2218, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2218)
  0.16666667 = coord(2/12)
```
Abstract

The quality of online health information for consumers has been a critical issue that concerns all stakeholders in healthcare. To gain an understanding of how quality is evaluated, this systematic review examined 165 articles in which researchers evaluated the quality of consumer-oriented health information on the web against predefined criteria. It was found that studies typically evaluated quality in relation to the substance and formality of content, as well as to the design of technological platforms. Attention to design, particularly interactivity, privacy, and social and cultural appropriateness is on the rise, which suggests the permeation of a user-centered perspective into the evaluation of health information systems, and a growing recognition of the need to study these systems from a social-technical perspective. Researchers used many preexisting instruments to facilitate evaluation of the formality of content; however, only a few were used in multiple studies, and their validity was questioned. The quality of content (i.e., accuracy and completeness) was always evaluated using proprietary instruments constructed based on medical guidelines or textbooks. The evaluation results revealed that the quality of health information varied across medical domains and across websites, and that the overall quality remained problematic. Future research is needed to examine the quality of user-generated content and to explore opportunities offered by emerging new media that can facilitate the consumer evaluation of health information.

Source

Journal of the Association for Information Science and Technology. 66(2015) no.10, S.2071-2084
Zhang, Y.; Broussard, R.; Ke, W.; Gong, X.: Evaluation of a scatter/gather interface for supporting distinct health information search tasks (2014) 0.01
```
0.0071413564 = product of:
  0.042848136 = sum of:
    0.028543243 = weight(_text_:web in 1261) [ClassicSimilarity], result of:
      0.028543243 = score(doc=1261,freq=4.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.25496176 = fieldWeight in 1261, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1261)
    0.014304894 = weight(_text_:information in 1261) [ClassicSimilarity], result of:
      0.014304894 = score(doc=1261,freq=12.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.23754507 = fieldWeight in 1261, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1261)
  0.16666667 = coord(2/12)
```
Abstract

Web search engines are important gateways for users to access health information. This study explored whether a search interface based on the Bing API and enabled by Scatter/Gather, a well-known document-clustering technique, can improve health information searches. Forty participants without medical backgrounds were randomly assigned to two interfaces: a baseline interface that resembles typical web search engines and a Scatter/Gather interface. Both groups performed two lookup and two exploratory health-related tasks. It was found that the baseline group was more likely to rephrase queries and less likely to access general-purpose sites than the Scatter/Gather group when completing exploratory tasks. Otherwise, the two groups did not differ in behavior and task performance, with participants in the Scatter/Gather group largely overlooking the features (key words, clusters, and the recluster function) designed to facilitate the exploration of semantic relationships between information objects, a potentially useful means for users in the rather unfamiliar domain of health. The results suggest a strong effect of users' mental models of search on their use of search interfaces and a high cognitive cost associated with using the Scatter/Gather features. It follows that novel features of a search interface should not only be compatible with users' mental models but also provide sufficient affordance to inform users of how they can be used. Compared with the interface, tasks showed more significant impacts on search behavior. In future studies, more effort should be devoted to identify salient features of health-related information needs.

Source

Journal of the Association for Information Science and Technology. 65(2014) no.5, S.1028-1041
Zhang, Y.: Toward a layered model of context for health information searching : an analysis of consumer-generated questions (2013) 0.01
```
0.0057082074 = product of:
  0.034249242 = sum of:
    0.015451051 = weight(_text_:information in 953) [ClassicSimilarity], result of:
      0.015451051 = score(doc=953,freq=14.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.256578 = fieldWeight in 953, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=953)
    0.018798191 = weight(_text_:system in 953) [ClassicSimilarity], result of:
      0.018798191 = score(doc=953,freq=2.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.17398985 = fieldWeight in 953, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=953)
  0.16666667 = coord(2/12)
```
Abstract

Designing effective consumer health information systems requires deep understanding of the context in which the systems are being used. However, due to the elusive nature of the concept of context, few studies have made it a focus of examination. To fill this gap, we studied the context of consumer health information searching by analyzing questions posted on a social question and answer site: Yahoo! Answers. Based on the analysis, a model of context was developed. The model consists of 5 layers: demographic, cognitive, affective, situational, and social and environmental. The demographic layer contains demographic factors of the person of concern; the cognitive layer contains factors related to the current search task (specifically, topics of interest and information goals) and users' cognitive ability to articulate their needs. The affective layer contains different affective motivations and intentions behind the search. The situational layer contains users' perceptions of the current health condition and where the person is in the illness trajectory. The social and environmental layer contains users' social roles, social norms, and various information channels. Several novel system functions, including faceted search and layered presentation of results, are proposed based on the model to help contextualize and improve users' interactions with health information systems.

Source

Journal of the American Society for Information Science and Technology. 64(2013) no.6, S.1158-1172
Zhang, Y.: Understanding the sustained use of online health communities from a self-determination perspective (2016) 0.01
```
0.005517181 = product of:
  0.033103086 = sum of:
    0.014304894 = weight(_text_:information in 3216) [ClassicSimilarity], result of:
      0.014304894 = score(doc=3216,freq=12.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.23754507 = fieldWeight in 3216, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3216)
    0.018798191 = weight(_text_:system in 3216) [ClassicSimilarity], result of:
      0.018798191 = score(doc=3216,freq=2.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.17398985 = fieldWeight in 3216, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3216)
  0.16666667 = coord(2/12)
```
Abstract

Sustained use of an information source is sometimes important for achieving an individual's long-term goals, such as learning and self-development. It is even more important for users of online health communities because health benefits usually come with sustained use. However, little is known about what retains a user. We interviewed 21 participants who had been using online diabetes communities in a sustained manner. Guided by self-determination theory, which posits that behaviors are sustained when they can satisfy basic human needs for autonomy, competence, and relatedness, we identified mechanisms that help satisfy these needs, and thus sustain users in online health communities. Autonomy-supportive mechanisms include being respected and supported as a unique individual, feeling free in making choices, and receiving meaningful rationales about others' decisions. Competence-cultivating mechanisms include seeking information, providing information, and exchanging information with others to construct knowledge. Mechanisms that cultivate relatedness include seeing similarities between oneself and peers, receiving responses from others, providing emotional support, and forming small underground groups for closer interactions. The results suggest that, like emotions, information and small group interactions also play a key role in retaining users. System design and community management strategies are discussed based on these mechanisms.

Source

Journal of the Association for Information Science and Technology. 67(2016) no.12, S.22842-2857
Zhang, Y.: ¬The effect of open access on citation impact : a comparison study based on Web citation analysis (2006) 0.01
```
0.0053187194 = product of:
  0.06382463 = sum of:
    0.06382463 = weight(_text_:web in 5071) [ClassicSimilarity], result of:
      0.06382463 = score(doc=5071,freq=20.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.5701118 = fieldWeight in 5071, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5071)
  0.083333336 = coord(1/12)
```
Abstract

The academic impact advantage of Open Access (OA) is a prominent topic of debate in the library and publishing communities. Web citations have been proposed as comparable to, even replacements for, bibliographic citations in assessing the academic impact of journals. In our study, we compare Web citations to articles in an OA journal, the Journal of Computer-Mediated Communication (JCMC), and a traditional access journal, New Media & Society (NMS), in the communication discipline. Web citation counts for JCMC are significantly higher than those for NMS. Furthermore, JCMC receives significantly higher Web citations from the formal scholarly publications posted on the Web than NMS does. The types of Web citations for journal articles were also examined. In the Web context, the impact of a journal can be assessed using more than one type of source: citations from scholarly articles, teaching materials and non-authoritative documents. The OA journal has higher percentages of citations from the third type, which suggests that, in addition to the research community, the impact advantage of open access is also detectable among ordinary users participating in Web-based academic communication. Moreover, our study also proves that the OA journal has impact advantage in developing countries. Compared with NMS, JCMC has more Web citations from developing countries.
Zhang, Y.; Trace, C.B.: ¬The quality of health and wellness self-tracking data : a consumer perspective (2022) 0.00
```
0.00481888 = product of:
  0.02891328 = sum of:
    0.010115089 = weight(_text_:information in 459) [ClassicSimilarity], result of:
      0.010115089 = score(doc=459,freq=6.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.16796975 = fieldWeight in 459, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=459)
    0.018798191 = weight(_text_:system in 459) [ClassicSimilarity], result of:
      0.018798191 = score(doc=459,freq=2.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.17398985 = fieldWeight in 459, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=459)
  0.16666667 = coord(2/12)
```
Abstract

Information quality (IQ) is key to users' satisfaction with information systems. Understanding what IQ means to users can effectively inform system improvement. Existing inquiries into self-tracking data quality primarily focus on accuracy. Interviewing 20 consumers who had self-tracked health indicators for at least 6 months, we identified eight dimensions that consumers apply to evaluate self-tracking data quality: value-added, accuracy, completeness, accessibility, ease of understanding, trustworthiness, aesthetics, and invasiveness. These dimensions fell into four categories-intrinsic, contextual, representational, and accessibility-suggesting that consumers judge self-tracking data quality not only based on the data's inherent quality but also considering tasks at hand, the clarity of data representation, and data accessibility. We also found that consumers' self-tracking data quality judgments are shaped primarily by their goals or motivations, subjective experience with tracked activities, mental models of how systems work, self-tracking tools' reputation, cost, and design, and domain knowledge and intuition, but less by more objective criteria such as scientific research results, validated devices, or consultation with experts. Future studies should develop and validate a scale for measuring consumers' perceptions of self-tracking data quality and commit efforts to develop technologies and training materials to enhance consumers' ability to evaluate data quality.

Source

Journal of the Association for Information Science and Technology. 73(2022) no.6, S.879-891
Zhang, Y.: ¬The impact of Internet-based electronic resources on formal scholarly communication in the area of library and information science : a citation analysis (1998) 0.00
```
0.004424533 = product of:
  0.026547197 = sum of:
    0.010115089 = weight(_text_:information in 2808) [ClassicSimilarity], result of:
      0.010115089 = score(doc=2808,freq=6.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.16796975 = fieldWeight in 2808, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2808)
    0.016432108 = product of:
      0.032864217 = sum of:
        0.032864217 = weight(_text_:22 in 2808) [ClassicSimilarity], result of:
          0.032864217 = score(doc=2808,freq=4.0), product of:
            0.120126344 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03430388 = queryNorm
            0.27358043 = fieldWeight in 2808, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2808)
      0.5 = coord(1/2)
  0.16666667 = coord(2/12)
```
Abstract

Internet based electronic resources are growing dramatically but there have been no empirical studies evaluating the impact of e-sources, as a whole, on formal scholarly communication. reports results of an investigation into how much e-sources have been used in formal scholarly communication, using a case study in the area of Library and Information Science (LIS) during the period 1994 to 1996. 4 citation based indicators were used in the study of the impact measurement. Concludes that, compared with the impact of print sources, the impact of e-sources on formal scholarly communication in LIS is small, as measured by e-sources cited, and does not increase significantly by year even though there is observable growth of these impact across the years. It is found that periodical format is related to the rate of citing e-sources, articles are more likely to cite e-sources than are print priodical articles. However, once authors cite electronic resource, there is no significant difference in the number of references per article by periodical format or by year. Suggests that, at this stage, citing e-sources may depend on authors rather than the periodical format in which authors choose to publish

Date

30. 1.1999 17:22:22

Source

Journal of information science. 24(1998) no.4, S.241-254
Zhang, Y.: Developing a holistic model for digital library evaluation (2010) 0.00
```
0.0034918408 = product of:
  0.020951044 = sum of:
    0.0070079383 = weight(_text_:information in 2360) [ClassicSimilarity], result of:
      0.0070079383 = score(doc=2360,freq=2.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.116372846 = fieldWeight in 2360, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2360)
    0.013943106 = product of:
      0.027886212 = sum of:
        0.027886212 = weight(_text_:22 in 2360) [ClassicSimilarity], result of:
          0.027886212 = score(doc=2360,freq=2.0), product of:
            0.120126344 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03430388 = queryNorm
            0.23214069 = fieldWeight in 2360, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2360)
      0.5 = coord(1/2)
  0.16666667 = coord(2/12)
```
Abstract

This article reports the author's recent research in developing a holistic model for various levels of digital library (DL) evaluation in which perceived important criteria from heterogeneous stakeholder groups are organized and presented. To develop such a model, the author applied a three-stage research approach: exploration, confirmation, and verification. During the exploration stage, a literature review was conducted followed by an interview, along with a card sorting technique, to collect important criteria perceived by DL experts. Then the criteria identified were used for developing an online survey during the confirmation stage. Survey respondents (431 in total) from 22 countries rated the importance of the criteria. A holistic DL evaluation model was constructed using statistical techniques. Eventually, the verification stage was devised to test the reliability of the model in the context of searching and evaluating an operational DL. The proposed model fills two lacunae in the DL domain: (a) the lack of a comprehensive and flexible framework to guide and benchmark evaluations, and (b) the uncertainty about what divergence exists among heterogeneous DL stakeholders, including general users.

Source

Journal of the American Society for Information Science and Technology. 61(2010) no.1, S.88-110
Zhang, Y.; Wu, M.; Zhang, G.; Lu, J.: Stepping beyond your comfort zone : diffusion-based network analytics for knowledge trajectory recommendation (2023) 0.00
```
0.0033130317 = product of:
  0.01987819 = sum of:
    0.008258934 = weight(_text_:information in 994) [ClassicSimilarity], result of:
      0.008258934 = score(doc=994,freq=4.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.13714671 = fieldWeight in 994, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=994)
    0.011619256 = product of:
      0.023238512 = sum of:
        0.023238512 = weight(_text_:22 in 994) [ClassicSimilarity], result of:
          0.023238512 = score(doc=994,freq=2.0), product of:
            0.120126344 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03430388 = queryNorm
            0.19345059 = fieldWeight in 994, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=994)
      0.5 = coord(1/2)
  0.16666667 = coord(2/12)
```
Abstract

Predicting a researcher's knowledge trajectories beyond their current foci can leverage potential inter-/cross-/multi-disciplinary interactions to achieve exploratory innovation. In this study, we present a method of diffusion-based network analytics for knowledge trajectory recommendation. The method begins by constructing a heterogeneous bibliometric network consisting of a co-topic layer and a co-authorship layer. A novel link prediction approach with a diffusion strategy is then used to capture the interactions between social elements (e.g., collaboration) and knowledge elements (e.g., technological similarity) in the process of exploratory innovation. This diffusion strategy differentiates the interactions occurring among homogeneous and heterogeneous nodes in the heterogeneous bibliometric network and weights the strengths of these interactions. Two sets of experiments-one with a local dataset and the other with a global dataset-demonstrate that the proposed method is prior to 10 selected baselines in link prediction, recommender systems, and upstream graph representation learning. A case study recommending knowledge trajectories of information scientists with topical hierarchy and explainable mediators reveals the proposed method's reliability and potential practical uses in broad scenarios.

Date

22. 6.2023 18:07:12

Source

Journal of the Association for Information Science and Technology. 74(2023) no.7, S.775-790

Search (42 results, page 1 of 3)

Authors

Years

Themes