Search (13 results, page 1 of 1)

  • × author_ss:"Wang, Y."
  1. Wang, Y.; Lee, J.-S.; Choi, I.-C.: Indexing by Latent Dirichlet Allocation and an Ensemble Model (2016) 0.02
    0.019982103 = product of:
      0.039964207 = sum of:
        0.039964207 = sum of:
          0.006318258 = weight(_text_:a in 3019) [ClassicSimilarity], result of:
            0.006318258 = score(doc=3019,freq=6.0), product of:
              0.04772363 = queryWeight, product of:
                1.153047 = idf(docFreq=37942, maxDocs=44218)
                0.041389145 = queryNorm
              0.13239266 = fieldWeight in 3019, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                1.153047 = idf(docFreq=37942, maxDocs=44218)
                0.046875 = fieldNorm(doc=3019)
          0.033645947 = weight(_text_:22 in 3019) [ClassicSimilarity], result of:
            0.033645947 = score(doc=3019,freq=2.0), product of:
              0.14493774 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.041389145 = queryNorm
              0.23214069 = fieldWeight in 3019, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=3019)
      0.5 = coord(1/2)
    
    Abstract
    The contribution of this article is twofold. First, we present Indexing by latent Dirichlet allocation (LDI), an automatic document indexing method. Many ad hoc applications, or their variants with smoothing techniques suggested in LDA-based language modeling, can result in unsatisfactory performance as the document representations do not accurately reflect concept space. To improve document retrieval performance, we introduce a new definition of document probability vectors in the context of LDA and present a novel scheme for automatic document indexing based on LDA. Second, we propose an Ensemble Model (EnM) for document retrieval. EnM combines basic indexing models by assigning different weights and attempts to uncover the optimal weights to maximize the mean average precision. To solve the optimization problem, we propose an algorithm, which is derived based on the boosting method. The results of our computational experiments on benchmark data sets indicate that both the proposed approaches are viable options for document retrieval.
    Date
    12. 6.2016 21:39:22
    Type
    a
  2. Wang, Y.; Shah, C.: Investigating failures in information seeking episodes (2017) 0.02
    0.017742215 = product of:
      0.03548443 = sum of:
        0.03548443 = sum of:
          0.0074461387 = weight(_text_:a in 2922) [ClassicSimilarity], result of:
            0.0074461387 = score(doc=2922,freq=12.0), product of:
              0.04772363 = queryWeight, product of:
                1.153047 = idf(docFreq=37942, maxDocs=44218)
                0.041389145 = queryNorm
              0.15602624 = fieldWeight in 2922, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                1.153047 = idf(docFreq=37942, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2922)
          0.028038291 = weight(_text_:22 in 2922) [ClassicSimilarity], result of:
            0.028038291 = score(doc=2922,freq=2.0), product of:
              0.14493774 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.041389145 = queryNorm
              0.19345059 = fieldWeight in 2922, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2922)
      0.5 = coord(1/2)
    
    Abstract
    Purpose People face barriers and failures in various kinds of information seeking experiences. These are often attributed to either the information seeker or the system/service they use. The purpose of this paper is to investigate how and why individuals fail to fulfill their information needs in all contexts and situations. It addresses the limitations of existing studies in examining the context of the task and information seeker's strategy and seeks to gain a holistic understanding of information seeking barriers and failures. Design/methodology/approach The primary method used for this investigation is a qualitative survey, in which 63 participants provided 208 real life examples of failures in information seeking. After analyzing the survey data, ten semi-structured interviews with another group of participants were conducted to further examine the survey findings. Data were analyzed using various theoretical frameworks of tasks, strategies, and barriers. Findings A careful examination of aspects of tasks, barriers, and strategies identified from the examples revealed that a wide range of external and internal factors caused people's failures. These factors were also caused or affected by multiple aspects of information seekers' tasks and strategies. People's information needs were often too contextual and specific to be fulfilled by the information retrieved. Other barriers, such as time constraint and institutional restrictions, also intensified the problem. Originality/value This paper highlights the importance of considering the information seeking episodes in which individuals fail to fulfill their needs in a holistic approach by analyzing their tasks, information needs, strategies, and obstacles. The modified theoretical frameworks and the coding methods used could also be instrumental for future research.
    Date
    20. 1.2015 18:30:22
    Type
    a
  3. Evens, M.; Wang, Y.; Vandendorpe, J.: Relational thesauri in information retrieval (1985) 0.00
    0.0024318986 = product of:
      0.004863797 = sum of:
        0.004863797 = product of:
          0.009727594 = sum of:
            0.009727594 = weight(_text_:a in 3911) [ClassicSimilarity], result of:
              0.009727594 = score(doc=3911,freq=2.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.20383182 = fieldWeight in 3911, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.125 = fieldNorm(doc=3911)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    a
  4. Zhang, C.; Liu, X.; Xu, Y.(C.); Wang, Y.: Quality-structure index : a new metric to measure scientific journal influence (2011) 0.00
    0.002279905 = product of:
      0.00455981 = sum of:
        0.00455981 = product of:
          0.00911962 = sum of:
            0.00911962 = weight(_text_:a in 4366) [ClassicSimilarity], result of:
              0.00911962 = score(doc=4366,freq=18.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.19109234 = fieldWeight in 4366, product of:
                  4.2426405 = tf(freq=18.0), with freq of:
                    18.0 = termFreq=18.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4366)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    An innovative model to measure the influence among scientific journals is developed in this study. This model is based on the path analysis of a journal citation network, and its output is a journal influence matrix that describes the directed influence among all journals. Based on this model, an index of journals' overall influence, the quality-structure index (QSI), is derived. Journal ranking based on QSI has the advantage of accounting for both intrinsic journal quality and the structural position of a journal in a citation network. The QSI also integrates the characteristics of two prevailing streams of journal-assessment measures: those based on bibliometric statistics to approximate intrinsic journal quality, such as the Journal Impact Factor, and those using a journal's structural position based on the PageRank-type of algorithm, such as the Eigenfactor score. Empirical results support our finding that the new index is significantly closer to scholars' subjective perception of journal influence than are the two aforementioned measures. In addition, the journal influence matrix offers a new way to measure two-way influences between any two academic journals, hence establishing a theoretical basis for future scientometrics studies to investigate the knowledge flow within and across research disciplines.
    Type
    a
  5. Li, D.; Wang, Y.; Madden, A.; Ding, Y.; Sun, G.G.; Zhang, N.; Zhou, E.: Analyzing stock market trends using social media user moods and social influence (2019) 0.00
    0.002149515 = product of:
      0.00429903 = sum of:
        0.00429903 = product of:
          0.00859806 = sum of:
            0.00859806 = weight(_text_:a in 5362) [ClassicSimilarity], result of:
              0.00859806 = score(doc=5362,freq=16.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.18016359 = fieldWeight in 5362, product of:
                  4.0 = tf(freq=16.0), with freq of:
                    16.0 = termFreq=16.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5362)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Information from microblogs is gaining increasing attention from researchers interested in analyzing fluctuations in stock markets. Behavioral financial theory draws on social psychology to explain some of the irrational behaviors associated with financial decisions to help explain some of the fluctuations. In this study we argue that social media users who demonstrate an interest in finance can offer insights into ways in which irrational behaviors may affect a stock market. To test this, we analyzed all the data collected over a 3-month period in 2011 from Tencent Weibo (one of the largest microblogging websites in China). We designed a social influence (SI)-based Tencent finance-related moods model to simulate investors' irrational behaviors, and designed a Tencent Moods-based Stock Trend Analysis (TM_STA) model to detect correlations between Tencent moods and the Hushen-300 index (one of the most important financial indexes in China). Experimental results show that the proposed method can help explain the data fluctuation. The findings support the existing behavioral financial theory, and can help to understand short-term rises and falls in a stock market. We use behavioral financial theory to further explain our findings, and to propose a trading model to verify the proposed model.
    Type
    a
  6. Xie, B.; He, D.; Mercer, T.; Wang, Y.; Wu, D.; Fleischmann, K.R.; Zhang, Y.; Yoder, L.H.; Stephens, K.K.; Mackert, M.; Lee, M.K.: Global health crises are also information crises : a call to action (2020) 0.00
    0.0021279112 = product of:
      0.0042558224 = sum of:
        0.0042558224 = product of:
          0.008511645 = sum of:
            0.008511645 = weight(_text_:a in 32) [ClassicSimilarity], result of:
              0.008511645 = score(doc=32,freq=8.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.17835285 = fieldWeight in 32, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=32)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    In this opinion paper, we argue that global health crises are also information crises. Using as an example the coronavirus disease 2019 (COVID-19) epidemic, we (a) examine challenges associated with what we term "global information crises"; (b) recommend changes needed for the field of information science to play a leading role in such crises; and (c) propose actionable items for short- and long-term research, education, and practice in information science.
    Type
    a
  7. Yu, L.; Hong, Q.; Gu, S.; Wang, Y.: ¬An epistemological critique of gap theory based library assessment : the case of SERVQUAL (2008) 0.00
    0.0019225847 = product of:
      0.0038451694 = sum of:
        0.0038451694 = product of:
          0.007690339 = sum of:
            0.007690339 = weight(_text_:a in 2212) [ClassicSimilarity], result of:
              0.007690339 = score(doc=2212,freq=20.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.16114321 = fieldWeight in 2212, product of:
                  4.472136 = tf(freq=20.0), with freq of:
                    20.0 = termFreq=20.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.03125 = fieldNorm(doc=2212)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Purpose - The purpose of this paper is twofold: first, to investigate the epistemological underpinning of SERVQUAL and its limitations; and second, to propose ways to enhance the utility of SERVQUAL as a library assessment tool. Design/methodology/approach - The study first conceptualises quality judgment as a knowing process and locates the epistemological stance of SERVQUAL within the general framework of epistemology demarcation; it then examines related SERVQUAL assumptions and their implications for library assessment in general and for service quality assessment in particular based on two empirical investigations: a questionnaire survey and an interview survey. The questionnaire survey applies the SERVQUAL instrument to three Chinese university libraries, with a view to examining the SERVQUAL score in light of epistemological considerations; the interview survey interviews 50 faculty users in one of the three universities with a view to illuminating the naturalistic process through which users develop their judgement of the library's service quality and through which the SERVQUAL score is formed. Findings - The study shows that the actual SERVQUAL score is distributed in a very scattered manner in all three libraries, and that it is formed through a very complex process rooted primarily in the user's personal experiences with the library, which are in turn shaped by factors from both the library world and the user's life-world. Based on these findings, this research questions a number of SERVQUAL assumptions and proposes three concepts which may help to contextualise the SERVQUAL score and enhance its utility in actual library assessment: library planning based variance of user perception, perception-dependent user expectation and library-sophistication based user differentiation. Originality/value - The research presented in the paper questions a number of SERVQUAL assumptions and proposes three concepts that may help to contextualise the SERVQUAL score and enhance its utility in actual library assessment.
    Type
    a
  8. Wang, Y.: ¬A look into Chinese persons' names in bibliography practice (2000) 0.00
    0.0018428253 = product of:
      0.0036856506 = sum of:
        0.0036856506 = product of:
          0.0073713013 = sum of:
            0.0073713013 = weight(_text_:a in 5401) [ClassicSimilarity], result of:
              0.0073713013 = score(doc=5401,freq=6.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.1544581 = fieldWeight in 5401, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5401)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Many Chinese persons active in different languages have redundant or inappropriate name headings in databases. This paper invents a ''Sheep-Fox Method'' visually describing various forms of Chinese persons' names in different languages and in transliteration, conceptually and factually clarifying complicated relations between the names, name forms, and gives typical examples to indicate appropriate choices in bibliography practice. It also suggests improvements for the practice. The paper discusses matters in Chinese persons' names with the understanding that its method could be universally applied to persons' names in other languages of scripts in general as well.
    Type
    a
  9. Cui, Y.; Wang, Y.; Liu, X.; Wang, X.; Zhang, X.: Multidimensional scholarly citations : characterizing and understanding scholars' citation behaviors (2023) 0.00
    0.0016993409 = product of:
      0.0033986818 = sum of:
        0.0033986818 = product of:
          0.0067973635 = sum of:
            0.0067973635 = weight(_text_:a in 847) [ClassicSimilarity], result of:
              0.0067973635 = score(doc=847,freq=10.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.14243183 = fieldWeight in 847, product of:
                  3.1622777 = tf(freq=10.0), with freq of:
                    10.0 = termFreq=10.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=847)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This study investigates scholars' citation behaviors from a fine-grained perspective. Specifically, each scholarly citation is considered multidimensional rather than logically unidimensional (i.e., present or absent). Thirty million articles from PubMed were accessed for use in empirical research, in which a total of 15 interpretable features of scholarly citations were constructed and grouped into three main categories. Each category corresponds to one aspect of the reasons and motivations behind scholars' citation decision-making during academic writing. Using about 500,000 pairs of actual and randomly generated scholarly citations, a series of Random Forest-based classification experiments were conducted to quantitatively evaluate the correlation between each constructed citation feature and citation decisions made by scholars. Our experimental results indicate that citation proximity is the category most relevant to scholars' citation decision-making, followed by citation authority and citation inertia. However, big-name scholars whose h-indexes rank among the top 1% exhibit a unique pattern of citation behaviors-their citation decision-making correlates most closely with citation inertia, with the correlation nearly three times as strong as that of their ordinary counterparts. Hopefully, the empirical findings presented in this paper can bring us closer to characterizing and understanding the complex process of generating scholarly citations in academia.
    Type
    a
  10. Wang, Y.; Tai, Y.; Yang, Y.: Determination of semantic types of tags in social tagging systems (2018) 0.00
    0.001289709 = product of:
      0.002579418 = sum of:
        0.002579418 = product of:
          0.005158836 = sum of:
            0.005158836 = weight(_text_:a in 4648) [ClassicSimilarity], result of:
              0.005158836 = score(doc=4648,freq=4.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.10809815 = fieldWeight in 4648, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4648)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    The purpose of this paper is to determine semantic types for tags in social tagging systems. In social tagging systems, the determination of the semantic type of tags plays an important role in tag classification, increasing the semantic information of tags and establishing mapping relations between tagged resources and a normed ontology. The research reported in this paper constructs the semantic type library that is needed based on the Unified Medical Language System (UMLS) and FrameNet and determines the semantic type of selected tags that have been pretreated via direct matching using the Semantic Navigator tool, the Semantic Type Word Sense Disambiguation (STWSD) tools in UMLS, and artificial matching. And finally, we verify the feasibility of the determination of semantic type for tags by empirical analysis.
    Type
    a
  11. Wang, Y.; Shah, C.: Authentic versus synthetic : an investigation of the influences of study settings and task configurations on search behaviors (2022) 0.00
    0.0010747575 = product of:
      0.002149515 = sum of:
        0.002149515 = product of:
          0.00429903 = sum of:
            0.00429903 = weight(_text_:a in 495) [ClassicSimilarity], result of:
              0.00429903 = score(doc=495,freq=4.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.090081796 = fieldWeight in 495, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=495)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    In information seeking and retrieval research, researchers often collect data about users' behaviors to predict task characteristics and personalize information for users. The reliability of user behavior may be directly influenced by data collection methods. This article reports on a mixed-methods study examining the impact of study setting (laboratory setting vs. remote setting) and task authenticity (authentic task vs. simulated task) on users' online browsing and searching behaviors. Thirty-six undergraduate participants finished one lab session and one remote session in which they completed one authentic and one simulated task. Using log data collected from 144 task sessions, this study demonstrates that the synthetic lab study setting and simulated tasks had significant influences mostly on behaviors related to content pages (e.g., page dwell time, number of pages visited per task). Meanwhile, first-query behaviors were less affected by study settings or task authenticity than whole-session behaviors, indicating the reliability of using first-query behaviors in task prediction. Qualitative interviews reveal why users were influenced. This study addresses methodological limitations in existing research and provides new insights and implications for researchers who collect online user search behavioral data.
    Type
    a
  12. Wu, S.; Liu, S.; Wang, Y.; Timmons, T.; Uppili, H.; Bedrick, S.; Hersh, W.; Liu, H,: Intrainstitutional EHR collections for patient-level information retrieval (2017) 0.00
    0.0010639556 = product of:
      0.0021279112 = sum of:
        0.0021279112 = product of:
          0.0042558224 = sum of:
            0.0042558224 = weight(_text_:a in 3925) [ClassicSimilarity], result of:
              0.0042558224 = score(doc=3925,freq=2.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.089176424 = fieldWeight in 3925, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3925)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    a
  13. Huang, C.; Zha, X.; Yan, Y.; Wang, Y.: Understanding the social structure of academic social networking sites : the case of ResearchGate (2019) 0.00
    7.5996824E-4 = product of:
      0.0015199365 = sum of:
        0.0015199365 = product of:
          0.003039873 = sum of:
            0.003039873 = weight(_text_:a in 5781) [ClassicSimilarity], result of:
              0.003039873 = score(doc=5781,freq=2.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.06369744 = fieldWeight in 5781, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5781)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    a