Search (22 results, page 1 of 2)

  • × author_ss:"Liu, X."
  1. Jiang, Z.; Liu, X.; Chen, Y.: Recovering uncaptured citations in a scholarly network : a two-step citation analysis to estimate publication importance (2016) 0.07
    0.066146925 = product of:
      0.115757115 = sum of:
        0.020927707 = weight(_text_:systems in 3018) [ClassicSimilarity], result of:
          0.020927707 = score(doc=3018,freq=2.0), product of:
            0.12327058 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04011181 = queryNorm
            0.1697705 = fieldWeight in 3018, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3018)
        0.04279475 = sum of:
          0.0153751075 = weight(_text_:science in 3018) [ClassicSimilarity], result of:
            0.0153751075 = score(doc=3018,freq=2.0), product of:
              0.10565929 = queryWeight, product of:
                2.6341193 = idf(docFreq=8627, maxDocs=44218)
                0.04011181 = queryNorm
              0.1455159 = fieldWeight in 3018, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.6341193 = idf(docFreq=8627, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3018)
          0.027419642 = weight(_text_:29 in 3018) [ClassicSimilarity], result of:
            0.027419642 = score(doc=3018,freq=2.0), product of:
              0.14110081 = queryWeight, product of:
                3.5176873 = idf(docFreq=3565, maxDocs=44218)
                0.04011181 = queryNorm
              0.19432661 = fieldWeight in 3018, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5176873 = idf(docFreq=3565, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3018)
        0.02166549 = weight(_text_:library in 3018) [ClassicSimilarity], result of:
          0.02166549 = score(doc=3018,freq=4.0), product of:
            0.10546913 = queryWeight, product of:
              2.6293786 = idf(docFreq=8668, maxDocs=44218)
              0.04011181 = queryNorm
            0.2054202 = fieldWeight in 3018, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.6293786 = idf(docFreq=8668, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3018)
        0.030369172 = product of:
          0.060738344 = sum of:
            0.060738344 = weight(_text_:applications in 3018) [ClassicSimilarity], result of:
              0.060738344 = score(doc=3018,freq=4.0), product of:
                0.17659263 = queryWeight, product of:
                  4.4025097 = idf(docFreq=1471, maxDocs=44218)
                  0.04011181 = queryNorm
                0.34394607 = fieldWeight in 3018, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.4025097 = idf(docFreq=1471, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3018)
          0.5 = coord(1/2)
      0.5714286 = coord(4/7)
    
    Abstract
    The citation relationships between publications, which are significant for assessing the importance of scholarly components within a network, have been used for various scientific applications. Missing citation metadata in scholarly databases, however, create problems for classical citation-based ranking algorithms and challenge the performance of citation-based retrieval systems. In this research, we utilize a two-step citation analysis method to investigate the importance of publications for which citation information is partially missing. First, we calculate the importance of the author and then use his importance to estimate the publication importance for some selected articles. To evaluate this method, we designed a simulation experiment-"random citation-missing"-to test the two-step citation analysis that we carried out with the Association for Computing Machinery (ACM) Digital Library (DL). In this experiment, we simulated different scenarios in a large-scale scientific digital library, from high-quality citation data, to very poor quality data, The results show that a two-step citation analysis can effectively uncover the importance of publications in different situations. More importantly, we found that the optimized impact from the importance of an author (first step) is exponentially increased when the quality of citation decreases. The findings from this study can further enhance citation-based publication-ranking algorithms for real-world applications.
    Date
    12. 6.2016 20:31:29
    Source
    Journal of the Association for Information Science and Technology. 67(2016) no.7, S.1722-1735
  2. Liu, X.: ¬The standardization of Chinese library classification (1993) 0.05
    0.0520705 = product of:
      0.121497825 = sum of:
        0.02511325 = weight(_text_:systems in 5588) [ClassicSimilarity], result of:
          0.02511325 = score(doc=5588,freq=2.0), product of:
            0.12327058 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04011181 = queryNorm
            0.2037246 = fieldWeight in 5588, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.046875 = fieldNorm(doc=5588)
        0.0513537 = sum of:
          0.018450128 = weight(_text_:science in 5588) [ClassicSimilarity], result of:
            0.018450128 = score(doc=5588,freq=2.0), product of:
              0.10565929 = queryWeight, product of:
                2.6341193 = idf(docFreq=8627, maxDocs=44218)
                0.04011181 = queryNorm
              0.17461908 = fieldWeight in 5588, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.6341193 = idf(docFreq=8627, maxDocs=44218)
                0.046875 = fieldNorm(doc=5588)
          0.03290357 = weight(_text_:29 in 5588) [ClassicSimilarity], result of:
            0.03290357 = score(doc=5588,freq=2.0), product of:
              0.14110081 = queryWeight, product of:
                3.5176873 = idf(docFreq=3565, maxDocs=44218)
                0.04011181 = queryNorm
              0.23319192 = fieldWeight in 5588, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5176873 = idf(docFreq=3565, maxDocs=44218)
                0.046875 = fieldNorm(doc=5588)
        0.045030877 = weight(_text_:library in 5588) [ClassicSimilarity], result of:
          0.045030877 = score(doc=5588,freq=12.0), product of:
            0.10546913 = queryWeight, product of:
              2.6293786 = idf(docFreq=8668, maxDocs=44218)
              0.04011181 = queryNorm
            0.42695788 = fieldWeight in 5588, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              2.6293786 = idf(docFreq=8668, maxDocs=44218)
              0.046875 = fieldNorm(doc=5588)
      0.42857143 = coord(3/7)
    
    Abstract
    The standardization of Chinese materials classification was first proposed in the late-1970s in China. In December 1980, the CCDST, the Chinese Library Association and the Chinese Society for Information Science proposed that the Chinese Library Classification system be adopted as national standard. This marked the beginning of the standardization of Chinese materials classification. Later on, there were many conferences and workshops held and four draft national standards were discussed, those for the Chinese Library Classification systems, the Materials Classification System, the Rules for Thesaurus and Subject Headings, and the rules for Materials Classifying Color Recognition. This article gives a brief review on the historical development of the standardization on Chinese Library Classification. It also discusses its effects on automation, networking and resources sharing and the feasibility of adopting Chinese Library Classification as a National Standard. In addition, the main content of the standardization of materials classification, use of the national standard classification system and variations under the standard system are covered in this article
    Date
    8.10.2000 14:29:26
  3. Frias-Martinez, E.; Chen, S.Y.; Liu, X.: Automatic cognitive style identification of digital library users for personalization (2007) 0.03
    0.03282094 = product of:
      0.07658219 = sum of:
        0.035515495 = weight(_text_:systems in 74) [ClassicSimilarity], result of:
          0.035515495 = score(doc=74,freq=4.0), product of:
            0.12327058 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04011181 = queryNorm
            0.28811008 = fieldWeight in 74, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.046875 = fieldNorm(doc=74)
        0.009225064 = product of:
          0.018450128 = sum of:
            0.018450128 = weight(_text_:science in 74) [ClassicSimilarity], result of:
              0.018450128 = score(doc=74,freq=2.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.17461908 = fieldWeight in 74, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.046875 = fieldNorm(doc=74)
          0.5 = coord(1/2)
        0.03184164 = weight(_text_:library in 74) [ClassicSimilarity], result of:
          0.03184164 = score(doc=74,freq=6.0), product of:
            0.10546913 = queryWeight, product of:
              2.6293786 = idf(docFreq=8668, maxDocs=44218)
              0.04011181 = queryNorm
            0.30190483 = fieldWeight in 74, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.6293786 = idf(docFreq=8668, maxDocs=44218)
              0.046875 = fieldNorm(doc=74)
      0.42857143 = coord(3/7)
    
    Abstract
    Digital libraries have become one of the most important Web services for information seeking. One of their main drawbacks is their global approach: In general, there is just one interface for all users. One of the key elements in improving user satisfaction in digital libraries is personalization. When considering personalizing factors, cognitive styles have been proved to be one of the relevant parameters that affect information seeking. This justifies the introduction of cognitive style as one of the parameters of a Web personalized service. Nevertheless, this approach has one major drawback: Each user has to run a time-consuming test that determines his or her cognitive style. In this article, we present a study of how different classification systems can be used to automatically identify the cognitive style of a user using the set of interactions with a digital library. These classification systems can be used to automatically personalize, from a cognitive-style point of view, the interaction of the digital library and each of its users.
    Source
    Journal of the American Society for Information Science and Technology. 58(2007) no.2, S.237-251
  4. Chen, M.; Liu, X.; Qin, J.: Semantic relation extraction from socially-generated tags : a methodology for metadata generation (2008) 0.03
    0.029034615 = product of:
      0.10162114 = sum of:
        0.013709821 = product of:
          0.027419642 = sum of:
            0.027419642 = weight(_text_:29 in 2648) [ClassicSimilarity], result of:
              0.027419642 = score(doc=2648,freq=2.0), product of:
                0.14110081 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04011181 = queryNorm
                0.19432661 = fieldWeight in 2648, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2648)
          0.5 = coord(1/2)
        0.08791132 = sum of:
          0.060738344 = weight(_text_:applications in 2648) [ClassicSimilarity], result of:
            0.060738344 = score(doc=2648,freq=4.0), product of:
              0.17659263 = queryWeight, product of:
                4.4025097 = idf(docFreq=1471, maxDocs=44218)
                0.04011181 = queryNorm
              0.34394607 = fieldWeight in 2648, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4025097 = idf(docFreq=1471, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2648)
          0.027172983 = weight(_text_:22 in 2648) [ClassicSimilarity], result of:
            0.027172983 = score(doc=2648,freq=2.0), product of:
              0.14046472 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.04011181 = queryNorm
              0.19345059 = fieldWeight in 2648, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2648)
      0.2857143 = coord(2/7)
    
    Date
    20. 2.2009 10:29:07
    Source
    Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas
  5. Liu, X.; Croft, W.B.: Statistical language modeling for information retrieval (2004) 0.01
    0.01087335 = product of:
      0.038056724 = sum of:
        0.0076875538 = product of:
          0.0153751075 = sum of:
            0.0153751075 = weight(_text_:science in 4277) [ClassicSimilarity], result of:
              0.0153751075 = score(doc=4277,freq=2.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.1455159 = fieldWeight in 4277, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4277)
          0.5 = coord(1/2)
        0.030369172 = product of:
          0.060738344 = sum of:
            0.060738344 = weight(_text_:applications in 4277) [ClassicSimilarity], result of:
              0.060738344 = score(doc=4277,freq=4.0), product of:
                0.17659263 = queryWeight, product of:
                  4.4025097 = idf(docFreq=1471, maxDocs=44218)
                  0.04011181 = queryNorm
                0.34394607 = fieldWeight in 4277, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.4025097 = idf(docFreq=1471, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4277)
          0.5 = coord(1/2)
      0.2857143 = coord(2/7)
    
    Abstract
    This chapter reviews research and applications in statistical language modeling for information retrieval (IR), which has emerged within the past several years as a new probabilistic framework for describing information retrieval processes. Generally speaking, statistical language modeling, or more simply language modeling (LM), involves estimating a probability distribution that captures statistical regularities of natural language use. Applied to information retrieval, language modeling refers to the problem of estimating the likelihood that a query and a document could have been generated by the same language model, given the language model of the document either with or without a language model of the query. The roots of statistical language modeling date to the beginning of the twentieth century when Markov tried to model letter sequences in works of Russian literature (Manning & Schütze, 1999). Zipf (1929, 1932, 1949, 1965) studied the statistical properties of text and discovered that the frequency of works decays as a Power function of each works rank. However, it was Shannon's (1951) work that inspired later research in this area. In 1951, eager to explore the applications of his newly founded information theory to human language, Shannon used a prediction game involving n-grams to investigate the information content of English text. He evaluated n-gram models' performance by comparing their crossentropy an texts with the true entropy estimated using predictions made by human subjects. For many years, statistical language models have been used primarily for automatic speech recognition. Since 1980, when the first significant language model was proposed (Rosenfeld, 2000), statistical language modeling has become a fundamental component of speech recognition, machine translation, and spelling correction.
    Source
    Annual review of information science and technology. 39(2005), S.3-32
  6. Liu, X.; Turtle, H.: Real-time user interest modeling for real-time ranking (2013) 0.01
    0.009810947 = product of:
      0.034338314 = sum of:
        0.02511325 = weight(_text_:systems in 1035) [ClassicSimilarity], result of:
          0.02511325 = score(doc=1035,freq=2.0), product of:
            0.12327058 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04011181 = queryNorm
            0.2037246 = fieldWeight in 1035, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.046875 = fieldNorm(doc=1035)
        0.009225064 = product of:
          0.018450128 = sum of:
            0.018450128 = weight(_text_:science in 1035) [ClassicSimilarity], result of:
              0.018450128 = score(doc=1035,freq=2.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.17461908 = fieldWeight in 1035, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1035)
          0.5 = coord(1/2)
      0.2857143 = coord(2/7)
    
    Abstract
    User interest as a very dynamic information need is often ignored in most existing information retrieval systems. In this research, we present the results of experiments designed to evaluate the performance of a real-time interest model (RIM) that attempts to identify the dynamic and changing query level interests regarding social media outputs. Unlike most existing ranking methods, our ranking approach targets calculation of the probability that user interest in the content of the document is subject to very dynamic user interest change. We describe 2 formulations of the model (real-time interest vector space and real-time interest language model) stemming from classical relevance ranking methods and develop a novel methodology for evaluating the performance of RIM using Amazon Mechanical Turk to collect (interest-based) relevance judgments on a daily basis. Our results show that the model usually, although not always, performs better than baseline results obtained from commercial web search engines. We identify factors that affect RIM performance and outline plans for future research.
    Source
    Journal of the American Society for Information Science and Technology. 64(2013) no.8, S.1557-1576
  7. Liu, X.; Guo, C.; Zhang, L.: Scholar metadata and knowledge generation with human and artificial intelligence (2014) 0.01
    0.009810947 = product of:
      0.034338314 = sum of:
        0.02511325 = weight(_text_:systems in 1287) [ClassicSimilarity], result of:
          0.02511325 = score(doc=1287,freq=2.0), product of:
            0.12327058 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04011181 = queryNorm
            0.2037246 = fieldWeight in 1287, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.046875 = fieldNorm(doc=1287)
        0.009225064 = product of:
          0.018450128 = sum of:
            0.018450128 = weight(_text_:science in 1287) [ClassicSimilarity], result of:
              0.018450128 = score(doc=1287,freq=2.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.17461908 = fieldWeight in 1287, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1287)
          0.5 = coord(1/2)
      0.2857143 = coord(2/7)
    
    Abstract
    Scholar metadata have traditionally centered on descriptive representations, which have been used as a foundation for scholarly publication repositories and academic information retrieval systems. In this article, we propose innovative and economic methods of generating knowledge-based structural metadata (structural keywords) using a combination of natural language processing-based machine-learning techniques and human intelligence. By allowing low-barrier participation through a social media system, scholars (both as authors and users) can participate in the metadata editing and enhancing process and benefit from more accurate and effective information retrieval. Our experimental web system ScholarWiki uses machine learning techniques, which automatically produce increasingly refined metadata by learning from the structural metadata contributed by scholars. The cumulated structural metadata add intelligence and automatically enhance and update recursively the quality of metadata, wiki pages, and the machine-learning model.
    Source
    Journal of the Association for Information Science and Technology. 65(2014) no.6, S.1187-1201
  8. Liu, X.; Yu, S.; Janssens, F.; Glänzel, W.; Moreau, Y.; Moor, B.de: Weighted hybrid clustering by combining text mining and bibliometrics on a large-scale journal database (2010) 0.01
    0.008427999 = product of:
      0.058995992 = sum of:
        0.058995992 = sum of:
          0.026092423 = weight(_text_:science in 3464) [ClassicSimilarity], result of:
            0.026092423 = score(doc=3464,freq=4.0), product of:
              0.10565929 = queryWeight, product of:
                2.6341193 = idf(docFreq=8627, maxDocs=44218)
                0.04011181 = queryNorm
              0.24694869 = fieldWeight in 3464, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.6341193 = idf(docFreq=8627, maxDocs=44218)
                0.046875 = fieldNorm(doc=3464)
          0.03290357 = weight(_text_:29 in 3464) [ClassicSimilarity], result of:
            0.03290357 = score(doc=3464,freq=2.0), product of:
              0.14110081 = queryWeight, product of:
                3.5176873 = idf(docFreq=3565, maxDocs=44218)
                0.04011181 = queryNorm
              0.23319192 = fieldWeight in 3464, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5176873 = idf(docFreq=3565, maxDocs=44218)
                0.046875 = fieldNorm(doc=3464)
      0.14285715 = coord(1/7)
    
    Abstract
    We propose a new hybrid clustering framework to incorporate text mining with bibliometrics in journal set analysis. The framework integrates two different approaches: clustering ensemble and kernel-fusion clustering. To improve the flexibility and the efficiency of processing large-scale data, we propose an information-based weighting scheme to leverage the effect of multiple data sources in hybrid clustering. Three different algorithms are extended by the proposed weighting scheme and they are employed on a large journal set retrieved from the Web of Science (WoS) database. The clustering performance of the proposed algorithms is systematically evaluated using multiple evaluation methods, and they were cross-compared with alternative methods. Experimental results demonstrate that the proposed weighted hybrid clustering strategy is superior to other methods in clustering performance and efficiency. The proposed approach also provides a more refined structural mapping of journal sets, which is useful for monitoring and detecting new trends in different scientific fields.
    Date
    1. 6.2010 9:29:57
    Source
    Journal of the American Society for Information Science and Technology. 61(2010) no.6, S.1105-1119
  9. Liu, X.; Zhang, J.; Guo, C.: Full-text citation analysis : a new method to enhance scholarly networks (2013) 0.01
    0.008175789 = product of:
      0.02861526 = sum of:
        0.020927707 = weight(_text_:systems in 1044) [ClassicSimilarity], result of:
          0.020927707 = score(doc=1044,freq=2.0), product of:
            0.12327058 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04011181 = queryNorm
            0.1697705 = fieldWeight in 1044, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1044)
        0.0076875538 = product of:
          0.0153751075 = sum of:
            0.0153751075 = weight(_text_:science in 1044) [ClassicSimilarity], result of:
              0.0153751075 = score(doc=1044,freq=2.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.1455159 = fieldWeight in 1044, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1044)
          0.5 = coord(1/2)
      0.2857143 = coord(2/7)
    
    Abstract
    In this article, we use innovative full-text citation analysis along with supervised topic modeling and network-analysis algorithms to enhance classical bibliometric analysis and publication/author/venue ranking. By utilizing citation contexts extracted from a large number of full-text publications, each citation or publication is represented by a probability distribution over a set of predefined topics, where each topic is labeled by an author-contributed keyword. We then used publication/citation topic distribution to generate a citation graph with vertex prior and edge transitioning probability distributions. The publication importance score for each given topic is calculated by PageRank with edge and vertex prior distributions. To evaluate this work, we sampled 104 topics (labeled with keywords) in review papers. The cited publications of each review paper are assumed to be "important publications" for the target topic (keyword), and we use these cited publications to validate our topic-ranking result and to compare different publication-ranking lists. Evaluation results show that full-text citation and publication content prior topic distribution, along with the classical PageRank algorithm can significantly enhance bibliometric analysis and scientific publication ranking performance, comparing with term frequency-inverted document frequency (tf-idf), language model, BM25, PageRank, and PageRank + language model (p < .001), for academic information retrieval (IR) systems.
    Source
    Journal of the American Society for Information Science and Technology. 64(2013) no.9, S.1852-1863
  10. Chen, S.Y.; Liu, X.: ¬The contribution of data mining to information science : making sense of it all (2005) 0.00
    0.0037274892 = product of:
      0.026092423 = sum of:
        0.026092423 = product of:
          0.052184846 = sum of:
            0.052184846 = weight(_text_:science in 4655) [ClassicSimilarity], result of:
              0.052184846 = score(doc=4655,freq=4.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.49389738 = fieldWeight in 4655, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4655)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Source
    Journal of information science. 30(2005) no.6, S.550-
  11. Liu, X.; Bu, Y.; Li, M.; Li, J.: Monodisciplinary collaboration disrupts science more than multidisciplinary collaboration (2024) 0.00
    0.0022826118 = product of:
      0.015978282 = sum of:
        0.015978282 = product of:
          0.031956565 = sum of:
            0.031956565 = weight(_text_:science in 1202) [ClassicSimilarity], result of:
              0.031956565 = score(doc=1202,freq=6.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.30244917 = fieldWeight in 1202, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1202)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Abstract
    Collaboration across disciplines is a critical form of scientific collaboration to solve complex problems and make innovative contributions. This study focuses on the association between multidisciplinary collaboration measured by coauthorship in publications and the disruption of publications measured by the Disruption (D) index. We used authors' affiliations as a proxy of the disciplines to which they belong and categorized an article into multidisciplinary collaboration or monodisciplinary collaboration. The D index quantifies the extent to which a study disrupts its predecessors. We selected 13 journals that publish articles in six disciplines from the Microsoft Academic Graph (MAG) database and then constructed regression models with fixed effects and estimated the relationship between the variables. The findings show that articles with monodisciplinary collaboration are more disruptive than those with multidisciplinary collaboration. Furthermore, we uncovered the mechanism of how monodisciplinary collaboration disrupts science more than multidisciplinary collaboration by exploring the references of the sampled publications.
    Source
    Journal of the Association for Information Science and Technology. 75(2023) no.1, S.59-78
  12. Clewley, N.; Chen, S.Y.; Liu, X.: Cognitive styles and search engine preferences : field dependence/independence vs holism/serialism (2010) 0.00
    0.001958546 = product of:
      0.013709821 = sum of:
        0.013709821 = product of:
          0.027419642 = sum of:
            0.027419642 = weight(_text_:29 in 3961) [ClassicSimilarity], result of:
              0.027419642 = score(doc=3961,freq=2.0), product of:
                0.14110081 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04011181 = queryNorm
                0.19432661 = fieldWeight in 3961, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3961)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Date
    29. 8.2010 13:11:47
  13. Liu, X.; Chen, X.: Authors' noninstitutional emails and their correlation with retraction (2021) 0.00
    0.0017571552 = product of:
      0.012300085 = sum of:
        0.012300085 = product of:
          0.02460017 = sum of:
            0.02460017 = weight(_text_:science in 152) [ClassicSimilarity], result of:
              0.02460017 = score(doc=152,freq=2.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.23282544 = fieldWeight in 152, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0625 = fieldNorm(doc=152)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Source
    Journal of the Association for Information Science and Technology. 72(2021) no.4, S.449-4473-477
  14. Liu, X.; Kaza, S.; Zhang, P.; Chen, H.: Determining inventor status and its effect on knowledge diffusion : a study on nanotechnology literature from China, Russia, and India (2011) 0.00
    0.0015531204 = product of:
      0.0108718425 = sum of:
        0.0108718425 = product of:
          0.021743685 = sum of:
            0.021743685 = weight(_text_:science in 4468) [ClassicSimilarity], result of:
              0.021743685 = score(doc=4468,freq=4.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.20579056 = fieldWeight in 4468, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4468)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Abstract
    In an increasingly global research landscape, it is important to identify the most prolific researchers in various institutions and their influence on the diffusion of knowledge. Knowledge diffusion within institutions is influenced by not just the status of individual researchers but also the collaborative culture that determines status. There are various methods to measure individual status, but few studies have compared them or explored the possible effects of different cultures on the status measures. In this article, we examine knowledge diffusion within science and technology-oriented research organizations. Using social network analysis metrics to measure individual status in large-scale coauthorship networks, we studied an individual's impact on the recombination of knowledge to produce innovation in nanotechnology. Data from the most productive and high-impact institutions in China (Chinese Academy of Sciences), Russia (Russian Academy of Sciences), and India (Indian Institutes of Technology) were used. We found that boundary-spanning individuals influenced knowledge diffusion in all countries. However, our results also indicate that cultural and institutional differences may influence knowledge diffusion.
    Source
    Journal of the American Society for Information Science and Technology. 62(2011) no.6, S.1166-1176
  15. Liu, X.: Generating metadata for cyberlearning resources through information retrieval and meta-search (2013) 0.00
    0.0013178664 = product of:
      0.009225064 = sum of:
        0.009225064 = product of:
          0.018450128 = sum of:
            0.018450128 = weight(_text_:science in 676) [ClassicSimilarity], result of:
              0.018450128 = score(doc=676,freq=2.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.17461908 = fieldWeight in 676, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.046875 = fieldNorm(doc=676)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Source
    Journal of the American Society for Information Science and Technology. 64(2013) no.4, S.771-786
  16. Liu, X.; Jia, H.: Answering academic questions for education by recommending cyberlearning resources (2013) 0.00
    0.0013178664 = product of:
      0.009225064 = sum of:
        0.009225064 = product of:
          0.018450128 = sum of:
            0.018450128 = weight(_text_:science in 1012) [ClassicSimilarity], result of:
              0.018450128 = score(doc=1012,freq=2.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.17461908 = fieldWeight in 1012, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1012)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Source
    Journal of the American Society for Information Science and Technology. 64(2013) no.8, S.1707-1722
  17. Zhang, X.; Fang, Y.; He, W.; Zhang, Y.; Liu, X.: Epistemic motivation, task reflexivity, and knowledge contribution behavior on team wikis : a cross-level moderation model (2019) 0.00
    0.0013178664 = product of:
      0.009225064 = sum of:
        0.009225064 = product of:
          0.018450128 = sum of:
            0.018450128 = weight(_text_:science in 5245) [ClassicSimilarity], result of:
              0.018450128 = score(doc=5245,freq=2.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.17461908 = fieldWeight in 5245, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5245)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Source
    Journal of the Association for Information Science and Technology. 70(2019) no.5, S.448-461
  18. Zhang, C.; Liu, X.; Xu, Y.(C.); Wang, Y.: Quality-structure index : a new metric to measure scientific journal influence (2011) 0.00
    0.001098222 = product of:
      0.0076875538 = sum of:
        0.0076875538 = product of:
          0.0153751075 = sum of:
            0.0153751075 = weight(_text_:science in 4366) [ClassicSimilarity], result of:
              0.0153751075 = score(doc=4366,freq=2.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.1455159 = fieldWeight in 4366, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4366)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Source
    Journal of the American Society for Information Science and Technology. 62(2011) no.4, S.643-653
  19. Liu, X.; Qin, J.: ¬An interactive metadata model for structural, descriptive, and referential representation of scholarly output (2014) 0.00
    0.001098222 = product of:
      0.0076875538 = sum of:
        0.0076875538 = product of:
          0.0153751075 = sum of:
            0.0153751075 = weight(_text_:science in 1253) [ClassicSimilarity], result of:
              0.0153751075 = score(doc=1253,freq=2.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.1455159 = fieldWeight in 1253, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1253)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Source
    Journal of the Association for Information Science and Technology. 65(2014) no.5, S.964-983
  20. Chen, Z.; Huang, Y.; Tian, J.; Liu, X.; Fu, K.; Huang, T.: Joint model for subsentence-level sentiment analysis with Markov logic (2015) 0.00
    0.001098222 = product of:
      0.0076875538 = sum of:
        0.0076875538 = product of:
          0.0153751075 = sum of:
            0.0153751075 = weight(_text_:science in 2210) [ClassicSimilarity], result of:
              0.0153751075 = score(doc=2210,freq=2.0), product of:
                0.10565929 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.04011181 = queryNorm
                0.1455159 = fieldWeight in 2210, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2210)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Source
    Journal of the Association for Information Science and Technology. 66(2015) no.9, S.1913-1922