Search (6 results, page 1 of 1)

  • × author_ss:"He, B."
  1. He, B.; Ding, Y.; Ni, C.: Mining enriched contextual information of scientific collaboration : a meso perspective (2011) 0.00
    4.1280582E-4 = product of:
      0.006192087 = sum of:
        0.006192087 = product of:
          0.012384174 = sum of:
            0.012384174 = weight(_text_:information in 4444) [ClassicSimilarity], result of:
              0.012384174 = score(doc=4444,freq=12.0), product of:
                0.052134 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.029697895 = queryNorm
                0.23754507 = fieldWeight in 4444, product of:
                  3.4641016 = tf(freq=12.0), with freq of:
                    12.0 = termFreq=12.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4444)
          0.5 = coord(1/2)
      0.06666667 = coord(1/15)
    
    Abstract
    Studying scientific collaboration using coauthorship networks has attracted much attention in recent years. How and in what context two authors collaborate remain among the major questions. Previous studies, however, have focused on either exploring the global topology of coauthorship networks (macro perspective) or ranking the impact of individual authors (micro perspective). Neither of them has provided information on the context of the collaboration between two specific authors, which may potentially imply rich socioeconomic, disciplinary, and institutional information on collaboration. Different from the macro perspective and micro perspective, this article proposes a novel method (meso perspective) to analyze scientific collaboration, in which a contextual subgraph is extracted as the unit of analysis. A contextual subgraph is defined as a small subgraph of a large-scale coauthorship network that captures relationship and context between two coauthors. This method is applied to the field of library and information science. Topological properties of all the subgraphs in four time spans are investigated, including size, average degree, clustering coefficient, and network centralization. Results show that contextual subgprahs capture useful contextual information on two authors' collaboration.
    Source
    Journal of the American Society for Information Science and Technology. 62(2011) no.5, S.831-845
  2. Ye, Z.; He, B.; Wang, L.; Luo, T.: Utilizing term proximity for blog post retrieval (2013) 0.00
    3.3705457E-4 = product of:
      0.0050558182 = sum of:
        0.0050558182 = product of:
          0.0101116365 = sum of:
            0.0101116365 = weight(_text_:information in 1126) [ClassicSimilarity], result of:
              0.0101116365 = score(doc=1126,freq=8.0), product of:
                0.052134 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.029697895 = queryNorm
                0.19395474 = fieldWeight in 1126, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1126)
          0.5 = coord(1/2)
      0.06666667 = coord(1/15)
    
    Abstract
    Term proximity is effective for many information retrieval (IR) research fields yet remains unexplored in blogosphere IR. The blogosphere is characterized by large amounts of noise, including incohesive, off-topic content and spam. Consequently, the classical bag-of-words unigram IR models are not reliable enough to provide robust and effective retrieval performance. In this article, we propose to boost the blog postretrieval performance by employing term proximity information. We investigate a variety of popular and state-of-the-art proximity-based statistical IR models, including a proximity-based counting model, the Markov random field (MRF) model, and the divergence from randomness (DFR) multinomial model. Extensive experimentation on the standard TREC Blog06 test dataset demonstrates that the introduction of term proximity information is indeed beneficial to retrieval from the blogosphere. Results also indicate the superiority of the unordered bi-gram model with the sequential-dependence phrases over other variants of the proximity-based models. Finally, inspired by the effectiveness of proximity models, we extend our study by exploring the proximity evidence between query terms and opinionated terms. The consequent opinionated proximity model shows promising performance in the experiments.
    Source
    Journal of the American Society for Information Science and Technology. 64(2013) no.11, S.2278-2298
  3. Ye, Z.; Huang, J.X.; He, B.; Lin, H.: Mining a multilingual association dictionary from Wikipedia for cross-language information retrieval (2012) 0.00
    2.9189783E-4 = product of:
      0.0043784673 = sum of:
        0.0043784673 = product of:
          0.008756935 = sum of:
            0.008756935 = weight(_text_:information in 513) [ClassicSimilarity], result of:
              0.008756935 = score(doc=513,freq=6.0), product of:
                0.052134 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.029697895 = queryNorm
                0.16796975 = fieldWeight in 513, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=513)
          0.5 = coord(1/2)
      0.06666667 = coord(1/15)
    
    Abstract
    Wikipedia is characterized by its dense link structure and a large number of articles in different languages, which make it a notable Web corpus for knowledge extraction and mining, in particular for mining the multilingual associations. In this paper, motivated by a psychological theory of word meaning, we propose a graph-based approach to constructing a cross-language association dictionary (CLAD) from Wikipedia, which can be used in a variety of cross-language accessing and processing applications. In order to evaluate the quality of the mined CLAD, and to demonstrate how the mined CLAD can be used in practice, we explore two different applications of the mined CLAD to cross-language information retrieval (CLIR). First, we use the mined CLAD to conduct cross-language query expansion; and, second, we use it to filter out translation candidates with low translation probabilities. Experimental results on a variety of standard CLIR test collections show that the CLIR retrieval performance can be substantially improved with the above two applications of CLAD, which indicates that the mined CLAD is of sound quality.
    Source
    Journal of the American Society for Information Science and Technology. 63(2012) no.12, S.2474-2487
  4. He, B.; Ounis, I.: Combining fields for query expansion and adaptive query expansion (2007) 0.00
    2.0223274E-4 = product of:
      0.0030334909 = sum of:
        0.0030334909 = product of:
          0.0060669817 = sum of:
            0.0060669817 = weight(_text_:information in 926) [ClassicSimilarity], result of:
              0.0060669817 = score(doc=926,freq=2.0), product of:
                0.052134 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.029697895 = queryNorm
                0.116372846 = fieldWeight in 926, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046875 = fieldNorm(doc=926)
          0.5 = coord(1/2)
      0.06666667 = coord(1/15)
    
    Source
    Information processing and management. 43(2007) no.5, S.1294-1307
  5. Li, D.; Ding, Y.; Sugimoto, C.; He, B.; Tang, J.; Yan, E.; Lin, N.; Qin, Z.; Dong, T.: Modeling topic and community structure in social tagging : the TTR-LDA-Community model (2011) 0.00
    1.6852729E-4 = product of:
      0.0025279091 = sum of:
        0.0025279091 = product of:
          0.0050558182 = sum of:
            0.0050558182 = weight(_text_:information in 4759) [ClassicSimilarity], result of:
              0.0050558182 = score(doc=4759,freq=2.0), product of:
                0.052134 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.029697895 = queryNorm
                0.09697737 = fieldWeight in 4759, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4759)
          0.5 = coord(1/2)
      0.06666667 = coord(1/15)
    
    Source
    Journal of the American Society for Information Science and Technology. 62(2011) no.9, S.1849-1866
  6. Lin, N.; Li, D.; Ding, Y.; He, B.; Qin, Z.; Tang, J.; Li, J.; Dong, T.: ¬The dynamic features of Delicious, Flickr, and YouTube (2012) 0.00
    1.6852729E-4 = product of:
      0.0025279091 = sum of:
        0.0025279091 = product of:
          0.0050558182 = sum of:
            0.0050558182 = weight(_text_:information in 4970) [ClassicSimilarity], result of:
              0.0050558182 = score(doc=4970,freq=2.0), product of:
                0.052134 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.029697895 = queryNorm
                0.09697737 = fieldWeight in 4970, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4970)
          0.5 = coord(1/2)
      0.06666667 = coord(1/15)
    
    Source
    Journal of the American Society for Information Science and Technology. 63(2012) no.1, S.139-162