Search (13 results, page 1 of 1)

  • × year_i:[2010 TO 2020}
  • × author_ss:"Chen, H."
  1. Qu, B.; Cong, G.; Li, C.; Sun, A.; Chen, H.: ¬An evaluation of classification models for question topic categorization (2012) 0.01
    0.00822272 = product of:
      0.01644544 = sum of:
        0.01644544 = product of:
          0.024668159 = sum of:
            0.015565722 = weight(_text_:h in 237) [ClassicSimilarity], result of:
              0.015565722 = score(doc=237,freq=2.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.13724773 = fieldWeight in 237, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=237)
            0.009102437 = weight(_text_:d in 237) [ClassicSimilarity], result of:
              0.009102437 = score(doc=237,freq=2.0), product of:
                0.0867278 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045649286 = queryNorm
                0.104954086 = fieldWeight in 237, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=237)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    We study the problem of question topic classification using a very large real-world Community Question Answering (CQA) dataset from Yahoo! Answers. The dataset comprises 3.9 million questions and these questions are organized into more than 1,000 categories in a hierarchy. To the best knowledge, this is the first systematic evaluation of the performance of different classification methods on question topic classification as well as short texts. Specifically, we empirically evaluate the following in classifying questions into CQA categories: (a) the usefulness of n-gram features and bag-of-word features; (b) the performance of three standard classification algorithms (naive Bayes, maximum entropy, and support vector machines); (c) the performance of the state-of-the-art hierarchical classification algorithms; (d) the effect of training data size on performance; and (e) the effectiveness of the different components of CQA data, including subject, content, asker, and the best answer. The experimental results show what aspects are important for question topic classification in terms of both effectiveness and efficiency. We believe that the experimental findings from this study will be useful in real-world classification problems.
  2. Benjamin, V.; Chen, H.; Zimbra, D.: Bridging the virtual and real : the relationship between web content, linkage, and geographical proximity of social movements (2014) 0.01
    0.00822272 = product of:
      0.01644544 = sum of:
        0.01644544 = product of:
          0.024668159 = sum of:
            0.015565722 = weight(_text_:h in 1527) [ClassicSimilarity], result of:
              0.015565722 = score(doc=1527,freq=2.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.13724773 = fieldWeight in 1527, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1527)
            0.009102437 = weight(_text_:d in 1527) [ClassicSimilarity], result of:
              0.009102437 = score(doc=1527,freq=2.0), product of:
                0.0867278 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045649286 = queryNorm
                0.104954086 = fieldWeight in 1527, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1527)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  3. Hu, P.J.-H.; Hsu, F.-M.; Hu, H.-f.; Chen, H.: Agency satisfaction with electronic record management systems : a large-scale survey (2010) 0.00
    0.004493437 = product of:
      0.008986874 = sum of:
        0.008986874 = product of:
          0.02696062 = sum of:
            0.02696062 = weight(_text_:h in 4115) [ClassicSimilarity], result of:
              0.02696062 = score(doc=4115,freq=6.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.23772003 = fieldWeight in 4115, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4115)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  4. Ku, Y.; Chiu, C.; Zhang, Y.; Chen, H.; Su, H.: Text mining self-disclosing health information for public health service (2014) 0.00
    0.004402651 = product of:
      0.008805302 = sum of:
        0.008805302 = product of:
          0.026415905 = sum of:
            0.026415905 = weight(_text_:h in 1262) [ClassicSimilarity], result of:
              0.026415905 = score(doc=1262,freq=4.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.2329171 = fieldWeight in 1262, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1262)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  5. Chen, H.; Beaudoin, C.E.; Hong, H.: Teen online information disclosure : empirical testing of a protection motivation and social capital model (2016) 0.00
    0.004402651 = product of:
      0.008805302 = sum of:
        0.008805302 = product of:
          0.026415905 = sum of:
            0.026415905 = weight(_text_:h in 3203) [ClassicSimilarity], result of:
              0.026415905 = score(doc=3203,freq=4.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.2329171 = fieldWeight in 3203, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3203)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  6. Jiang, S.; Gao, Q.; Chen, H.; Roco, M.C.: ¬The roles of sharing, transfer, and public funding in nanotechnology knowledge-diffusion networks (2015) 0.00
    0.003113144 = product of:
      0.006226288 = sum of:
        0.006226288 = product of:
          0.018678864 = sum of:
            0.018678864 = weight(_text_:h in 1823) [ClassicSimilarity], result of:
              0.018678864 = score(doc=1823,freq=2.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.16469726 = fieldWeight in 1823, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1823)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  7. Chau, M.; Wong, C.H.; Zhou, Y.; Qin, J.; Chen, H.: Evaluating the use of search engine development tools in IT education (2010) 0.00
    0.002594287 = product of:
      0.005188574 = sum of:
        0.005188574 = product of:
          0.015565722 = sum of:
            0.015565722 = weight(_text_:h in 3325) [ClassicSimilarity], result of:
              0.015565722 = score(doc=3325,freq=2.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.13724773 = fieldWeight in 3325, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3325)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  8. Huang, C.; Fu, T.; Chen, H.: Text-based video content classification for online video-sharing sites (2010) 0.00
    0.002594287 = product of:
      0.005188574 = sum of:
        0.005188574 = product of:
          0.015565722 = sum of:
            0.015565722 = weight(_text_:h in 3452) [ClassicSimilarity], result of:
              0.015565722 = score(doc=3452,freq=2.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.13724773 = fieldWeight in 3452, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3452)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  9. Fu, T.; Abbasi, A.; Chen, H.: ¬A focused crawler for Dark Web forums (2010) 0.00
    0.002594287 = product of:
      0.005188574 = sum of:
        0.005188574 = product of:
          0.015565722 = sum of:
            0.015565722 = weight(_text_:h in 3471) [ClassicSimilarity], result of:
              0.015565722 = score(doc=3471,freq=2.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.13724773 = fieldWeight in 3471, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3471)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  10. Suakkaphong, N.; Zhang, Z.; Chen, H.: Disease named entity recognition using semisupervised learning and conditional random fields (2011) 0.00
    0.002594287 = product of:
      0.005188574 = sum of:
        0.005188574 = product of:
          0.015565722 = sum of:
            0.015565722 = weight(_text_:h in 4367) [ClassicSimilarity], result of:
              0.015565722 = score(doc=4367,freq=2.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.13724773 = fieldWeight in 4367, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4367)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  11. Liu, X.; Kaza, S.; Zhang, P.; Chen, H.: Determining inventor status and its effect on knowledge diffusion : a study on nanotechnology literature from China, Russia, and India (2011) 0.00
    0.002594287 = product of:
      0.005188574 = sum of:
        0.005188574 = product of:
          0.015565722 = sum of:
            0.015565722 = weight(_text_:h in 4468) [ClassicSimilarity], result of:
              0.015565722 = score(doc=4468,freq=2.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.13724773 = fieldWeight in 4468, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4468)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  12. Yang, M.; Kiang, M.; Chen, H.; Li, Y.: Artificial immune system for illicit content identification in social media (2012) 0.00
    0.002594287 = product of:
      0.005188574 = sum of:
        0.005188574 = product of:
          0.015565722 = sum of:
            0.015565722 = weight(_text_:h in 4980) [ClassicSimilarity], result of:
              0.015565722 = score(doc=4980,freq=2.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.13724773 = fieldWeight in 4980, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4980)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  13. Chen, H.; Baptista Nunes, J.M.; Ragsdell, G.; An, X.: Somatic and cultural knowledge : drivers of a habitus-driven model of tacit knowledge acquisition (2019) 0.00
    0.0018160008 = product of:
      0.0036320016 = sum of:
        0.0036320016 = product of:
          0.010896005 = sum of:
            0.010896005 = weight(_text_:h in 5460) [ClassicSimilarity], result of:
              0.010896005 = score(doc=5460,freq=2.0), product of:
                0.113413334 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045649286 = queryNorm
                0.096073404 = fieldWeight in 5460, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=5460)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)