Search (55 results, page 1 of 3)

  • × theme_ss:"Data Mining"
  1. Chen, Y.-L.; Liu, Y.-H.; Ho, W.-L.: ¬A text mining approach to assist the general public in the retrieval of legal documents (2013) 0.04
    0.038487673 = product of:
      0.076975346 = sum of:
        0.076975346 = product of:
          0.11546301 = sum of:
            0.09715338 = weight(_text_:y in 521) [ClassicSimilarity], result of:
              0.09715338 = score(doc=521,freq=4.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.45116252 = fieldWeight in 521, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=521)
            0.018309632 = weight(_text_:h in 521) [ClassicSimilarity], result of:
              0.018309632 = score(doc=521,freq=2.0), product of:
                0.11117145 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04474692 = queryNorm
                0.16469726 = fieldWeight in 521, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.046875 = fieldNorm(doc=521)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  2. Wei, C.-P.; Lee, Y.-H.; Chiang, Y.-S.; Chen, C.-T.; Yang, C.C.C.: Exploiting temporal characteristics of features for effectively discovering event episodes from news corpora (2014) 0.03
    0.032073062 = product of:
      0.064146124 = sum of:
        0.064146124 = product of:
          0.09621918 = sum of:
            0.08096115 = weight(_text_:y in 1225) [ClassicSimilarity], result of:
              0.08096115 = score(doc=1225,freq=4.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.37596878 = fieldWeight in 1225, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1225)
            0.015258028 = weight(_text_:h in 1225) [ClassicSimilarity], result of:
              0.015258028 = score(doc=1225,freq=2.0), product of:
                0.11117145 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04474692 = queryNorm
                0.13724773 = fieldWeight in 1225, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1225)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  3. Vaughan, L.; Chen, Y.: Data mining from web search queries : a comparison of Google trends and Baidu index (2015) 0.03
    0.029187044 = product of:
      0.05837409 = sum of:
        0.05837409 = product of:
          0.08756113 = sum of:
            0.05724818 = weight(_text_:y in 1605) [ClassicSimilarity], result of:
              0.05724818 = score(doc=1605,freq=2.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.26585007 = fieldWeight in 1605, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1605)
            0.030312952 = weight(_text_:22 in 1605) [ClassicSimilarity], result of:
              0.030312952 = score(doc=1605,freq=2.0), product of:
                0.1566961 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04474692 = queryNorm
                0.19345059 = fieldWeight in 1605, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1605)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Source
    Journal of the Association for Information Science and Technology. 66(2015) no.1, S.13-22
  4. Tu, Y.-N.; Hsu, S.-L.: Constructing conceptual trajectory maps to trace the development of research fields (2016) 0.03
    0.026275432 = product of:
      0.052550863 = sum of:
        0.052550863 = product of:
          0.07882629 = sum of:
            0.05724818 = weight(_text_:y in 3059) [ClassicSimilarity], result of:
              0.05724818 = score(doc=3059,freq=2.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.26585007 = fieldWeight in 3059, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3059)
            0.02157811 = weight(_text_:h in 3059) [ClassicSimilarity], result of:
              0.02157811 = score(doc=3059,freq=4.0), product of:
                0.11117145 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04474692 = queryNorm
                0.1940976 = fieldWeight in 3059, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3059)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    This study proposes a new method to construct and trace the trajectory of conceptual development of a research field by combining main path analysis, citation analysis, and text-mining techniques. Main path analysis, a method used commonly to trace the most critical path in a citation network, helps describe the developmental trajectory of a research field. This study extends the main path analysis method and applies text-mining techniques in the new method, which reflects the trajectory of conceptual development in an academic research field more accurately than citation frequency, which represents only the articles examined. Articles can be merged based on similarity of concepts, and by merging concepts the history of a research field can be described more precisely. The new method was applied to the "h-index" and "text mining" fields. The precision, recall, and F-measures of the h-index were 0.738, 0.652, and 0.658 and those of text-mining were 0.501, 0.653, and 0.551, respectively. Last, this study not only establishes the conceptual trajectory map of a research field, but also recommends keywords that are more precise than those used currently by researchers. These precise keywords could enable researchers to gather related works more quickly than before.
  5. Saz, J.T.: Perspectivas en recuperacion y explotacion de informacion electronica : el 'data mining' (1997) 0.02
    0.019082727 = product of:
      0.038165454 = sum of:
        0.038165454 = product of:
          0.11449636 = sum of:
            0.11449636 = weight(_text_:y in 3723) [ClassicSimilarity], result of:
              0.11449636 = score(doc=3723,freq=2.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.53170013 = fieldWeight in 3723, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3723)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  6. Perugini, S.; Ramakrishnan, N.: Mining Web functional dependencies for flexible information access (2007) 0.02
    0.016192231 = product of:
      0.032384463 = sum of:
        0.032384463 = product of:
          0.09715338 = sum of:
            0.09715338 = weight(_text_:y in 602) [ClassicSimilarity], result of:
              0.09715338 = score(doc=602,freq=4.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.45116252 = fieldWeight in 602, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=602)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    We present an approach to enhancing information access through Web structure mining in contrast to traditional approaches involving usage mining. Specifically, we mine the hardwired hierarchical hyperlink structure of Web sites to identify patterns of term-term co-occurrences we call Web functional dependencies (FDs). Intuitively, a Web FD x -> y declares that all paths through a site involving a hyperlink labeled x also contain a hyperlink labeled y. The complete set of FDs satisfied by a site help characterize (flexible and expressive) interaction paradigms supported by a site, where a paradigm is the set of explorable sequences therein. We describe algorithms for mining FDs and results from mining several hierarchical Web sites and present several interface designs that can exploit such FDs to provide compelling user experiences.
  7. Song, J.; Huang, Y.; Qi, X.; Li, Y.; Li, F.; Fu, K.; Huang, T.: Discovering hierarchical topic evolution in time-stamped documents (2016) 0.02
    0.016192231 = product of:
      0.032384463 = sum of:
        0.032384463 = product of:
          0.09715338 = sum of:
            0.09715338 = weight(_text_:y in 2853) [ClassicSimilarity], result of:
              0.09715338 = score(doc=2853,freq=4.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.45116252 = fieldWeight in 2853, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2853)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  8. Chowdhury, G.G.: Template mining for information extraction from digital documents (1999) 0.01
    0.014146044 = product of:
      0.028292088 = sum of:
        0.028292088 = product of:
          0.08487626 = sum of:
            0.08487626 = weight(_text_:22 in 4577) [ClassicSimilarity], result of:
              0.08487626 = score(doc=4577,freq=2.0), product of:
                0.1566961 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04474692 = queryNorm
                0.5416616 = fieldWeight in 4577, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4577)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    2. 4.2000 18:01:22
  9. KDD : techniques and applications (1998) 0.01
    0.012125181 = product of:
      0.024250362 = sum of:
        0.024250362 = product of:
          0.07275108 = sum of:
            0.07275108 = weight(_text_:22 in 6783) [ClassicSimilarity], result of:
              0.07275108 = score(doc=6783,freq=2.0), product of:
                0.1566961 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04474692 = queryNorm
                0.46428138 = fieldWeight in 6783, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=6783)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Footnote
    A special issue of selected papers from the Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'97), held Singapore, 22-23 Feb 1997
  10. Wu, K.J.; Chen, M.-C.; Sun, Y.: Automatic topics discovery from hyperlinked documents (2004) 0.01
    0.011449637 = product of:
      0.022899274 = sum of:
        0.022899274 = product of:
          0.06869782 = sum of:
            0.06869782 = weight(_text_:y in 2563) [ClassicSimilarity], result of:
              0.06869782 = score(doc=2563,freq=2.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.3190201 = fieldWeight in 2563, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2563)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  11. Gaizauskas, R.; Wilks, Y.: Information extraction : beyond document retrieval (1998) 0.01
    0.011449637 = product of:
      0.022899274 = sum of:
        0.022899274 = product of:
          0.06869782 = sum of:
            0.06869782 = weight(_text_:y in 4716) [ClassicSimilarity], result of:
              0.06869782 = score(doc=4716,freq=2.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.3190201 = fieldWeight in 4716, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4716)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  12. Liu, X.; Yu, S.; Janssens, F.; Glänzel, W.; Moreau, Y.; Moor, B.de: Weighted hybrid clustering by combining text mining and bibliometrics on a large-scale journal database (2010) 0.01
    0.011449637 = product of:
      0.022899274 = sum of:
        0.022899274 = product of:
          0.06869782 = sum of:
            0.06869782 = weight(_text_:y in 3464) [ClassicSimilarity], result of:
              0.06869782 = score(doc=3464,freq=2.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.3190201 = fieldWeight in 3464, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3464)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  13. Qiu, X.Y.; Srinivasan, P.; Hu, Y.: Supervised learning models to predict firm performance with annual reports : an empirical study (2014) 0.01
    0.011449637 = product of:
      0.022899274 = sum of:
        0.022899274 = product of:
          0.06869782 = sum of:
            0.06869782 = weight(_text_:y in 1205) [ClassicSimilarity], result of:
              0.06869782 = score(doc=1205,freq=2.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.3190201 = fieldWeight in 1205, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1205)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  14. Loonus, Y.: Einsatzbereiche der KI und ihre Relevanz für Information Professionals (2017) 0.01
    0.011449637 = product of:
      0.022899274 = sum of:
        0.022899274 = product of:
          0.06869782 = sum of:
            0.06869782 = weight(_text_:y in 5668) [ClassicSimilarity], result of:
              0.06869782 = score(doc=5668,freq=2.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.3190201 = fieldWeight in 5668, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5668)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  15. Liu, Y.; Huang, X.; An, A.: Personalized recommendation with adaptive mixture of markov models (2007) 0.01
    0.009541363 = product of:
      0.019082727 = sum of:
        0.019082727 = product of:
          0.05724818 = sum of:
            0.05724818 = weight(_text_:y in 606) [ClassicSimilarity], result of:
              0.05724818 = score(doc=606,freq=2.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.26585007 = fieldWeight in 606, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=606)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  16. Liu, Y.; Zhang, M.; Cen, R.; Ru, L.; Ma, S.: Data cleansing for Web information retrieval using query independent features (2007) 0.01
    0.009541363 = product of:
      0.019082727 = sum of:
        0.019082727 = product of:
          0.05724818 = sum of:
            0.05724818 = weight(_text_:y in 607) [ClassicSimilarity], result of:
              0.05724818 = score(doc=607,freq=2.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.26585007 = fieldWeight in 607, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=607)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  17. Li, D.; Tang, J.; Ding, Y.; Shuai, X.; Chambers, T.; Sun, G.; Luo, Z.; Zhang, J.: Topic-level opinion influence model (TOIM) : an investigation using tencent microblogging (2015) 0.01
    0.009541363 = product of:
      0.019082727 = sum of:
        0.019082727 = product of:
          0.05724818 = sum of:
            0.05724818 = weight(_text_:y in 2345) [ClassicSimilarity], result of:
              0.05724818 = score(doc=2345,freq=2.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.26585007 = fieldWeight in 2345, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2345)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  18. Varathan, K.D.; Giachanou, A.; Crestani, F.: Comparative opinion mining : a review (2017) 0.01
    0.009541363 = product of:
      0.019082727 = sum of:
        0.019082727 = product of:
          0.05724818 = sum of:
            0.05724818 = weight(_text_:y in 3540) [ClassicSimilarity], result of:
              0.05724818 = score(doc=3540,freq=2.0), product of:
                0.2153401 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.04474692 = queryNorm
                0.26585007 = fieldWeight in 3540, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3540)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    Opinion mining refers to the use of natural language processing, text analysis, and computational linguistics to identify and extract subjective information in textual material. Opinion mining, also known as sentiment analysis, has received a lot of attention in recent times, as it provides a number of tools to analyze public opinion on a number of different topics. Comparative opinion mining is a subfield of opinion mining which deals with identifying and extracting information that is expressed in a comparative form (e.g., "paper X is better than the Y"). Comparative opinion mining plays a very important role when one tries to evaluate something because it provides a reference point for the comparison. This paper provides a review of the area of comparative opinion mining. It is the first review that cover specifically this topic as all previous reviews dealt mostly with general opinion mining. This survey covers comparative opinion mining from two different angles. One from the perspective of techniques and the other from the perspective of comparative opinion elements. It also incorporates preprocessing tools as well as data set that were used by past researchers that can be useful to future researchers in the field of comparative opinion mining.
  19. Matson, L.D.; Bonski, D.J.: Do digital libraries need librarians? (1997) 0.01
    0.008083453 = product of:
      0.016166907 = sum of:
        0.016166907 = product of:
          0.04850072 = sum of:
            0.04850072 = weight(_text_:22 in 1737) [ClassicSimilarity], result of:
              0.04850072 = score(doc=1737,freq=2.0), product of:
                0.1566961 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04474692 = queryNorm
                0.30952093 = fieldWeight in 1737, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1737)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    22.11.1998 18:57:22
  20. Lusti, M.: Data Warehousing and Data Mining : Eine Einführung in entscheidungsunterstützende Systeme (1999) 0.01
    0.008083453 = product of:
      0.016166907 = sum of:
        0.016166907 = product of:
          0.04850072 = sum of:
            0.04850072 = weight(_text_:22 in 4261) [ClassicSimilarity], result of:
              0.04850072 = score(doc=4261,freq=2.0), product of:
                0.1566961 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04474692 = queryNorm
                0.30952093 = fieldWeight in 4261, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4261)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    17. 7.2002 19:22:06

Languages

  • e 33
  • d 21
  • sp 1
  • More… Less…

Types