Search (42 results, page 1 of 3)

  • × theme_ss:"Data Mining"
  1. Perugini, S.; Ramakrishnan, N.: Mining Web functional dependencies for flexible information access (2007) 0.05
    0.05014659 = product of:
      0.10029318 = sum of:
        0.10029318 = product of:
          0.15043977 = sum of:
            0.09596762 = weight(_text_:y in 602) [ClassicSimilarity], result of:
              0.09596762 = score(doc=602,freq=4.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.45116252 = fieldWeight in 602, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=602)
            0.054472156 = weight(_text_:n in 602) [ClassicSimilarity], result of:
              0.054472156 = score(doc=602,freq=2.0), product of:
                0.19057861 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044200785 = queryNorm
                0.28582513 = fieldWeight in 602, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.046875 = fieldNorm(doc=602)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    We present an approach to enhancing information access through Web structure mining in contrast to traditional approaches involving usage mining. Specifically, we mine the hardwired hierarchical hyperlink structure of Web sites to identify patterns of term-term co-occurrences we call Web functional dependencies (FDs). Intuitively, a Web FD x -> y declares that all paths through a site involving a hyperlink labeled x also contain a hyperlink labeled y. The complete set of FDs satisfied by a site help characterize (flexible and expressive) interaction paradigms supported by a site, where a paradigm is the set of explorable sequences therein. We describe algorithms for mining FDs and results from mining several hierarchical Web sites and present several interface designs that can exploit such FDs to provide compelling user experiences.
  2. Tu, Y.-N.; Hsu, S.-L.: Constructing conceptual trajectory maps to trace the development of research fields (2016) 0.03
    0.033980977 = product of:
      0.06796195 = sum of:
        0.06796195 = product of:
          0.10194293 = sum of:
            0.056549463 = weight(_text_:y in 3059) [ClassicSimilarity], result of:
              0.056549463 = score(doc=3059,freq=2.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.26585007 = fieldWeight in 3059, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3059)
            0.045393463 = weight(_text_:n in 3059) [ClassicSimilarity], result of:
              0.045393463 = score(doc=3059,freq=2.0), product of:
                0.19057861 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044200785 = queryNorm
                0.23818761 = fieldWeight in 3059, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3059)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  3. Vaughan, L.; Chen, Y.: Data mining from web search queries : a comparison of Google trends and Baidu index (2015) 0.03
    0.028830817 = product of:
      0.057661634 = sum of:
        0.057661634 = product of:
          0.08649245 = sum of:
            0.056549463 = weight(_text_:y in 1605) [ClassicSimilarity], result of:
              0.056549463 = score(doc=1605,freq=2.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.26585007 = fieldWeight in 1605, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1605)
            0.029942982 = weight(_text_:22 in 1605) [ClassicSimilarity], result of:
              0.029942982 = score(doc=1605,freq=2.0), product of:
                0.15478362 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044200785 = queryNorm
                0.19345059 = fieldWeight in 1605, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1605)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Source
    Journal of the Association for Information Science and Technology. 66(2015) no.1, S.13-22
  4. Tunbridge, N.: Semiology put to data mining (1999) 0.02
    0.024209848 = product of:
      0.048419695 = sum of:
        0.048419695 = product of:
          0.14525908 = sum of:
            0.14525908 = weight(_text_:n in 6782) [ClassicSimilarity], result of:
              0.14525908 = score(doc=6782,freq=2.0), product of:
                0.19057861 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044200785 = queryNorm
                0.76220036 = fieldWeight in 6782, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.125 = fieldNorm(doc=6782)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  5. Saz, J.T.: Perspectivas en recuperacion y explotacion de informacion electronica : el 'data mining' (1997) 0.02
    0.018849822 = product of:
      0.037699644 = sum of:
        0.037699644 = product of:
          0.11309893 = sum of:
            0.11309893 = weight(_text_:y in 3723) [ClassicSimilarity], result of:
              0.11309893 = score(doc=3723,freq=2.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.53170013 = fieldWeight in 3723, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3723)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  6. Chen, Y.-L.; Liu, Y.-H.; Ho, W.-L.: ¬A text mining approach to assist the general public in the retrieval of legal documents (2013) 0.02
    0.015994605 = product of:
      0.03198921 = sum of:
        0.03198921 = product of:
          0.09596762 = sum of:
            0.09596762 = weight(_text_:y in 521) [ClassicSimilarity], result of:
              0.09596762 = score(doc=521,freq=4.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.45116252 = fieldWeight in 521, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=521)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  7. Song, J.; Huang, Y.; Qi, X.; Li, Y.; Li, F.; Fu, K.; Huang, T.: Discovering hierarchical topic evolution in time-stamped documents (2016) 0.02
    0.015994605 = product of:
      0.03198921 = sum of:
        0.03198921 = product of:
          0.09596762 = sum of:
            0.09596762 = weight(_text_:y in 2853) [ClassicSimilarity], result of:
              0.09596762 = score(doc=2853,freq=4.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.45116252 = fieldWeight in 2853, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2853)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  8. Chowdhury, G.G.: Template mining for information extraction from digital documents (1999) 0.01
    0.013973392 = product of:
      0.027946783 = sum of:
        0.027946783 = product of:
          0.08384035 = sum of:
            0.08384035 = weight(_text_:22 in 4577) [ClassicSimilarity], result of:
              0.08384035 = score(doc=4577,freq=2.0), product of:
                0.15478362 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044200785 = queryNorm
                0.5416616 = fieldWeight in 4577, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4577)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    2. 4.2000 18:01:22
  9. Wei, C.-P.; Lee, Y.-H.; Chiang, Y.-S.; Chen, C.-T.; Yang, C.C.C.: Exploiting temporal characteristics of features for effectively discovering event episodes from news corpora (2014) 0.01
    0.013328837 = product of:
      0.026657674 = sum of:
        0.026657674 = product of:
          0.07997302 = sum of:
            0.07997302 = weight(_text_:y in 1225) [ClassicSimilarity], result of:
              0.07997302 = score(doc=1225,freq=4.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.37596878 = fieldWeight in 1225, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1225)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  10. Fayyad, U.M.; Djorgovski, S.G.; Weir, N.: From digitized images to online catalogs : data ming a sky server (1996) 0.01
    0.012104924 = product of:
      0.024209848 = sum of:
        0.024209848 = product of:
          0.07262954 = sum of:
            0.07262954 = weight(_text_:n in 6625) [ClassicSimilarity], result of:
              0.07262954 = score(doc=6625,freq=2.0), product of:
                0.19057861 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044200785 = queryNorm
                0.38110018 = fieldWeight in 6625, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6625)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  11. KDD : techniques and applications (1998) 0.01
    0.011977192 = product of:
      0.023954384 = sum of:
        0.023954384 = product of:
          0.07186315 = sum of:
            0.07186315 = weight(_text_:22 in 6783) [ClassicSimilarity], result of:
              0.07186315 = score(doc=6783,freq=2.0), product of:
                0.15478362 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044200785 = queryNorm
                0.46428138 = fieldWeight in 6783, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=6783)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Footnote
    A special issue of selected papers from the Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'97), held Singapore, 22-23 Feb 1997
  12. Wu, K.J.; Chen, M.-C.; Sun, Y.: Automatic topics discovery from hyperlinked documents (2004) 0.01
    0.011309894 = product of:
      0.022619788 = sum of:
        0.022619788 = product of:
          0.06785936 = sum of:
            0.06785936 = weight(_text_:y in 2563) [ClassicSimilarity], result of:
              0.06785936 = score(doc=2563,freq=2.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.3190201 = fieldWeight in 2563, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2563)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  13. Gaizauskas, R.; Wilks, Y.: Information extraction : beyond document retrieval (1998) 0.01
    0.011309894 = product of:
      0.022619788 = sum of:
        0.022619788 = product of:
          0.06785936 = sum of:
            0.06785936 = weight(_text_:y in 4716) [ClassicSimilarity], result of:
              0.06785936 = score(doc=4716,freq=2.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.3190201 = fieldWeight in 4716, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4716)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  14. Liu, X.; Yu, S.; Janssens, F.; Glänzel, W.; Moreau, Y.; Moor, B.de: Weighted hybrid clustering by combining text mining and bibliometrics on a large-scale journal database (2010) 0.01
    0.011309894 = product of:
      0.022619788 = sum of:
        0.022619788 = product of:
          0.06785936 = sum of:
            0.06785936 = weight(_text_:y in 3464) [ClassicSimilarity], result of:
              0.06785936 = score(doc=3464,freq=2.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.3190201 = fieldWeight in 3464, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3464)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  15. Qiu, X.Y.; Srinivasan, P.; Hu, Y.: Supervised learning models to predict firm performance with annual reports : an empirical study (2014) 0.01
    0.011309894 = product of:
      0.022619788 = sum of:
        0.022619788 = product of:
          0.06785936 = sum of:
            0.06785936 = weight(_text_:y in 1205) [ClassicSimilarity], result of:
              0.06785936 = score(doc=1205,freq=2.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.3190201 = fieldWeight in 1205, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1205)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  16. Loonus, Y.: Einsatzbereiche der KI und ihre Relevanz für Information Professionals (2017) 0.01
    0.011309894 = product of:
      0.022619788 = sum of:
        0.022619788 = product of:
          0.06785936 = sum of:
            0.06785936 = weight(_text_:y in 5668) [ClassicSimilarity], result of:
              0.06785936 = score(doc=5668,freq=2.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.3190201 = fieldWeight in 5668, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5668)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  17. Methodologies for knowledge discovery and data mining : Third Pacific-Asia Conference, PAKDD'99, Beijing, China, April 26-28, 1999, Proceedings (1999) 0.01
    0.010591809 = product of:
      0.021183617 = sum of:
        0.021183617 = product of:
          0.06355085 = sum of:
            0.06355085 = weight(_text_:n in 3821) [ClassicSimilarity], result of:
              0.06355085 = score(doc=3821,freq=2.0), product of:
                0.19057861 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044200785 = queryNorm
                0.33346266 = fieldWeight in 3821, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3821)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Editor
    Zhong, N.
  18. Maaten, L. van den: Accelerating t-SNE using Tree-Based Algorithms (2014) 0.01
    0.010591809 = product of:
      0.021183617 = sum of:
        0.021183617 = product of:
          0.06355085 = sum of:
            0.06355085 = weight(_text_:n in 3886) [ClassicSimilarity], result of:
              0.06355085 = score(doc=3886,freq=2.0), product of:
                0.19057861 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044200785 = queryNorm
                0.33346266 = fieldWeight in 3886, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3886)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    The paper investigates the acceleration of t-SNE-an embedding technique that is commonly used for the visualization of high-dimensional data in scatter plots-using two tree-based algorithms. In particular, the paper develops variants of the Barnes-Hut algorithm and of the dual-tree algorithm that approximate the gradient used for learning t-SNE embeddings in O(N*logN). Our experiments show that the resulting algorithms substantially accelerate t-SNE, and that they make it possible to learn embeddings of data sets with millions of objects. Somewhat counterintuitively, the Barnes-Hut variant of t-SNE appears to outperform the dual-tree variant.
  19. Liu, Y.; Huang, X.; An, A.: Personalized recommendation with adaptive mixture of markov models (2007) 0.01
    0.009424911 = product of:
      0.018849822 = sum of:
        0.018849822 = product of:
          0.056549463 = sum of:
            0.056549463 = weight(_text_:y in 606) [ClassicSimilarity], result of:
              0.056549463 = score(doc=606,freq=2.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.26585007 = fieldWeight in 606, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=606)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  20. Liu, Y.; Zhang, M.; Cen, R.; Ru, L.; Ma, S.: Data cleansing for Web information retrieval using query independent features (2007) 0.01
    0.009424911 = product of:
      0.018849822 = sum of:
        0.018849822 = product of:
          0.056549463 = sum of:
            0.056549463 = weight(_text_:y in 607) [ClassicSimilarity], result of:
              0.056549463 = score(doc=607,freq=2.0), product of:
                0.21271187 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.044200785 = queryNorm
                0.26585007 = fieldWeight in 607, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=607)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    

Years

Languages

  • e 33
  • d 8
  • sp 1
  • More… Less…

Types