Search (25 results, page 1 of 2)

  • × theme_ss:"Data Mining"
  • × language_ss:"e"
  1. Amir, A.; Feldman, R.; Kashi, R.: ¬A new and versatile method for association generation (1997) 0.02
    0.021227032 = product of:
      0.063681096 = sum of:
        0.063681096 = product of:
          0.095521644 = sum of:
            0.047976613 = weight(_text_:29 in 1270) [ClassicSimilarity], result of:
              0.047976613 = score(doc=1270,freq=2.0), product of:
                0.15430406 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0438652 = queryNorm
                0.31092256 = fieldWeight in 1270, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1270)
            0.04754503 = weight(_text_:22 in 1270) [ClassicSimilarity], result of:
              0.04754503 = score(doc=1270,freq=2.0), product of:
                0.15360846 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0438652 = queryNorm
                0.30952093 = fieldWeight in 1270, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1270)
          0.6666667 = coord(2/3)
      0.33333334 = coord(1/3)
    
    Date
    5. 4.1996 15:29:15
    Source
    Information systems. 22(1997) nos.5/6, S.333-347
  2. Hofstede, A.H.M. ter; Proper, H.A.; Van der Weide, T.P.: Exploiting fact verbalisation in conceptual information modelling (1997) 0.02
    0.018573655 = product of:
      0.055720963 = sum of:
        0.055720963 = product of:
          0.08358144 = sum of:
            0.041979536 = weight(_text_:29 in 2908) [ClassicSimilarity], result of:
              0.041979536 = score(doc=2908,freq=2.0), product of:
                0.15430406 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0438652 = queryNorm
                0.27205724 = fieldWeight in 2908, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2908)
            0.041601904 = weight(_text_:22 in 2908) [ClassicSimilarity], result of:
              0.041601904 = score(doc=2908,freq=2.0), product of:
                0.15360846 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0438652 = queryNorm
                0.2708308 = fieldWeight in 2908, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2908)
          0.6666667 = coord(2/3)
      0.33333334 = coord(1/3)
    
    Date
    5. 4.1996 15:29:15
    Source
    Information systems. 22(1997) nos.5/6, S.349-385
  3. Ma, Z.; Sun, A.; Cong, G.: On predicting the popularity of newly emerging hashtags in Twitter (2013) 0.01
    0.013525645 = product of:
      0.040576935 = sum of:
        0.040576935 = product of:
          0.060865402 = sum of:
            0.03088002 = weight(_text_:k in 967) [ClassicSimilarity], result of:
              0.03088002 = score(doc=967,freq=2.0), product of:
                0.15658903 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0438652 = queryNorm
                0.19720423 = fieldWeight in 967, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=967)
            0.029985385 = weight(_text_:29 in 967) [ClassicSimilarity], result of:
              0.029985385 = score(doc=967,freq=2.0), product of:
                0.15430406 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0438652 = queryNorm
                0.19432661 = fieldWeight in 967, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=967)
          0.6666667 = coord(2/3)
      0.33333334 = coord(1/3)
    
    Abstract
    Because of Twitter's popularity and the viral nature of information dissemination on Twitter, predicting which Twitter topics will become popular in the near future becomes a task of considerable economic importance. Many Twitter topics are annotated by hashtags. In this article, we propose methods to predict the popularity of new hashtags on Twitter by formulating the problem as a classification task. We use five standard classification models (i.e., Naïve bayes, k-nearest neighbors, decision trees, support vector machines, and logistic regression) for prediction. The main challenge is the identification of effective features for describing new hashtags. We extract 7 content features from a hashtag string and the collection of tweets containing the hashtag and 11 contextual features from the social graph formed by users who have adopted the hashtag. We conducted experiments on a Twitter data set consisting of 31 million tweets from 2 million Singapore-based users. The experimental results show that the standard classifiers using the extracted features significantly outperform the baseline methods that do not use these features. Among the five classifiers, the logistic regression model performs the best in terms of the Micro-F1 measure. We also observe that contextual features are more effective than content features.
    Date
    25. 6.2013 19:05:29
  4. Budzik, J.; Hammond, K.J.; Birnbaum, L.: Information access in context (2001) 0.01
    0.009328786 = product of:
      0.027986359 = sum of:
        0.027986359 = product of:
          0.08395907 = sum of:
            0.08395907 = weight(_text_:29 in 3835) [ClassicSimilarity], result of:
              0.08395907 = score(doc=3835,freq=2.0), product of:
                0.15430406 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0438652 = queryNorm
                0.5441145 = fieldWeight in 3835, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.109375 = fieldNorm(doc=3835)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    29. 3.2002 17:31:17
  5. Chowdhury, G.G.: Template mining for information extraction from digital documents (1999) 0.01
    0.0092448685 = product of:
      0.027734604 = sum of:
        0.027734604 = product of:
          0.08320381 = sum of:
            0.08320381 = weight(_text_:22 in 4577) [ClassicSimilarity], result of:
              0.08320381 = score(doc=4577,freq=2.0), product of:
                0.15360846 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0438652 = queryNorm
                0.5416616 = fieldWeight in 4577, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4577)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    2. 4.2000 18:01:22
  6. KDD : techniques and applications (1998) 0.01
    0.007924172 = product of:
      0.023772515 = sum of:
        0.023772515 = product of:
          0.071317546 = sum of:
            0.071317546 = weight(_text_:22 in 6783) [ClassicSimilarity], result of:
              0.071317546 = score(doc=6783,freq=2.0), product of:
                0.15360846 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0438652 = queryNorm
                0.46428138 = fieldWeight in 6783, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=6783)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Footnote
    A special issue of selected papers from the Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'97), held Singapore, 22-23 Feb 1997
  7. Cardie, C.: Empirical methods in information extraction (1997) 0.01
    0.0053307354 = product of:
      0.015992206 = sum of:
        0.015992206 = product of:
          0.047976613 = sum of:
            0.047976613 = weight(_text_:29 in 3246) [ClassicSimilarity], result of:
              0.047976613 = score(doc=3246,freq=2.0), product of:
                0.15430406 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0438652 = queryNorm
                0.31092256 = fieldWeight in 3246, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3246)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    6. 3.1999 13:50:29
  8. Bath, P.A.: Data mining in health and medical information (2003) 0.01
    0.0053307354 = product of:
      0.015992206 = sum of:
        0.015992206 = product of:
          0.047976613 = sum of:
            0.047976613 = weight(_text_:29 in 4263) [ClassicSimilarity], result of:
              0.047976613 = score(doc=4263,freq=2.0), product of:
                0.15430406 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0438652 = queryNorm
                0.31092256 = fieldWeight in 4263, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4263)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    23.10.2005 18:29:03
  9. Matson, L.D.; Bonski, D.J.: Do digital libraries need librarians? (1997) 0.01
    0.0052827816 = product of:
      0.015848344 = sum of:
        0.015848344 = product of:
          0.04754503 = sum of:
            0.04754503 = weight(_text_:22 in 1737) [ClassicSimilarity], result of:
              0.04754503 = score(doc=1737,freq=2.0), product of:
                0.15360846 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0438652 = queryNorm
                0.30952093 = fieldWeight in 1737, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1737)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    22.11.1998 18:57:22
  10. Methodologies for knowledge discovery and data mining : Third Pacific-Asia Conference, PAKDD'99, Beijing, China, April 26-28, 1999, Proceedings (1999) 0.00
    0.004664393 = product of:
      0.013993179 = sum of:
        0.013993179 = product of:
          0.041979536 = sum of:
            0.041979536 = weight(_text_:29 in 3821) [ClassicSimilarity], result of:
              0.041979536 = score(doc=3821,freq=2.0), product of:
                0.15430406 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0438652 = queryNorm
                0.27205724 = fieldWeight in 3821, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3821)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Abstract
    The 29 revised full papers presented together with 37 short papers were carefully selected from a total of 158 submissions. The book is divided into sections on emerging KDD technology; association rules; feature selection and generation; mining in semi-unstructured data; interestingness, surprisingness, and exceptions; rough sets, fuzzy logic, and neural networks; induction, classification, and clustering; visualization, causal models and graph-based methods; agent-based and distributed data mining; and advanced topics and new methodologies
  11. Ohly, H.P.: Bibliometric mining : added value from document analysis and retrieval (2008) 0.00
    0.004117336 = product of:
      0.012352008 = sum of:
        0.012352008 = product of:
          0.037056025 = sum of:
            0.037056025 = weight(_text_:k in 2386) [ClassicSimilarity], result of:
              0.037056025 = score(doc=2386,freq=2.0), product of:
                0.15658903 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0438652 = queryNorm
                0.23664509 = fieldWeight in 2386, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2386)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Source
    Kompatibilität, Medien und Ethik in der Wissensorganisation - Compatibility, Media and Ethics in Knowledge Organization: Proceedings der 10. Tagung der Deutschen Sektion der Internationalen Gesellschaft für Wissensorganisation Wien, 3.-5. Juli 2006 - Proceedings of the 10th Conference of the German Section of the International Society of Knowledge Organization Vienna, 3-5 July 2006. Ed.: H.P. Ohly, S. Netscher u. K. Mitgutsch
  12. Dang, X.H.; Ong. K.-L.: Knowledge discovery in data streams (2009) 0.00
    0.004117336 = product of:
      0.012352008 = sum of:
        0.012352008 = product of:
          0.037056025 = sum of:
            0.037056025 = weight(_text_:k in 3829) [ClassicSimilarity], result of:
              0.037056025 = score(doc=3829,freq=2.0), product of:
                0.15658903 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0438652 = queryNorm
                0.23664509 = fieldWeight in 3829, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3829)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
  13. Song, J.; Huang, Y.; Qi, X.; Li, Y.; Li, F.; Fu, K.; Huang, T.: Discovering hierarchical topic evolution in time-stamped documents (2016) 0.00
    0.004117336 = product of:
      0.012352008 = sum of:
        0.012352008 = product of:
          0.037056025 = sum of:
            0.037056025 = weight(_text_:k in 2853) [ClassicSimilarity], result of:
              0.037056025 = score(doc=2853,freq=2.0), product of:
                0.15658903 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0438652 = queryNorm
                0.23664509 = fieldWeight in 2853, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2853)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
  14. Srinivasan, P.: Text mining in biomedicine : challenges and opportunities (2006) 0.00
    0.0039980514 = product of:
      0.011994153 = sum of:
        0.011994153 = product of:
          0.03598246 = sum of:
            0.03598246 = weight(_text_:29 in 1497) [ClassicSimilarity], result of:
              0.03598246 = score(doc=1497,freq=2.0), product of:
                0.15430406 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0438652 = queryNorm
                0.23319192 = fieldWeight in 1497, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1497)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    29. 2.2008 17:14:09
  15. Liu, X.; Yu, S.; Janssens, F.; Glänzel, W.; Moreau, Y.; Moor, B.de: Weighted hybrid clustering by combining text mining and bibliometrics on a large-scale journal database (2010) 0.00
    0.0039980514 = product of:
      0.011994153 = sum of:
        0.011994153 = product of:
          0.03598246 = sum of:
            0.03598246 = weight(_text_:29 in 3464) [ClassicSimilarity], result of:
              0.03598246 = score(doc=3464,freq=2.0), product of:
                0.15430406 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0438652 = queryNorm
                0.23319192 = fieldWeight in 3464, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3464)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    1. 6.2010 9:29:57
  16. Qiu, X.Y.; Srinivasan, P.; Hu, Y.: Supervised learning models to predict firm performance with annual reports : an empirical study (2014) 0.00
    0.0039980514 = product of:
      0.011994153 = sum of:
        0.011994153 = product of:
          0.03598246 = sum of:
            0.03598246 = weight(_text_:29 in 1205) [ClassicSimilarity], result of:
              0.03598246 = score(doc=1205,freq=2.0), product of:
                0.15430406 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0438652 = queryNorm
                0.23319192 = fieldWeight in 1205, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1205)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    29. 1.2014 16:46:40
  17. Kantardzic, M.: Data mining : concepts, models, methods, and algorithms (2003) 0.00
    0.003881862 = product of:
      0.011645585 = sum of:
        0.011645585 = product of:
          0.034936756 = sum of:
            0.034936756 = weight(_text_:k in 2291) [ClassicSimilarity], result of:
              0.034936756 = score(doc=2291,freq=4.0), product of:
                0.15658903 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0438652 = queryNorm
                0.22311112 = fieldWeight in 2291, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.03125 = fieldNorm(doc=2291)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Classification
    PZY (FH K)
    GHBS
    PZY (FH K)
  18. Liu, B.: Web data mining : exploring hyperlinks, contents, and usage data (2011) 0.00
    0.003881862 = product of:
      0.011645585 = sum of:
        0.011645585 = product of:
          0.034936756 = sum of:
            0.034936756 = weight(_text_:k in 354) [ClassicSimilarity], result of:
              0.034936756 = score(doc=354,freq=4.0), product of:
                0.15658903 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0438652 = queryNorm
                0.22311112 = fieldWeight in 354, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.03125 = fieldNorm(doc=354)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Classification
    TZG (FH K)
    GHBS
    TZG (FH K)
  19. Raan, A.F.J. van; Noyons, E.C.M.: Discovery of patterns of scientific and technological development and knowledge transfer (2002) 0.00
    0.0033317097 = product of:
      0.009995129 = sum of:
        0.009995129 = product of:
          0.029985385 = sum of:
            0.029985385 = weight(_text_:29 in 3603) [ClassicSimilarity], result of:
              0.029985385 = score(doc=3603,freq=2.0), product of:
                0.15430406 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0438652 = queryNorm
                0.19432661 = fieldWeight in 3603, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3603)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Source
    Gaining insight from research information (CRIS2002): Proceedings of the 6th International Conference an Current Research Information Systems, University of Kassel, August 29 - 31, 2002. Eds: W. Adamczak u. A. Nase
  20. Tu, Y.-N.; Hsu, S.-L.: Constructing conceptual trajectory maps to trace the development of research fields (2016) 0.00
    0.0033317097 = product of:
      0.009995129 = sum of:
        0.009995129 = product of:
          0.029985385 = sum of:
            0.029985385 = weight(_text_:29 in 3059) [ClassicSimilarity], result of:
              0.029985385 = score(doc=3059,freq=2.0), product of:
                0.15430406 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0438652 = queryNorm
                0.19432661 = fieldWeight in 3059, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3059)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    21. 7.2016 19:29:19

Types