Search (27 results, page 1 of 2)

  • × theme_ss:"Data Mining"
  1. Fonseca, F.; Marcinkowski, M.; Davis, C.: Cyber-human systems of thought and understanding (2019) 0.01
    0.0051844902 = product of:
      0.012961226 = sum of:
        0.008208886 = product of:
          0.024626656 = sum of:
            0.024626656 = weight(_text_:f in 5011) [ClassicSimilarity], result of:
              0.024626656 = score(doc=5011,freq=2.0), product of:
                0.11184496 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.028060954 = queryNorm
                0.22018565 = fieldWeight in 5011, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5011)
          0.33333334 = coord(1/3)
        0.00475234 = product of:
          0.01900936 = sum of:
            0.01900936 = weight(_text_:22 in 5011) [ClassicSimilarity], result of:
              0.01900936 = score(doc=5011,freq=2.0), product of:
                0.09826468 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.028060954 = queryNorm
                0.19345059 = fieldWeight in 5011, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5011)
          0.25 = coord(1/4)
      0.4 = coord(2/5)
    
    Date
    7. 3.2019 16:32:22
  2. Advances in knowledge discovery and data mining (1996) 0.00
    0.003940265 = product of:
      0.019701324 = sum of:
        0.019701324 = product of:
          0.05910397 = sum of:
            0.05910397 = weight(_text_:f in 413) [ClassicSimilarity], result of:
              0.05910397 = score(doc=413,freq=2.0), product of:
                0.11184496 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.028060954 = queryNorm
                0.52844554 = fieldWeight in 413, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.09375 = fieldNorm(doc=413)
          0.33333334 = coord(1/3)
      0.2 = coord(1/5)
    
    Footnote
    Rez. in: JASIS 49(1998) no.4, S.386-387 (F. Exner)
  3. Chowdhury, G.G.: Template mining for information extraction from digital documents (1999) 0.00
    0.0026613104 = product of:
      0.013306552 = sum of:
        0.013306552 = product of:
          0.053226206 = sum of:
            0.053226206 = weight(_text_:22 in 4577) [ClassicSimilarity], result of:
              0.053226206 = score(doc=4577,freq=2.0), product of:
                0.09826468 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.028060954 = queryNorm
                0.5416616 = fieldWeight in 4577, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4577)
          0.25 = coord(1/4)
      0.2 = coord(1/5)
    
    Date
    2. 4.2000 18:01:22
  4. Wattenberg, M.; Viégas, F.; Johnson, I.: How to use t-SNE effectively (2016) 0.00
    0.0026268435 = product of:
      0.013134217 = sum of:
        0.013134217 = product of:
          0.03940265 = sum of:
            0.03940265 = weight(_text_:f in 3887) [ClassicSimilarity], result of:
              0.03940265 = score(doc=3887,freq=2.0), product of:
                0.11184496 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.028060954 = queryNorm
                0.35229704 = fieldWeight in 3887, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3887)
          0.33333334 = coord(1/3)
      0.2 = coord(1/5)
    
  5. Ku, L.-W.; Chen, H.-H.: Mining opinions from the Web : beyond relevance retrieval (2007) 0.00
    0.0023218233 = product of:
      0.011609117 = sum of:
        0.011609117 = product of:
          0.034827348 = sum of:
            0.034827348 = weight(_text_:f in 605) [ClassicSimilarity], result of:
              0.034827348 = score(doc=605,freq=4.0), product of:
                0.11184496 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.028060954 = queryNorm
                0.31138954 = fieldWeight in 605, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=605)
          0.33333334 = coord(1/3)
      0.2 = coord(1/5)
    
    Abstract
    Documents discussing public affairs, common themes, interesting products, and so on, are reported and distributed on the Web. Positive and negative opinions embedded in documents are useful references and feedbacks for governments to improve their services, for companies to market their products, and for customers to purchase their objects. Web opinion mining aims to extract, summarize, and track various aspects of subjective information on the Web. Mining subjective information enables traditional information retrieval (IR) systems to retrieve more data from human viewpoints and provide information with finer granularity. Opinion extraction identifies opinion holders, extracts the relevant opinion sentences, and decides their polarities. Opinion summarization recognizes the major events embedded in documents and summarizes the supportive and the nonsupportive evidence. Opinion tracking captures subjective information from various genres and monitors the developments of opinions from spatial and temporal dimensions. To demonstrate and evaluate the proposed opinion mining algorithms, news and bloggers' articles are adopted. Documents in the evaluation corpora are tagged in different granularities from words, sentences to documents. In the experiments, positive and negative sentiment words and their weights are mined on the basis of Chinese word structures. The f-measure is 73.18% and 63.75% for verbs and nouns, respectively. Utilizing the sentiment words mined together with topical words, we achieve f-measure 62.16% at the sentence level and 74.37% at the document level.
  6. Lam, W.; Yang, C.C.; Menczer, F.: Introduction to the special topic section on mining Web resources for enhancing information retrieval (2007) 0.00
    0.0022984878 = product of:
      0.011492439 = sum of:
        0.011492439 = product of:
          0.034477316 = sum of:
            0.034477316 = weight(_text_:f in 600) [ClassicSimilarity], result of:
              0.034477316 = score(doc=600,freq=2.0), product of:
                0.11184496 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.028060954 = queryNorm
                0.3082599 = fieldWeight in 600, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=600)
          0.33333334 = coord(1/3)
      0.2 = coord(1/5)
    
  7. Kong, S.; Ye, F.; Feng, L.; Zhao, Z.: Towards the prediction problems of bursting hashtags on Twitter (2015) 0.00
    0.0022984878 = product of:
      0.011492439 = sum of:
        0.011492439 = product of:
          0.034477316 = sum of:
            0.034477316 = weight(_text_:f in 2338) [ClassicSimilarity], result of:
              0.034477316 = score(doc=2338,freq=2.0), product of:
                0.11184496 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.028060954 = queryNorm
                0.3082599 = fieldWeight in 2338, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2338)
          0.33333334 = coord(1/3)
      0.2 = coord(1/5)
    
  8. KDD : techniques and applications (1998) 0.00
    0.002281123 = product of:
      0.011405615 = sum of:
        0.011405615 = product of:
          0.04562246 = sum of:
            0.04562246 = weight(_text_:22 in 6783) [ClassicSimilarity], result of:
              0.04562246 = score(doc=6783,freq=2.0), product of:
                0.09826468 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.028060954 = queryNorm
                0.46428138 = fieldWeight in 6783, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=6783)
          0.25 = coord(1/4)
      0.2 = coord(1/5)
    
    Footnote
    A special issue of selected papers from the Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'97), held Singapore, 22-23 Feb 1997
  9. Liu, X.; Yu, S.; Janssens, F.; Glänzel, W.; Moreau, Y.; Moor, B.de: Weighted hybrid clustering by combining text mining and bibliometrics on a large-scale journal database (2010) 0.00
    0.0019701326 = product of:
      0.009850662 = sum of:
        0.009850662 = product of:
          0.029551985 = sum of:
            0.029551985 = weight(_text_:f in 3464) [ClassicSimilarity], result of:
              0.029551985 = score(doc=3464,freq=2.0), product of:
                0.11184496 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.028060954 = queryNorm
                0.26422277 = fieldWeight in 3464, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3464)
          0.33333334 = coord(1/3)
      0.2 = coord(1/5)
    
  10. Song, J.; Huang, Y.; Qi, X.; Li, Y.; Li, F.; Fu, K.; Huang, T.: Discovering hierarchical topic evolution in time-stamped documents (2016) 0.00
    0.0019701326 = product of:
      0.009850662 = sum of:
        0.009850662 = product of:
          0.029551985 = sum of:
            0.029551985 = weight(_text_:f in 2853) [ClassicSimilarity], result of:
              0.029551985 = score(doc=2853,freq=2.0), product of:
                0.11184496 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.028060954 = queryNorm
                0.26422277 = fieldWeight in 2853, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2853)
          0.33333334 = coord(1/3)
      0.2 = coord(1/5)
    
  11. Ebrahimi, M.; ShafieiBavani, E.; Wong, R.; Chen, F.: Twitter user geolocation by filtering of highly mentioned users (2018) 0.00
    0.0019701326 = product of:
      0.009850662 = sum of:
        0.009850662 = product of:
          0.029551985 = sum of:
            0.029551985 = weight(_text_:f in 4286) [ClassicSimilarity], result of:
              0.029551985 = score(doc=4286,freq=2.0), product of:
                0.11184496 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.028060954 = queryNorm
                0.26422277 = fieldWeight in 4286, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4286)
          0.33333334 = coord(1/3)
      0.2 = coord(1/5)
    
  12. Tu, Y.-N.; Hsu, S.-L.: Constructing conceptual trajectory maps to trace the development of research fields (2016) 0.00
    0.0016417772 = product of:
      0.008208886 = sum of:
        0.008208886 = product of:
          0.024626656 = sum of:
            0.024626656 = weight(_text_:f in 3059) [ClassicSimilarity], result of:
              0.024626656 = score(doc=3059,freq=2.0), product of:
                0.11184496 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.028060954 = queryNorm
                0.22018565 = fieldWeight in 3059, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3059)
          0.33333334 = coord(1/3)
      0.2 = coord(1/5)
    
    Abstract
    This study proposes a new method to construct and trace the trajectory of conceptual development of a research field by combining main path analysis, citation analysis, and text-mining techniques. Main path analysis, a method used commonly to trace the most critical path in a citation network, helps describe the developmental trajectory of a research field. This study extends the main path analysis method and applies text-mining techniques in the new method, which reflects the trajectory of conceptual development in an academic research field more accurately than citation frequency, which represents only the articles examined. Articles can be merged based on similarity of concepts, and by merging concepts the history of a research field can be described more precisely. The new method was applied to the "h-index" and "text mining" fields. The precision, recall, and F-measures of the h-index were 0.738, 0.652, and 0.658 and those of text-mining were 0.501, 0.653, and 0.551, respectively. Last, this study not only establishes the conceptual trajectory map of a research field, but also recommends keywords that are more precise than those used currently by researchers. These precise keywords could enable researchers to gather related works more quickly than before.
  13. Varathan, K.D.; Giachanou, A.; Crestani, F.: Comparative opinion mining : a review (2017) 0.00
    0.0016417772 = product of:
      0.008208886 = sum of:
        0.008208886 = product of:
          0.024626656 = sum of:
            0.024626656 = weight(_text_:f in 3540) [ClassicSimilarity], result of:
              0.024626656 = score(doc=3540,freq=2.0), product of:
                0.11184496 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.028060954 = queryNorm
                0.22018565 = fieldWeight in 3540, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3540)
          0.33333334 = coord(1/3)
      0.2 = coord(1/5)
    
  14. Matson, L.D.; Bonski, D.J.: Do digital libraries need librarians? (1997) 0.00
    0.0015207487 = product of:
      0.0076037436 = sum of:
        0.0076037436 = product of:
          0.030414974 = sum of:
            0.030414974 = weight(_text_:22 in 1737) [ClassicSimilarity], result of:
              0.030414974 = score(doc=1737,freq=2.0), product of:
                0.09826468 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.028060954 = queryNorm
                0.30952093 = fieldWeight in 1737, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1737)
          0.25 = coord(1/4)
      0.2 = coord(1/5)
    
    Date
    22.11.1998 18:57:22
  15. Lusti, M.: Data Warehousing and Data Mining : Eine Einführung in entscheidungsunterstützende Systeme (1999) 0.00
    0.0015207487 = product of:
      0.0076037436 = sum of:
        0.0076037436 = product of:
          0.030414974 = sum of:
            0.030414974 = weight(_text_:22 in 4261) [ClassicSimilarity], result of:
              0.030414974 = score(doc=4261,freq=2.0), product of:
                0.09826468 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.028060954 = queryNorm
                0.30952093 = fieldWeight in 4261, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4261)
          0.25 = coord(1/4)
      0.2 = coord(1/5)
    
    Date
    17. 7.2002 19:22:06
  16. Amir, A.; Feldman, R.; Kashi, R.: ¬A new and versatile method for association generation (1997) 0.00
    0.0015207487 = product of:
      0.0076037436 = sum of:
        0.0076037436 = product of:
          0.030414974 = sum of:
            0.030414974 = weight(_text_:22 in 1270) [ClassicSimilarity], result of:
              0.030414974 = score(doc=1270,freq=2.0), product of:
                0.09826468 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.028060954 = queryNorm
                0.30952093 = fieldWeight in 1270, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1270)
          0.25 = coord(1/4)
      0.2 = coord(1/5)
    
    Source
    Information systems. 22(1997) nos.5/6, S.333-347
  17. Hofstede, A.H.M. ter; Proper, H.A.; Van der Weide, T.P.: Exploiting fact verbalisation in conceptual information modelling (1997) 0.00
    0.0013306552 = product of:
      0.006653276 = sum of:
        0.006653276 = product of:
          0.026613103 = sum of:
            0.026613103 = weight(_text_:22 in 2908) [ClassicSimilarity], result of:
              0.026613103 = score(doc=2908,freq=2.0), product of:
                0.09826468 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.028060954 = queryNorm
                0.2708308 = fieldWeight in 2908, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2908)
          0.25 = coord(1/4)
      0.2 = coord(1/5)
    
    Source
    Information systems. 22(1997) nos.5/6, S.349-385
  18. Schwartz, F.; Fang, Y.C.: Citation data analysis on hydrogeology (2007) 0.00
    0.0013134218 = product of:
      0.0065671084 = sum of:
        0.0065671084 = product of:
          0.019701324 = sum of:
            0.019701324 = weight(_text_:f in 433) [ClassicSimilarity], result of:
              0.019701324 = score(doc=433,freq=2.0), product of:
                0.11184496 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.028060954 = queryNorm
                0.17614852 = fieldWeight in 433, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.03125 = fieldNorm(doc=433)
          0.33333334 = coord(1/3)
      0.2 = coord(1/5)
    
  19. Lackes, R.; Tillmanns, C.: Data Mining für die Unternehmenspraxis : Entscheidungshilfen und Fallstudien mit führenden Softwarelösungen (2006) 0.00
    0.0011405615 = product of:
      0.0057028076 = sum of:
        0.0057028076 = product of:
          0.02281123 = sum of:
            0.02281123 = weight(_text_:22 in 1383) [ClassicSimilarity], result of:
              0.02281123 = score(doc=1383,freq=2.0), product of:
                0.09826468 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.028060954 = queryNorm
                0.23214069 = fieldWeight in 1383, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1383)
          0.25 = coord(1/4)
      0.2 = coord(1/5)
    
    Date
    22. 3.2008 14:46:06
  20. Hallonsten, O.; Holmberg, D.: Analyzing structural stratification in the Swedish higher education system : data contextualization with policy-history analysis (2013) 0.00
    9.5046795E-4 = product of:
      0.00475234 = sum of:
        0.00475234 = product of:
          0.01900936 = sum of:
            0.01900936 = weight(_text_:22 in 668) [ClassicSimilarity], result of:
              0.01900936 = score(doc=668,freq=2.0), product of:
                0.09826468 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.028060954 = queryNorm
                0.19345059 = fieldWeight in 668, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=668)
          0.25 = coord(1/4)
      0.2 = coord(1/5)
    
    Date
    22. 3.2013 19:43:01

Languages

  • e 20
  • d 7

Types