Search (14 results, page 1 of 1)

  • × theme_ss:"Data Mining"
  • × language_ss:"e"
  1. Miao, Q.; Li, Q.; Zeng, D.: Fine-grained opinion mining by integrating multiple review sources (2010) 0.03
    0.032873254 = product of:
      0.06574651 = sum of:
        0.06574651 = product of:
          0.13149302 = sum of:
            0.13149302 = weight(_text_:2.0 in 4104) [ClassicSimilarity], result of:
              0.13149302 = score(doc=4104,freq=2.0), product of:
                0.29315117 = queryWeight, product of:
                  5.799733 = idf(docFreq=363, maxDocs=44218)
                  0.050545633 = queryNorm
                0.4485502 = fieldWeight in 4104, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.799733 = idf(docFreq=363, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=4104)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    With the rapid development of Web 2.0, online reviews have become extremely valuable sources for mining customers' opinions. Fine-grained opinion mining has attracted more and more attention of both applied and theoretical research. In this article, the authors study how to automatically mine product features and opinions from multiple review sources. Specifically, they propose an integration strategy to solve the issue. Within the integration strategy, the authors mine domain knowledge from semistructured reviews and then exploit the domain knowledge to assist product feature extraction and sentiment orientation identification from unstructured reviews. Finally, feature-opinion tuples are generated. Experimental results on real-world datasets show that the proposed approach is effective.
  2. Wu, X.: Rule induction with extension matrices (1998) 0.03
    0.028177075 = product of:
      0.05635415 = sum of:
        0.05635415 = product of:
          0.1127083 = sum of:
            0.1127083 = weight(_text_:2.0 in 2912) [ClassicSimilarity], result of:
              0.1127083 = score(doc=2912,freq=2.0), product of:
                0.29315117 = queryWeight, product of:
                  5.799733 = idf(docFreq=363, maxDocs=44218)
                  0.050545633 = queryNorm
                0.3844716 = fieldWeight in 2912, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.799733 = idf(docFreq=363, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2912)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Presents a heuristic, attribute-based, noise-tolerant data mining program, HCV (Version 2.0), absed on the newly-developed extension matrix approach. Gives a simple example of attribute-based induction to show the difference between the rules in variable-valued logic produced by HCV, the decision tree generated by C4.5 and the decision tree's decompiled rules by C4.5 rules. Outlines the extension matrix approach for data mining. Describes the HCV algorithm in detail. Outlines techniques developed and implemented in the HCV program for noise handling and discretization of continuous domains respectively. Follows these with a performance comparison of HCV with famous ID3-like algorithms including C4.5 and C4.5 rules on a collection of standard databases including the famous MONK's problems
  3. Kulathuramaiyer, N.; Maurer, H.: Implications of emerging data mining (2009) 0.03
    0.028177075 = product of:
      0.05635415 = sum of:
        0.05635415 = product of:
          0.1127083 = sum of:
            0.1127083 = weight(_text_:2.0 in 3144) [ClassicSimilarity], result of:
              0.1127083 = score(doc=3144,freq=2.0), product of:
                0.29315117 = queryWeight, product of:
                  5.799733 = idf(docFreq=363, maxDocs=44218)
                  0.050545633 = queryNorm
                0.3844716 = fieldWeight in 3144, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.799733 = idf(docFreq=363, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3144)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Social Semantic Web: Web 2.0, was nun? Hrsg.: A. Blumauer u. T. Pellegrini
  4. Chowdhury, G.G.: Template mining for information extraction from digital documents (1999) 0.02
    0.023968823 = product of:
      0.047937647 = sum of:
        0.047937647 = product of:
          0.09587529 = sum of:
            0.09587529 = weight(_text_:22 in 4577) [ClassicSimilarity], result of:
              0.09587529 = score(doc=4577,freq=2.0), product of:
                0.17700219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050545633 = queryNorm
                0.5416616 = fieldWeight in 4577, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4577)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    2. 4.2000 18:01:22
  5. Zhang, Z.; Li, Q.; Zeng, D.; Ga, H.: Extracting evolutionary communities in community question answering (2014) 0.02
    0.023480896 = product of:
      0.04696179 = sum of:
        0.04696179 = product of:
          0.09392358 = sum of:
            0.09392358 = weight(_text_:2.0 in 1286) [ClassicSimilarity], result of:
              0.09392358 = score(doc=1286,freq=2.0), product of:
                0.29315117 = queryWeight, product of:
                  5.799733 = idf(docFreq=363, maxDocs=44218)
                  0.050545633 = queryNorm
                0.320393 = fieldWeight in 1286, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.799733 = idf(docFreq=363, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1286)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    With the rapid growth of Web 2.0, community question answering (CQA) has become a prevalent information seeking channel, in which users form interactive communities by posting questions and providing answers. Communities may evolve over time, because of changes in users' interests, activities, and new users joining the network. To better understand user interactions in CQA communities, it is necessary to analyze the community structures and track community evolution over time. Existing work in CQA focuses on question searching or content quality detection, and the important problems of community extraction and evolutionary pattern detection have not been studied. In this article, we propose a probabilistic community model (PCM) to extract overlapping community structures and capture their evolution patterns in CQA. The empirical results show that our algorithm appears to improve the community extraction quality. We show empirically, using the iPhone data set, that interesting community evolution patterns can be discovered, with each evolution pattern reflecting the variation of users' interests over time. Our analysis suggests that individual users could benefit to gain comprehensive information from tracking the transition of products. We also show that the communities provide a decision-making basis for business.
  6. KDD : techniques and applications (1998) 0.02
    0.020544706 = product of:
      0.04108941 = sum of:
        0.04108941 = product of:
          0.08217882 = sum of:
            0.08217882 = weight(_text_:22 in 6783) [ClassicSimilarity], result of:
              0.08217882 = score(doc=6783,freq=2.0), product of:
                0.17700219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050545633 = queryNorm
                0.46428138 = fieldWeight in 6783, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=6783)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Footnote
    A special issue of selected papers from the Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'97), held Singapore, 22-23 Feb 1997
  7. Mining text data (2012) 0.02
    0.018784717 = product of:
      0.037569433 = sum of:
        0.037569433 = product of:
          0.07513887 = sum of:
            0.07513887 = weight(_text_:2.0 in 362) [ClassicSimilarity], result of:
              0.07513887 = score(doc=362,freq=2.0), product of:
                0.29315117 = queryWeight, product of:
                  5.799733 = idf(docFreq=363, maxDocs=44218)
                  0.050545633 = queryNorm
                0.2563144 = fieldWeight in 362, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.799733 = idf(docFreq=363, maxDocs=44218)
                  0.03125 = fieldNorm(doc=362)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Text mining applications have experienced tremendous advances because of web 2.0 and social networking applications. Recent advances in hardware and software technology have lead to a number of unique scenarios where text mining algorithms are learned. Mining Text Data introduces an important niche in the text analytics field, and is an edited volume contributed by leading international researchers and practitioners focused on social networks & data mining. This book contains a wide swath in topics across social networks & data mining. Each chapter contains a comprehensive survey including the key research content on the topic, and the future directions of research in the field. There is a special focus on Text Embedded with Heterogeneous and Multimedia Data which makes the mining process much more challenging. A number of methods have been designed such as transfer learning and cross-lingual mining for such cases. Mining Text Data simplifies the content, so that advanced-level students, practitioners and researchers in computer science can benefit from this book. Academic and corporate libraries, as well as ACM, IEEE, and Management Science focused on information security, electronic commerce, databases, data mining, machine learning, and statistics are the primary buyers for this reference book.
  8. Matson, L.D.; Bonski, D.J.: Do digital libraries need librarians? (1997) 0.01
    0.01369647 = product of:
      0.02739294 = sum of:
        0.02739294 = product of:
          0.05478588 = sum of:
            0.05478588 = weight(_text_:22 in 1737) [ClassicSimilarity], result of:
              0.05478588 = score(doc=1737,freq=2.0), product of:
                0.17700219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050545633 = queryNorm
                0.30952093 = fieldWeight in 1737, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1737)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22.11.1998 18:57:22
  9. Amir, A.; Feldman, R.; Kashi, R.: ¬A new and versatile method for association generation (1997) 0.01
    0.01369647 = product of:
      0.02739294 = sum of:
        0.02739294 = product of:
          0.05478588 = sum of:
            0.05478588 = weight(_text_:22 in 1270) [ClassicSimilarity], result of:
              0.05478588 = score(doc=1270,freq=2.0), product of:
                0.17700219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050545633 = queryNorm
                0.30952093 = fieldWeight in 1270, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1270)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information systems. 22(1997) nos.5/6, S.333-347
  10. Hofstede, A.H.M. ter; Proper, H.A.; Van der Weide, T.P.: Exploiting fact verbalisation in conceptual information modelling (1997) 0.01
    0.011984412 = product of:
      0.023968823 = sum of:
        0.023968823 = product of:
          0.047937647 = sum of:
            0.047937647 = weight(_text_:22 in 2908) [ClassicSimilarity], result of:
              0.047937647 = score(doc=2908,freq=2.0), product of:
                0.17700219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050545633 = queryNorm
                0.2708308 = fieldWeight in 2908, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2908)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information systems. 22(1997) nos.5/6, S.349-385
  11. Hallonsten, O.; Holmberg, D.: Analyzing structural stratification in the Swedish higher education system : data contextualization with policy-history analysis (2013) 0.01
    0.008560294 = product of:
      0.017120589 = sum of:
        0.017120589 = product of:
          0.034241177 = sum of:
            0.034241177 = weight(_text_:22 in 668) [ClassicSimilarity], result of:
              0.034241177 = score(doc=668,freq=2.0), product of:
                0.17700219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050545633 = queryNorm
                0.19345059 = fieldWeight in 668, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=668)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2013 19:43:01
  12. Vaughan, L.; Chen, Y.: Data mining from web search queries : a comparison of Google trends and Baidu index (2015) 0.01
    0.008560294 = product of:
      0.017120589 = sum of:
        0.017120589 = product of:
          0.034241177 = sum of:
            0.034241177 = weight(_text_:22 in 1605) [ClassicSimilarity], result of:
              0.034241177 = score(doc=1605,freq=2.0), product of:
                0.17700219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050545633 = queryNorm
                0.19345059 = fieldWeight in 1605, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1605)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the Association for Information Science and Technology. 66(2015) no.1, S.13-22
  13. Fonseca, F.; Marcinkowski, M.; Davis, C.: Cyber-human systems of thought and understanding (2019) 0.01
    0.008560294 = product of:
      0.017120589 = sum of:
        0.017120589 = product of:
          0.034241177 = sum of:
            0.034241177 = weight(_text_:22 in 5011) [ClassicSimilarity], result of:
              0.034241177 = score(doc=5011,freq=2.0), product of:
                0.17700219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050545633 = queryNorm
                0.19345059 = fieldWeight in 5011, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5011)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    7. 3.2019 16:32:22
  14. Information visualization in data mining and knowledge discovery (2002) 0.00
    0.0034241176 = product of:
      0.006848235 = sum of:
        0.006848235 = product of:
          0.01369647 = sum of:
            0.01369647 = weight(_text_:22 in 1789) [ClassicSimilarity], result of:
              0.01369647 = score(doc=1789,freq=2.0), product of:
                0.17700219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050545633 = queryNorm
                0.07738023 = fieldWeight in 1789, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.015625 = fieldNorm(doc=1789)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    23. 3.2008 19:10:22