Search (120 results, page 2 of 6)

  • × theme_ss:"Data Mining"
  • × type_ss:"a"
  1. Liu, W.; Weichselbraun, A.; Scharl, A.; Chang, E.: Semi-automatic ontology extension using spreading activation (2005) 0.00
    0.0025998545 = product of:
      0.005199709 = sum of:
        0.005199709 = product of:
          0.010399418 = sum of:
            0.010399418 = weight(_text_:e in 3028) [ClassicSimilarity], result of:
              0.010399418 = score(doc=3028,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.15721233 = fieldWeight in 3028, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3028)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  2. Wu, K.J.; Chen, M.-C.; Sun, Y.: Automatic topics discovery from hyperlinked documents (2004) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 2563) [ClassicSimilarity], result of:
              0.008913787 = score(doc=2563,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 2563, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2563)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Topic discovery is an important means for marketing, e-Business and social science studies. As well, it can be applied to various purposes, such as identifying a group with certain properties and observing the emergence and diminishment of a certain cyber community. Previous topic discovery work (J.M. Kleinberg, Proceedings of the 9th Annual ACM-SIAM Symposium on Discrete Algorithms, San Francisco, California, p. 668) requires manual judgment of usefulness of outcomes and is thus incapable of handling the explosive growth of the Internet. In this paper, we propose the Automatic Topic Discovery (ATD) method, which combines a method of base set construction, a clustering algorithm and an iterative principal eigenvector computation method to discover the topics relevant to a given query without using manual examination. Given a query, ATD returns with topics associated with the query and top representative pages for each topic. Our experiments show that the ATD method performs better than the traditional eigenvector method in terms of computation time and topic discovery quality.
    Language
    e
  3. Dang, X.H.; Ong. K.-L.: Knowledge discovery in data streams (2009) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 3829) [ClassicSimilarity], result of:
              0.008913787 = score(doc=3829,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 3829, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3829)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Footnote
    Vgl.: http://www.tandfonline.com/doi/book/10.1081/E-ELIS3.
    Language
    e
  4. Sarnikar, S.; Zhang, Z.; Zhao, J.L.: Query-performance prediction for effective query routing in domain-specific repositories (2014) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 1326) [ClassicSimilarity], result of:
              0.008913787 = score(doc=1326,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 1326, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1326)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    The effective use of corporate memory is becoming increasingly important because every aspect of e-business requires access to information repositories. Unfortunately, less-than-satisfying effectiveness in state-of-the-art information-retrieval techniques is well known, even for some of the best search engines such as Google. In this study, the authors resolve this retrieval ineffectiveness problem by developing a new framework for predicting query performance, which is the first step toward better retrieval effectiveness. Specifically, they examine the relationship between query performance and query context. A query context consists of the query itself, the document collection, and the interaction between the two. The authors first analyze the characteristics of query context and develop various features for predicting query performance. Then, they propose a context-sensitive model for predicting query performance based on the characteristics of the query and the document collection. Finally, they validate this model with respect to five real-world collections of documents and demonstrate its utility in routing queries to the correct repository with high accuracy.
    Language
    e
  5. Bella, A. La; Fronzetti Colladon, A.; Battistoni, E.; Castellan, S.; Francucci, M.: Assessing perceived organizational leadership styles through twitter text mining (2018) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 2400) [ClassicSimilarity], result of:
              0.008913787 = score(doc=2400,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 2400, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2400)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  6. Ebrahimi, M.; ShafieiBavani, E.; Wong, R.; Chen, F.: Twitter user geolocation by filtering of highly mentioned users (2018) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 4286) [ClassicSimilarity], result of:
              0.008913787 = score(doc=4286,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 4286, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4286)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  7. Fayyad, U.M.; Djorgovski, S.G.; Weir, N.: From digitized images to online catalogs : data ming a sky server (1996) 0.00
    0.0021009997 = product of:
      0.0042019994 = sum of:
        0.0042019994 = product of:
          0.008403999 = sum of:
            0.008403999 = weight(_text_:e in 6625) [ClassicSimilarity], result of:
              0.008403999 = score(doc=6625,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.12704675 = fieldWeight in 6625, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6625)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  8. Chen, Z.: Knowledge discovery and system-user partnership : on a production 'adversarial partnership' approach (1994) 0.00
    0.0021009997 = product of:
      0.0042019994 = sum of:
        0.0042019994 = product of:
          0.008403999 = sum of:
            0.008403999 = weight(_text_:e in 6759) [ClassicSimilarity], result of:
              0.008403999 = score(doc=6759,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.12704675 = fieldWeight in 6759, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6759)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  9. Cardie, C.: Empirical methods in information extraction (1997) 0.00
    0.0021009997 = product of:
      0.0042019994 = sum of:
        0.0042019994 = product of:
          0.008403999 = sum of:
            0.008403999 = weight(_text_:e in 3246) [ClassicSimilarity], result of:
              0.008403999 = score(doc=3246,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.12704675 = fieldWeight in 3246, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3246)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  10. Brückner, T.; Dambeck, H.: Sortierautomaten : Grundlagen der Textklassifizierung (2003) 0.00
    0.0021009997 = product of:
      0.0042019994 = sum of:
        0.0042019994 = product of:
          0.008403999 = sum of:
            0.008403999 = weight(_text_:e in 2398) [ClassicSimilarity], result of:
              0.008403999 = score(doc=2398,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.12704675 = fieldWeight in 2398, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2398)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Rechnung, Kündigung oder Adressänderung? Eingehende Briefe und E-Mails werden immer häufiger von Software statt aufwändig von Menschenhand sortiert. Die Textklassifizierer arbeiten erstaunlich genau. Sie fahnden auch nach ähnlichen Texten und sorgen so für einen schnellen Überblick. Ihre Werkzeuge sind Linguistik, Statistik und Logik
  11. Bath, P.A.: Data mining in health and medical information (2003) 0.00
    0.0021009997 = product of:
      0.0042019994 = sum of:
        0.0042019994 = product of:
          0.008403999 = sum of:
            0.008403999 = weight(_text_:e in 4263) [ClassicSimilarity], result of:
              0.008403999 = score(doc=4263,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.12704675 = fieldWeight in 4263, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4263)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  12. Derek Doran, D.; Gokhale, S.S.: ¬A classification framework for web robots (2012) 0.00
    0.0021009997 = product of:
      0.0042019994 = sum of:
        0.0042019994 = product of:
          0.008403999 = sum of:
            0.008403999 = weight(_text_:e in 505) [ClassicSimilarity], result of:
              0.008403999 = score(doc=505,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.12704675 = fieldWeight in 505, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0625 = fieldNorm(doc=505)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  13. Chardonnens, A.; Hengchen, S.: Text mining for cultural heritage institutions : a 5-step method for cultural heritage institutions (2017) 0.00
    0.0021009997 = product of:
      0.0042019994 = sum of:
        0.0042019994 = product of:
          0.008403999 = sum of:
            0.008403999 = weight(_text_:e in 646) [ClassicSimilarity], result of:
              0.008403999 = score(doc=646,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.12704675 = fieldWeight in 646, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0625 = fieldNorm(doc=646)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  14. Wattenberg, M.; Viégas, F.; Johnson, I.: How to use t-SNE effectively (2016) 0.00
    0.0021009997 = product of:
      0.0042019994 = sum of:
        0.0042019994 = product of:
          0.008403999 = sum of:
            0.008403999 = weight(_text_:e in 3887) [ClassicSimilarity], result of:
              0.008403999 = score(doc=3887,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.12704675 = fieldWeight in 3887, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3887)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  15. Jones, K.M.L.; Rubel, A.; LeClere, E.: ¬A matter of trust : higher education institutions as information fiduciaries in an age of educational data mining and learning analytics (2020) 0.00
    0.0018570389 = product of:
      0.0037140779 = sum of:
        0.0037140779 = product of:
          0.0074281557 = sum of:
            0.0074281557 = weight(_text_:e in 5968) [ClassicSimilarity], result of:
              0.0074281557 = score(doc=5968,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.112294525 = fieldWeight in 5968, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5968)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  16. Trybula, W.J.: Data mining and knowledge discovery (1997) 0.00
    0.0018383748 = product of:
      0.0036767495 = sum of:
        0.0036767495 = product of:
          0.007353499 = sum of:
            0.007353499 = weight(_text_:e in 2300) [ClassicSimilarity], result of:
              0.007353499 = score(doc=2300,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.1111659 = fieldWeight in 2300, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2300)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  17. Wong, S.K.M.; Butz, C.J.; Xiang, X.: Automated database schema design using mined data dependencies (1998) 0.00
    0.0018383748 = product of:
      0.0036767495 = sum of:
        0.0036767495 = product of:
          0.007353499 = sum of:
            0.007353499 = weight(_text_:e in 2897) [ClassicSimilarity], result of:
              0.007353499 = score(doc=2897,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.1111659 = fieldWeight in 2897, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2897)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  18. Raghavan, V.V.; Deogun, J.S.; Sever, H.: Knowledge discovery and data mining : introduction (1998) 0.00
    0.0018383748 = product of:
      0.0036767495 = sum of:
        0.0036767495 = product of:
          0.007353499 = sum of:
            0.007353499 = weight(_text_:e in 2899) [ClassicSimilarity], result of:
              0.007353499 = score(doc=2899,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.1111659 = fieldWeight in 2899, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2899)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  19. Bell, D.A.; Guan, J.W.: Computational methods for rough classification and discovery (1998) 0.00
    0.0018383748 = product of:
      0.0036767495 = sum of:
        0.0036767495 = product of:
          0.007353499 = sum of:
            0.007353499 = weight(_text_:e in 2909) [ClassicSimilarity], result of:
              0.007353499 = score(doc=2909,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.1111659 = fieldWeight in 2909, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2909)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  20. Lingras, P.J.; Yao, Y.Y.: Data mining using extensions of the rough set model (1998) 0.00
    0.0018383748 = product of:
      0.0036767495 = sum of:
        0.0036767495 = product of:
          0.007353499 = sum of:
            0.007353499 = weight(_text_:e in 2910) [ClassicSimilarity], result of:
              0.007353499 = score(doc=2910,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.1111659 = fieldWeight in 2910, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2910)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e

Years

Languages

  • e 114
  • d 6
  • More… Less…

Classifications