Search (32 results, page 1 of 2)

  • × type_ss:"a"
  • × theme_ss:"Data Mining"
  1. Tunbridge, N.: Semiology put to data mining (1999) 0.02
    0.024524815 = product of:
      0.04904963 = sum of:
        0.04904963 = product of:
          0.14714889 = sum of:
            0.14714889 = weight(_text_:n in 6782) [ClassicSimilarity], result of:
              0.14714889 = score(doc=6782,freq=2.0), product of:
                0.19305801 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044775832 = queryNorm
                0.76220036 = fieldWeight in 6782, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.125 = fieldNorm(doc=6782)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  2. Peters, G.; Gaese, V.: ¬Das DocCat-System in der Textdokumentation von G+J (2003) 0.01
    0.014748422 = product of:
      0.029496845 = sum of:
        0.029496845 = product of:
          0.044245265 = sum of:
            0.019979235 = weight(_text_:j in 1507) [ClassicSimilarity], result of:
              0.019979235 = score(doc=1507,freq=2.0), product of:
                0.14227505 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.044775832 = queryNorm
                0.14042683 = fieldWeight in 1507, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1507)
            0.024266029 = weight(_text_:22 in 1507) [ClassicSimilarity], result of:
              0.024266029 = score(doc=1507,freq=2.0), product of:
                0.15679733 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044775832 = queryNorm
                0.15476047 = fieldWeight in 1507, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1507)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    22. 4.2003 11:45:36
  3. Chowdhury, G.G.: Template mining for information extraction from digital documents (1999) 0.01
    0.014155183 = product of:
      0.028310366 = sum of:
        0.028310366 = product of:
          0.0849311 = sum of:
            0.0849311 = weight(_text_:22 in 4577) [ClassicSimilarity], result of:
              0.0849311 = score(doc=4577,freq=2.0), product of:
                0.15679733 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044775832 = queryNorm
                0.5416616 = fieldWeight in 4577, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4577)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    2. 4.2000 18:01:22
  4. Fayyad, U.M.; Djorgovski, S.G.; Weir, N.: From digitized images to online catalogs : data ming a sky server (1996) 0.01
    0.012262408 = product of:
      0.024524815 = sum of:
        0.024524815 = product of:
          0.073574446 = sum of:
            0.073574446 = weight(_text_:n in 6625) [ClassicSimilarity], result of:
              0.073574446 = score(doc=6625,freq=2.0), product of:
                0.19305801 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044775832 = queryNorm
                0.38110018 = fieldWeight in 6625, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6625)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  5. Budzik, J.; Hammond, K.J.; Birnbaum, L.: Information access in context (2001) 0.01
    0.011654554 = product of:
      0.023309108 = sum of:
        0.023309108 = product of:
          0.06992732 = sum of:
            0.06992732 = weight(_text_:j in 3835) [ClassicSimilarity], result of:
              0.06992732 = score(doc=3835,freq=2.0), product of:
                0.14227505 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.044775832 = queryNorm
                0.4914939 = fieldWeight in 3835, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.109375 = fieldNorm(doc=3835)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  6. Schmid, J.: Data mining : wie finde ich in Datensammlungen entscheidungsrelevante Muster? (1999) 0.01
    0.011654554 = product of:
      0.023309108 = sum of:
        0.023309108 = product of:
          0.06992732 = sum of:
            0.06992732 = weight(_text_:j in 4540) [ClassicSimilarity], result of:
              0.06992732 = score(doc=4540,freq=2.0), product of:
                0.14227505 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.044775832 = queryNorm
                0.4914939 = fieldWeight in 4540, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4540)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  7. Maaten, L. van den: Accelerating t-SNE using Tree-Based Algorithms (2014) 0.01
    0.010729606 = product of:
      0.021459213 = sum of:
        0.021459213 = product of:
          0.064377636 = sum of:
            0.064377636 = weight(_text_:n in 3886) [ClassicSimilarity], result of:
              0.064377636 = score(doc=3886,freq=2.0), product of:
                0.19305801 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044775832 = queryNorm
                0.33346266 = fieldWeight in 3886, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3886)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    The paper investigates the acceleration of t-SNE-an embedding technique that is commonly used for the visualization of high-dimensional data in scatter plots-using two tree-based algorithms. In particular, the paper develops variants of the Barnes-Hut algorithm and of the dual-tree algorithm that approximate the gradient used for learning t-SNE embeddings in O(N*logN). Our experiments show that the resulting algorithms substantially accelerate t-SNE, and that they make it possible to learn embeddings of data sets with millions of objects. Somewhat counterintuitively, the Barnes-Hut variant of t-SNE appears to outperform the dual-tree variant.
  8. Perugini, S.; Ramakrishnan, N.: Mining Web functional dependencies for flexible information access (2007) 0.01
    0.009196806 = product of:
      0.018393612 = sum of:
        0.018393612 = product of:
          0.055180833 = sum of:
            0.055180833 = weight(_text_:n in 602) [ClassicSimilarity], result of:
              0.055180833 = score(doc=602,freq=2.0), product of:
                0.19305801 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044775832 = queryNorm
                0.28582513 = fieldWeight in 602, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.046875 = fieldNorm(doc=602)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  9. Whittle, M.; Eaglestone, B.; Ford, N.; Gillet, V.J.; Madden, A.: Data mining of search engine logs (2007) 0.01
    0.009196806 = product of:
      0.018393612 = sum of:
        0.018393612 = product of:
          0.055180833 = sum of:
            0.055180833 = weight(_text_:n in 1330) [ClassicSimilarity], result of:
              0.055180833 = score(doc=1330,freq=2.0), product of:
                0.19305801 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044775832 = queryNorm
                0.28582513 = fieldWeight in 1330, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1330)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  10. Kulathuramaiyer, N.; Maurer, H.: Implications of emerging data mining (2009) 0.01
    0.009196806 = product of:
      0.018393612 = sum of:
        0.018393612 = product of:
          0.055180833 = sum of:
            0.055180833 = weight(_text_:n in 3144) [ClassicSimilarity], result of:
              0.055180833 = score(doc=3144,freq=2.0), product of:
                0.19305801 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044775832 = queryNorm
                0.28582513 = fieldWeight in 3144, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3144)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  11. Teich, E.; Degaetano-Ortlieb, S.; Fankhauser, P.; Kermes, H.; Lapshinova-Koltunski, E.: ¬The linguistic construal of disciplinarity : a data-mining approach using register features (2016) 0.01
    0.009196806 = product of:
      0.018393612 = sum of:
        0.018393612 = product of:
          0.055180833 = sum of:
            0.055180833 = weight(_text_:n in 3015) [ClassicSimilarity], result of:
              0.055180833 = score(doc=3015,freq=2.0), product of:
                0.19305801 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044775832 = queryNorm
                0.28582513 = fieldWeight in 3015, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3015)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    We analyze the linguistic evolution of selected scientific disciplines over a 30-year time span (1970s to 2000s). Our focus is on four highly specialized disciplines at the boundaries of computer science that emerged during that time: computational linguistics, bioinformatics, digital construction, and microelectronics. Our analysis is driven by the question whether these disciplines develop a distinctive language use-both individually and collectively-over the given time period. The data set is the English Scientific Text Corpus (scitex), which includes texts from the 1970s/1980s and early 2000s. Our theoretical basis is register theory. In terms of methods, we combine corpus-based methods of feature extraction (various aggregated features [part-of-speech based], n-grams, lexico-grammatical patterns) and automatic text classification. The results of our research are directly relevant to the study of linguistic variation and languages for specific purposes (LSP) and have implications for various natural language processing (NLP) tasks, for example, authorship attribution, text mining, or training NLP tools.
  12. Matson, L.D.; Bonski, D.J.: Do digital libraries need librarians? (1997) 0.01
    0.008088676 = product of:
      0.016177353 = sum of:
        0.016177353 = product of:
          0.048532058 = sum of:
            0.048532058 = weight(_text_:22 in 1737) [ClassicSimilarity], result of:
              0.048532058 = score(doc=1737,freq=2.0), product of:
                0.15679733 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044775832 = queryNorm
                0.30952093 = fieldWeight in 1737, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1737)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    22.11.1998 18:57:22
  13. Amir, A.; Feldman, R.; Kashi, R.: ¬A new and versatile method for association generation (1997) 0.01
    0.008088676 = product of:
      0.016177353 = sum of:
        0.016177353 = product of:
          0.048532058 = sum of:
            0.048532058 = weight(_text_:22 in 1270) [ClassicSimilarity], result of:
              0.048532058 = score(doc=1270,freq=2.0), product of:
                0.15679733 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044775832 = queryNorm
                0.30952093 = fieldWeight in 1270, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1270)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    Information systems. 22(1997) nos.5/6, S.333-347
  14. Suakkaphong, N.; Zhang, Z.; Chen, H.: Disease named entity recognition using semisupervised learning and conditional random fields (2011) 0.01
    0.0076640043 = product of:
      0.015328009 = sum of:
        0.015328009 = product of:
          0.045984026 = sum of:
            0.045984026 = weight(_text_:n in 4367) [ClassicSimilarity], result of:
              0.045984026 = score(doc=4367,freq=2.0), product of:
                0.19305801 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044775832 = queryNorm
                0.23818761 = fieldWeight in 4367, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4367)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  15. Tu, Y.-N.; Hsu, S.-L.: Constructing conceptual trajectory maps to trace the development of research fields (2016) 0.01
    0.0076640043 = product of:
      0.015328009 = sum of:
        0.015328009 = product of:
          0.045984026 = sum of:
            0.045984026 = weight(_text_:n in 3059) [ClassicSimilarity], result of:
              0.045984026 = score(doc=3059,freq=2.0), product of:
                0.19305801 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044775832 = queryNorm
                0.23818761 = fieldWeight in 3059, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3059)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  16. Borgman, C.L.; Wofford, M.F.; Golshan, M.S.; Darch, P.T.: Collaborative qualitative research at scale : reflections on 20 years of acquiring global data and making data global (2021) 0.01
    0.0076640043 = product of:
      0.015328009 = sum of:
        0.015328009 = product of:
          0.045984026 = sum of:
            0.045984026 = weight(_text_:n in 239) [ClassicSimilarity], result of:
              0.045984026 = score(doc=239,freq=2.0), product of:
                0.19305801 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044775832 = queryNorm
                0.23818761 = fieldWeight in 239, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=239)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    A 5-year project to study scientific data uses in geography, starting in 1999, evolved into 20 years of research on data practices in sensor networks, environmental sciences, biology, seismology, undersea science, biomedicine, astronomy, and other fields. By emulating the "team science" approaches of the scientists studied, the UCLA Center for Knowledge Infrastructures accumulated a comprehensive collection of qualitative data about how scientists generate, manage, use, and reuse data across domains. Building upon Paul N. Edwards's model of "making global data"-collecting signals via consistent methods, technologies, and policies-to "make data global"-comparing and integrating those data, the research team has managed and exploited these data as a collaborative resource. This article reflects on the social, technical, organizational, economic, and policy challenges the team has encountered in creating new knowledge from data old and new. We reflect on continuity over generations of students and staff, transitions between grants, transfer of legacy data between software tools, research methods, and the role of professional data managers in the social sciences.
  17. Goldberg, D.M.; Zaman, N.; Brahma, A.; Aloiso, M.: Are mortgage loan closing delay risks predictable? : A predictive analysis using text mining on discussion threads (2022) 0.01
    0.0076640043 = product of:
      0.015328009 = sum of:
        0.015328009 = product of:
          0.045984026 = sum of:
            0.045984026 = weight(_text_:n in 501) [ClassicSimilarity], result of:
              0.045984026 = score(doc=501,freq=2.0), product of:
                0.19305801 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.044775832 = queryNorm
                0.23818761 = fieldWeight in 501, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=501)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  18. Hofstede, A.H.M. ter; Proper, H.A.; Van der Weide, T.P.: Exploiting fact verbalisation in conceptual information modelling (1997) 0.01
    0.0070775915 = product of:
      0.014155183 = sum of:
        0.014155183 = product of:
          0.04246555 = sum of:
            0.04246555 = weight(_text_:22 in 2908) [ClassicSimilarity], result of:
              0.04246555 = score(doc=2908,freq=2.0), product of:
                0.15679733 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044775832 = queryNorm
                0.2708308 = fieldWeight in 2908, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2908)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    Information systems. 22(1997) nos.5/6, S.349-385
  19. Li, J.; Zhang, P.; Cao, J.: External concept support for group support systems through Web mining (2009) 0.01
    0.007063727 = product of:
      0.014127454 = sum of:
        0.014127454 = product of:
          0.04238236 = sum of:
            0.04238236 = weight(_text_:j in 2806) [ClassicSimilarity], result of:
              0.04238236 = score(doc=2806,freq=4.0), product of:
                0.14227505 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.044775832 = queryNorm
                0.2978903 = fieldWeight in 2806, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2806)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  20. Hereth, J.; Stumme, G.; Wille, R.; Wille, U.: Conceptual knowledge discovery and data analysis (2000) 0.01
    0.0058864383 = product of:
      0.011772877 = sum of:
        0.011772877 = product of:
          0.035318628 = sum of:
            0.035318628 = weight(_text_:j in 5083) [ClassicSimilarity], result of:
              0.035318628 = score(doc=5083,freq=4.0), product of:
                0.14227505 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.044775832 = queryNorm
                0.2482419 = fieldWeight in 5083, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5083)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    In this paper, we discuss Conceptual Knowledge Discovery in Databases (CKDD) in its connection with Data Analysis. Our approach is based on Formal Concept Analysis, a mathematical theory which has been developed and proven useful during the last 20 years. Formal Concept Analysis has led to a theory of conceptual information systems which has been applied by using the management system TOSCANA in a wide range of domains. In this paper, we use such an application in database marketing to demonstrate how methods and procedures of CKDD can be applied in Data Analysis. In particular, we show the interplay and integration of data mining and data analysis techniques based on Formal Concept Analysis. The main concern of this paper is to explain how the transition from data to knowledge can be supported by a TOSCANA system. To clarify the transition steps we discuss their correspondence to the five levels of knowledge representation established by R. Brachman and to the steps of empirically grounded theory building proposed by A. Strauss and J. Corbin

Years

Languages

  • e 27
  • d 5