Search (114 results, page 1 of 6)

  • × language_ss:"e"
  • × theme_ss:"Data Mining"
  • × type_ss:"a"
  1. Chowdhury, G.G.: Template mining for information extraction from digital documents (1999) 0.05
    0.050999753 = product of:
      0.10199951 = sum of:
        0.10199951 = sum of:
          0.014706998 = weight(_text_:e in 4577) [ClassicSimilarity], result of:
            0.014706998 = score(doc=4577,freq=2.0), product of:
              0.06614887 = queryWeight, product of:
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.04602077 = queryNorm
              0.2223318 = fieldWeight in 4577, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.109375 = fieldNorm(doc=4577)
          0.08729251 = weight(_text_:22 in 4577) [ClassicSimilarity], result of:
            0.08729251 = score(doc=4577,freq=2.0), product of:
              0.1611569 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.04602077 = queryNorm
              0.5416616 = fieldWeight in 4577, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.109375 = fieldNorm(doc=4577)
      0.5 = coord(1/2)
    
    Date
    2. 4.2000 18:01:22
    Language
    e
  2. Matson, L.D.; Bonski, D.J.: Do digital libraries need librarians? (1997) 0.03
    0.029142715 = product of:
      0.05828543 = sum of:
        0.05828543 = sum of:
          0.008403999 = weight(_text_:e in 1737) [ClassicSimilarity], result of:
            0.008403999 = score(doc=1737,freq=2.0), product of:
              0.06614887 = queryWeight, product of:
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.04602077 = queryNorm
              0.12704675 = fieldWeight in 1737, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.0625 = fieldNorm(doc=1737)
          0.049881432 = weight(_text_:22 in 1737) [ClassicSimilarity], result of:
            0.049881432 = score(doc=1737,freq=2.0), product of:
              0.1611569 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.04602077 = queryNorm
              0.30952093 = fieldWeight in 1737, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0625 = fieldNorm(doc=1737)
      0.5 = coord(1/2)
    
    Date
    22.11.1998 18:57:22
    Language
    e
  3. Amir, A.; Feldman, R.; Kashi, R.: ¬A new and versatile method for association generation (1997) 0.03
    0.029142715 = product of:
      0.05828543 = sum of:
        0.05828543 = sum of:
          0.008403999 = weight(_text_:e in 1270) [ClassicSimilarity], result of:
            0.008403999 = score(doc=1270,freq=2.0), product of:
              0.06614887 = queryWeight, product of:
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.04602077 = queryNorm
              0.12704675 = fieldWeight in 1270, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.0625 = fieldNorm(doc=1270)
          0.049881432 = weight(_text_:22 in 1270) [ClassicSimilarity], result of:
            0.049881432 = score(doc=1270,freq=2.0), product of:
              0.1611569 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.04602077 = queryNorm
              0.30952093 = fieldWeight in 1270, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0625 = fieldNorm(doc=1270)
      0.5 = coord(1/2)
    
    Language
    e
    Source
    Information systems. 22(1997) nos.5/6, S.333-347
  4. Hofstede, A.H.M. ter; Proper, H.A.; Van der Weide, T.P.: Exploiting fact verbalisation in conceptual information modelling (1997) 0.03
    0.025499877 = product of:
      0.050999753 = sum of:
        0.050999753 = sum of:
          0.007353499 = weight(_text_:e in 2908) [ClassicSimilarity], result of:
            0.007353499 = score(doc=2908,freq=2.0), product of:
              0.06614887 = queryWeight, product of:
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.04602077 = queryNorm
              0.1111659 = fieldWeight in 2908, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2908)
          0.043646254 = weight(_text_:22 in 2908) [ClassicSimilarity], result of:
            0.043646254 = score(doc=2908,freq=2.0), product of:
              0.1611569 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.04602077 = queryNorm
              0.2708308 = fieldWeight in 2908, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2908)
      0.5 = coord(1/2)
    
    Language
    e
    Source
    Information systems. 22(1997) nos.5/6, S.349-385
  5. Hallonsten, O.; Holmberg, D.: Analyzing structural stratification in the Swedish higher education system : data contextualization with policy-history analysis (2013) 0.02
    0.018214198 = product of:
      0.036428396 = sum of:
        0.036428396 = sum of:
          0.0052524996 = weight(_text_:e in 668) [ClassicSimilarity], result of:
            0.0052524996 = score(doc=668,freq=2.0), product of:
              0.06614887 = queryWeight, product of:
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.04602077 = queryNorm
              0.07940422 = fieldWeight in 668, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.0390625 = fieldNorm(doc=668)
          0.031175895 = weight(_text_:22 in 668) [ClassicSimilarity], result of:
            0.031175895 = score(doc=668,freq=2.0), product of:
              0.1611569 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.04602077 = queryNorm
              0.19345059 = fieldWeight in 668, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=668)
      0.5 = coord(1/2)
    
    Date
    22. 3.2013 19:43:01
    Language
    e
  6. Vaughan, L.; Chen, Y.: Data mining from web search queries : a comparison of Google trends and Baidu index (2015) 0.02
    0.018214198 = product of:
      0.036428396 = sum of:
        0.036428396 = sum of:
          0.0052524996 = weight(_text_:e in 1605) [ClassicSimilarity], result of:
            0.0052524996 = score(doc=1605,freq=2.0), product of:
              0.06614887 = queryWeight, product of:
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.04602077 = queryNorm
              0.07940422 = fieldWeight in 1605, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1605)
          0.031175895 = weight(_text_:22 in 1605) [ClassicSimilarity], result of:
            0.031175895 = score(doc=1605,freq=2.0), product of:
              0.1611569 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.04602077 = queryNorm
              0.19345059 = fieldWeight in 1605, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1605)
      0.5 = coord(1/2)
    
    Language
    e
    Source
    Journal of the Association for Information Science and Technology. 66(2015) no.1, S.13-22
  7. Fonseca, F.; Marcinkowski, M.; Davis, C.: Cyber-human systems of thought and understanding (2019) 0.02
    0.018214198 = product of:
      0.036428396 = sum of:
        0.036428396 = sum of:
          0.0052524996 = weight(_text_:e in 5011) [ClassicSimilarity], result of:
            0.0052524996 = score(doc=5011,freq=2.0), product of:
              0.06614887 = queryWeight, product of:
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.04602077 = queryNorm
              0.07940422 = fieldWeight in 5011, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.43737 = idf(docFreq=28552, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5011)
          0.031175895 = weight(_text_:22 in 5011) [ClassicSimilarity], result of:
            0.031175895 = score(doc=5011,freq=2.0), product of:
              0.1611569 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.04602077 = queryNorm
              0.19345059 = fieldWeight in 5011, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5011)
      0.5 = coord(1/2)
    
    Date
    7. 3.2019 16:32:22
    Language
    e
  8. Howlett, D.: Digging deep for treasure (1998) 0.00
    0.0042019994 = product of:
      0.008403999 = sum of:
        0.008403999 = product of:
          0.016807998 = sum of:
            0.016807998 = weight(_text_:e in 4544) [ClassicSimilarity], result of:
              0.016807998 = score(doc=4544,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.2540935 = fieldWeight in 4544, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.125 = fieldNorm(doc=4544)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  9. Tunbridge, N.: Semiology put to data mining (1999) 0.00
    0.0042019994 = product of:
      0.008403999 = sum of:
        0.008403999 = product of:
          0.016807998 = sum of:
            0.016807998 = weight(_text_:e in 6782) [ClassicSimilarity], result of:
              0.016807998 = score(doc=6782,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.2540935 = fieldWeight in 6782, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.125 = fieldNorm(doc=6782)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  10. Budzik, J.; Hammond, K.J.; Birnbaum, L.: Information access in context (2001) 0.00
    0.0036767495 = product of:
      0.007353499 = sum of:
        0.007353499 = product of:
          0.014706998 = sum of:
            0.014706998 = weight(_text_:e in 3835) [ClassicSimilarity], result of:
              0.014706998 = score(doc=3835,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.2223318 = fieldWeight in 3835, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.109375 = fieldNorm(doc=3835)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  11. Fong, A.C.M.: Mining a Web citation database for document clustering (2002) 0.00
    0.0036767495 = product of:
      0.007353499 = sum of:
        0.007353499 = product of:
          0.014706998 = sum of:
            0.014706998 = weight(_text_:e in 3940) [ClassicSimilarity], result of:
              0.014706998 = score(doc=3940,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.2223318 = fieldWeight in 3940, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.109375 = fieldNorm(doc=3940)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  12. Blake, C.: Text mining (2011) 0.00
    0.0036767495 = product of:
      0.007353499 = sum of:
        0.007353499 = product of:
          0.014706998 = sum of:
            0.014706998 = weight(_text_:e in 1599) [ClassicSimilarity], result of:
              0.014706998 = score(doc=1599,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.2223318 = fieldWeight in 1599, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.109375 = fieldNorm(doc=1599)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  13. Chen, S.Y.; Liu, X.: ¬The contribution of data mining to information science : making sense of it all (2005) 0.00
    0.0031514994 = product of:
      0.006302999 = sum of:
        0.006302999 = product of:
          0.012605998 = sum of:
            0.012605998 = weight(_text_:e in 4655) [ClassicSimilarity], result of:
              0.012605998 = score(doc=4655,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.19057012 = fieldWeight in 4655, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4655)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  14. Teich, E.; Degaetano-Ortlieb, S.; Fankhauser, P.; Kermes, H.; Lapshinova-Koltunski, E.: ¬The linguistic construal of disciplinarity : a data-mining approach using register features (2016) 0.00
    0.002729279 = product of:
      0.005458558 = sum of:
        0.005458558 = product of:
          0.010917116 = sum of:
            0.010917116 = weight(_text_:e in 3015) [ClassicSimilarity], result of:
              0.010917116 = score(doc=3015,freq=6.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.16503859 = fieldWeight in 3015, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3015)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  15. Fayyad, U.M.: Data mining and knowledge dicovery : making sense out of data (1996) 0.00
    0.0026262498 = product of:
      0.0052524996 = sum of:
        0.0052524996 = product of:
          0.010504999 = sum of:
            0.010504999 = weight(_text_:e in 7007) [ClassicSimilarity], result of:
              0.010504999 = score(doc=7007,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.15880844 = fieldWeight in 7007, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.078125 = fieldNorm(doc=7007)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  16. Fayyad, U.; Piatetsky-Shapiro, G.; Smyth, P.: From data mining to knowledge discovery in databases (1996) 0.00
    0.0026262498 = product of:
      0.0052524996 = sum of:
        0.0052524996 = product of:
          0.010504999 = sum of:
            0.010504999 = weight(_text_:e in 7458) [ClassicSimilarity], result of:
              0.010504999 = score(doc=7458,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.15880844 = fieldWeight in 7458, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.078125 = fieldNorm(doc=7458)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  17. Liu, W.; Weichselbraun, A.; Scharl, A.; Chang, E.: Semi-automatic ontology extension using spreading activation (2005) 0.00
    0.0025998545 = product of:
      0.005199709 = sum of:
        0.005199709 = product of:
          0.010399418 = sum of:
            0.010399418 = weight(_text_:e in 3028) [ClassicSimilarity], result of:
              0.010399418 = score(doc=3028,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.15721233 = fieldWeight in 3028, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3028)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  18. Wu, K.J.; Chen, M.-C.; Sun, Y.: Automatic topics discovery from hyperlinked documents (2004) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 2563) [ClassicSimilarity], result of:
              0.008913787 = score(doc=2563,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 2563, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2563)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Topic discovery is an important means for marketing, e-Business and social science studies. As well, it can be applied to various purposes, such as identifying a group with certain properties and observing the emergence and diminishment of a certain cyber community. Previous topic discovery work (J.M. Kleinberg, Proceedings of the 9th Annual ACM-SIAM Symposium on Discrete Algorithms, San Francisco, California, p. 668) requires manual judgment of usefulness of outcomes and is thus incapable of handling the explosive growth of the Internet. In this paper, we propose the Automatic Topic Discovery (ATD) method, which combines a method of base set construction, a clustering algorithm and an iterative principal eigenvector computation method to discover the topics relevant to a given query without using manual examination. Given a query, ATD returns with topics associated with the query and top representative pages for each topic. Our experiments show that the ATD method performs better than the traditional eigenvector method in terms of computation time and topic discovery quality.
    Language
    e
  19. Dang, X.H.; Ong. K.-L.: Knowledge discovery in data streams (2009) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 3829) [ClassicSimilarity], result of:
              0.008913787 = score(doc=3829,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 3829, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3829)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Footnote
    Vgl.: http://www.tandfonline.com/doi/book/10.1081/E-ELIS3.
    Language
    e
  20. Sarnikar, S.; Zhang, Z.; Zhao, J.L.: Query-performance prediction for effective query routing in domain-specific repositories (2014) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 1326) [ClassicSimilarity], result of:
              0.008913787 = score(doc=1326,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 1326, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1326)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    The effective use of corporate memory is becoming increasingly important because every aspect of e-business requires access to information repositories. Unfortunately, less-than-satisfying effectiveness in state-of-the-art information-retrieval techniques is well known, even for some of the best search engines such as Google. In this study, the authors resolve this retrieval ineffectiveness problem by developing a new framework for predicting query performance, which is the first step toward better retrieval effectiveness. Specifically, they examine the relationship between query performance and query context. A query context consists of the query itself, the document collection, and the interaction between the two. The authors first analyze the characteristics of query context and develop various features for predicting query performance. Then, they propose a context-sensitive model for predicting query performance based on the characteristics of the query and the document collection. Finally, they validate this model with respect to five real-world collections of documents and demonstrate its utility in routing queries to the correct repository with high accuracy.
    Language
    e

Years

Classifications