Document (#35830)

Author
Dang, X.H.
Ong. K.-L.
Title
Knowledge discovery in data streams
Source
Encyclopedia of library and information sciences. 3rd ed. Ed.: M.J. Bates
Imprint
London : Taylor & Francis
Year
2009
Pages
S.xx-xx
Abstract
Knowing what to do with the massive amount of data collected has always been an ongoing issue for many organizations. While data mining has been touted to be the solution, it has failed to deliver the impact despite its successes in many areas. One reason is that data mining algorithms were not designed for the real world, i.e., they usually assume a static view of the data and a stable execution environment where resourcesare abundant. The reality however is that data are constantly changing and the execution environment is dynamic. Hence, it becomes difficult for data mining to truly deliver timely and relevant results. Recently, the processing of stream data has received many attention. What is interesting is that the methodology to design stream-based algorithms may well be the solution to the above problem. In this entry, we discuss this issue and present an overview of recent works.
Footnote
Vgl.: http://www.tandfonline.com/doi/book/10.1081/E-ELIS3.
Theme
Data Mining

Similar documents (author)

  1. Over, P.; Dang, H.; Harman, D.: DUC in context (2007) 3.56
    3.5623734 = sum of:
      3.5623734 = weight(author_txt:dang in 934) [ClassicSimilarity], result of:
        3.5623734 = fieldWeight in 934, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=934)
    
  2. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: Beyond bag-of-words : bigram-enhanced context-dependent term weights (2014) 3.56
    3.5623734 = sum of:
      3.5623734 = weight(author_txt:dang in 1283) [ClassicSimilarity], result of:
        3.5623734 = fieldWeight in 1283, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=1283)
    
  3. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: ¬A context-dependent relevance model (2016) 3.56
    3.5623734 = sum of:
      3.5623734 = weight(author_txt:dang in 2778) [ClassicSimilarity], result of:
        3.5623734 = fieldWeight in 2778, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=2778)
    
  4. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: ¬A retrieval model family based on the probability ranking principle for ad hoc retrieval (2022) 3.56
    3.5623734 = sum of:
      3.5623734 = weight(author_txt:dang in 638) [ClassicSimilarity], result of:
        3.5623734 = fieldWeight in 638, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=638)
    
  5. Dang, E.K.F.; Luk, R.W.P.; Ho, K.S.; Chan, S.C.F.; Lee, D.L.: ¬A new measure of clustering effectiveness : algorithms and experimental studies (2008) 2.97
    2.9686446 = sum of:
      2.9686446 = weight(author_txt:dang in 1367) [ClassicSimilarity], result of:
        2.9686446 = fieldWeight in 1367, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.3125 = fieldNorm(doc=1367)
    

Similar documents (content)

  1. Calvanese, D.; Kalayci, T.E.; Montali, M.; Santoso, A.: OBDA for log extraction in process mining (2017) 0.22
    0.22357921 = sum of:
      0.22357921 = product of:
        0.7984972 = sum of:
          0.016035194 = weight(abstract_txt:been in 3931) [ClassicSimilarity], result of:
            0.016035194 = score(doc=3931,freq=1.0), product of:
              0.07092121 = queryWeight, product of:
                1.0158634 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.019298468 = queryNorm
              0.22609869 = fieldWeight in 3931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=3931)
          0.046539173 = weight(abstract_txt:issue in 3931) [ClassicSimilarity], result of:
            0.046539173 = score(doc=3931,freq=1.0), product of:
              0.14430204 = queryWeight, product of:
                1.4490503 = boost
                5.160196 = idf(docFreq=689, maxDocs=44218)
                0.019298468 = queryNorm
              0.32251224 = fieldWeight in 3931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.160196 = idf(docFreq=689, maxDocs=44218)
                0.0625 = fieldNorm(doc=3931)
          0.071717925 = weight(abstract_txt:solution in 3931) [ClassicSimilarity], result of:
            0.071717925 = score(doc=3931,freq=1.0), product of:
              0.19252118 = queryWeight, product of:
                1.6737342 = boost
                5.9603148 = idf(docFreq=309, maxDocs=44218)
                0.019298468 = queryNorm
              0.37251967 = fieldWeight in 3931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9603148 = idf(docFreq=309, maxDocs=44218)
                0.0625 = fieldNorm(doc=3931)
          0.0345336 = weight(abstract_txt:many in 3931) [ClassicSimilarity], result of:
            0.0345336 = score(doc=3931,freq=1.0), product of:
              0.1353895 = queryWeight, product of:
                1.7190375 = boost
                4.081096 = idf(docFreq=2029, maxDocs=44218)
                0.019298468 = queryNorm
              0.2550685 = fieldWeight in 3931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.081096 = idf(docFreq=2029, maxDocs=44218)
                0.0625 = fieldNorm(doc=3931)
          0.26712644 = weight(abstract_txt:execution in 3931) [ClassicSimilarity], result of:
            0.26712644 = score(doc=3931,freq=2.0), product of:
              0.367165 = queryWeight, product of:
                2.3114147 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.019298468 = queryNorm
              0.7275379 = fieldWeight in 3931, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0625 = fieldNorm(doc=3931)
          0.23929977 = weight(abstract_txt:mining in 3931) [ClassicSimilarity], result of:
            0.23929977 = score(doc=3931,freq=4.0), product of:
              0.3100026 = queryWeight, product of:
                2.6012106 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.019298468 = queryNorm
              0.7719283 = fieldWeight in 3931, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0625 = fieldNorm(doc=3931)
          0.12324511 = weight(abstract_txt:data in 3931) [ClassicSimilarity], result of:
            0.12324511 = score(doc=3931,freq=6.0), product of:
              0.2412919 = queryWeight, product of:
                3.74756 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.019298468 = queryNorm
              0.5107719 = fieldWeight in 3931, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=3931)
        0.28 = coord(7/25)
    
  2. Liu, B.: Web data mining : exploring hyperlinks, contents, and usage data (2011) 0.11
    0.11288559 = sum of:
      0.11288559 = product of:
        0.70553493 = sum of:
          0.10909774 = weight(abstract_txt:algorithms in 354) [ClassicSimilarity], result of:
            0.10909774 = score(doc=354,freq=3.0), product of:
              0.17656182 = queryWeight, product of:
                1.6028601 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.019298468 = queryNorm
              0.6179011 = fieldWeight in 354, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=354)
          0.048837885 = weight(abstract_txt:many in 354) [ClassicSimilarity], result of:
            0.048837885 = score(doc=354,freq=2.0), product of:
              0.1353895 = queryWeight, product of:
                1.7190375 = boost
                4.081096 = idf(docFreq=2029, maxDocs=44218)
                0.019298468 = queryNorm
              0.36072135 = fieldWeight in 354, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.081096 = idf(docFreq=2029, maxDocs=44218)
                0.0625 = fieldNorm(doc=354)
          0.41447937 = weight(abstract_txt:mining in 354) [ClassicSimilarity], result of:
            0.41447937 = score(doc=354,freq=12.0), product of:
              0.3100026 = queryWeight, product of:
                2.6012106 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.019298468 = queryNorm
              1.3370191 = fieldWeight in 354, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0625 = fieldNorm(doc=354)
          0.13311993 = weight(abstract_txt:data in 354) [ClassicSimilarity], result of:
            0.13311993 = score(doc=354,freq=7.0), product of:
              0.2412919 = queryWeight, product of:
                3.74756 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.019298468 = queryNorm
              0.55169666 = fieldWeight in 354, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=354)
        0.16 = coord(4/25)
    
  3. Haslhofer, B.: ¬A Web-based mapping technique for establishing metadata interoperability (2008) 0.10
    0.099901184 = sum of:
      0.099901184 = product of:
        0.41625494 = sum of:
          0.036522914 = weight(abstract_txt:environment in 3173) [ClassicSimilarity], result of:
            0.036522914 = score(doc=3173,freq=3.0), product of:
              0.116450995 = queryWeight, product of:
                1.3017237 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.019298468 = queryNorm
              0.31363332 = fieldWeight in 3173, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3173)
          0.039367255 = weight(abstract_txt:algorithms in 3173) [ClassicSimilarity], result of:
            0.039367255 = score(doc=3173,freq=1.0), product of:
              0.17656182 = queryWeight, product of:
                1.6028601 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.019298468 = queryNorm
              0.22296585 = fieldWeight in 3173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3173)
          0.08964741 = weight(abstract_txt:solution in 3173) [ClassicSimilarity], result of:
            0.08964741 = score(doc=3173,freq=4.0), product of:
              0.19252118 = queryWeight, product of:
                1.6737342 = boost
                5.9603148 = idf(docFreq=309, maxDocs=44218)
                0.019298468 = queryNorm
              0.4656496 = fieldWeight in 3173, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9603148 = idf(docFreq=309, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3173)
          0.078195855 = weight(abstract_txt:deliver in 3173) [ClassicSimilarity], result of:
            0.078195855 = score(doc=3173,freq=1.0), product of:
              0.27899462 = queryWeight, product of:
                2.0148613 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.019298468 = queryNorm
              0.28027728 = fieldWeight in 3173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3173)
          0.11805433 = weight(abstract_txt:execution in 3173) [ClassicSimilarity], result of:
            0.11805433 = score(doc=3173,freq=1.0), product of:
              0.367165 = queryWeight, product of:
                2.3114147 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.019298468 = queryNorm
              0.32152936 = fieldWeight in 3173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3173)
          0.054467157 = weight(abstract_txt:data in 3173) [ClassicSimilarity], result of:
            0.054467157 = score(doc=3173,freq=3.0), product of:
              0.2412919 = queryWeight, product of:
                3.74756 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.019298468 = queryNorm
              0.2257314 = fieldWeight in 3173, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3173)
        0.24 = coord(6/25)
    
  4. Maron, M.E.: Theory and foundation of information retrieval : some introductory remarks (1978) 0.09
    0.09485601 = sum of:
      0.09485601 = product of:
        0.3952334 = sum of:
          0.024052791 = weight(abstract_txt:been in 7407) [ClassicSimilarity], result of:
            0.024052791 = score(doc=7407,freq=1.0), product of:
              0.07092121 = queryWeight, product of:
                1.0158634 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.019298468 = queryNorm
              0.33914804 = fieldWeight in 7407, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.09375 = fieldNorm(doc=7407)
          0.10437234 = weight(abstract_txt:truly in 7407) [ClassicSimilarity], result of:
            0.10437234 = score(doc=7407,freq=1.0), product of:
              0.14975436 = queryWeight, product of:
                1.0438112 = boost
                7.4342074 = idf(docFreq=70, maxDocs=44218)
                0.019298468 = queryNorm
              0.69695693 = fieldWeight in 7407, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4342074 = idf(docFreq=70, maxDocs=44218)
                0.09375 = fieldNorm(doc=7407)
          0.04081147 = weight(abstract_txt:what in 7407) [ClassicSimilarity], result of:
            0.04081147 = score(doc=7407,freq=1.0), product of:
              0.100891374 = queryWeight, product of:
                1.2116418 = boost
                4.314763 = idf(docFreq=1606, maxDocs=44218)
                0.019298468 = queryNorm
              0.40450904 = fieldWeight in 7407, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.314763 = idf(docFreq=1606, maxDocs=44218)
                0.09375 = fieldNorm(doc=7407)
          0.0987245 = weight(abstract_txt:issue in 7407) [ClassicSimilarity], result of:
            0.0987245 = score(doc=7407,freq=2.0), product of:
              0.14430204 = queryWeight, product of:
                1.4490503 = boost
                5.160196 = idf(docFreq=689, maxDocs=44218)
                0.019298468 = queryNorm
              0.68415177 = fieldWeight in 7407, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.160196 = idf(docFreq=689, maxDocs=44218)
                0.09375 = fieldNorm(doc=7407)
          0.051800396 = weight(abstract_txt:many in 7407) [ClassicSimilarity], result of:
            0.051800396 = score(doc=7407,freq=1.0), product of:
              0.1353895 = queryWeight, product of:
                1.7190375 = boost
                4.081096 = idf(docFreq=2029, maxDocs=44218)
                0.019298468 = queryNorm
              0.38260275 = fieldWeight in 7407, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.081096 = idf(docFreq=2029, maxDocs=44218)
                0.09375 = fieldNorm(doc=7407)
          0.07547191 = weight(abstract_txt:data in 7407) [ClassicSimilarity], result of:
            0.07547191 = score(doc=7407,freq=1.0), product of:
              0.2412919 = queryWeight, product of:
                3.74756 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.019298468 = queryNorm
              0.31278262 = fieldWeight in 7407, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=7407)
        0.24 = coord(6/25)
    
  5. Mining text data (2012) 0.09
    0.08979481 = sum of:
      0.08979481 = product of:
        0.56121755 = sum of:
          0.016035194 = weight(abstract_txt:been in 362) [ClassicSimilarity], result of:
            0.016035194 = score(doc=362,freq=1.0), product of:
              0.07092121 = queryWeight, product of:
                1.0158634 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.019298468 = queryNorm
              0.22609869 = fieldWeight in 362, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=362)
          0.06298761 = weight(abstract_txt:algorithms in 362) [ClassicSimilarity], result of:
            0.06298761 = score(doc=362,freq=1.0), product of:
              0.17656182 = queryWeight, product of:
                1.6028601 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.019298468 = queryNorm
              0.35674536 = fieldWeight in 362, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=362)
          0.35894966 = weight(abstract_txt:mining in 362) [ClassicSimilarity], result of:
            0.35894966 = score(doc=362,freq=9.0), product of:
              0.3100026 = queryWeight, product of:
                2.6012106 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.019298468 = queryNorm
              1.1578925 = fieldWeight in 362, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0625 = fieldNorm(doc=362)
          0.12324511 = weight(abstract_txt:data in 362) [ClassicSimilarity], result of:
            0.12324511 = score(doc=362,freq=6.0), product of:
              0.2412919 = queryWeight, product of:
                3.74756 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.019298468 = queryNorm
              0.5107719 = fieldWeight in 362, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=362)
        0.16 = coord(4/25)