Search (12 results, page 1 of 1)

  • × theme_ss:"Data Mining"
  • × language_ss:"e"
  1. Chowdhury, G.G.: Template mining for information extraction from digital documents (1999) 0.02
    0.024543159 = product of:
      0.049086317 = sum of:
        0.049086317 = product of:
          0.098172635 = sum of:
            0.098172635 = weight(_text_:22 in 4577) [ClassicSimilarity], result of:
              0.098172635 = score(doc=4577,freq=2.0), product of:
                0.18124348 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.051756795 = queryNorm
                0.5416616 = fieldWeight in 4577, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4577)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    2. 4.2000 18:01:22
  2. Short, M.: Text mining and subject analysis for fiction; or, using machine learning and information extraction to assign subject headings to dime novels (2019) 0.02
    0.021981878 = product of:
      0.043963756 = sum of:
        0.043963756 = product of:
          0.08792751 = sum of:
            0.08792751 = weight(_text_:cataloging in 5481) [ClassicSimilarity], result of:
              0.08792751 = score(doc=5481,freq=4.0), product of:
                0.20397975 = queryWeight, product of:
                  3.9411201 = idf(docFreq=2334, maxDocs=44218)
                  0.051756795 = queryNorm
                0.43106002 = fieldWeight in 5481, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.9411201 = idf(docFreq=2334, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5481)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This article describes multiple experiments in text mining at Northern Illinois University that were undertaken to improve the efficiency and accuracy of cataloging. It focuses narrowly on subject analysis of dime novels, a format of inexpensive fiction that was popular in the United States between 1860 and 1915. NIU holds more than 55,000 dime novels in its collections, which it is in the process of comprehensively digitizing. Classification, keyword extraction, named-entity recognition, clustering, and topic modeling are discussed as means of assigning subject headings to improve their discoverability by researchers and to increase the productivity of digitization workflows.
    Source
    Cataloging and classification quarterly. 57(2019) no.5, S.315-336
  3. KDD : techniques and applications (1998) 0.02
    0.021036994 = product of:
      0.042073987 = sum of:
        0.042073987 = product of:
          0.084147975 = sum of:
            0.084147975 = weight(_text_:22 in 6783) [ClassicSimilarity], result of:
              0.084147975 = score(doc=6783,freq=2.0), product of:
                0.18124348 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.051756795 = queryNorm
                0.46428138 = fieldWeight in 6783, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=6783)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Footnote
    A special issue of selected papers from the Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'97), held Singapore, 22-23 Feb 1997
  4. Matson, L.D.; Bonski, D.J.: Do digital libraries need librarians? (1997) 0.01
    0.014024663 = product of:
      0.028049326 = sum of:
        0.028049326 = product of:
          0.05609865 = sum of:
            0.05609865 = weight(_text_:22 in 1737) [ClassicSimilarity], result of:
              0.05609865 = score(doc=1737,freq=2.0), product of:
                0.18124348 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.051756795 = queryNorm
                0.30952093 = fieldWeight in 1737, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1737)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22.11.1998 18:57:22
  5. Amir, A.; Feldman, R.; Kashi, R.: ¬A new and versatile method for association generation (1997) 0.01
    0.014024663 = product of:
      0.028049326 = sum of:
        0.028049326 = product of:
          0.05609865 = sum of:
            0.05609865 = weight(_text_:22 in 1270) [ClassicSimilarity], result of:
              0.05609865 = score(doc=1270,freq=2.0), product of:
                0.18124348 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.051756795 = queryNorm
                0.30952093 = fieldWeight in 1270, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1270)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information systems. 22(1997) nos.5/6, S.333-347
  6. Lowe, D.B.; Dollinger, I.; Koster, T.; Herbert, B.E.: Text mining for type of research classification (2021) 0.01
    0.0133230295 = product of:
      0.026646059 = sum of:
        0.026646059 = product of:
          0.053292118 = sum of:
            0.053292118 = weight(_text_:cataloging in 720) [ClassicSimilarity], result of:
              0.053292118 = score(doc=720,freq=2.0), product of:
                0.20397975 = queryWeight, product of:
                  3.9411201 = idf(docFreq=2334, maxDocs=44218)
                  0.051756795 = queryNorm
                0.26126182 = fieldWeight in 720, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.9411201 = idf(docFreq=2334, maxDocs=44218)
                  0.046875 = fieldNorm(doc=720)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Cataloging and classification quarterly. 59(2021) no.8, p.815-834
  7. Hofstede, A.H.M. ter; Proper, H.A.; Van der Weide, T.P.: Exploiting fact verbalisation in conceptual information modelling (1997) 0.01
    0.012271579 = product of:
      0.024543159 = sum of:
        0.024543159 = product of:
          0.049086317 = sum of:
            0.049086317 = weight(_text_:22 in 2908) [ClassicSimilarity], result of:
              0.049086317 = score(doc=2908,freq=2.0), product of:
                0.18124348 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.051756795 = queryNorm
                0.2708308 = fieldWeight in 2908, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2908)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information systems. 22(1997) nos.5/6, S.349-385
  8. Haravu, L.J.; Neelameghan, A.: Text mining and data mining in knowledge organization and discovery : the making of knowledge-based products (2003) 0.01
    0.011102525 = product of:
      0.02220505 = sum of:
        0.02220505 = product of:
          0.0444101 = sum of:
            0.0444101 = weight(_text_:cataloging in 5653) [ClassicSimilarity], result of:
              0.0444101 = score(doc=5653,freq=2.0), product of:
                0.20397975 = queryWeight, product of:
                  3.9411201 = idf(docFreq=2334, maxDocs=44218)
                  0.051756795 = queryNorm
                0.21771818 = fieldWeight in 5653, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.9411201 = idf(docFreq=2334, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5653)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Cataloging and classification quarterly. 37(2003) nos.1/2, S.96-114
  9. Hallonsten, O.; Holmberg, D.: Analyzing structural stratification in the Swedish higher education system : data contextualization with policy-history analysis (2013) 0.01
    0.008765414 = product of:
      0.017530829 = sum of:
        0.017530829 = product of:
          0.035061657 = sum of:
            0.035061657 = weight(_text_:22 in 668) [ClassicSimilarity], result of:
              0.035061657 = score(doc=668,freq=2.0), product of:
                0.18124348 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.051756795 = queryNorm
                0.19345059 = fieldWeight in 668, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=668)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2013 19:43:01
  10. Vaughan, L.; Chen, Y.: Data mining from web search queries : a comparison of Google trends and Baidu index (2015) 0.01
    0.008765414 = product of:
      0.017530829 = sum of:
        0.017530829 = product of:
          0.035061657 = sum of:
            0.035061657 = weight(_text_:22 in 1605) [ClassicSimilarity], result of:
              0.035061657 = score(doc=1605,freq=2.0), product of:
                0.18124348 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.051756795 = queryNorm
                0.19345059 = fieldWeight in 1605, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1605)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the Association for Information Science and Technology. 66(2015) no.1, S.13-22
  11. Fonseca, F.; Marcinkowski, M.; Davis, C.: Cyber-human systems of thought and understanding (2019) 0.01
    0.008765414 = product of:
      0.017530829 = sum of:
        0.017530829 = product of:
          0.035061657 = sum of:
            0.035061657 = weight(_text_:22 in 5011) [ClassicSimilarity], result of:
              0.035061657 = score(doc=5011,freq=2.0), product of:
                0.18124348 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.051756795 = queryNorm
                0.19345059 = fieldWeight in 5011, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5011)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    7. 3.2019 16:32:22
  12. Information visualization in data mining and knowledge discovery (2002) 0.00
    0.0035061657 = product of:
      0.0070123314 = sum of:
        0.0070123314 = product of:
          0.014024663 = sum of:
            0.014024663 = weight(_text_:22 in 1789) [ClassicSimilarity], result of:
              0.014024663 = score(doc=1789,freq=2.0), product of:
                0.18124348 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.051756795 = queryNorm
                0.07738023 = fieldWeight in 1789, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.015625 = fieldNorm(doc=1789)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    23. 3.2008 19:10:22