Document (#20912)

Author
Deogun, J.S.
Title
Feature selection and effective classifiers
Source
Journal of the American Society for Information Science. 49(1998) no.5, S.423-434
Year
1998
Abstract
Develops and analyzes 4 algorithms for feature selection in the context of rough set methodology. Develops the notion of accuracy of classification that can be used for upper or lower classification methods and defines the feature selection problem. Presents a discussion of upper classifiers and develops 4 features selection heuristics and discusses the family of stepwise backward selection algorithms. Analyzes the worst case time complexity in all algorithms presented. Discusses details of the experiments and results of using a family of stepwise backward selection learning data sets and a duodenal ulcer data set. Includes the experimental setup and results of comparison of lower classifiers and upper classiers on the duodenal ulcer data set. Discusses exteded decision tables
Footnote
Contribution to a special issue devoted to knowledge discovery and data mining
Theme
Data Mining

Similar documents (content)

  1. Aphinyanaphongs, Y.; Fu, L.D.; Li, Z.; Peskin, E.R.; Efstathiadis, E.; Aliferis, C.F.; Statnikov, A.: ¬A comprehensive empirical comparison of modern supervised classification and feature selection methods for text categorization (2014) 0.15
    0.14980917 = sum of:
      0.14980917 = product of:
        0.74904585 = sum of:
          0.040195994 = weight(abstract_txt:classification in 1496) [ClassicSimilarity], result of:
            0.040195994 = score(doc=1496,freq=4.0), product of:
              0.064441256 = queryWeight, product of:
                1.3720472 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.011765117 = queryNorm
              0.6237618 = fieldWeight in 1496, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=1496)
          0.01759794 = weight(abstract_txt:data in 1496) [ClassicSimilarity], result of:
            0.01759794 = score(doc=1496,freq=1.0), product of:
              0.06751503 = queryWeight, product of:
                1.7200178 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.011765117 = queryNorm
              0.26065218 = fieldWeight in 1496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=1496)
          0.19472289 = weight(abstract_txt:feature in 1496) [ClassicSimilarity], result of:
            0.19472289 = score(doc=1496,freq=4.0), product of:
              0.21119514 = queryWeight, product of:
                3.042108 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.011765117 = queryNorm
              0.9220046 = fieldWeight in 1496, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.078125 = fieldNorm(doc=1496)
          0.20171328 = weight(abstract_txt:classifiers in 1496) [ClassicSimilarity], result of:
            0.20171328 = score(doc=1496,freq=1.0), product of:
              0.34322765 = queryWeight, product of:
                3.878143 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.011765117 = queryNorm
              0.5876953 = fieldWeight in 1496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.078125 = fieldNorm(doc=1496)
          0.29481575 = weight(abstract_txt:selection in 1496) [ClassicSimilarity], result of:
            0.29481575 = score(doc=1496,freq=4.0), product of:
              0.3508459 = queryWeight, product of:
                5.5450554 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.011765117 = queryNorm
              0.84029984 = fieldWeight in 1496, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.078125 = fieldNorm(doc=1496)
        0.2 = coord(5/25)
    
  2. Dietterich, T.G.: Machine-learning research : four current directions (1997) 0.15
    0.14908062 = sum of:
      0.14908062 = product of:
        0.74540305 = sum of:
          0.0672184 = weight(abstract_txt:accuracy in 3321) [ClassicSimilarity], result of:
            0.0672184 = score(doc=3321,freq=1.0), product of:
              0.07205945 = queryWeight, product of:
                1.0259296 = boost
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.011765117 = queryNorm
              0.93281865 = fieldWeight in 3321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.15625 = fieldNorm(doc=3321)
          0.040195994 = weight(abstract_txt:classification in 3321) [ClassicSimilarity], result of:
            0.040195994 = score(doc=3321,freq=1.0), product of:
              0.064441256 = queryWeight, product of:
                1.3720472 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.011765117 = queryNorm
              0.6237618 = fieldWeight in 3321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.15625 = fieldNorm(doc=3321)
          0.058318716 = weight(abstract_txt:discusses in 3321) [ClassicSimilarity], result of:
            0.058318716 = score(doc=3321,freq=1.0), product of:
              0.094539054 = queryWeight, product of:
                2.0353463 = boost
                3.947996 = idf(docFreq=2318, maxDocs=44218)
                0.011765117 = queryNorm
              0.61687434 = fieldWeight in 3321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.947996 = idf(docFreq=2318, maxDocs=44218)
                0.15625 = fieldNorm(doc=3321)
          0.17624338 = weight(abstract_txt:algorithms in 3321) [ClassicSimilarity], result of:
            0.17624338 = score(doc=3321,freq=1.0), product of:
              0.19761252 = queryWeight, product of:
                2.942659 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.011765117 = queryNorm
              0.8918634 = fieldWeight in 3321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.15625 = fieldNorm(doc=3321)
          0.40342656 = weight(abstract_txt:classifiers in 3321) [ClassicSimilarity], result of:
            0.40342656 = score(doc=3321,freq=1.0), product of:
              0.34322765 = queryWeight, product of:
                3.878143 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.011765117 = queryNorm
              1.1753906 = fieldWeight in 3321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.15625 = fieldNorm(doc=3321)
        0.2 = coord(5/25)
    
  3. Goller, C.; Löning, J.; Will, T.; Wolff, W.: Automatic document classification : a thourough evaluation of various methods (2000) 0.14
    0.13794073 = sum of:
      0.13794073 = product of:
        0.68970364 = sum of:
          0.013341385 = weight(abstract_txt:results in 5480) [ClassicSimilarity], result of:
            0.013341385 = score(doc=5480,freq=1.0), product of:
              0.049037628 = queryWeight, product of:
                1.1968832 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.011765117 = queryNorm
              0.27206424 = fieldWeight in 5480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
          0.044940487 = weight(abstract_txt:classification in 5480) [ClassicSimilarity], result of:
            0.044940487 = score(doc=5480,freq=5.0), product of:
              0.064441256 = queryWeight, product of:
                1.3720472 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.011765117 = queryNorm
              0.69738686 = fieldWeight in 5480, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
          0.13768987 = weight(abstract_txt:feature in 5480) [ClassicSimilarity], result of:
            0.13768987 = score(doc=5480,freq=2.0), product of:
              0.21119514 = queryWeight, product of:
                3.042108 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.011765117 = queryNorm
              0.65195566 = fieldWeight in 5480, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
          0.28526565 = weight(abstract_txt:classifiers in 5480) [ClassicSimilarity], result of:
            0.28526565 = score(doc=5480,freq=2.0), product of:
              0.34322765 = queryWeight, product of:
                3.878143 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.011765117 = queryNorm
              0.83112663 = fieldWeight in 5480, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
          0.20846622 = weight(abstract_txt:selection in 5480) [ClassicSimilarity], result of:
            0.20846622 = score(doc=5480,freq=2.0), product of:
              0.3508459 = queryWeight, product of:
                5.5450554 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.011765117 = queryNorm
              0.5941817 = fieldWeight in 5480, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
        0.2 = coord(5/25)
    
  4. Yoon, Y.; Lee, G.G.: Efficient implementation of associative classifiers for document classification (2007) 0.13
    0.13330019 = sum of:
      0.13330019 = product of:
        0.5554175 = sum of:
          0.026887361 = weight(abstract_txt:accuracy in 909) [ClassicSimilarity], result of:
            0.026887361 = score(doc=909,freq=1.0), product of:
              0.07205945 = queryWeight, product of:
                1.0259296 = boost
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.011765117 = queryNorm
              0.37312746 = fieldWeight in 909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.0625 = fieldNorm(doc=909)
          0.010673108 = weight(abstract_txt:results in 909) [ClassicSimilarity], result of:
            0.010673108 = score(doc=909,freq=1.0), product of:
              0.049037628 = queryWeight, product of:
                1.1968832 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.011765117 = queryNorm
              0.21765138 = fieldWeight in 909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=909)
          0.042539436 = weight(abstract_txt:classification in 909) [ClassicSimilarity], result of:
            0.042539436 = score(doc=909,freq=7.0), product of:
              0.064441256 = queryWeight, product of:
                1.3720472 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.011765117 = queryNorm
              0.66012734 = fieldWeight in 909, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=909)
          0.07788915 = weight(abstract_txt:feature in 909) [ClassicSimilarity], result of:
            0.07788915 = score(doc=909,freq=1.0), product of:
              0.21119514 = queryWeight, product of:
                3.042108 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.011765117 = queryNorm
              0.36880183 = fieldWeight in 909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.0625 = fieldNorm(doc=909)
          0.27950212 = weight(abstract_txt:classifiers in 909) [ClassicSimilarity], result of:
            0.27950212 = score(doc=909,freq=3.0), product of:
              0.34322765 = queryWeight, product of:
                3.878143 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.011765117 = queryNorm
              0.8143345 = fieldWeight in 909, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=909)
          0.11792631 = weight(abstract_txt:selection in 909) [ClassicSimilarity], result of:
            0.11792631 = score(doc=909,freq=1.0), product of:
              0.3508459 = queryWeight, product of:
                5.5450554 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.011765117 = queryNorm
              0.33611995 = fieldWeight in 909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.0625 = fieldNorm(doc=909)
        0.24 = coord(6/25)
    
  5. Mengle, S.S.R.; Goharian, N.: Ambiguity measure feature-selection algorithm (2009) 0.13
    0.13188726 = sum of:
      0.13188726 = product of:
        0.54953027 = sum of:
          0.026124856 = weight(abstract_txt:complexity in 2804) [ClassicSimilarity], result of:
            0.026124856 = score(doc=2804,freq=1.0), product of:
              0.070690565 = queryWeight, product of:
                1.0161382 = boost
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.011765117 = queryNorm
              0.36956638 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.026887361 = weight(abstract_txt:accuracy in 2804) [ClassicSimilarity], result of:
            0.026887361 = score(doc=2804,freq=1.0), product of:
              0.07205945 = queryWeight, product of:
                1.0259296 = boost
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.011765117 = queryNorm
              0.37312746 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.010673108 = weight(abstract_txt:results in 2804) [ClassicSimilarity], result of:
            0.010673108 = score(doc=2804,freq=1.0), product of:
              0.049037628 = queryWeight, product of:
                1.1968832 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.011765117 = queryNorm
              0.21765138 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.016078396 = weight(abstract_txt:classification in 2804) [ClassicSimilarity], result of:
            0.016078396 = score(doc=2804,freq=1.0), product of:
              0.064441256 = queryWeight, product of:
                1.3720472 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.011765117 = queryNorm
              0.2495047 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.20607533 = weight(abstract_txt:feature in 2804) [ClassicSimilarity], result of:
            0.20607533 = score(doc=2804,freq=7.0), product of:
              0.21119514 = queryWeight, product of:
                3.042108 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.011765117 = queryNorm
              0.9757579 = fieldWeight in 2804, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.26369125 = weight(abstract_txt:selection in 2804) [ClassicSimilarity], result of:
            0.26369125 = score(doc=2804,freq=5.0), product of:
              0.3508459 = queryWeight, product of:
                5.5450554 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.011765117 = queryNorm
              0.7515871 = fieldWeight in 2804, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
        0.24 = coord(6/25)