Search (49 results, page 1 of 3)

  • × theme_ss:"Automatisches Klassifizieren"
  • × type_ss:"a"
  1. Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.26
    0.25833747 = product of:
      0.3875062 = sum of:
        0.054560162 = product of:
          0.16368048 = sum of:
            0.16368048 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
              0.16368048 = score(doc=562,freq=2.0), product of:
                0.29123706 = queryWeight, product of:
                  8.478011 = idf(docFreq=24, maxDocs=44218)
                  0.03435205 = queryNorm
                0.56201804 = fieldWeight in 562, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  8.478011 = idf(docFreq=24, maxDocs=44218)
                  0.046875 = fieldNorm(doc=562)
          0.33333334 = coord(1/3)
        0.16368048 = weight(_text_:2f in 562) [ClassicSimilarity], result of:
          0.16368048 = score(doc=562,freq=2.0), product of:
            0.29123706 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.03435205 = queryNorm
            0.56201804 = fieldWeight in 562, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=562)
        0.16368048 = weight(_text_:2f in 562) [ClassicSimilarity], result of:
          0.16368048 = score(doc=562,freq=2.0), product of:
            0.29123706 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.03435205 = queryNorm
            0.56201804 = fieldWeight in 562, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=562)
        0.005585074 = product of:
          0.02792537 = sum of:
            0.02792537 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
              0.02792537 = score(doc=562,freq=2.0), product of:
                0.120295025 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03435205 = queryNorm
                0.23214069 = fieldWeight in 562, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=562)
          0.2 = coord(1/5)
      0.6666667 = coord(4/6)
    
    Content
    Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
    Date
    8. 1.2013 10:22:32
  2. Jenkins, C.: Automatic classification of Web resources using Java and Dewey Decimal Classification (1998) 0.02
    0.019849308 = product of:
      0.05954792 = sum of:
        0.053032 = weight(_text_:suchmaschinen in 1673) [ClassicSimilarity], result of:
          0.053032 = score(doc=1673,freq=2.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.3455367 = fieldWeight in 1673, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1673)
        0.00651592 = product of:
          0.0325796 = sum of:
            0.0325796 = weight(_text_:22 in 1673) [ClassicSimilarity], result of:
              0.0325796 = score(doc=1673,freq=2.0), product of:
                0.120295025 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03435205 = queryNorm
                0.2708308 = fieldWeight in 1673, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1673)
          0.2 = coord(1/5)
      0.33333334 = coord(2/6)
    
    Date
    1. 8.1996 22:08:06
    Theme
    Suchmaschinen
  3. Ardö, A.; Koch, T.: Automatic classification applied to full-text Internet documents in a robot-generated subject index (1999) 0.02
    0.015152 = product of:
      0.090912 = sum of:
        0.090912 = weight(_text_:suchmaschinen in 382) [ClassicSimilarity], result of:
          0.090912 = score(doc=382,freq=2.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.59234864 = fieldWeight in 382, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.09375 = fieldNorm(doc=382)
      0.16666667 = coord(1/6)
    
    Theme
    Suchmaschinen
  4. Krellenstein, M.: Document classification at Northern Light (1999) 0.02
    0.015152 = product of:
      0.090912 = sum of:
        0.090912 = weight(_text_:suchmaschinen in 4435) [ClassicSimilarity], result of:
          0.090912 = score(doc=4435,freq=2.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.59234864 = fieldWeight in 4435, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.09375 = fieldNorm(doc=4435)
      0.16666667 = coord(1/6)
    
    Theme
    Suchmaschinen
  5. Wätjen, H.-J.: GERHARD : Automatisches Sammeln, Klassifizieren und Indexieren von wissenschaftlich relevanten Informationsressourcen im deutschen World Wide Web (1998) 0.01
    0.008838667 = product of:
      0.053032 = sum of:
        0.053032 = weight(_text_:suchmaschinen in 3064) [ClassicSimilarity], result of:
          0.053032 = score(doc=3064,freq=2.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.3455367 = fieldWeight in 3064, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3064)
      0.16666667 = coord(1/6)
    
    Abstract
    Die intellektuelle Erschließung des Internet befindet sich in einer Krise. Yahoo und andere Dienste können mit dem Wachstum des Web nicht mithalten. GERHARD ist derzeit weltweit der einzige Such- und Navigationsdienst, der die mit einem Roboter gesammelten Internetressourcen mit computerlinguistischen und statistischen Verfahren auch automatisch vollständig klassifiziert. Weit über eine Million HTML-Dokumente von wissenschaftlich relevanten Servern in Deutschland können wie bei anderen Suchmaschinen in der Datenbank gesucht, aber auch über die Navigation in der dreisprachigen Universalen Dezimalklassifikation (ETH-Bibliothek Zürich) recherchiert werden
  6. Ozmutlu, S.; Cosar, G.C.: Analyzing the results of automatic new topic identification (2008) 0.01
    0.007576 = product of:
      0.045456 = sum of:
        0.045456 = weight(_text_:suchmaschinen in 2604) [ClassicSimilarity], result of:
          0.045456 = score(doc=2604,freq=2.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.29617432 = fieldWeight in 2604, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.046875 = fieldNorm(doc=2604)
      0.16666667 = coord(1/6)
    
    Theme
    Suchmaschinen
  7. Savic, D.: Designing an expert system for classifying office documents (1994) 0.01
    0.005102382 = product of:
      0.030614292 = sum of:
        0.030614292 = product of:
          0.07653573 = sum of:
            0.03896392 = weight(_text_:28 in 2655) [ClassicSimilarity], result of:
              0.03896392 = score(doc=2655,freq=2.0), product of:
                0.12305808 = queryWeight, product of:
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03435205 = queryNorm
                0.31663033 = fieldWeight in 2655, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2655)
            0.03757181 = weight(_text_:29 in 2655) [ClassicSimilarity], result of:
              0.03757181 = score(doc=2655,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.31092256 = fieldWeight in 2655, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2655)
          0.4 = coord(2/5)
      0.16666667 = coord(1/6)
    
    Source
    Records management quarterly. 28(1994) no.3, S.20-29
  8. Savic, D.: Automatic classification of office documents : review of available methods and techniques (1995) 0.00
    0.004464585 = product of:
      0.026787508 = sum of:
        0.026787508 = product of:
          0.06696877 = sum of:
            0.034093432 = weight(_text_:28 in 2219) [ClassicSimilarity], result of:
              0.034093432 = score(doc=2219,freq=2.0), product of:
                0.12305808 = queryWeight, product of:
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03435205 = queryNorm
                0.27705154 = fieldWeight in 2219, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2219)
            0.032875333 = weight(_text_:29 in 2219) [ClassicSimilarity], result of:
              0.032875333 = score(doc=2219,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.27205724 = fieldWeight in 2219, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2219)
          0.4 = coord(2/5)
      0.16666667 = coord(1/6)
    
    Date
    23. 7.1996 10:28:09
    Source
    Records management quarterly. 29(1995) no.4, S.3-18
  9. Yi, K.: Automatic text classification using library classification schemes : trends, issues and challenges (2007) 0.00
    0.0044448692 = product of:
      0.026669214 = sum of:
        0.026669214 = product of:
          0.06667303 = sum of:
            0.034093432 = weight(_text_:28 in 2560) [ClassicSimilarity], result of:
              0.034093432 = score(doc=2560,freq=2.0), product of:
                0.12305808 = queryWeight, product of:
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03435205 = queryNorm
                0.27705154 = fieldWeight in 2560, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2560)
            0.0325796 = weight(_text_:22 in 2560) [ClassicSimilarity], result of:
              0.0325796 = score(doc=2560,freq=2.0), product of:
                0.120295025 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03435205 = queryNorm
                0.2708308 = fieldWeight in 2560, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2560)
          0.4 = coord(2/5)
      0.16666667 = coord(1/6)
    
    Date
    28. 9.2003 11:42:17
    22. 9.2008 18:31:54
  10. Pfeffer, M.: Automatische Vergabe von RVK-Notationen mittels fallbasiertem Schließen (2009) 0.00
    0.0038098875 = product of:
      0.022859324 = sum of:
        0.022859324 = product of:
          0.057148308 = sum of:
            0.02922294 = weight(_text_:28 in 3051) [ClassicSimilarity], result of:
              0.02922294 = score(doc=3051,freq=2.0), product of:
                0.12305808 = queryWeight, product of:
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03435205 = queryNorm
                0.23747274 = fieldWeight in 3051, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3051)
            0.02792537 = weight(_text_:22 in 3051) [ClassicSimilarity], result of:
              0.02792537 = score(doc=3051,freq=2.0), product of:
                0.120295025 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03435205 = queryNorm
                0.23214069 = fieldWeight in 3051, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3051)
          0.4 = coord(2/5)
      0.16666667 = coord(1/6)
    
    Date
    22. 8.2009 19:51:28
  11. Ibekwe-SanJuan, F.; SanJuan, E.: From term variants to research topics (2002) 0.00
    0.0031889891 = product of:
      0.019133935 = sum of:
        0.019133935 = product of:
          0.047834836 = sum of:
            0.024352452 = weight(_text_:28 in 1853) [ClassicSimilarity], result of:
              0.024352452 = score(doc=1853,freq=2.0), product of:
                0.12305808 = queryWeight, product of:
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03435205 = queryNorm
                0.19789396 = fieldWeight in 1853, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1853)
            0.023482382 = weight(_text_:29 in 1853) [ClassicSimilarity], result of:
              0.023482382 = score(doc=1853,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.19432661 = fieldWeight in 1853, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1853)
          0.4 = coord(2/5)
      0.16666667 = coord(1/6)
    
    Date
    6. 1.1997 18:30:28
    Source
    Knowledge organization. 29(2002) nos.3/4, S.181-197
  12. Giorgetti, D.; Sebastiani, F.: Automating survey coding by multiclass text categorization techniques (2003) 0.00
    0.0031889891 = product of:
      0.019133935 = sum of:
        0.019133935 = product of:
          0.047834836 = sum of:
            0.024352452 = weight(_text_:28 in 5172) [ClassicSimilarity], result of:
              0.024352452 = score(doc=5172,freq=2.0), product of:
                0.12305808 = queryWeight, product of:
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03435205 = queryNorm
                0.19789396 = fieldWeight in 5172, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5172)
            0.023482382 = weight(_text_:29 in 5172) [ClassicSimilarity], result of:
              0.023482382 = score(doc=5172,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.19432661 = fieldWeight in 5172, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5172)
          0.4 = coord(2/5)
      0.16666667 = coord(1/6)
    
    Abstract
    In this issue Giorgetti, and Sebastiani suggest that answers to open ended questions in survey instruments can be coded automatically by creating classifiers which learn from training sets of manually coded answers. The manual effort required is only that of classifying a representative set of documents, not creating a dictionary of words that trigger an assignment. They use a naive Bayesian probabilistic learner from Mc Callum's RAINBOW package and the multi-class support vector machine learner from Hsu and Lin's BSVM package, both examples of text categorization techniques. Data from the 1996 General Social Survey by the U.S. National Opinion Research Center provided a set of answers to three questions (previously tested by Viechnicki using a dictionary approach), their associated manually assigned category codes, and a complete set of predefined category codes. The learners were run on three random disjoint subsets of the answer sets to create the classifiers and a remaining set was used as a test set. The dictionary approach is out preformed by 18% for RAINBOW and by 17% for BSVM, while the standard deviation of the results is reduced by 28% and 34% respectively over the dictionary approach.
    Date
    9. 7.2006 10:29:12
  13. Khoo, C.S.G.; Ng, K.; Ou, S.: ¬An exploratory study of human clustering of Web pages (2003) 0.00
    0.0025399253 = product of:
      0.015239551 = sum of:
        0.015239551 = product of:
          0.038098875 = sum of:
            0.01948196 = weight(_text_:28 in 2741) [ClassicSimilarity], result of:
              0.01948196 = score(doc=2741,freq=2.0), product of:
                0.12305808 = queryWeight, product of:
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03435205 = queryNorm
                0.15831517 = fieldWeight in 2741, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03125 = fieldNorm(doc=2741)
            0.018616915 = weight(_text_:22 in 2741) [ClassicSimilarity], result of:
              0.018616915 = score(doc=2741,freq=2.0), product of:
                0.120295025 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03435205 = queryNorm
                0.15476047 = fieldWeight in 2741, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=2741)
          0.4 = coord(2/5)
      0.16666667 = coord(1/6)
    
    Date
    6. 1.1997 18:30:28
    12. 9.2004 9:56:22
  14. Panyr, J.: STEINADLER: ein Verfahren zur automatischen Deskribierung und zur automatischen thematischen Klassifikation (1978) 0.00
    0.0025047874 = product of:
      0.015028724 = sum of:
        0.015028724 = product of:
          0.07514362 = sum of:
            0.07514362 = weight(_text_:29 in 5169) [ClassicSimilarity], result of:
              0.07514362 = score(doc=5169,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.6218451 = fieldWeight in 5169, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.125 = fieldNorm(doc=5169)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Source
    Nachrichten für Dokumentation. 29(1978), S.92-96
  15. Kleinoeder, H.H.; Puzicha, J.: Automatische Katalogisierung am Beispiel einer Pilotanwendung (2002) 0.00
    0.0022728955 = product of:
      0.013637373 = sum of:
        0.013637373 = product of:
          0.068186864 = sum of:
            0.068186864 = weight(_text_:28 in 1154) [ClassicSimilarity], result of:
              0.068186864 = score(doc=1154,freq=2.0), product of:
                0.12305808 = queryWeight, product of:
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03435205 = queryNorm
                0.5541031 = fieldWeight in 1154, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.109375 = fieldNorm(doc=1154)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    11. 7.2003 13:27:28
  16. Subramanian, S.; Shafer, K.E.: Clustering (2001) 0.00
    0.0018616914 = product of:
      0.011170148 = sum of:
        0.011170148 = product of:
          0.05585074 = sum of:
            0.05585074 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
              0.05585074 = score(doc=1046,freq=2.0), product of:
                0.120295025 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03435205 = queryNorm
                0.46428138 = fieldWeight in 1046, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1046)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    5. 5.2003 14:17:22
  17. HaCohen-Kerner, Y. et al.: Classification using various machine learning methods and combinations of key-phrases and visual features (2016) 0.00
    0.0015514096 = product of:
      0.009308457 = sum of:
        0.009308457 = product of:
          0.046542287 = sum of:
            0.046542287 = weight(_text_:22 in 2748) [ClassicSimilarity], result of:
              0.046542287 = score(doc=2748,freq=2.0), product of:
                0.120295025 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03435205 = queryNorm
                0.38690117 = fieldWeight in 2748, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2748)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    1. 2.2016 18:25:22
  18. Golub, K.; Hamon, T.; Ardö, A.: Automated classification of textual documents based on a controlled vocabulary in engineering (2007) 0.00
    0.0013775828 = product of:
      0.008265496 = sum of:
        0.008265496 = product of:
          0.04132748 = sum of:
            0.04132748 = weight(_text_:28 in 1461) [ClassicSimilarity], result of:
              0.04132748 = score(doc=1461,freq=4.0), product of:
                0.12305808 = queryWeight, product of:
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03435205 = queryNorm
                0.3358372 = fieldWeight in 1461, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1461)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    6. 1.1997 18:30:28
    28. 2.2008 14:21:51
  19. Desale, S.K.; Kumbhar, R.: Research on automatic classification of documents in library environment : a literature review (2013) 0.00
    0.0013775828 = product of:
      0.008265496 = sum of:
        0.008265496 = product of:
          0.04132748 = sum of:
            0.04132748 = weight(_text_:28 in 1071) [ClassicSimilarity], result of:
              0.04132748 = score(doc=1071,freq=4.0), product of:
                0.12305808 = queryWeight, product of:
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03435205 = queryNorm
                0.3358372 = fieldWeight in 1071, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1071)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    6. 1.1997 18:30:28
    19. 9.2013 19:28:15
  20. Aphinyanaphongs, Y.; Fu, L.D.; Li, Z.; Peskin, E.R.; Efstathiadis, E.; Aliferis, C.F.; Statnikov, A.: ¬A comprehensive empirical comparison of modern supervised classification and feature selection methods for text categorization (2014) 0.00
    0.0013775828 = product of:
      0.008265496 = sum of:
        0.008265496 = product of:
          0.04132748 = sum of:
            0.04132748 = weight(_text_:28 in 1496) [ClassicSimilarity], result of:
              0.04132748 = score(doc=1496,freq=4.0), product of:
                0.12305808 = queryWeight, product of:
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03435205 = queryNorm
                0.3358372 = fieldWeight in 1496, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1496)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Abstract
    An important aspect to performing text categorization is selecting appropriate supervised classification and feature selection methods. A comprehensive benchmark is needed to inform best practices in this broad application field. Previous benchmarks have evaluated performance for a few supervised classification and feature selection methods and limited ways to optimize them. The present work updates prior benchmarks by increasing the number of classifiers and feature selection methods order of magnitude, including adding recently developed, state-of-the-art methods. Specifically, this study used 229 text categorization data sets/tasks, and evaluated 28 classification methods (both well-established and proprietary/commercial) and 19 feature selection methods according to 4 classification performance metrics. We report several key findings that will be helpful in establishing best methodological practices for text categorization.
    Date
    26. 9.2014 18:28:57

Years

Languages

  • e 41
  • d 7