Search (2 results, page 1 of 1)

  • × author_ss:"Sebastiani, F."
  • × theme_ss:"Automatisches Klassifizieren"
  1. Sebastiani, F.: ¬A tutorial an automated text categorisation (1999) 0.00
    0.0030688148 = product of:
      0.07365155 = sum of:
        0.07365155 = weight(_text_:1960 in 3390) [ClassicSimilarity], result of:
          0.07365155 = score(doc=3390,freq=2.0), product of:
            0.15622076 = queryWeight, product of:
              7.11192 = idf(docFreq=97, maxDocs=44218)
              0.021966046 = queryNorm
            0.47145814 = fieldWeight in 3390, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              7.11192 = idf(docFreq=97, maxDocs=44218)
              0.046875 = fieldNorm(doc=3390)
      0.041666668 = coord(1/24)
    
    Abstract
    The automated categorisation (or classification) of texts into topical categories has a long history, dating back at least to 1960. Until the late '80s, the dominant approach to the problem involved knowledge-engineering automatic categorisers, i.e. manually building a set of rules encoding expert knowledge an how to classify documents. In the '90s, with the booming production and availability of on-line documents, automated text categorisation has witnessed an increased and renewed interest. A newer paradigm based an machine learning has superseded the previous approach. Within this paradigm, a general inductive process automatically builds a classifier by "learning", from a set of previously classified documents, the characteristics of one or more categories; the advantages are a very good effectiveness, a considerable savings in terms of expert manpower, and domain independence. In this tutorial we look at the main approaches that have been taken towards automatic text categorisation within the general machine learning paradigm. Issues of document indexing, classifier construction, and classifier evaluation, will be touched upon.
  2. Giorgetti, D.; Sebastiani, F.: Automating survey coding by multiclass text categorization techniques (2003) 0.00
    3.1282406E-4 = product of:
      0.0075077773 = sum of:
        0.0075077773 = product of:
          0.015015555 = sum of:
            0.015015555 = weight(_text_:29 in 5172) [ClassicSimilarity], result of:
              0.015015555 = score(doc=5172,freq=2.0), product of:
                0.07726968 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.021966046 = queryNorm
                0.19432661 = fieldWeight in 5172, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5172)
          0.5 = coord(1/2)
      0.041666668 = coord(1/24)
    
    Date
    9. 7.2006 10:29:12

Types