Search (1 results, page 1 of 1)

Did you mean:
precises%3a%2fdocuments %2f subject classification schemes%3a bliss%2c henry evelyn %2f bliss bibliographic classification %2f texts%22 1
precises%3a%2f_documents %2f subject classification schemes%3a bliss%2c henry evelyn %2f bliss bibliographic classification %2f texts%22 1
precises%3a%2fdocuments %2f subject classification schemes%3a bliss%2c heery evelyn %2f bliss bibliographic classification %2f texts%22 1
precises%3a%2fdocuments %2f subject classification schemes%3a bliss%2c henry evelyn %2f bloss bibliographic classification %2f texts%22 1
precises%3a%2fdocuments %2f subject classification schemes%3a bloss%2c henry evelyn %2f bliss bibliographic classification %2f texts%22 1

Finn, A.; Kushmerick, N.: Learning to classify documents according to genre (2006) 0.01
```
0.008156957 = product of:
  0.057098698 = sum of:
    0.028549349 = weight(_text_:classification in 6010) [ClassicSimilarity], result of:
      0.028549349 = score(doc=6010,freq=4.0), product of:
        0.09562149 = queryWeight, product of:
          3.1847067 = idf(docFreq=4974, maxDocs=44218)
          0.03002521 = queryNorm
        0.29856625 = fieldWeight in 6010, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1847067 = idf(docFreq=4974, maxDocs=44218)
          0.046875 = fieldNorm(doc=6010)
    0.028549349 = weight(_text_:classification in 6010) [ClassicSimilarity], result of:
      0.028549349 = score(doc=6010,freq=4.0), product of:
        0.09562149 = queryWeight, product of:
          3.1847067 = idf(docFreq=4974, maxDocs=44218)
          0.03002521 = queryNorm
        0.29856625 = fieldWeight in 6010, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1847067 = idf(docFreq=4974, maxDocs=44218)
          0.046875 = fieldNorm(doc=6010)
  0.14285715 = coord(2/14)
```
Abstract

Current document-retrieval tools succeed in locating large numbers of documents relevant to a given query. While search results may be relevant according to the topic of the documents, it is more difficult to identify which of the relevant documents are most suitable for a particular user. Automatic genre analysis (i.e., the ability to distinguish documents according to style) would be a useful tool for identifying documents that are most suitable for a particular user. We investigate the use of machine learning for automatic genre classification. We introduce the idea of domain transfer-genre classifiers should be reusable across multiple topics-which does not arise in standard text classification. We investigate different features for building genre classifiers and their ability to transfer across multiple-topic domains. We also show how different feature-sets can be used in conjunction with each other to improve performance and reduce the number of documents that need to be labeled.