May, A.D.: Automatic classification of e-mail messages by message type (1997)
0.01
0.009894149 = product of:
0.019788299 = sum of:
0.013760565 = weight(_text_:a in 6493) [ClassicSimilarity], result of:
0.013760565 = score(doc=6493,freq=14.0), product of:
0.05832264 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.05058132 = queryNorm
0.23593865 = fieldWeight in 6493, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0546875 = fieldNorm(doc=6493)
0.006027733 = product of:
0.012055466 = sum of:
0.012055466 = weight(_text_:information in 6493) [ClassicSimilarity], result of:
0.012055466 = score(doc=6493,freq=2.0), product of:
0.088794395 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.05058132 = queryNorm
0.13576832 = fieldWeight in 6493, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=6493)
0.5 = coord(1/2)
0.5 = coord(2/4)
- Abstract
- This article describes a system that automatically classifies e-mail messages in the HUMANIST electronic discussion group into one of 4 classes: questions, responses, announcement or administartive. A total of 1.372 messages were analyzed. The automatic classification of a message was based on string matching between a message text and predefined string sets for each of the massage types. The system's automated ability to accurately classify a message was compared against manually assigned codes. The Cohen's Kappa of .55 suggested that there was a statistical agreement between the automatic and manually assigned codes
- Source
- Journal of the American Society for Information Science. 48(1997) no.1, S.32-39
- Type
- a