-
Koch, T.; Ardö, A.: Automatic classification of full-text HTML-documents from one specific subject area : DESIRE II D3.6a, Working Paper 2 (2000)
0.01
0.014486472 = product of:
0.028972944 = sum of:
0.028972944 = product of:
0.05794589 = sum of:
0.05794589 = weight(_text_:b in 1667) [ClassicSimilarity], result of:
0.05794589 = score(doc=1667,freq=2.0), product of:
0.18503809 = queryWeight, product of:
3.542962 = idf(docFreq=3476, maxDocs=44218)
0.052226946 = queryNorm
0.31315655 = fieldWeight in 1667, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.542962 = idf(docFreq=3476, maxDocs=44218)
0.0625 = fieldNorm(doc=1667)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Content
- 1 Introduction / 2 Method overview / 3 Ei thesaurus preprocessing / 4 Automatic classification process: 4.1 Matching -- 4.2 Weighting -- 4.3 Preparation for display / 5 Results of the classification process / 6 Evaluations / 7 Software / 8 Other applications / 9 Experiments with universal classification systems / References / Appendix A: Ei classification service: Software / Appendix B: Use of the classification software as subject filter in a WWW harvester.
-
Automatic classification research at OCLC (2002)
0.01
0.012383052 = product of:
0.024766104 = sum of:
0.024766104 = product of:
0.04953221 = sum of:
0.04953221 = weight(_text_:22 in 1563) [ClassicSimilarity], result of:
0.04953221 = score(doc=1563,freq=2.0), product of:
0.18288986 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.052226946 = queryNorm
0.2708308 = fieldWeight in 1563, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=1563)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 5. 5.2003 9:22:09
-
Lindholm, J.; Schönthal, T.; Jansson , K.: Experiences of harvesting Web resources in engineering using automatic classification (2003)
0.01
0.011992325 = product of:
0.02398465 = sum of:
0.02398465 = product of:
0.0959386 = sum of:
0.0959386 = weight(_text_:authors in 4088) [ClassicSimilarity], result of:
0.0959386 = score(doc=4088,freq=2.0), product of:
0.23809293 = queryWeight, product of:
4.558814 = idf(docFreq=1258, maxDocs=44218)
0.052226946 = queryNorm
0.40294603 = fieldWeight in 4088, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.558814 = idf(docFreq=1258, maxDocs=44218)
0.0625 = fieldNorm(doc=4088)
0.25 = coord(1/4)
0.5 = coord(1/2)
- Abstract
- Authors describe the background and the work involved in setting up Engine-e, a Web index that uses automatic classification as a mean for the selection of resources in Engineering. Considerations in offering a robot-generated Web index as a successor to a manually indexed quality-controlled subject gateway are also discussed