-
Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004)
0.10
0.104570225 = sum of:
0.08326228 = product of:
0.24978682 = sum of:
0.24978682 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
0.24978682 = score(doc=562,freq=2.0), product of:
0.44444627 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.05242341 = queryNorm
0.56201804 = fieldWeight in 562, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=562)
0.33333334 = coord(1/3)
0.021307945 = product of:
0.04261589 = sum of:
0.04261589 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
0.04261589 = score(doc=562,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.23214069 = fieldWeight in 562, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=562)
0.5 = coord(1/2)
- Content
- Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
- Date
- 8. 1.2013 10:22:32
-
Dubin, D.: Dimensions and discriminability (1998)
0.04
0.03722297 = product of:
0.07444594 = sum of:
0.07444594 = sum of:
0.024727406 = weight(_text_:2 in 2338) [ClassicSimilarity], result of:
0.024727406 = score(doc=2338,freq=2.0), product of:
0.1294644 = queryWeight, product of:
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.05242341 = queryNorm
0.19099772 = fieldWeight in 2338, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.0546875 = fieldNorm(doc=2338)
0.049718536 = weight(_text_:22 in 2338) [ClassicSimilarity], result of:
0.049718536 = score(doc=2338,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.2708308 = fieldWeight in 2338, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=2338)
0.5 = coord(1/2)
- Date
- 22. 9.1997 19:16:05
- Source
- Visualizing subject access for 21st century information resources: Papers presented at the 1997 Clinic on Library Applications of Data Processing, 2-4 Mar 1997, Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign. Ed.: P.A. Cochrane et al
-
Pfeffer, M.: Automatische Vergabe von RVK-Notationen mittels fallbasiertem Schließen (2009)
0.03
0.031905405 = product of:
0.06381081 = sum of:
0.06381081 = sum of:
0.021194918 = weight(_text_:2 in 3051) [ClassicSimilarity], result of:
0.021194918 = score(doc=3051,freq=2.0), product of:
0.1294644 = queryWeight, product of:
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.05242341 = queryNorm
0.16371232 = fieldWeight in 3051, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.046875 = fieldNorm(doc=3051)
0.04261589 = weight(_text_:22 in 3051) [ClassicSimilarity], result of:
0.04261589 = score(doc=3051,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.23214069 = fieldWeight in 3051, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=3051)
0.5 = coord(1/2)
- Date
- 22. 8.2009 19:51:28
- Footnote
- Vgl. auch die Präsentationen unter: http://www.bibliothek.uni-regensburg.de/Systematik/pdf/Anw2008_PPT1.pdf. http://blog.bib.uni-mannheim.de/Classification/wp-content/uploads/2007/10/hu-berlin-2007-2.pdf. Volltexte unter:
-
Egbert, J.; Biber, D.; Davies, M.: Developing a bottom-up, user-based method of web register classification (2015)
0.03
0.031905405 = product of:
0.06381081 = sum of:
0.06381081 = sum of:
0.021194918 = weight(_text_:2 in 2158) [ClassicSimilarity], result of:
0.021194918 = score(doc=2158,freq=2.0), product of:
0.1294644 = queryWeight, product of:
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.05242341 = queryNorm
0.16371232 = fieldWeight in 2158, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.046875 = fieldNorm(doc=2158)
0.04261589 = weight(_text_:22 in 2158) [ClassicSimilarity], result of:
0.04261589 = score(doc=2158,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.23214069 = fieldWeight in 2158, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=2158)
0.5 = coord(1/2)
- Abstract
- This paper introduces a project to develop a reliable, cost-effective method for classifying Internet texts into register categories, and apply that approach to the analysis of a large corpus of web documents. To date, the project has proceeded in 2 key phases. First, we developed a bottom-up method for web register classification, asking end users of the web to utilize a decision-tree survey to code relevant situational characteristics of web documents, resulting in a bottom-up identification of register and subregister categories. We present details regarding the development and testing of this method through a series of 10 pilot studies. Then, in the second phase of our project we applied this procedure to a corpus of 53,000 web documents. An analysis of the results demonstrates the effectiveness of these methods for web register classification and provides a preliminary description of the types and distribution of registers on the web.
- Date
- 4. 8.2015 19:22:04
-
Subramanian, S.; Shafer, K.E.: Clustering (2001)
0.02
0.021307945 = product of:
0.04261589 = sum of:
0.04261589 = product of:
0.08523178 = sum of:
0.08523178 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
0.08523178 = score(doc=1046,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.46428138 = fieldWeight in 1046, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.09375 = fieldNorm(doc=1046)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 5. 5.2003 14:17:22
-
Reiner, U.: Automatische DDC-Klassifizierung von bibliografischen Titeldatensätzen (2009)
0.02
0.01775662 = product of:
0.03551324 = sum of:
0.03551324 = product of:
0.07102648 = sum of:
0.07102648 = weight(_text_:22 in 611) [ClassicSimilarity], result of:
0.07102648 = score(doc=611,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.38690117 = fieldWeight in 611, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.078125 = fieldNorm(doc=611)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 22. 8.2009 12:54:24
-
HaCohen-Kerner, Y. et al.: Classification using various machine learning methods and combinations of key-phrases and visual features (2016)
0.02
0.01775662 = product of:
0.03551324 = sum of:
0.03551324 = product of:
0.07102648 = sum of:
0.07102648 = weight(_text_:22 in 2748) [ClassicSimilarity], result of:
0.07102648 = score(doc=2748,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.38690117 = fieldWeight in 2748, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.078125 = fieldNorm(doc=2748)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 1. 2.2016 18:25:22
-
Bock, H.-H.: Datenanalyse zur Strukturierung und Ordnung von Information (1989)
0.01
0.012429634 = product of:
0.024859268 = sum of:
0.024859268 = product of:
0.049718536 = sum of:
0.049718536 = weight(_text_:22 in 141) [ClassicSimilarity], result of:
0.049718536 = score(doc=141,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.2708308 = fieldWeight in 141, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=141)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Pages
- S.1-22
-
Automatic classification research at OCLC (2002)
0.01
0.012429634 = product of:
0.024859268 = sum of:
0.024859268 = product of:
0.049718536 = sum of:
0.049718536 = weight(_text_:22 in 1563) [ClassicSimilarity], result of:
0.049718536 = score(doc=1563,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.2708308 = fieldWeight in 1563, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=1563)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 5. 5.2003 9:22:09
-
Jenkins, C.: Automatic classification of Web resources using Java and Dewey Decimal Classification (1998)
0.01
0.012429634 = product of:
0.024859268 = sum of:
0.024859268 = product of:
0.049718536 = sum of:
0.049718536 = weight(_text_:22 in 1673) [ClassicSimilarity], result of:
0.049718536 = score(doc=1673,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.2708308 = fieldWeight in 1673, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=1673)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 1. 8.1996 22:08:06
-
Yoon, Y.; Lee, C.; Lee, G.G.: ¬An effective procedure for constructing a hierarchical text classification system (2006)
0.01
0.012429634 = product of:
0.024859268 = sum of:
0.024859268 = product of:
0.049718536 = sum of:
0.049718536 = weight(_text_:22 in 5273) [ClassicSimilarity], result of:
0.049718536 = score(doc=5273,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.2708308 = fieldWeight in 5273, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=5273)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 22. 7.2006 16:24:52
-
Yi, K.: Automatic text classification using library classification schemes : trends, issues and challenges (2007)
0.01
0.012429634 = product of:
0.024859268 = sum of:
0.024859268 = product of:
0.049718536 = sum of:
0.049718536 = weight(_text_:22 in 2560) [ClassicSimilarity], result of:
0.049718536 = score(doc=2560,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.2708308 = fieldWeight in 2560, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=2560)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 22. 9.2008 18:31:54
-
Liu, R.-L.: Context recognition for hierarchical text classification (2009)
0.01
0.010653973 = product of:
0.021307945 = sum of:
0.021307945 = product of:
0.04261589 = sum of:
0.04261589 = weight(_text_:22 in 2760) [ClassicSimilarity], result of:
0.04261589 = score(doc=2760,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.23214069 = fieldWeight in 2760, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=2760)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 22. 3.2009 19:11:54
-
Zhu, W.Z.; Allen, R.B.: Document clustering using the LSI subspace signature model (2013)
0.01
0.010653973 = product of:
0.021307945 = sum of:
0.021307945 = product of:
0.04261589 = sum of:
0.04261589 = weight(_text_:22 in 690) [ClassicSimilarity], result of:
0.04261589 = score(doc=690,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.23214069 = fieldWeight in 690, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=690)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 23. 3.2013 13:22:36
-
Shafer, K.E.: Automatic Subject Assignment via the Scorpion System (2001)
0.01
0.010597459 = product of:
0.021194918 = sum of:
0.021194918 = product of:
0.042389836 = sum of:
0.042389836 = weight(_text_:2 in 1043) [ClassicSimilarity], result of:
0.042389836 = score(doc=1043,freq=2.0), product of:
0.1294644 = queryWeight, product of:
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.05242341 = queryNorm
0.32742465 = fieldWeight in 1043, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.09375 = fieldNorm(doc=1043)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Journal of library administration. 34(2001) nos.1/2, S.187-189
-
Fangmeyer, H.; Gloden, R.: Bewertung und Vergleich von Klassifikationsergebnissen bei automatischen Verfahren (1978)
0.01
0.00999138 = product of:
0.01998276 = sum of:
0.01998276 = product of:
0.03996552 = sum of:
0.03996552 = weight(_text_:2 in 81) [ClassicSimilarity], result of:
0.03996552 = score(doc=81,freq=4.0), product of:
0.1294644 = queryWeight, product of:
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.05242341 = queryNorm
0.30869892 = fieldWeight in 81, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.0625 = fieldNorm(doc=81)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Series
- Studien zur Klassifikation; Bd.2
- Source
- Kooperation in der Klassifikation I. Proc. der Sekt.1-3 der 2. Fachtagung der Gesellschaft für Klassifikation, Frankfurt-Hoechst, 6.-7.4.1978. Bearb.: W. Dahlberg
-
Bollmann, P.; Konrad, E.; Schneider, H.-J.; Zuse, H.: Anwendung automatischer Klassifikationsverfahren mit dem System FAKYR (1978)
0.01
0.00999138 = product of:
0.01998276 = sum of:
0.01998276 = product of:
0.03996552 = sum of:
0.03996552 = weight(_text_:2 in 82) [ClassicSimilarity], result of:
0.03996552 = score(doc=82,freq=4.0), product of:
0.1294644 = queryWeight, product of:
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.05242341 = queryNorm
0.30869892 = fieldWeight in 82, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.0625 = fieldNorm(doc=82)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Series
- Studien zur Klassifikation; Bd.2
- Source
- Kooperation in der Klassifikation I. Proc. der Sekt.1-3 der 2. Fachtagung der Gesellschaft für Klassifikation, Frankfurt-Hoechst, 6.-7.4.1978. Bearb.: W. Dahlberg
-
Schulze, U.: Erfahrungen bei der Anwendung automatischer Klassifizierungsverfahren zur Inhaltsanalyse einer Dokumentenmenge (1978)
0.01
0.00999138 = product of:
0.01998276 = sum of:
0.01998276 = product of:
0.03996552 = sum of:
0.03996552 = weight(_text_:2 in 83) [ClassicSimilarity], result of:
0.03996552 = score(doc=83,freq=4.0), product of:
0.1294644 = queryWeight, product of:
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.05242341 = queryNorm
0.30869892 = fieldWeight in 83, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.0625 = fieldNorm(doc=83)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Series
- Studien zur Klassifikation; Bd.2
- Source
- Kooperation in der Klassifikation I. Proc. der Sekt.1-3 der 2. Fachtagung der Gesellschaft für Klassifikation, Frankfurt-Hoechst, 6.-7.4.1978. Bearb.: W. Dahlberg
-
Koch, T.; Ardö, A.: Automatic classification of full-text HTML-documents from one specific subject area : DESIRE II D3.6a, Working Paper 2 (2000)
0.01
0.00999138 = product of:
0.01998276 = sum of:
0.01998276 = product of:
0.03996552 = sum of:
0.03996552 = weight(_text_:2 in 1667) [ClassicSimilarity], result of:
0.03996552 = score(doc=1667,freq=4.0), product of:
0.1294644 = queryWeight, product of:
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.05242341 = queryNorm
0.30869892 = fieldWeight in 1667, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.0625 = fieldNorm(doc=1667)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Content
- 1 Introduction / 2 Method overview / 3 Ei thesaurus preprocessing / 4 Automatic classification process: 4.1 Matching -- 4.2 Weighting -- 4.3 Preparation for display / 5 Results of the classification process / 6 Evaluations / 7 Software / 8 Other applications / 9 Experiments with universal classification systems / References / Appendix A: Ei classification service: Software / Appendix B: Use of the classification software as subject filter in a WWW harvester.
-
Mengle, S.; Goharian, N.: Passage detection using text classification (2009)
0.01
0.00887831 = product of:
0.01775662 = sum of:
0.01775662 = product of:
0.03551324 = sum of:
0.03551324 = weight(_text_:22 in 2765) [ClassicSimilarity], result of:
0.03551324 = score(doc=2765,freq=2.0), product of:
0.18357785 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05242341 = queryNorm
0.19345059 = fieldWeight in 2765, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=2765)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 22. 3.2009 19:14:43