-
Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004)
0.10
0.1038222 = sum of:
0.08266668 = product of:
0.24800003 = sum of:
0.24800003 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
0.24800003 = score(doc=562,freq=2.0), product of:
0.441267 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.05204841 = queryNorm
0.56201804 = fieldWeight in 562, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=562)
0.33333334 = coord(1/3)
0.021155523 = product of:
0.042311046 = sum of:
0.042311046 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
0.042311046 = score(doc=562,freq=2.0), product of:
0.18226467 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05204841 = queryNorm
0.23214069 = fieldWeight in 562, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=562)
0.5 = coord(1/2)
- Content
- Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
- Date
- 8. 1.2013 10:22:32
-
Liu, R.-L.: Context recognition for hierarchical text classification (2009)
0.04
0.04075531 = product of:
0.08151062 = sum of:
0.08151062 = sum of:
0.039199576 = weight(_text_:management in 2760) [ClassicSimilarity], result of:
0.039199576 = score(doc=2760,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.22344214 = fieldWeight in 2760, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.046875 = fieldNorm(doc=2760)
0.042311046 = weight(_text_:22 in 2760) [ClassicSimilarity], result of:
0.042311046 = score(doc=2760,freq=2.0), product of:
0.18226467 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05204841 = queryNorm
0.23214069 = fieldWeight in 2760, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=2760)
0.5 = coord(1/2)
- Abstract
- Information is often organized as a text hierarchy. A hierarchical text-classification system is thus essential for the management, sharing, and dissemination of information. It aims to automatically classify each incoming document into zero, one, or several categories in the text hierarchy. In this paper, we present a technique called CRHTC (context recognition for hierarchical text classification) that performs hierarchical text classification by recognizing the context of discussion (COD) of each category. A category's COD is governed by its ancestor categories, whose contents indicate contextual backgrounds of the category. A document may be classified into a category only if its content matches the category's COD. CRHTC does not require any trials to manually set parameters, and hence is more portable and easier to implement than other methods. It is empirically evaluated under various conditions. The results show that CRHTC achieves both better and more stable performance than several hierarchical and nonhierarchical text-classification methodologies.
- Date
- 22. 3.2009 19:11:54
-
Wu, M.; Fuller, M.; Wilkinson, R.: Using clustering and classification approaches in interactive retrieval (2001)
0.02
0.022866419 = product of:
0.045732837 = sum of:
0.045732837 = product of:
0.091465674 = sum of:
0.091465674 = weight(_text_:management in 2666) [ClassicSimilarity], result of:
0.091465674 = score(doc=2666,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.521365 = fieldWeight in 2666, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.109375 = fieldNorm(doc=2666)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 37(2001) no.3, S.459-484
-
Subramanian, S.; Shafer, K.E.: Clustering (2001)
0.02
0.021155523 = product of:
0.042311046 = sum of:
0.042311046 = product of:
0.08462209 = sum of:
0.08462209 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
0.08462209 = score(doc=1046,freq=2.0), product of:
0.18226467 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05204841 = queryNorm
0.46428138 = fieldWeight in 1046, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.09375 = fieldNorm(doc=1046)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 5. 5.2003 14:17:22
-
Major, R.L.; Ragsdale, C.T.: ¬An aggregation approach to the classification problem using multiple prediction experts (2000)
0.02
0.019599788 = product of:
0.039199576 = sum of:
0.039199576 = product of:
0.07839915 = sum of:
0.07839915 = weight(_text_:management in 3789) [ClassicSimilarity], result of:
0.07839915 = score(doc=3789,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.44688427 = fieldWeight in 3789, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.09375 = fieldNorm(doc=3789)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 36(2000) no.4, S.683-696
-
Guerrero-Bote, V.P.; Moya Anegón, F. de; Herrero Solana, V.: Document organization using Kohonen's algorithm (2002)
0.01
0.013066526 = product of:
0.026133051 = sum of:
0.026133051 = product of:
0.052266102 = sum of:
0.052266102 = weight(_text_:management in 2564) [ClassicSimilarity], result of:
0.052266102 = score(doc=2564,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.29792285 = fieldWeight in 2564, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.0625 = fieldNorm(doc=2564)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 38(2002) no.1, S.79-89
-
Yoon, Y.; Lee, C.; Lee, G.G.: ¬An effective procedure for constructing a hierarchical text classification system (2006)
0.01
0.012340722 = product of:
0.024681443 = sum of:
0.024681443 = product of:
0.049362887 = sum of:
0.049362887 = weight(_text_:22 in 5273) [ClassicSimilarity], result of:
0.049362887 = score(doc=5273,freq=2.0), product of:
0.18226467 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05204841 = queryNorm
0.2708308 = fieldWeight in 5273, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=5273)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 22. 7.2006 16:24:52
-
Yi, K.: Automatic text classification using library classification schemes : trends, issues and challenges (2007)
0.01
0.012340722 = product of:
0.024681443 = sum of:
0.024681443 = product of:
0.049362887 = sum of:
0.049362887 = weight(_text_:22 in 2560) [ClassicSimilarity], result of:
0.049362887 = score(doc=2560,freq=2.0), product of:
0.18226467 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05204841 = queryNorm
0.2708308 = fieldWeight in 2560, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=2560)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 22. 9.2008 18:31:54
-
Miyamoto, S.: Information clustering based an fuzzy multisets (2003)
0.01
0.011433209 = product of:
0.022866419 = sum of:
0.022866419 = product of:
0.045732837 = sum of:
0.045732837 = weight(_text_:management in 1071) [ClassicSimilarity], result of:
0.045732837 = score(doc=1071,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.2606825 = fieldWeight in 1071, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.0546875 = fieldNorm(doc=1071)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 39(2003) no.2, S.195-213
-
Hu, G.; Zhou, S.; Guan, J.; Hu, X.: Towards effective document clustering : a constrained K-means based approach (2008)
0.01
0.011433209 = product of:
0.022866419 = sum of:
0.022866419 = product of:
0.045732837 = sum of:
0.045732837 = weight(_text_:management in 2113) [ClassicSimilarity], result of:
0.045732837 = score(doc=2113,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.2606825 = fieldWeight in 2113, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.0546875 = fieldNorm(doc=2113)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 44(2008) no.4, S.1397-1409
-
Pfeffer, M.: Automatische Vergabe von RVK-Notationen mittels fallbasiertem Schließen (2009)
0.01
0.010577762 = product of:
0.021155523 = sum of:
0.021155523 = product of:
0.042311046 = sum of:
0.042311046 = weight(_text_:22 in 3051) [ClassicSimilarity], result of:
0.042311046 = score(doc=3051,freq=2.0), product of:
0.18226467 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05204841 = queryNorm
0.23214069 = fieldWeight in 3051, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=3051)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 22. 8.2009 19:51:28
-
Wu, K.J.; Chen, M.-C.; Sun, Y.: Automatic topics discovery from hyperlinked documents (2004)
0.01
0.009799894 = product of:
0.019599788 = sum of:
0.019599788 = product of:
0.039199576 = sum of:
0.039199576 = weight(_text_:management in 2563) [ClassicSimilarity], result of:
0.039199576 = score(doc=2563,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.22344214 = fieldWeight in 2563, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.046875 = fieldNorm(doc=2563)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 40(2004) no.2, S.239-255
-
Liu, R.-L.: Dynamic category profiling for text filtering and classification (2007)
0.01
0.009799894 = product of:
0.019599788 = sum of:
0.019599788 = product of:
0.039199576 = sum of:
0.039199576 = weight(_text_:management in 900) [ClassicSimilarity], result of:
0.039199576 = score(doc=900,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.22344214 = fieldWeight in 900, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.046875 = fieldNorm(doc=900)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 43(2007) no.1, S.154-168
-
Yoon, Y.; Lee, G.G.: Efficient implementation of associative classifiers for document classification (2007)
0.01
0.009799894 = product of:
0.019599788 = sum of:
0.019599788 = product of:
0.039199576 = sum of:
0.039199576 = weight(_text_:management in 909) [ClassicSimilarity], result of:
0.039199576 = score(doc=909,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.22344214 = fieldWeight in 909, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.046875 = fieldNorm(doc=909)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 43(2007) no.2, S.393-405
-
Denoyer, L.; Gallinari, P.: Bayesian network model for semi-structured document classification (2004)
0.01
0.009799894 = product of:
0.019599788 = sum of:
0.019599788 = product of:
0.039199576 = sum of:
0.039199576 = weight(_text_:management in 995) [ClassicSimilarity], result of:
0.039199576 = score(doc=995,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.22344214 = fieldWeight in 995, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.046875 = fieldNorm(doc=995)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 40(2004) no.5, S.807-827
-
Malenica, M.; Smuc, T.; Snajder, J.; Basic, B.D.: Language morphology offset : text classification on a Croatian-English parallel corpus (2008)
0.01
0.009799894 = product of:
0.019599788 = sum of:
0.019599788 = product of:
0.039199576 = sum of:
0.039199576 = weight(_text_:management in 2035) [ClassicSimilarity], result of:
0.039199576 = score(doc=2035,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.22344214 = fieldWeight in 2035, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.046875 = fieldNorm(doc=2035)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 44(2008) no.1, S.325-339
-
Zhou, G.D.; Zhang, M.; Ji, D.H.; Zhu, Q.M.: Hierarchical learning strategy in semantic relation extraction (2008)
0.01
0.009799894 = product of:
0.019599788 = sum of:
0.019599788 = product of:
0.039199576 = sum of:
0.039199576 = weight(_text_:management in 2077) [ClassicSimilarity], result of:
0.039199576 = score(doc=2077,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.22344214 = fieldWeight in 2077, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.046875 = fieldNorm(doc=2077)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 44(2008) no.3, S.1008-1021
-
Montesi, M.; Navarrete, T.: Classifying web genres in context : A case study documenting the web genres used by a software engineer (2008)
0.01
0.009799894 = product of:
0.019599788 = sum of:
0.019599788 = product of:
0.039199576 = sum of:
0.039199576 = weight(_text_:management in 2100) [ClassicSimilarity], result of:
0.039199576 = score(doc=2100,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.22344214 = fieldWeight in 2100, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.046875 = fieldNorm(doc=2100)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 44(2008) no.4, S.1410-1430
-
Ko, Y.; Seo, J.: Text classification from unlabeled documents with bootstrapping and feature projection techniques (2009)
0.01
0.009799894 = product of:
0.019599788 = sum of:
0.019599788 = product of:
0.039199576 = sum of:
0.039199576 = weight(_text_:management in 2452) [ClassicSimilarity], result of:
0.039199576 = score(doc=2452,freq=2.0), product of:
0.17543502 = queryWeight, product of:
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.05204841 = queryNorm
0.22344214 = fieldWeight in 2452, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3706124 = idf(docFreq=4130, maxDocs=44218)
0.046875 = fieldNorm(doc=2452)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 45(2009) no.1, S.70-83
-
Mengle, S.; Goharian, N.: Passage detection using text classification (2009)
0.01
0.008814801 = product of:
0.017629603 = sum of:
0.017629603 = product of:
0.035259206 = sum of:
0.035259206 = weight(_text_:22 in 2765) [ClassicSimilarity], result of:
0.035259206 = score(doc=2765,freq=2.0), product of:
0.18226467 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05204841 = queryNorm
0.19345059 = fieldWeight in 2765, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=2765)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 22. 3.2009 19:14:43