-
Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004)
0.08
0.081290185 = product of:
0.21677382 = sum of:
0.050934732 = product of:
0.1528042 = sum of:
0.1528042 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
0.1528042 = score(doc=562,freq=2.0), product of:
0.27188486 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.032069415 = queryNorm
0.56201804 = fieldWeight in 562, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=562)
0.33333334 = coord(1/3)
0.1528042 = weight(_text_:2f in 562) [ClassicSimilarity], result of:
0.1528042 = score(doc=562,freq=2.0), product of:
0.27188486 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.032069415 = queryNorm
0.56201804 = fieldWeight in 562, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=562)
0.013034889 = product of:
0.026069777 = sum of:
0.026069777 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
0.026069777 = score(doc=562,freq=2.0), product of:
0.112301625 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.032069415 = queryNorm
0.23214069 = fieldWeight in 562, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=562)
0.5 = coord(1/2)
0.375 = coord(3/8)
- Content
- Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
- Date
- 8. 1.2013 10:22:32
-
Ko, Y.: ¬A new term-weighting scheme for text classification using the odds of positive and negative class probabilities (2015)
0.00
0.0048631052 = product of:
0.038904842 = sum of:
0.038904842 = weight(_text_:retrieval in 2339) [ClassicSimilarity], result of:
0.038904842 = score(doc=2339,freq=8.0), product of:
0.09700725 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.032069415 = queryNorm
0.40105087 = fieldWeight in 2339, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.046875 = fieldNorm(doc=2339)
0.125 = coord(1/8)
- Abstract
- Text classification (TC) is a core technique for text mining and information retrieval. It has been applied to many applications in many different research and industrial areas. Term-weighting schemes assign an appropriate weight to each term to obtain a high TC performance. Although term weighting is one of the important modules for TC and TC has different peculiarities from those in information retrieval, many term-weighting schemes used in information retrieval, such as term frequency-inverse document frequency (tf-idf), have been used in TC in the same manner. The peculiarity of TC that differs most from information retrieval is the existence of class information. This article proposes a new term-weighting scheme that uses class information using positive and negative class distributions. As a result, the proposed scheme, log tf-TRR, consistently performs better than do other schemes using class information as well as traditional schemes such as tf-idf.
-
Ruiz, M.E.; Srinivasan, P.: Combining machine learning and hierarchical indexing structures for text categorization (2001)
0.00
0.0019181764 = product of:
0.015345411 = sum of:
0.015345411 = product of:
0.030690823 = sum of:
0.030690823 = weight(_text_:29 in 1595) [ClassicSimilarity], result of:
0.030690823 = score(doc=1595,freq=2.0), product of:
0.11281017 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.032069415 = queryNorm
0.27205724 = fieldWeight in 1595, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0546875 = fieldNorm(doc=1595)
0.5 = coord(1/2)
0.125 = coord(1/8)
- Date
- 11. 5.2003 18:29:44
-
Ibekwe-SanJuan, F.; SanJuan, E.: From term variants to research topics (2002)
0.00
0.0013701261 = product of:
0.010961009 = sum of:
0.010961009 = product of:
0.021922018 = sum of:
0.021922018 = weight(_text_:29 in 1853) [ClassicSimilarity], result of:
0.021922018 = score(doc=1853,freq=2.0), product of:
0.11281017 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.032069415 = queryNorm
0.19432661 = fieldWeight in 1853, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=1853)
0.5 = coord(1/2)
0.125 = coord(1/8)
- Source
- Knowledge organization. 29(2002) nos.3/4, S.181-197