Search (17 results, page 1 of 1)

Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.10

0.10123343 = sum of:
  0.08060541 = product of:
    0.24181622 = sum of:
      0.24181622 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
        0.24181622 = score(doc=562,freq=2.0), product of:
          0.43026417 = queryWeight, product of:
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.050750602 = queryNorm
          0.56201804 = fieldWeight in 562, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.046875 = fieldNorm(doc=562)
    0.33333334 = coord(1/3)
  0.020628018 = product of:
    0.041256037 = sum of:
      0.041256037 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
        0.041256037 = score(doc=562,freq=2.0), product of:
          0.17771997 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.050750602 = queryNorm
          0.23214069 = fieldWeight in 562, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=562)
    0.5 = coord(1/2)

Content: Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
Date: 8. 1.2013 10:22:32

Liu, R.-L.: Context-based term frequency assessment for text classification (2010) 0.04
```
0.0444055 = product of:
  0.088811 = sum of:
    0.088811 = product of:
      0.177622 = sum of:
        0.177622 = weight(_text_:assessment in 3331) [ClassicSimilarity], result of:
          0.177622 = score(doc=3331,freq=6.0), product of:
            0.2801951 = queryWeight, product of:
              5.52102 = idf(docFreq=480, maxDocs=44218)
              0.050750602 = queryNorm
            0.63392264 = fieldWeight in 3331, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              5.52102 = idf(docFreq=480, maxDocs=44218)
              0.046875 = fieldNorm(doc=3331)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Automatic text classification (TC) is essential for the management of information. To properly classify a document d, it is essential to identify the semantics of each term t in d, while the semantics heavily depend on context (neighboring terms) of t in d. Therefore, we present a technique CTFA (Context-based Term Frequency Assessment) that improves text classifiers by considering term contexts in test documents. The results of the term context recognition are used to assess term frequencies of terms, and hence CTFA may easily work with various kinds of text classifiers that base their TC decisions on term frequencies, without needing to modify the classifiers. Moreover, CTFA is efficient, and neither huge memory nor domain-specific knowledge is required. Empirical results show that CTFA successfully enhances performance of several kinds of text classifiers on different experimental data.

Object

Context-based Term Frequency Assessment
Barthel, S.; Tönnies, S.; Balke, W.-T.: Large-scale experiments for mathematical document classification (2013) 0.02
```
0.021364605 = product of:
  0.04272921 = sum of:
    0.04272921 = product of:
      0.08545842 = sum of:
        0.08545842 = weight(_text_:assessment in 1056) [ClassicSimilarity], result of:
          0.08545842 = score(doc=1056,freq=2.0), product of:
            0.2801951 = queryWeight, product of:
              5.52102 = idf(docFreq=480, maxDocs=44218)
              0.050750602 = queryNorm
            0.30499613 = fieldWeight in 1056, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.52102 = idf(docFreq=480, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1056)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

The ever increasing amount of digitally available information is curse and blessing at the same time. On the one hand, users have increasingly large amounts of information at their fingertips. On the other hand, the assessment and refinement of web search results becomes more and more tiresome and difficult for non-experts in a domain. Therefore, established digital libraries offer specialized collections with a certain degree of quality. This quality can largely be attributed to the great effort invested into semantic enrichment of the provided documents e.g. by annotating their documents with respect to a domain-specific taxonomy. This process is still done manually in many domains, e.g. chemistry CAS, medicine MeSH, or mathematics MSC. But due to the growing amount of data, this manual task gets more and more time consuming and expensive. The only solution for this problem seems to employ automated classification algorithms, but from evaluations done in previous research, conclusions to a real world scenario are difficult to make. We therefore conducted a large scale feasibility study on a real world data set from one of the biggest mathematical digital libraries, i.e. Zentralblatt MATH, with special focus on its practical applicability.
Golub, K.; Soergel, D.; Buchanan, G.; Tudhope, D.; Lykke, M.; Hiom, D.: ¬A framework for evaluating automatic indexing or classification in the context of retrieval (2016) 0.02
```
0.021364605 = product of:
  0.04272921 = sum of:
    0.04272921 = product of:
      0.08545842 = sum of:
        0.08545842 = weight(_text_:assessment in 3311) [ClassicSimilarity], result of:
          0.08545842 = score(doc=3311,freq=2.0), product of:
            0.2801951 = queryWeight, product of:
              5.52102 = idf(docFreq=480, maxDocs=44218)
              0.050750602 = queryNorm
            0.30499613 = fieldWeight in 3311, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.52102 = idf(docFreq=480, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3311)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Tools for automatic subject assignment help deal with scale and sustainability in creating and enriching metadata, establishing more connections across and between resources and enhancing consistency. Although some software vendors and experimental researchers claim the tools can replace manual subject indexing, hard scientific evidence of their performance in operating information environments is scarce. A major reason for this is that research is usually conducted in laboratory conditions, excluding the complexities of real-life systems and situations. The article reviews and discusses issues with existing evaluation approaches such as problems of aboutness and relevance assessments, implying the need to use more than a single "gold standard" method when evaluating indexing and retrieval, and proposes a comprehensive evaluation framework. The framework is informed by a systematic review of the literature on evaluation approaches: evaluating indexing quality directly through assessment by an evaluator or through comparison with a gold standard, evaluating the quality of computer-assisted indexing directly in the context of an indexing workflow, and evaluating indexing quality indirectly through analyzing retrieval performance.

Subramanian, S.; Shafer, K.E.: Clustering (2001) 0.02

0.020628018 = product of:
  0.041256037 = sum of:
    0.041256037 = product of:
      0.08251207 = sum of:
        0.08251207 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
          0.08251207 = score(doc=1046,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.46428138 = fieldWeight in 1046, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=1046)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 5. 5.2003 14:17:22

HaCohen-Kerner, Y. et al.: Classification using various machine learning methods and combinations of key-phrases and visual features (2016) 0.02

0.017190015 = product of:
  0.03438003 = sum of:
    0.03438003 = product of:
      0.06876006 = sum of:
        0.06876006 = weight(_text_:22 in 2748) [ClassicSimilarity], result of:
          0.06876006 = score(doc=2748,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.38690117 = fieldWeight in 2748, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=2748)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 1. 2.2016 18:25:22

Dubin, D.: Dimensions and discriminability (1998) 0.01

0.012033011 = product of:
  0.024066022 = sum of:
    0.024066022 = product of:
      0.048132043 = sum of:
        0.048132043 = weight(_text_:22 in 2338) [ClassicSimilarity], result of:
          0.048132043 = score(doc=2338,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.2708308 = fieldWeight in 2338, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2338)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 22. 9.1997 19:16:05

Automatic classification research at OCLC (2002) 0.01

0.012033011 = product of:
  0.024066022 = sum of:
    0.024066022 = product of:
      0.048132043 = sum of:
        0.048132043 = weight(_text_:22 in 1563) [ClassicSimilarity], result of:
          0.048132043 = score(doc=1563,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.2708308 = fieldWeight in 1563, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1563)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 5. 5.2003 9:22:09

Jenkins, C.: Automatic classification of Web resources using Java and Dewey Decimal Classification (1998) 0.01

0.012033011 = product of:
  0.024066022 = sum of:
    0.024066022 = product of:
      0.048132043 = sum of:
        0.048132043 = weight(_text_:22 in 1673) [ClassicSimilarity], result of:
          0.048132043 = score(doc=1673,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.2708308 = fieldWeight in 1673, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1673)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 1. 8.1996 22:08:06

Yoon, Y.; Lee, C.; Lee, G.G.: ¬An effective procedure for constructing a hierarchical text classification system (2006) 0.01

0.012033011 = product of:
  0.024066022 = sum of:
    0.024066022 = product of:
      0.048132043 = sum of:
        0.048132043 = weight(_text_:22 in 5273) [ClassicSimilarity], result of:
          0.048132043 = score(doc=5273,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.2708308 = fieldWeight in 5273, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5273)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 22. 7.2006 16:24:52

Yi, K.: Automatic text classification using library classification schemes : trends, issues and challenges (2007) 0.01

0.012033011 = product of:
  0.024066022 = sum of:
    0.024066022 = product of:
      0.048132043 = sum of:
        0.048132043 = weight(_text_:22 in 2560) [ClassicSimilarity], result of:
          0.048132043 = score(doc=2560,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.2708308 = fieldWeight in 2560, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2560)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 22. 9.2008 18:31:54

Liu, R.-L.: Context recognition for hierarchical text classification (2009) 0.01

0.010314009 = product of:
  0.020628018 = sum of:
    0.020628018 = product of:
      0.041256037 = sum of:
        0.041256037 = weight(_text_:22 in 2760) [ClassicSimilarity], result of:
          0.041256037 = score(doc=2760,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.23214069 = fieldWeight in 2760, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2760)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 22. 3.2009 19:11:54

Zhu, W.Z.; Allen, R.B.: Document clustering using the LSI subspace signature model (2013) 0.01

0.010314009 = product of:
  0.020628018 = sum of:
    0.020628018 = product of:
      0.041256037 = sum of:
        0.041256037 = weight(_text_:22 in 690) [ClassicSimilarity], result of:
          0.041256037 = score(doc=690,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.23214069 = fieldWeight in 690, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=690)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 23. 3.2013 13:22:36

Egbert, J.; Biber, D.; Davies, M.: Developing a bottom-up, user-based method of web register classification (2015) 0.01

0.010314009 = product of:
  0.020628018 = sum of:
    0.020628018 = product of:
      0.041256037 = sum of:
        0.041256037 = weight(_text_:22 in 2158) [ClassicSimilarity], result of:
          0.041256037 = score(doc=2158,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.23214069 = fieldWeight in 2158, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2158)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 4. 8.2015 19:22:04

Mengle, S.; Goharian, N.: Passage detection using text classification (2009) 0.01

0.0085950075 = product of:
  0.017190015 = sum of:
    0.017190015 = product of:
      0.03438003 = sum of:
        0.03438003 = weight(_text_:22 in 2765) [ClassicSimilarity], result of:
          0.03438003 = score(doc=2765,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.19345059 = fieldWeight in 2765, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2765)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 22. 3.2009 19:14:43

Liu, R.-L.: ¬A passage extractor for classification of disease aspect information (2013) 0.01

0.0085950075 = product of:
  0.017190015 = sum of:
    0.017190015 = product of:
      0.03438003 = sum of:
        0.03438003 = weight(_text_:22 in 1107) [ClassicSimilarity], result of:
          0.03438003 = score(doc=1107,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.19345059 = fieldWeight in 1107, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1107)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 28.10.2013 19:22:57

Khoo, C.S.G.; Ng, K.; Ou, S.: ¬An exploratory study of human clustering of Web pages (2003) 0.01

0.0068760063 = product of:
  0.0137520125 = sum of:
    0.0137520125 = product of:
      0.027504025 = sum of:
        0.027504025 = weight(_text_:22 in 2741) [ClassicSimilarity], result of:
          0.027504025 = score(doc=2741,freq=2.0), product of:
            0.17771997 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050750602 = queryNorm
            0.15476047 = fieldWeight in 2741, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=2741)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 12. 9.2004 9:56:22

Search (17 results, page 1 of 1)

Authors

Years

Types

Themes