Search (34 results, page 1 of 2)

Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.08

0.08062537 = sum of:
  0.06011354 = product of:
    0.24045417 = sum of:
      0.24045417 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
        0.24045417 = score(doc=562,freq=2.0), product of:
          0.42784065 = queryWeight, product of:
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.050464742 = queryNorm
          0.56201804 = fieldWeight in 562, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.046875 = fieldNorm(doc=562)
    0.25 = coord(1/4)
  0.020511828 = product of:
    0.041023657 = sum of:
      0.041023657 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
        0.041023657 = score(doc=562,freq=2.0), product of:
          0.17671894 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.050464742 = queryNorm
          0.23214069 = fieldWeight in 562, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=562)
    0.5 = coord(1/2)

Content: Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
Date: 8. 1.2013 10:22:32

Liu, R.-L.: ¬A passage extractor for classification of disease aspect information (2013) 0.05
```
0.045136496 = product of:
  0.09027299 = sum of:
    0.09027299 = sum of:
      0.056086615 = weight(_text_:i in 1107) [ClassicSimilarity], result of:
        0.056086615 = score(doc=1107,freq=4.0), product of:
          0.19033937 = queryWeight, product of:
            3.7717297 = idf(docFreq=2765, maxDocs=44218)
            0.050464742 = queryNorm
          0.29466638 = fieldWeight in 1107, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            3.7717297 = idf(docFreq=2765, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1107)
      0.03418638 = weight(_text_:22 in 1107) [ClassicSimilarity], result of:
        0.03418638 = score(doc=1107,freq=2.0), product of:
          0.17671894 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.050464742 = queryNorm
          0.19345059 = fieldWeight in 1107, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1107)
  0.5 = coord(1/2)
```
Abstract

Retrieval of disease information is often based on several key aspects such as etiology, diagnosis, treatment, prevention, and symptoms of diseases. Automatic identification of disease aspect information is thus essential. In this article, I model the aspect identification problem as a text classification (TC) problem in which a disease aspect corresponds to a category. The disease aspect classification problem poses two challenges to classifiers: (a) a medical text often contains information about multiple aspects of a disease and hence produces noise for the classifiers and (b) text classifiers often cannot extract the textual parts (i.e., passages) about the categories of interest. I thus develop a technique, PETC (Passage Extractor for Text Classification), that extracts passages (from medical texts) for the underlying text classifiers to classify. Case studies on thousands of Chinese and English medical texts show that PETC enhances a support vector machine (SVM) classifier in classifying disease aspect information. PETC also performs better than three state-of-the-art classifier enhancement techniques, including two passage extraction techniques for text classifiers and a technique that employs term proximity information to enhance text classifiers. The contribution is of significance to evidence-based medicine, health education, and healthcare decision support. PETC can be used in those application domains in which a text to be classified may have several parts about different categories.

Date

28.10.2013 19:22:57

Shafer, K.E.: Automatic Subject Assignment via the Scorpion System (2001) 0.02

0.023795536 = product of:
  0.04759107 = sum of:
    0.04759107 = product of:
      0.09518214 = sum of:
        0.09518214 = weight(_text_:i in 1043) [ClassicSimilarity], result of:
          0.09518214 = score(doc=1043,freq=2.0), product of:
            0.19033937 = queryWeight, product of:
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.050464742 = queryNorm
            0.50006545 = fieldWeight in 1043, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.09375 = fieldNorm(doc=1043)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Footnote: Teil eines Themenheftes: OCLC and the Internet: An Historical Overview of Research Activities, 1990-1999 - Part I

Subramanian, S.; Shafer, K.E.: Clustering (2001) 0.02

0.020511828 = product of:
  0.041023657 = sum of:
    0.041023657 = product of:
      0.08204731 = sum of:
        0.08204731 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
          0.08204731 = score(doc=1046,freq=2.0), product of:
            0.17671894 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050464742 = queryNorm
            0.46428138 = fieldWeight in 1046, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=1046)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 5. 5.2003 14:17:22

Reiner, U.: Automatische DDC-Klassifizierung von bibliografischen Titeldatensätzen (2009) 0.02

0.01709319 = product of:
  0.03418638 = sum of:
    0.03418638 = product of:
      0.06837276 = sum of:
        0.06837276 = weight(_text_:22 in 611) [ClassicSimilarity], result of:
          0.06837276 = score(doc=611,freq=2.0), product of:
            0.17671894 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050464742 = queryNorm
            0.38690117 = fieldWeight in 611, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=611)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 22. 8.2009 12:54:24

HaCohen-Kerner, Y. et al.: Classification using various machine learning methods and combinations of key-phrases and visual features (2016) 0.02

0.01709319 = product of:
  0.03418638 = sum of:
    0.03418638 = product of:
      0.06837276 = sum of:
        0.06837276 = weight(_text_:22 in 2748) [ClassicSimilarity], result of:
          0.06837276 = score(doc=2748,freq=2.0), product of:
            0.17671894 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050464742 = queryNorm
            0.38690117 = fieldWeight in 2748, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=2748)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 1. 2.2016 18:25:22

Fangmeyer, H.; Gloden, R.: Bewertung und Vergleich von Klassifikationsergebnissen bei automatischen Verfahren (1978) 0.02

0.015863689 = product of:
  0.031727377 = sum of:
    0.031727377 = product of:
      0.063454755 = sum of:
        0.063454755 = weight(_text_:i in 81) [ClassicSimilarity], result of:
          0.063454755 = score(doc=81,freq=2.0), product of:
            0.19033937 = queryWeight, product of:
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.050464742 = queryNorm
            0.33337694 = fieldWeight in 81, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.0625 = fieldNorm(doc=81)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Kooperation in der Klassifikation I. Proc. der Sekt.1-3 der 2. Fachtagung der Gesellschaft für Klassifikation, Frankfurt-Hoechst, 6.-7.4.1978. Bearb.: W. Dahlberg

Bollmann, P.; Konrad, E.; Schneider, H.-J.; Zuse, H.: Anwendung automatischer Klassifikationsverfahren mit dem System FAKYR (1978) 0.02

0.015863689 = product of:
  0.031727377 = sum of:
    0.031727377 = product of:
      0.063454755 = sum of:
        0.063454755 = weight(_text_:i in 82) [ClassicSimilarity], result of:
          0.063454755 = score(doc=82,freq=2.0), product of:
            0.19033937 = queryWeight, product of:
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.050464742 = queryNorm
            0.33337694 = fieldWeight in 82, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.0625 = fieldNorm(doc=82)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Kooperation in der Klassifikation I. Proc. der Sekt.1-3 der 2. Fachtagung der Gesellschaft für Klassifikation, Frankfurt-Hoechst, 6.-7.4.1978. Bearb.: W. Dahlberg

Schulze, U.: Erfahrungen bei der Anwendung automatischer Klassifizierungsverfahren zur Inhaltsanalyse einer Dokumentenmenge (1978) 0.02

0.015863689 = product of:
  0.031727377 = sum of:
    0.031727377 = product of:
      0.063454755 = sum of:
        0.063454755 = weight(_text_:i in 83) [ClassicSimilarity], result of:
          0.063454755 = score(doc=83,freq=2.0), product of:
            0.19033937 = queryWeight, product of:
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.050464742 = queryNorm
            0.33337694 = fieldWeight in 83, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.0625 = fieldNorm(doc=83)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Kooperation in der Klassifikation I. Proc. der Sekt.1-3 der 2. Fachtagung der Gesellschaft für Klassifikation, Frankfurt-Hoechst, 6.-7.4.1978. Bearb.: W. Dahlberg

Cheng, P.T.K.; Wu, A.K.W.: ACS: an automatic classification system (1995) 0.02
```
0.015863689 = product of:
  0.031727377 = sum of:
    0.031727377 = product of:
      0.063454755 = sum of:
        0.063454755 = weight(_text_:i in 2188) [ClassicSimilarity], result of:
          0.063454755 = score(doc=2188,freq=2.0), product of:
            0.19033937 = queryWeight, product of:
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.050464742 = queryNorm
            0.33337694 = fieldWeight in 2188, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.0625 = fieldNorm(doc=2188)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

In this paper, we introduce ACS, an automatic classification system for school libraries. First, various approaches towards automatic classification, namely (i) rule-based, (ii) browse and search, and (iii) partial match, are critically reviewed. The central issues of scheme selection, text analysis and similarity measures are discussed. A novel approach towards detecting book-class similarity with Modified Overlap Coefficient (MOC) is also proposed. Finally, the design and implementation of ACS is presented. The test result of over 80% correctness in automatic classification and a cost reduction of 75% compared to manual classification suggest that ACS is highly adoptable

Panyr, J.: Automatische Indexierung und Klassifikation (1983) 0.02

0.015863689 = product of:
  0.031727377 = sum of:
    0.031727377 = product of:
      0.063454755 = sum of:
        0.063454755 = weight(_text_:i in 7692) [ClassicSimilarity], result of:
          0.063454755 = score(doc=7692,freq=2.0), product of:
            0.19033937 = queryWeight, product of:
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.050464742 = queryNorm
            0.33337694 = fieldWeight in 7692, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.0625 = fieldNorm(doc=7692)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Automatisierung in der Klassifikation. Proc. 7. Jahrestagung der Gesellschaft für Klassifikation (Teil 1), Königswinter, 5.-8.4.1983. Hrsg.: I. Dahlberg u.a

Ingwersen, P.; Wormell, I.: Ranganathan in the perspective of advanced information retrieval (1992) 0.02

0.015863689 = product of:
  0.031727377 = sum of:
    0.031727377 = product of:
      0.063454755 = sum of:
        0.063454755 = weight(_text_:i in 7695) [ClassicSimilarity], result of:
          0.063454755 = score(doc=7695,freq=2.0), product of:
            0.19033937 = queryWeight, product of:
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.050464742 = queryNorm
            0.33337694 = fieldWeight in 7695, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.0625 = fieldNorm(doc=7695)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Fuhr, N.: Klassifikationsverfahren bei der automatischen Indexierung (1983) 0.02

0.015863689 = product of:
  0.031727377 = sum of:
    0.031727377 = product of:
      0.063454755 = sum of:
        0.063454755 = weight(_text_:i in 7697) [ClassicSimilarity], result of:
          0.063454755 = score(doc=7697,freq=2.0), product of:
            0.19033937 = queryWeight, product of:
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.050464742 = queryNorm
            0.33337694 = fieldWeight in 7697, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.0625 = fieldNorm(doc=7697)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Automatisierung in der Klassifikation. Proc. 7. Jahrestagung der Gesellschaft für Klassifikation (Teil 1), Königswinter, 5.-8.4.1983. Hrsg.: I. Dahlberg u.a

Krauth, J.: Evaluation von Verfahren der automatischen Klassifikation (1983) 0.02

0.015863689 = product of:
  0.031727377 = sum of:
    0.031727377 = product of:
      0.063454755 = sum of:
        0.063454755 = weight(_text_:i in 111) [ClassicSimilarity], result of:
          0.063454755 = score(doc=111,freq=2.0), product of:
            0.19033937 = queryWeight, product of:
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.050464742 = queryNorm
            0.33337694 = fieldWeight in 111, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7717297 = idf(docFreq=2765, maxDocs=44218)
              0.0625 = fieldNorm(doc=111)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Automatisierung in der Klassifikation. Proc. 7. Jahrestagung der Gesellschaft für Klassifikation (Teil 1), Königswinter, 5.-8.4.1983. Hrsg.: I. Dahlberg u.a

Golub, K.: Automated subject classification of textual web documents (2006) 0.02
```
0.015737344 = product of:
  0.031474687 = sum of:
    0.031474687 = product of:
      0.12589875 = sum of:
        0.12589875 = weight(_text_:author's in 5600) [ClassicSimilarity], result of:
          0.12589875 = score(doc=5600,freq=2.0), product of:
            0.3391308 = queryWeight, product of:
              6.7201533 = idf(docFreq=144, maxDocs=44218)
              0.050464742 = queryNorm
            0.3712395 = fieldWeight in 5600, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.7201533 = idf(docFreq=144, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5600)
      0.25 = coord(1/4)
  0.5 = coord(1/2)
```
Abstract

Purpose - To provide an integrated perspective to similarities and differences between approaches to automated classification in different research communities (machine learning, information retrieval and library science), and point to problems with the approaches and automated classification as such. Design/methodology/approach - A range of works dealing with automated classification of full-text web documents are discussed. Explorations of individual approaches are given in the following sections: special features (description, differences, evaluation), application and characteristics of web pages. Findings - Provides major similarities and differences between the three approaches: document pre-processing and utilization of web-specific document characteristics is common to all the approaches; major differences are in applied algorithms, employment or not of the vector space model and of controlled vocabularies. Problems of automated classification are recognized. Research limitations/implications - The paper does not attempt to provide an exhaustive bibliography of related resources. Practical implications - As an integrated overview of approaches from different research communities with application examples, it is very useful for students in library and information science and computer science, as well as for practitioners. Researchers from one community have the information on how similar tasks are conducted in different communities. Originality/value - To the author's knowledge, no review paper on automated text classification attempted to discuss more than one community's approach from an integrated perspective.

Bock, H.-H.: Datenanalyse zur Strukturierung und Ordnung von Information (1989) 0.01

0.011965233 = product of:
  0.023930466 = sum of:
    0.023930466 = product of:
      0.04786093 = sum of:
        0.04786093 = weight(_text_:22 in 141) [ClassicSimilarity], result of:
          0.04786093 = score(doc=141,freq=2.0), product of:
            0.17671894 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050464742 = queryNorm
            0.2708308 = fieldWeight in 141, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=141)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Pages: S.1-22

Dubin, D.: Dimensions and discriminability (1998) 0.01

0.011965233 = product of:
  0.023930466 = sum of:
    0.023930466 = product of:
      0.04786093 = sum of:
        0.04786093 = weight(_text_:22 in 2338) [ClassicSimilarity], result of:
          0.04786093 = score(doc=2338,freq=2.0), product of:
            0.17671894 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050464742 = queryNorm
            0.2708308 = fieldWeight in 2338, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2338)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 22. 9.1997 19:16:05

Automatic classification research at OCLC (2002) 0.01

0.011965233 = product of:
  0.023930466 = sum of:
    0.023930466 = product of:
      0.04786093 = sum of:
        0.04786093 = weight(_text_:22 in 1563) [ClassicSimilarity], result of:
          0.04786093 = score(doc=1563,freq=2.0), product of:
            0.17671894 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050464742 = queryNorm
            0.2708308 = fieldWeight in 1563, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1563)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 5. 5.2003 9:22:09

Jenkins, C.: Automatic classification of Web resources using Java and Dewey Decimal Classification (1998) 0.01

0.011965233 = product of:
  0.023930466 = sum of:
    0.023930466 = product of:
      0.04786093 = sum of:
        0.04786093 = weight(_text_:22 in 1673) [ClassicSimilarity], result of:
          0.04786093 = score(doc=1673,freq=2.0), product of:
            0.17671894 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050464742 = queryNorm
            0.2708308 = fieldWeight in 1673, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1673)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 1. 8.1996 22:08:06

Yoon, Y.; Lee, C.; Lee, G.G.: ¬An effective procedure for constructing a hierarchical text classification system (2006) 0.01

0.011965233 = product of:
  0.023930466 = sum of:
    0.023930466 = product of:
      0.04786093 = sum of:
        0.04786093 = weight(_text_:22 in 5273) [ClassicSimilarity], result of:
          0.04786093 = score(doc=5273,freq=2.0), product of:
            0.17671894 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050464742 = queryNorm
            0.2708308 = fieldWeight in 5273, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5273)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 22. 7.2006 16:24:52

Search (34 results, page 1 of 2)

Authors

Years

Languages

Types

Themes