Search (33 results, page 1 of 2)

  • × theme_ss:"Automatisches Klassifizieren"
  1. Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.07
    0.07161492 = sum of:
      0.053395435 = product of:
        0.21358174 = sum of:
          0.21358174 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
            0.21358174 = score(doc=562,freq=2.0), product of:
              0.3800265 = queryWeight, product of:
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.044824958 = queryNorm
              0.56201804 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.25 = coord(1/4)
      0.01821949 = product of:
        0.03643898 = sum of:
          0.03643898 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
            0.03643898 = score(doc=562,freq=2.0), product of:
              0.15696937 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.044824958 = queryNorm
              0.23214069 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.5 = coord(1/2)
    
    Content
    Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
    Date
    8. 1.2013 10:22:32
  2. Khoo, C.S.G.; Ng, K.; Ou, S.: ¬An exploratory study of human clustering of Web pages (2003) 0.03
    0.034995675 = product of:
      0.06999135 = sum of:
        0.06999135 = sum of:
          0.0456987 = weight(_text_:2003 in 2741) [ClassicSimilarity], result of:
            0.0456987 = score(doc=2741,freq=3.0), product of:
              0.19453894 = queryWeight, product of:
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.044824958 = queryNorm
              0.23490772 = fieldWeight in 2741, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.03125 = fieldNorm(doc=2741)
          0.024292652 = weight(_text_:22 in 2741) [ClassicSimilarity], result of:
            0.024292652 = score(doc=2741,freq=2.0), product of:
              0.15696937 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.044824958 = queryNorm
              0.15476047 = fieldWeight in 2741, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.03125 = fieldNorm(doc=2741)
      0.5 = coord(1/2)
    
    Date
    12. 9.2004 9:56:22
    Year
    2003
  3. Brückner, T.; Dambeck, H.: Sortierautomaten : Grundlagen der Textklassifizierung (2003) 0.03
    0.029498382 = product of:
      0.058996763 = sum of:
        0.058996763 = product of:
          0.117993526 = sum of:
            0.117993526 = weight(_text_:2003 in 2398) [ClassicSimilarity], result of:
              0.117993526 = score(doc=2398,freq=5.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.6065291 = fieldWeight in 2398, product of:
                  2.236068 = tf(freq=5.0), with freq of:
                    5.0 = termFreq=5.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2398)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    c't. 2003, H.19, S.192-197
    Year
    2003
  4. Lindholm, J.; Schönthal, T.; Jansson , K.: Experiences of harvesting Web resources in engineering using automatic classification (2003) 0.03
    0.029498382 = product of:
      0.058996763 = sum of:
        0.058996763 = product of:
          0.117993526 = sum of:
            0.117993526 = weight(_text_:2003 in 4088) [ClassicSimilarity], result of:
              0.117993526 = score(doc=4088,freq=5.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.6065291 = fieldWeight in 4088, product of:
                  2.236068 = tf(freq=5.0), with freq of:
                    5.0 = termFreq=5.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4088)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Ariadne magazine. 2003, no.37
    Year
    2003
  5. Miyamoto, S.: Information clustering based an fuzzy multisets (2003) 0.03
    0.025811084 = product of:
      0.051622167 = sum of:
        0.051622167 = product of:
          0.103244334 = sum of:
            0.103244334 = weight(_text_:2003 in 1071) [ClassicSimilarity], result of:
              0.103244334 = score(doc=1071,freq=5.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.53071296 = fieldWeight in 1071, product of:
                  2.236068 = tf(freq=5.0), with freq of:
                    5.0 = termFreq=5.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1071)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 39(2003) no.2, S.195-213
    Year
    2003
  6. Drori, O.; Alon, N.: Using document classification for displaying search results (2003) 0.02
    0.022123788 = product of:
      0.044247575 = sum of:
        0.044247575 = product of:
          0.08849515 = sum of:
            0.08849515 = weight(_text_:2003 in 1565) [ClassicSimilarity], result of:
              0.08849515 = score(doc=1565,freq=5.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.45489684 = fieldWeight in 1565, product of:
                  2.236068 = tf(freq=5.0), with freq of:
                    5.0 = termFreq=5.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1565)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of information science. 29(2003) no.2, S.97-106
    Year
    2003
  7. Chung, Y.-M.; Noh, Y.-H.: Developing a specialized directory system by automatically classifying Web documents (2003) 0.02
    0.022123788 = product of:
      0.044247575 = sum of:
        0.044247575 = product of:
          0.08849515 = sum of:
            0.08849515 = weight(_text_:2003 in 1566) [ClassicSimilarity], result of:
              0.08849515 = score(doc=1566,freq=5.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.45489684 = fieldWeight in 1566, product of:
                  2.236068 = tf(freq=5.0), with freq of:
                    5.0 = termFreq=5.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1566)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of information science. 29(2003) no.2, S.117-126
    Year
    2003
  8. Mukhopadhyay, S.; Peng, S.; Raje, R.; Palakal, M.; Mostafa, J.: Multi-agent information classification using dynamic acquaintance lists (2003) 0.02
    0.022123788 = product of:
      0.044247575 = sum of:
        0.044247575 = product of:
          0.08849515 = sum of:
            0.08849515 = weight(_text_:2003 in 1755) [ClassicSimilarity], result of:
              0.08849515 = score(doc=1755,freq=5.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.45489684 = fieldWeight in 1755, product of:
                  2.236068 = tf(freq=5.0), with freq of:
                    5.0 = termFreq=5.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1755)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science and technology. 54(2003) no.10, S.966-975
    Year
    2003
  9. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.02
    0.022123788 = product of:
      0.044247575 = sum of:
        0.044247575 = product of:
          0.08849515 = sum of:
            0.08849515 = weight(_text_:2003 in 1808) [ClassicSimilarity], result of:
              0.08849515 = score(doc=1808,freq=5.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.45489684 = fieldWeight in 1808, product of:
                  2.236068 = tf(freq=5.0), with freq of:
                    5.0 = termFreq=5.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1808)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science and technology. 54(2003) no.11, S.1014-1028
    Year
    2003
  10. Godby, C.J.; Stuler, J.: ¬The Library of Congress Classification as a knowledge base for automatic subject categorization : subject access issues (2003) 0.02
    0.01999318 = product of:
      0.03998636 = sum of:
        0.03998636 = product of:
          0.07997272 = sum of:
            0.07997272 = weight(_text_:2003 in 3962) [ClassicSimilarity], result of:
              0.07997272 = score(doc=3962,freq=3.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.4110885 = fieldWeight in 3962, product of:
                  1.7320508 = tf(freq=3.0), with freq of:
                    3.0 = termFreq=3.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3962)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Year
    2003
  11. Zhang, X: Rough set theory based automatic text categorization (2005) 0.02
    0.018656416 = product of:
      0.03731283 = sum of:
        0.03731283 = product of:
          0.07462566 = sum of:
            0.07462566 = weight(_text_:2003 in 2822) [ClassicSimilarity], result of:
              0.07462566 = score(doc=2822,freq=2.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.3836027 = fieldWeight in 2822, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2822)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Der Forschungsbericht "Rough Set Theory Based Automatic Text Categorization and the Handling of Semantic Heterogeneity" von Xueying Zhang ist in Buchform auf Englisch erschienen. Zhang hat in ihrer Arbeit ein Verfahren basierend auf der Rough Set Theory entwickelt, das Beziehungen zwischen Schlagwörtern verschiedener Vokabulare herstellt. Sie war von 2003 bis 2005 Mitarbeiterin des IZ und ist seit Oktober 2005 Associate Professor an der Nanjing University of Science and Technology.
  12. Giorgetti, D.; Sebastiani, F.: Automating survey coding by multiclass text categorization techniques (2003) 0.02
    0.01843649 = product of:
      0.03687298 = sum of:
        0.03687298 = product of:
          0.07374596 = sum of:
            0.07374596 = weight(_text_:2003 in 5172) [ClassicSimilarity], result of:
              0.07374596 = score(doc=5172,freq=5.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.3790807 = fieldWeight in 5172, product of:
                  2.236068 = tf(freq=5.0), with freq of:
                    5.0 = termFreq=5.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5172)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science and technology. 54(2003) no.14, S.1269-1277
    Year
    2003
  13. Kwon, O.W.; Lee, J.H.: Text categorization based on k-nearest neighbor approach for web site classification (2003) 0.02
    0.01843649 = product of:
      0.03687298 = sum of:
        0.03687298 = product of:
          0.07374596 = sum of:
            0.07374596 = weight(_text_:2003 in 1070) [ClassicSimilarity], result of:
              0.07374596 = score(doc=1070,freq=5.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.3790807 = fieldWeight in 1070, product of:
                  2.236068 = tf(freq=5.0), with freq of:
                    5.0 = termFreq=5.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1070)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 39(2003) no.1, S.25-44
    Year
    2003
  14. Subramanian, S.; Shafer, K.E.: Clustering (2001) 0.02
    0.01821949 = product of:
      0.03643898 = sum of:
        0.03643898 = product of:
          0.07287796 = sum of:
            0.07287796 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
              0.07287796 = score(doc=1046,freq=2.0), product of:
                0.15696937 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044824958 = queryNorm
                0.46428138 = fieldWeight in 1046, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1046)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    5. 5.2003 14:17:22
  15. Reiner, U.: Automatische DDC-Klassifizierung von bibliografischen Titeldatensätzen (2009) 0.02
    0.015182908 = product of:
      0.030365815 = sum of:
        0.030365815 = product of:
          0.06073163 = sum of:
            0.06073163 = weight(_text_:22 in 611) [ClassicSimilarity], result of:
              0.06073163 = score(doc=611,freq=2.0), product of:
                0.15696937 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044824958 = queryNorm
                0.38690117 = fieldWeight in 611, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=611)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 8.2009 12:54:24
  16. HaCohen-Kerner, Y. et al.: Classification using various machine learning methods and combinations of key-phrases and visual features (2016) 0.02
    0.015182908 = product of:
      0.030365815 = sum of:
        0.030365815 = product of:
          0.06073163 = sum of:
            0.06073163 = weight(_text_:22 in 2748) [ClassicSimilarity], result of:
              0.06073163 = score(doc=2748,freq=2.0), product of:
                0.15696937 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044824958 = queryNorm
                0.38690117 = fieldWeight in 2748, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2748)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 2.2016 18:25:22
  17. Adams, K.C.: Word wranglers : Automatic classification tools transform enterprise documents from "bags of words" into knowledge resources (2003) 0.01
    0.014280844 = product of:
      0.028561687 = sum of:
        0.028561687 = product of:
          0.057123374 = sum of:
            0.057123374 = weight(_text_:2003 in 1665) [ClassicSimilarity], result of:
              0.057123374 = score(doc=1665,freq=3.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.29363465 = fieldWeight in 1665, product of:
                  1.7320508 = tf(freq=3.0), with freq of:
                    3.0 = termFreq=3.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1665)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Year
    2003
  18. Zhou, G.D.; Zhang, M.; Ji, D.H.; Zhu, Q.M.: Hierarchical learning strategy in semantic relation extraction (2008) 0.01
    0.013992311 = product of:
      0.027984623 = sum of:
        0.027984623 = product of:
          0.055969246 = sum of:
            0.055969246 = weight(_text_:2003 in 2077) [ClassicSimilarity], result of:
              0.055969246 = score(doc=2077,freq=2.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.28770202 = fieldWeight in 2077, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2077)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This paper proposes a novel hierarchical learning strategy to deal with the data sparseness problem in semantic relation extraction by modeling the commonality among related classes. For each class in the hierarchy either manually predefined or automatically clustered, a discriminative function is determined in a top-down way. As the upper-level class normally has much more positive training examples than the lower-level class, the corresponding discriminative function can be determined more reliably and guide the discriminative function learning in the lower-level one more effectively, which otherwise might suffer from limited training data. In this paper, two classifier learning approaches, i.e. the simple perceptron algorithm and the state-of-the-art Support Vector Machines, are applied using the hierarchical learning strategy. Moreover, several kinds of class hierarchies either manually predefined or automatically clustered are explored and compared. Evaluation on the ACE RDC 2003 and 2004 corpora shows that the hierarchical learning strategy much improves the performance on least- and medium-frequent relations.
  19. Reiner, U.: VZG-Projekt Colibri : Bewertung von automatisch DDC-klassifizierten Titeldatensätzen der Deutschen Nationalbibliothek (DNB) (2009) 0.01
    0.01166026 = product of:
      0.02332052 = sum of:
        0.02332052 = product of:
          0.04664104 = sum of:
            0.04664104 = weight(_text_:2003 in 2675) [ClassicSimilarity], result of:
              0.04664104 = score(doc=2675,freq=2.0), product of:
                0.19453894 = queryWeight, product of:
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.044824958 = queryNorm
                0.2397517 = fieldWeight in 2675, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.339969 = idf(docFreq=1566, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2675)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Das VZG-Projekt Colibri/DDC beschäftigt sich seit 2003 mit automatischen Verfahren zur Dewey-Dezimalklassifikation (Dewey Decimal Classification, kurz DDC). Ziel des Projektes ist eine einheitliche DDC-Erschließung von bibliografischen Titeldatensätzen und eine Unterstützung der DDC-Expert(inn)en und DDC-Laien, z. B. bei der Analyse und Synthese von DDC-Notationen und deren Qualitätskontrolle und der DDC-basierten Suche. Der vorliegende Bericht konzentriert sich auf die erste größere automatische DDC-Klassifizierung und erste automatische und intellektuelle Bewertung mit der Klassifizierungskomponente vc_dcl1. Grundlage hierfür waren die von der Deutschen Nationabibliothek (DNB) im November 2007 zur Verfügung gestellten 25.653 Titeldatensätze (12 Wochen-/Monatslieferungen) der Deutschen Nationalbibliografie der Reihen A, B und H. Nach Erläuterung der automatischen DDC-Klassifizierung und automatischen Bewertung in Kapitel 2 wird in Kapitel 3 auf den DNB-Bericht "Colibri_Auswertung_DDC_Endbericht_Sommer_2008" eingegangen. Es werden Sachverhalte geklärt und Fragen gestellt, deren Antworten die Weichen für den Verlauf der weiteren Klassifizierungstests stellen werden. Über das Kapitel 3 hinaus führende weitergehende Betrachtungen und Gedanken zur Fortführung der automatischen DDC-Klassifizierung werden in Kapitel 4 angestellt. Der Bericht dient dem vertieften Verständnis für die automatischen Verfahren.
  20. Bock, H.-H.: Datenanalyse zur Strukturierung und Ordnung von Information (1989) 0.01
    0.010628035 = product of:
      0.02125607 = sum of:
        0.02125607 = product of:
          0.04251214 = sum of:
            0.04251214 = weight(_text_:22 in 141) [ClassicSimilarity], result of:
              0.04251214 = score(doc=141,freq=2.0), product of:
                0.15696937 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044824958 = queryNorm
                0.2708308 = fieldWeight in 141, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=141)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Pages
    S.1-22