Search (83 results, page 1 of 5)

  • × language_ss:"e"
  • × theme_ss:"Automatisches Klassifizieren"
  • × year_i:[2000 TO 2010}
  1. Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.05
    0.0489438 = product of:
      0.081572995 = sum of:
        0.060778037 = product of:
          0.18233411 = sum of:
            0.18233411 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
              0.18233411 = score(doc=562,freq=2.0), product of:
                0.32442752 = queryWeight, product of:
                  8.478011 = idf(docFreq=24, maxDocs=44218)
                  0.03826694 = queryNorm
                0.56201804 = fieldWeight in 562, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  8.478011 = idf(docFreq=24, maxDocs=44218)
                  0.046875 = fieldNorm(doc=562)
          0.33333334 = coord(1/3)
        0.0052410355 = weight(_text_:e in 562) [ClassicSimilarity], result of:
          0.0052410355 = score(doc=562,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.09528506 = fieldWeight in 562, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=562)
        0.015553925 = product of:
          0.03110785 = sum of:
            0.03110785 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
              0.03110785 = score(doc=562,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.23214069 = fieldWeight in 562, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=562)
          0.5 = coord(1/2)
      0.6 = coord(3/5)
    
    Content
    Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
    Date
    8. 1.2013 10:22:32
    Language
    e
  2. Subramanian, S.; Shafer, K.E.: Clustering (2001) 0.02
    0.01663597 = product of:
      0.041589923 = sum of:
        0.010482071 = weight(_text_:e in 1046) [ClassicSimilarity], result of:
          0.010482071 = score(doc=1046,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.19057012 = fieldWeight in 1046, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.09375 = fieldNorm(doc=1046)
        0.03110785 = product of:
          0.0622157 = sum of:
            0.0622157 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
              0.0622157 = score(doc=1046,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.46428138 = fieldWeight in 1046, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1046)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    5. 5.2003 14:17:22
    Language
    e
  3. Ibekwe-SanJuan, F.; SanJuan, E.: From term variants to research topics (2002) 0.01
    0.010378103 = product of:
      0.025945257 = sum of:
        0.00617662 = weight(_text_:e in 1853) [ClassicSimilarity], result of:
          0.00617662 = score(doc=1853,freq=4.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.112294525 = fieldWeight in 1853, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1853)
        0.019768637 = product of:
          0.05930591 = sum of:
            0.05930591 = weight(_text_:evolution in 1853) [ClassicSimilarity], result of:
              0.05930591 = score(doc=1853,freq=2.0), product of:
                0.2026858 = queryWeight, product of:
                  5.29663 = idf(docFreq=601, maxDocs=44218)
                  0.03826694 = queryNorm
                0.2926002 = fieldWeight in 1853, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.29663 = idf(docFreq=601, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1853)
          0.33333334 = coord(1/3)
      0.4 = coord(2/5)
    
    Abstract
    In a scientific and technological watch (STW) task, an expert user needs to survey the evolution of research topics in his area of specialisation in order to detect interesting changes. The majority of methods proposing evaluation metrics (bibliometrics and scientometrics studies) for STW rely solely an statistical data analysis methods (Co-citation analysis, co-word analysis). Such methods usually work an structured databases where the units of analysis (words, keywords) are already attributed to documents by human indexers. The advent of huge amounts of unstructured textual data has rendered necessary the integration of natural language processing (NLP) techniques to first extract meaningful units from texts. We propose a method for STW which is NLP-oriented. The method not only analyses texts linguistically in order to extract terms from them, but also uses linguistic relations (syntactic variations) as the basis for clustering. Terms and variation relations are formalised as weighted di-graphs which the clustering algorithm, CPCL (Classification by Preferential Clustered Link) will seek to reduce in order to produces classes. These classes ideally represent the research topics present in the corpus. The results of the classification are subjected to validation by an expert in STW.
    Language
    e
  4. Automatic classification research at OCLC (2002) 0.01
    0.009704315 = product of:
      0.024260787 = sum of:
        0.006114541 = weight(_text_:e in 1563) [ClassicSimilarity], result of:
          0.006114541 = score(doc=1563,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.1111659 = fieldWeight in 1563, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1563)
        0.018146247 = product of:
          0.036292493 = sum of:
            0.036292493 = weight(_text_:22 in 1563) [ClassicSimilarity], result of:
              0.036292493 = score(doc=1563,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.2708308 = fieldWeight in 1563, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1563)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    5. 5.2003 9:22:09
    Language
    e
  5. Yoon, Y.; Lee, C.; Lee, G.G.: ¬An effective procedure for constructing a hierarchical text classification system (2006) 0.01
    0.009704315 = product of:
      0.024260787 = sum of:
        0.006114541 = weight(_text_:e in 5273) [ClassicSimilarity], result of:
          0.006114541 = score(doc=5273,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.1111659 = fieldWeight in 5273, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5273)
        0.018146247 = product of:
          0.036292493 = sum of:
            0.036292493 = weight(_text_:22 in 5273) [ClassicSimilarity], result of:
              0.036292493 = score(doc=5273,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.2708308 = fieldWeight in 5273, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5273)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    22. 7.2006 16:24:52
    Language
    e
  6. Yi, K.: Automatic text classification using library classification schemes : trends, issues and challenges (2007) 0.01
    0.009704315 = product of:
      0.024260787 = sum of:
        0.006114541 = weight(_text_:e in 2560) [ClassicSimilarity], result of:
          0.006114541 = score(doc=2560,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.1111659 = fieldWeight in 2560, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2560)
        0.018146247 = product of:
          0.036292493 = sum of:
            0.036292493 = weight(_text_:22 in 2560) [ClassicSimilarity], result of:
              0.036292493 = score(doc=2560,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.2708308 = fieldWeight in 2560, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2560)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    22. 9.2008 18:31:54
    Language
    e
  7. Liu, R.-L.: Context recognition for hierarchical text classification (2009) 0.01
    0.008317985 = product of:
      0.020794962 = sum of:
        0.0052410355 = weight(_text_:e in 2760) [ClassicSimilarity], result of:
          0.0052410355 = score(doc=2760,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.09528506 = fieldWeight in 2760, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=2760)
        0.015553925 = product of:
          0.03110785 = sum of:
            0.03110785 = weight(_text_:22 in 2760) [ClassicSimilarity], result of:
              0.03110785 = score(doc=2760,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.23214069 = fieldWeight in 2760, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2760)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    22. 3.2009 19:11:54
    Language
    e
  8. Mengle, S.; Goharian, N.: Passage detection using text classification (2009) 0.01
    0.0069316537 = product of:
      0.017329134 = sum of:
        0.00436753 = weight(_text_:e in 2765) [ClassicSimilarity], result of:
          0.00436753 = score(doc=2765,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.07940422 = fieldWeight in 2765, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2765)
        0.012961605 = product of:
          0.02592321 = sum of:
            0.02592321 = weight(_text_:22 in 2765) [ClassicSimilarity], result of:
              0.02592321 = score(doc=2765,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.19345059 = fieldWeight in 2765, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2765)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    22. 3.2009 19:14:43
    Language
    e
  9. Khoo, C.S.G.; Ng, K.; Ou, S.: ¬An exploratory study of human clustering of Web pages (2003) 0.01
    0.005545323 = product of:
      0.0138633065 = sum of:
        0.0034940236 = weight(_text_:e in 2741) [ClassicSimilarity], result of:
          0.0034940236 = score(doc=2741,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.063523374 = fieldWeight in 2741, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03125 = fieldNorm(doc=2741)
        0.010369283 = product of:
          0.020738566 = sum of:
            0.020738566 = weight(_text_:22 in 2741) [ClassicSimilarity], result of:
              0.020738566 = score(doc=2741,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.15476047 = fieldWeight in 2741, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=2741)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    12. 9.2004 9:56:22
    Language
    e
  10. Wu, M.; Fuller, M.; Wilkinson, R.: Using clustering and classification approaches in interactive retrieval (2001) 0.00
    0.0024458165 = product of:
      0.012229082 = sum of:
        0.012229082 = weight(_text_:e in 2666) [ClassicSimilarity], result of:
          0.012229082 = score(doc=2666,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.2223318 = fieldWeight in 2666, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.109375 = fieldNorm(doc=2666)
      0.2 = coord(1/5)
    
    Language
    e
  11. Fong, A.C.M.: Mining a Web citation database for document clustering (2002) 0.00
    0.0024458165 = product of:
      0.012229082 = sum of:
        0.012229082 = weight(_text_:e in 3940) [ClassicSimilarity], result of:
          0.012229082 = score(doc=3940,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.2223318 = fieldWeight in 3940, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.109375 = fieldNorm(doc=3940)
      0.2 = coord(1/5)
    
    Language
    e
  12. Major, R.L.; Ragsdale, C.T.: ¬An aggregation approach to the classification problem using multiple prediction experts (2000) 0.00
    0.0020964143 = product of:
      0.010482071 = sum of:
        0.010482071 = weight(_text_:e in 3789) [ClassicSimilarity], result of:
          0.010482071 = score(doc=3789,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.19057012 = fieldWeight in 3789, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.09375 = fieldNorm(doc=3789)
      0.2 = coord(1/5)
    
    Language
    e
  13. Chan, L.M.; Lin, X.; Zeng, M.L.: Structural and multilingual approaches to subject access on the Web (2000) 0.00
    0.0020964143 = product of:
      0.010482071 = sum of:
        0.010482071 = weight(_text_:e in 507) [ClassicSimilarity], result of:
          0.010482071 = score(doc=507,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.19057012 = fieldWeight in 507, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.09375 = fieldNorm(doc=507)
      0.2 = coord(1/5)
    
    Language
    e
  14. Shafer, K.E.: Automatic Subject Assignment via the Scorpion System (2001) 0.00
    0.0020964143 = product of:
      0.010482071 = sum of:
        0.010482071 = weight(_text_:e in 1043) [ClassicSimilarity], result of:
          0.010482071 = score(doc=1043,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.19057012 = fieldWeight in 1043, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.09375 = fieldNorm(doc=1043)
      0.2 = coord(1/5)
    
    Language
    e
  15. Yu, W.; Gong, Y.: Document clustering by concept factorization (2004) 0.00
    0.0020964143 = product of:
      0.010482071 = sum of:
        0.010482071 = weight(_text_:e in 4084) [ClassicSimilarity], result of:
          0.010482071 = score(doc=4084,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.19057012 = fieldWeight in 4084, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.09375 = fieldNorm(doc=4084)
      0.2 = coord(1/5)
    
    Language
    e
  16. Reiner, U.: Automatic analysis of DDC notations (2007) 0.00
    0.0020964143 = product of:
      0.010482071 = sum of:
        0.010482071 = weight(_text_:e in 118) [ClassicSimilarity], result of:
          0.010482071 = score(doc=118,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.19057012 = fieldWeight in 118, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.09375 = fieldNorm(doc=118)
      0.2 = coord(1/5)
    
    Language
    e
  17. Lindholm, J.; Schönthal, T.; Jansson , K.: Experiences of harvesting Web resources in engineering using automatic classification (2003) 0.00
    0.0019765184 = product of:
      0.009882592 = sum of:
        0.009882592 = weight(_text_:e in 4088) [ClassicSimilarity], result of:
          0.009882592 = score(doc=4088,freq=4.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.17967124 = fieldWeight in 4088, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0625 = fieldNorm(doc=4088)
      0.2 = coord(1/5)
    
    Abstract
    Authors describe the background and the work involved in setting up Engine-e, a Web index that uses automatic classification as a mean for the selection of resources in Engineering. Considerations in offering a robot-generated Web index as a successor to a manually indexed quality-controlled subject gateway are also discussed
    Language
    e
  18. Shafer, K.E.: Evaluating Scorpion Results (2001) 0.00
    0.001747012 = product of:
      0.00873506 = sum of:
        0.00873506 = weight(_text_:e in 4085) [ClassicSimilarity], result of:
          0.00873506 = score(doc=4085,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.15880844 = fieldWeight in 4085, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.078125 = fieldNorm(doc=4085)
      0.2 = coord(1/5)
    
    Language
    e
  19. Shen, D.; Chen, Z.; Yang, Q.; Zeng, H.J.; Zhang, B.; Lu, Y.; Ma, W.Y.: Web page classification through summarization (2004) 0.00
    0.001747012 = product of:
      0.00873506 = sum of:
        0.00873506 = weight(_text_:e in 4132) [ClassicSimilarity], result of:
          0.00873506 = score(doc=4132,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.15880844 = fieldWeight in 4132, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.078125 = fieldNorm(doc=4132)
      0.2 = coord(1/5)
    
    Language
    e
  20. Sebastiani, F.: Classification of text, automatic (2006) 0.00
    0.0017294536 = product of:
      0.008647268 = sum of:
        0.008647268 = weight(_text_:e in 5003) [ClassicSimilarity], result of:
          0.008647268 = score(doc=5003,freq=4.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.15721233 = fieldWeight in 5003, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5003)
      0.2 = coord(1/5)
    
    Abstract
    Automatic text classification (ATC) is a discipline at the crossroads of information retrieval (IR), machine learning (ML), and computational linguistics (CL), and consists in the realization of text classifiers, i.e. software systems capable of assigning texts to one or more categories, or classes, from a predefined set. Applications range from the automated indexing of scientific articles, to e-mail routing, spam filtering, authorship attribution, and automated survey coding. This article will focus on the ML approach to ATC, whereby a software system (called the learner) automatically builds a classifier for the categories of interest by generalizing from a "training" set of pre-classified texts.
    Language
    e

Types

  • a 75
  • el 8
  • m 2
  • s 1
  • More… Less…