Search (150 results, page 2 of 8)

  • × language_ss:"e"
  • × theme_ss:"Automatisches Klassifizieren"
  • × type_ss:"a"
  1. Major, R.L.; Ragsdale, C.T.: ¬An aggregation approach to the classification problem using multiple prediction experts (2000) 0.00
    0.0031514994 = product of:
      0.006302999 = sum of:
        0.006302999 = product of:
          0.012605998 = sum of:
            0.012605998 = weight(_text_:e in 3789) [ClassicSimilarity], result of:
              0.012605998 = score(doc=3789,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.19057012 = fieldWeight in 3789, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.09375 = fieldNorm(doc=3789)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  2. Ardö, A.; Koch, T.: Automatic classification applied to full-text Internet documents in a robot-generated subject index (1999) 0.00
    0.0031514994 = product of:
      0.006302999 = sum of:
        0.006302999 = product of:
          0.012605998 = sum of:
            0.012605998 = weight(_text_:e in 382) [ClassicSimilarity], result of:
              0.012605998 = score(doc=382,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.19057012 = fieldWeight in 382, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.09375 = fieldNorm(doc=382)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  3. Chan, L.M.; Lin, X.; Zeng, M.L.: Structural and multilingual approaches to subject access on the Web (2000) 0.00
    0.0031514994 = product of:
      0.006302999 = sum of:
        0.006302999 = product of:
          0.012605998 = sum of:
            0.012605998 = weight(_text_:e in 507) [ClassicSimilarity], result of:
              0.012605998 = score(doc=507,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.19057012 = fieldWeight in 507, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.09375 = fieldNorm(doc=507)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  4. Shafer, K.E.: Automatic Subject Assignment via the Scorpion System (2001) 0.00
    0.0031514994 = product of:
      0.006302999 = sum of:
        0.006302999 = product of:
          0.012605998 = sum of:
            0.012605998 = weight(_text_:e in 1043) [ClassicSimilarity], result of:
              0.012605998 = score(doc=1043,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.19057012 = fieldWeight in 1043, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1043)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  5. Yu, W.; Gong, Y.: Document clustering by concept factorization (2004) 0.00
    0.0031514994 = product of:
      0.006302999 = sum of:
        0.006302999 = product of:
          0.012605998 = sum of:
            0.012605998 = weight(_text_:e in 4084) [ClassicSimilarity], result of:
              0.012605998 = score(doc=4084,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.19057012 = fieldWeight in 4084, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4084)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  6. Cortez, E.; Herrera, M.R.; Silva, A.S. da; Moura, E.S. de; Neubert, M.: Lightweight methods for large-scale product categorization (2011) 0.00
    0.0031514994 = product of:
      0.006302999 = sum of:
        0.006302999 = product of:
          0.012605998 = sum of:
            0.012605998 = weight(_text_:e in 4758) [ClassicSimilarity], result of:
              0.012605998 = score(doc=4758,freq=8.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.19057012 = fieldWeight in 4758, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4758)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    In this article, we present a study about classification methods for large-scale categorization of product offers on e-shopping web sites. We present a study about the performance of previously proposed approaches and deployed a probabilistic approach to model the classification problem. We also studied an alternative way of modeling information about the description of product offers and investigated the usage of price and store of product offers as features adopted in the classification process. Our experiments used two collections of over a million product offers previously categorized by human editors and taxonomies of hundreds of categories from a real e-shopping web site. In these experiments, our method achieved an improvement of up to 9% in the quality of the categorization in comparison with the best baseline we have found.
    Language
    e
  7. Lindholm, J.; Schönthal, T.; Jansson , K.: Experiences of harvesting Web resources in engineering using automatic classification (2003) 0.00
    0.0029712624 = product of:
      0.005942525 = sum of:
        0.005942525 = product of:
          0.01188505 = sum of:
            0.01188505 = weight(_text_:e in 4088) [ClassicSimilarity], result of:
              0.01188505 = score(doc=4088,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.17967124 = fieldWeight in 4088, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4088)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Authors describe the background and the work involved in setting up Engine-e, a Web index that uses automatic classification as a mean for the selection of resources in Engineering. Considerations in offering a robot-generated Web index as a successor to a manually indexed quality-controlled subject gateway are also discussed
    Language
    e
  8. Teich, E.; Degaetano-Ortlieb, S.; Fankhauser, P.; Kermes, H.; Lapshinova-Koltunski, E.: ¬The linguistic construal of disciplinarity : a data-mining approach using register features (2016) 0.00
    0.002729279 = product of:
      0.005458558 = sum of:
        0.005458558 = product of:
          0.010917116 = sum of:
            0.010917116 = weight(_text_:e in 3015) [ClassicSimilarity], result of:
              0.010917116 = score(doc=3015,freq=6.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.16503859 = fieldWeight in 3015, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3015)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  9. McKiernan, G.: Automated categorisation of Web resources : a profile of selected projects, research, products, and services (1996) 0.00
    0.0026262498 = product of:
      0.0052524996 = sum of:
        0.0052524996 = product of:
          0.010504999 = sum of:
            0.010504999 = weight(_text_:e in 2533) [ClassicSimilarity], result of:
              0.010504999 = score(doc=2533,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.15880844 = fieldWeight in 2533, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2533)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  10. Vizine-Goetz, D.: NetLab / OCLC collaboration seeks to improve Web searching (1999) 0.00
    0.0026262498 = product of:
      0.0052524996 = sum of:
        0.0052524996 = product of:
          0.010504999 = sum of:
            0.010504999 = weight(_text_:e in 4180) [ClassicSimilarity], result of:
              0.010504999 = score(doc=4180,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.15880844 = fieldWeight in 4180, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4180)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  11. Möller, G.: Automatic classification of the World Wide Web using Universal Decimal Classification (1999) 0.00
    0.0026262498 = product of:
      0.0052524996 = sum of:
        0.0052524996 = product of:
          0.010504999 = sum of:
            0.010504999 = weight(_text_:e in 494) [ClassicSimilarity], result of:
              0.010504999 = score(doc=494,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.15880844 = fieldWeight in 494, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.078125 = fieldNorm(doc=494)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  12. Shafer, K.E.: Evaluating Scorpion Results (2001) 0.00
    0.0026262498 = product of:
      0.0052524996 = sum of:
        0.0052524996 = product of:
          0.010504999 = sum of:
            0.010504999 = weight(_text_:e in 4085) [ClassicSimilarity], result of:
              0.010504999 = score(doc=4085,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.15880844 = fieldWeight in 4085, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4085)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  13. Shen, D.; Chen, Z.; Yang, Q.; Zeng, H.J.; Zhang, B.; Lu, Y.; Ma, W.Y.: Web page classification through summarization (2004) 0.00
    0.0026262498 = product of:
      0.0052524996 = sum of:
        0.0052524996 = product of:
          0.010504999 = sum of:
            0.010504999 = weight(_text_:e in 4132) [ClassicSimilarity], result of:
              0.010504999 = score(doc=4132,freq=2.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.15880844 = fieldWeight in 4132, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4132)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  14. Sebastiani, F.: Classification of text, automatic (2006) 0.00
    0.0025998545 = product of:
      0.005199709 = sum of:
        0.005199709 = product of:
          0.010399418 = sum of:
            0.010399418 = weight(_text_:e in 5003) [ClassicSimilarity], result of:
              0.010399418 = score(doc=5003,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.15721233 = fieldWeight in 5003, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5003)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Automatic text classification (ATC) is a discipline at the crossroads of information retrieval (IR), machine learning (ML), and computational linguistics (CL), and consists in the realization of text classifiers, i.e. software systems capable of assigning texts to one or more categories, or classes, from a predefined set. Applications range from the automated indexing of scientific articles, to e-mail routing, spam filtering, authorship attribution, and automated survey coding. This article will focus on the ML approach to ATC, whereby a software system (called the learner) automatically builds a classifier for the categories of interest by generalizing from a "training" set of pre-classified texts.
    Language
    e
  15. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 1808) [ClassicSimilarity], result of:
              0.008913787 = score(doc=1808,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 1808, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1808)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  16. Frank, E.; Paynter, G.W.: Predicting Library of Congress Classifications from Library of Congress Subject Headings (2004) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 2218) [ClassicSimilarity], result of:
              0.008913787 = score(doc=2218,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 2218, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2218)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  17. Díaz, I.; Ranilla, J.; Montañes, E.; Fernández, J.; Combarro, E.F.: Improving performance of text categorization by combining filtering and support vector machines (2004) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 2234) [ClassicSimilarity], result of:
              0.008913787 = score(doc=2234,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 2234, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2234)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  18. Wu, K.J.; Chen, M.-C.; Sun, Y.: Automatic topics discovery from hyperlinked documents (2004) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 2563) [ClassicSimilarity], result of:
              0.008913787 = score(doc=2563,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 2563, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2563)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Topic discovery is an important means for marketing, e-Business and social science studies. As well, it can be applied to various purposes, such as identifying a group with certain properties and observing the emergence and diminishment of a certain cyber community. Previous topic discovery work (J.M. Kleinberg, Proceedings of the 9th Annual ACM-SIAM Symposium on Discrete Algorithms, San Francisco, California, p. 668) requires manual judgment of usefulness of outcomes and is thus incapable of handling the explosive growth of the Internet. In this paper, we propose the Automatic Topic Discovery (ATD) method, which combines a method of base set construction, a clustering algorithm and an iterative principal eigenvector computation method to discover the topics relevant to a given query without using manual examination. Given a query, ATD returns with topics associated with the query and top representative pages for each topic. Our experiments show that the ATD method performs better than the traditional eigenvector method in terms of computation time and topic discovery quality.
    Language
    e
  19. Aphinyanaphongs, Y.; Fu, L.D.; Li, Z.; Peskin, E.R.; Efstathiadis, E.; Aliferis, C.F.; Statnikov, A.: ¬A comprehensive empirical comparison of modern supervised classification and feature selection methods for text categorization (2014) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 1496) [ClassicSimilarity], result of:
              0.008913787 = score(doc=1496,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 1496, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1496)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e
  20. Barbu, E.: What kind of knowledge is in Wikipedia? : unsupervised extraction of properties for similar concepts (2014) 0.00
    0.0022284468 = product of:
      0.0044568935 = sum of:
        0.0044568935 = product of:
          0.008913787 = sum of:
            0.008913787 = weight(_text_:e in 1547) [ClassicSimilarity], result of:
              0.008913787 = score(doc=1547,freq=4.0), product of:
                0.06614887 = queryWeight, product of:
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.04602077 = queryNorm
                0.13475344 = fieldWeight in 1547, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.43737 = idf(docFreq=28552, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1547)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    e

Years