Search (38 results, page 2 of 2)

  • × language_ss:"e"
  • × theme_ss:"Automatisches Klassifizieren"
  1. Dolin, R.; Agrawal, D.; El Abbadi, A.; Pearlman, J.: Using automated classification for summarizing and selecting heterogeneous information sources (1998) 0.00
    0.003886916 = product of:
      0.007773832 = sum of:
        0.007773832 = product of:
          0.015547664 = sum of:
            0.015547664 = weight(_text_:d in 316) [ClassicSimilarity], result of:
              0.015547664 = score(doc=316,freq=4.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.178113 = fieldWeight in 316, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.046875 = fieldNorm(doc=316)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    D-Lib magazine. 4(1998) no.1
  2. Hagedorn, K.; Chapman, S.; Newman, D.: Enhancing search and browse using automated clustering of subject metadata (2007) 0.00
    0.003886916 = product of:
      0.007773832 = sum of:
        0.007773832 = product of:
          0.015547664 = sum of:
            0.015547664 = weight(_text_:d in 1168) [ClassicSimilarity], result of:
              0.015547664 = score(doc=1168,freq=4.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.178113 = fieldWeight in 1168, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1168)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    D-Lib magazine. 13(2007) nos.7/8, x S
  3. Savic, D.: Designing an expert system for classifying office documents (1994) 0.00
    0.0036646193 = product of:
      0.0073292386 = sum of:
        0.0073292386 = product of:
          0.014658477 = sum of:
            0.014658477 = weight(_text_:d in 2655) [ClassicSimilarity], result of:
              0.014658477 = score(doc=2655,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.16792654 = fieldWeight in 2655, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2655)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  4. Golub, K.; Hansson, J.; Soergel, D.; Tudhope, D.: Managing classification in libraries : a methodological outline for evaluating automatic subject indexing and classification in Swedish library catalogues (2015) 0.00
    0.0032390966 = product of:
      0.006478193 = sum of:
        0.006478193 = product of:
          0.012956386 = sum of:
            0.012956386 = weight(_text_:d in 2300) [ClassicSimilarity], result of:
              0.012956386 = score(doc=2300,freq=4.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.1484275 = fieldWeight in 2300, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2300)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  5. Savic, D.: Automatic classification of office documents : review of available methods and techniques (1995) 0.00
    0.0032065418 = product of:
      0.0064130835 = sum of:
        0.0064130835 = product of:
          0.012826167 = sum of:
            0.012826167 = weight(_text_:d in 2219) [ClassicSimilarity], result of:
              0.012826167 = score(doc=2219,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.14693572 = fieldWeight in 2219, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2219)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  6. Koch, T.; Vizine-Goetz, D.: Automatic classification and content navigation support for Web services : DESIRE II cooperates with OCLC (1998) 0.00
    0.0032065418 = product of:
      0.0064130835 = sum of:
        0.0064130835 = product of:
          0.012826167 = sum of:
            0.012826167 = weight(_text_:d in 1568) [ClassicSimilarity], result of:
              0.012826167 = score(doc=1568,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.14693572 = fieldWeight in 1568, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1568)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  7. Koch, T.; Vizine-Goetz, D.: DDC and knowledge organization in the digital library : Research and development. Demonstration pages (1999) 0.00
    0.0027484642 = product of:
      0.0054969285 = sum of:
        0.0054969285 = product of:
          0.010993857 = sum of:
            0.010993857 = weight(_text_:d in 942) [ClassicSimilarity], result of:
              0.010993857 = score(doc=942,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.1259449 = fieldWeight in 942, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.046875 = fieldNorm(doc=942)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  8. Prabowo, R.; Jackson, M.; Burden, P.; Knoell, H.-D.: Ontology-based automatic classification for the Web pages : design, implementation and evaluation (2002) 0.00
    0.0027484642 = product of:
      0.0054969285 = sum of:
        0.0054969285 = product of:
          0.010993857 = sum of:
            0.010993857 = weight(_text_:d in 3383) [ClassicSimilarity], result of:
              0.010993857 = score(doc=3383,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.1259449 = fieldWeight in 3383, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3383)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  9. Reiner, U.: DDC-based search in the data of the German National Bibliography (2008) 0.00
    0.0027484642 = product of:
      0.0054969285 = sum of:
        0.0054969285 = product of:
          0.010993857 = sum of:
            0.010993857 = weight(_text_:d in 2166) [ClassicSimilarity], result of:
              0.010993857 = score(doc=2166,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.1259449 = fieldWeight in 2166, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2166)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Location
    D
  10. Giorgetti, D.; Sebastiani, F.: Automating survey coding by multiclass text categorization techniques (2003) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 5172) [ClassicSimilarity], result of:
              0.009161548 = score(doc=5172,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 5172, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5172)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  11. Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 448) [ClassicSimilarity], result of:
              0.009161548 = score(doc=448,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 448, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=448)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  12. Rooney, N.; Patterson, D.; Galushka, M.; Dobrynin, V.; Smirnova, E.: ¬An investigation into the stability of contextual document clustering (2008) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 1356) [ClassicSimilarity], result of:
              0.009161548 = score(doc=1356,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 1356, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1356)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  13. HaCohen-Kerner, Y.; Beck, H.; Yehudai, E.; Rosenstein, M.; Mughaz, D.: Cuisine : classification using stylistic feature sets and/or name-based feature sets (2010) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 3706) [ClassicSimilarity], result of:
              0.009161548 = score(doc=3706,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 3706, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3706)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  14. Qu, B.; Cong, G.; Li, C.; Sun, A.; Chen, H.: ¬An evaluation of classification models for question topic categorization (2012) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 237) [ClassicSimilarity], result of:
              0.009161548 = score(doc=237,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 237, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=237)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    We study the problem of question topic classification using a very large real-world Community Question Answering (CQA) dataset from Yahoo! Answers. The dataset comprises 3.9 million questions and these questions are organized into more than 1,000 categories in a hierarchy. To the best knowledge, this is the first systematic evaluation of the performance of different classification methods on question topic classification as well as short texts. Specifically, we empirically evaluate the following in classifying questions into CQA categories: (a) the usefulness of n-gram features and bag-of-word features; (b) the performance of three standard classification algorithms (naive Bayes, maximum entropy, and support vector machines); (c) the performance of the state-of-the-art hierarchical classification algorithms; (d) the effect of training data size on performance; and (e) the effectiveness of the different components of CQA data, including subject, content, asker, and the best answer. The experimental results show what aspects are important for question topic classification in terms of both effectiveness and efficiency. We believe that the experimental findings from this study will be useful in real-world classification problems.
  15. Alberts, I.; Forest, D.: Email pragmatics and automatic classification : a study in the organizational context (2012) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 238) [ClassicSimilarity], result of:
              0.009161548 = score(doc=238,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 238, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=238)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  16. Vilares, D.; Alonso, M.A.; Gómez-Rodríguez, C.: On the usefulness of lexical and syntactic processing in polarity classification of Twitter messages (2015) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 2161) [ClassicSimilarity], result of:
              0.009161548 = score(doc=2161,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 2161, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2161)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  17. Han, K.; Rezapour, R.; Nakamura, K.; Devkota, D.; Miller, D.C.; Diesner, J.: ¬An expert-in-the-loop method for domain-specific document categorization based on small training data (2023) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 967) [ClassicSimilarity], result of:
              0.009161548 = score(doc=967,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 967, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=967)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  18. Dolin, R.; Agrawal, D.; El Abbadi, A.; Pearlman, J.: Using automated classification for summarizing and selecting heterogeneous information sources (1998) 0.00
    0.001943458 = product of:
      0.003886916 = sum of:
        0.003886916 = product of:
          0.007773832 = sum of:
            0.007773832 = weight(_text_:d in 1253) [ClassicSimilarity], result of:
              0.007773832 = score(doc=1253,freq=4.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.0890565 = fieldWeight in 1253, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0234375 = fieldNorm(doc=1253)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    D-Lib magazine. 4(1998) no.1, xx S