Search (83 results, page 4 of 5)

  • × theme_ss:"Automatisches Klassifizieren"
  1. Helmbrecht-Schaar, A.: Entwicklung eines Verfahrens der automatischen Klassifizierung für Textdokumente aus dem Fachbereich Informatik mithilfe eines fachspezifischen Klassifikationssystems (2007) 0.00
    0.0027484642 = product of:
      0.0054969285 = sum of:
        0.0054969285 = product of:
          0.010993857 = sum of:
            0.010993857 = weight(_text_:d in 1410) [ClassicSimilarity], result of:
              0.010993857 = score(doc=1410,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.1259449 = fieldWeight in 1410, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1410)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    d
  2. Reiner, U.: DDC-based search in the data of the German National Bibliography (2008) 0.00
    0.0027484642 = product of:
      0.0054969285 = sum of:
        0.0054969285 = product of:
          0.010993857 = sum of:
            0.010993857 = weight(_text_:d in 2166) [ClassicSimilarity], result of:
              0.010993857 = score(doc=2166,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.1259449 = fieldWeight in 2166, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2166)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Location
    D
  3. Puzicha, J.: Informationen finden! : Intelligente Suchmaschinentechnologie & automatische Kategorisierung (2007) 0.00
    0.0027484642 = product of:
      0.0054969285 = sum of:
        0.0054969285 = product of:
          0.010993857 = sum of:
            0.010993857 = weight(_text_:d in 2817) [ClassicSimilarity], result of:
              0.010993857 = score(doc=2817,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.1259449 = fieldWeight in 2817, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2817)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    d
  4. Sommer, M.: Automatische Generierung von DDC-Notationen für Hochschulveröffentlichungen (2012) 0.00
    0.0027484642 = product of:
      0.0054969285 = sum of:
        0.0054969285 = product of:
          0.010993857 = sum of:
            0.010993857 = weight(_text_:d in 587) [ClassicSimilarity], result of:
              0.010993857 = score(doc=587,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.1259449 = fieldWeight in 587, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.046875 = fieldNorm(doc=587)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    d
  5. Kasprzik, A.: Automatisierte und semiautomatisierte Klassifizierung : eine Analyse aktueller Projekte (2014) 0.00
    0.0027484642 = product of:
      0.0054969285 = sum of:
        0.0054969285 = product of:
          0.010993857 = sum of:
            0.010993857 = weight(_text_:d in 2470) [ClassicSimilarity], result of:
              0.010993857 = score(doc=2470,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.1259449 = fieldWeight in 2470, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2470)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    d
  6. Schek, M.: Automatische Klassifizierung in Erschließung und Recherche eines Pressearchivs (2006) 0.00
    0.0025912772 = product of:
      0.0051825545 = sum of:
        0.0051825545 = product of:
          0.010365109 = sum of:
            0.010365109 = weight(_text_:d in 6043) [ClassicSimilarity], result of:
              0.010365109 = score(doc=6043,freq=4.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.118742 = fieldWeight in 6043, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.03125 = fieldNorm(doc=6043)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    d
    Location
    D
  7. Groß, T.; Faden, M.: Automatische Indexierung elektronischer Dokumente an der Deutschen Zentralbibliothek für Wirtschaftswissenschaften : Bericht über die Jahrestagung der Internationalen Buchwissenschaftlichen Gesellschaft (2010) 0.00
    0.0025912772 = product of:
      0.0051825545 = sum of:
        0.0051825545 = product of:
          0.010365109 = sum of:
            0.010365109 = weight(_text_:d in 4051) [ClassicSimilarity], result of:
              0.010365109 = score(doc=4051,freq=4.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.118742 = fieldWeight in 4051, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.03125 = fieldNorm(doc=4051)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    d
    Location
    D
  8. Krüger, C.: Evaluation des WWW-Suchdienstes GERHARD unter besonderer Beachtung automatischer Indexierung (1999) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 1777) [ClassicSimilarity], result of:
              0.009161548 = score(doc=1777,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 1777, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1777)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    d
  9. Automatische Klassifikation und Extraktion in Documentum (2005) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 3974) [ClassicSimilarity], result of:
              0.009161548 = score(doc=3974,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 3974, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3974)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    d
  10. Giorgetti, D.; Sebastiani, F.: Automating survey coding by multiclass text categorization techniques (2003) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 5172) [ClassicSimilarity], result of:
              0.009161548 = score(doc=5172,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 5172, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5172)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  11. Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 448) [ClassicSimilarity], result of:
              0.009161548 = score(doc=448,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 448, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=448)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  12. Rooney, N.; Patterson, D.; Galushka, M.; Dobrynin, V.; Smirnova, E.: ¬An investigation into the stability of contextual document clustering (2008) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 1356) [ClassicSimilarity], result of:
              0.009161548 = score(doc=1356,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 1356, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1356)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  13. Reiner, U.: VZG-Projekt Colibri : Bewertung von automatisch DDC-klassifizierten Titeldatensätzen der Deutschen Nationalbibliothek (DNB) (2009) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 2675) [ClassicSimilarity], result of:
              0.009161548 = score(doc=2675,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 2675, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2675)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    d
  14. HaCohen-Kerner, Y.; Beck, H.; Yehudai, E.; Rosenstein, M.; Mughaz, D.: Cuisine : classification using stylistic feature sets and/or name-based feature sets (2010) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 3706) [ClassicSimilarity], result of:
              0.009161548 = score(doc=3706,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 3706, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3706)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  15. Qu, B.; Cong, G.; Li, C.; Sun, A.; Chen, H.: ¬An evaluation of classification models for question topic categorization (2012) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 237) [ClassicSimilarity], result of:
              0.009161548 = score(doc=237,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 237, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=237)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    We study the problem of question topic classification using a very large real-world Community Question Answering (CQA) dataset from Yahoo! Answers. The dataset comprises 3.9 million questions and these questions are organized into more than 1,000 categories in a hierarchy. To the best knowledge, this is the first systematic evaluation of the performance of different classification methods on question topic classification as well as short texts. Specifically, we empirically evaluate the following in classifying questions into CQA categories: (a) the usefulness of n-gram features and bag-of-word features; (b) the performance of three standard classification algorithms (naive Bayes, maximum entropy, and support vector machines); (c) the performance of the state-of-the-art hierarchical classification algorithms; (d) the effect of training data size on performance; and (e) the effectiveness of the different components of CQA data, including subject, content, asker, and the best answer. The experimental results show what aspects are important for question topic classification in terms of both effectiveness and efficiency. We believe that the experimental findings from this study will be useful in real-world classification problems.
  16. Alberts, I.; Forest, D.: Email pragmatics and automatic classification : a study in the organizational context (2012) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 238) [ClassicSimilarity], result of:
              0.009161548 = score(doc=238,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 238, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=238)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  17. Vilares, D.; Alonso, M.A.; Gómez-Rodríguez, C.: On the usefulness of lexical and syntactic processing in polarity classification of Twitter messages (2015) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 2161) [ClassicSimilarity], result of:
              0.009161548 = score(doc=2161,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 2161, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2161)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  18. Han, K.; Rezapour, R.; Nakamura, K.; Devkota, D.; Miller, D.C.; Diesner, J.: ¬An expert-in-the-loop method for domain-specific document categorization based on small training data (2023) 0.00
    0.002290387 = product of:
      0.004580774 = sum of:
        0.004580774 = product of:
          0.009161548 = sum of:
            0.009161548 = weight(_text_:d in 967) [ClassicSimilarity], result of:
              0.009161548 = score(doc=967,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.104954086 = fieldWeight in 967, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=967)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  19. Dolin, R.; Agrawal, D.; El Abbadi, A.; Pearlman, J.: Using automated classification for summarizing and selecting heterogeneous information sources (1998) 0.00
    0.001943458 = product of:
      0.003886916 = sum of:
        0.003886916 = product of:
          0.007773832 = sum of:
            0.007773832 = weight(_text_:d in 1253) [ClassicSimilarity], result of:
              0.007773832 = score(doc=1253,freq=4.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.0890565 = fieldWeight in 1253, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.0234375 = fieldNorm(doc=1253)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    D-Lib magazine. 4(1998) no.1, xx S
  20. Oberhauser, O.: Automatisches Klassifizieren : Verfahren zur Erschließung elektronischer Dokumente (2004) 0.00
    0.0018323096 = product of:
      0.0036646193 = sum of:
        0.0036646193 = product of:
          0.0073292386 = sum of:
            0.0073292386 = weight(_text_:d in 2487) [ClassicSimilarity], result of:
              0.0073292386 = score(doc=2487,freq=2.0), product of:
                0.08729101 = queryWeight, product of:
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.045945734 = queryNorm
                0.08396327 = fieldWeight in 2487, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.899872 = idf(docFreq=17979, maxDocs=44218)
                  0.03125 = fieldNorm(doc=2487)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Language
    d

Years

Languages

  • d 45
  • e 38

Types

  • a 60
  • el 15
  • x 9
  • m 3
  • r 3
  • d 1
  • More… Less…