Search (60 results, page 3 of 3)

Golub, K.; Hansson, J.; Soergel, D.; Tudhope, D.: Managing classification in libraries : a methodological outline for evaluating automatic subject indexing and classification in Swedish library catalogues (2015) 0.00

0.0032390966 = product of:
  0.006478193 = sum of:
    0.006478193 = product of:
      0.012956386 = sum of:
        0.012956386 = weight(_text_:d in 2300) [ClassicSimilarity], result of:
          0.012956386 = score(doc=2300,freq=4.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.1484275 = fieldWeight in 2300, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2300)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Savic, D.: Automatic classification of office documents : review of available methods and techniques (1995) 0.00

0.0032065418 = product of:
  0.0064130835 = sum of:
    0.0064130835 = product of:
      0.012826167 = sum of:
        0.012826167 = weight(_text_:d in 2219) [ClassicSimilarity], result of:
          0.012826167 = score(doc=2219,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.14693572 = fieldWeight in 2219, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2219)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Panyr, J.: Automatische thematische Textklassifikation und ihre Interpretation in der Dokumentengrobrecherche (1980) 0.00

0.0032065418 = product of:
  0.0064130835 = sum of:
    0.0064130835 = product of:
      0.012826167 = sum of:
        0.012826167 = weight(_text_:d in 100) [ClassicSimilarity], result of:
          0.012826167 = score(doc=100,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.14693572 = fieldWeight in 100, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0546875 = fieldNorm(doc=100)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Language: d

Wätjen, H.-J.: GERHARD : Automatisches Sammeln, Klassifizieren und Indexieren von wissenschaftlich relevanten Informationsressourcen im deutschen World Wide Web (1998) 0.00

0.0032065418 = product of:
  0.0064130835 = sum of:
    0.0064130835 = product of:
      0.012826167 = sum of:
        0.012826167 = weight(_text_:d in 3064) [ClassicSimilarity], result of:
          0.012826167 = score(doc=3064,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.14693572 = fieldWeight in 3064, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3064)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Language: d

Oberhauser, O.: Automatisches Klassifizieren und Bibliothekskataloge (2005) 0.00

0.0032065418 = product of:
  0.0064130835 = sum of:
    0.0064130835 = product of:
      0.012826167 = sum of:
        0.012826167 = weight(_text_:d in 4099) [ClassicSimilarity], result of:
          0.012826167 = score(doc=4099,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.14693572 = fieldWeight in 4099, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4099)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Language: d

Reiner, U.: DDC-based search in the data of the German National Bibliography (2008) 0.00

0.0027484642 = product of:
  0.0054969285 = sum of:
    0.0054969285 = product of:
      0.010993857 = sum of:
        0.010993857 = weight(_text_:d in 2166) [ClassicSimilarity], result of:
          0.010993857 = score(doc=2166,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.1259449 = fieldWeight in 2166, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.046875 = fieldNorm(doc=2166)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Location: D

Kasprzik, A.: Automatisierte und semiautomatisierte Klassifizierung : eine Analyse aktueller Projekte (2014) 0.00

0.0027484642 = product of:
  0.0054969285 = sum of:
    0.0054969285 = product of:
      0.010993857 = sum of:
        0.010993857 = weight(_text_:d in 2470) [ClassicSimilarity], result of:
          0.010993857 = score(doc=2470,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.1259449 = fieldWeight in 2470, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.046875 = fieldNorm(doc=2470)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Language: d

Schek, M.: Automatische Klassifizierung in Erschließung und Recherche eines Pressearchivs (2006) 0.00

0.0025912772 = product of:
  0.0051825545 = sum of:
    0.0051825545 = product of:
      0.010365109 = sum of:
        0.010365109 = weight(_text_:d in 6043) [ClassicSimilarity], result of:
          0.010365109 = score(doc=6043,freq=4.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.118742 = fieldWeight in 6043, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.03125 = fieldNorm(doc=6043)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Language: d
Location: D

Groß, T.; Faden, M.: Automatische Indexierung elektronischer Dokumente an der Deutschen Zentralbibliothek für Wirtschaftswissenschaften : Bericht über die Jahrestagung der Internationalen Buchwissenschaftlichen Gesellschaft (2010) 0.00

0.0025912772 = product of:
  0.0051825545 = sum of:
    0.0051825545 = product of:
      0.010365109 = sum of:
        0.010365109 = weight(_text_:d in 4051) [ClassicSimilarity], result of:
          0.010365109 = score(doc=4051,freq=4.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.118742 = fieldWeight in 4051, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.03125 = fieldNorm(doc=4051)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Language: d
Location: D

Automatische Klassifikation und Extraktion in Documentum (2005) 0.00

0.002290387 = product of:
  0.004580774 = sum of:
    0.004580774 = product of:
      0.009161548 = sum of:
        0.009161548 = weight(_text_:d in 3974) [ClassicSimilarity], result of:
          0.009161548 = score(doc=3974,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.104954086 = fieldWeight in 3974, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3974)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Language: d

Giorgetti, D.; Sebastiani, F.: Automating survey coding by multiclass text categorization techniques (2003) 0.00

0.002290387 = product of:
  0.004580774 = sum of:
    0.004580774 = product of:
      0.009161548 = sum of:
        0.009161548 = weight(_text_:d in 5172) [ClassicSimilarity], result of:
          0.009161548 = score(doc=5172,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.104954086 = fieldWeight in 5172, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5172)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.00

0.002290387 = product of:
  0.004580774 = sum of:
    0.004580774 = product of:
      0.009161548 = sum of:
        0.009161548 = weight(_text_:d in 448) [ClassicSimilarity], result of:
          0.009161548 = score(doc=448,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.104954086 = fieldWeight in 448, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=448)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Rooney, N.; Patterson, D.; Galushka, M.; Dobrynin, V.; Smirnova, E.: ¬An investigation into the stability of contextual document clustering (2008) 0.00

0.002290387 = product of:
  0.004580774 = sum of:
    0.004580774 = product of:
      0.009161548 = sum of:
        0.009161548 = weight(_text_:d in 1356) [ClassicSimilarity], result of:
          0.009161548 = score(doc=1356,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.104954086 = fieldWeight in 1356, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1356)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

HaCohen-Kerner, Y.; Beck, H.; Yehudai, E.; Rosenstein, M.; Mughaz, D.: Cuisine : classification using stylistic feature sets and/or name-based feature sets (2010) 0.00

0.002290387 = product of:
  0.004580774 = sum of:
    0.004580774 = product of:
      0.009161548 = sum of:
        0.009161548 = weight(_text_:d in 3706) [ClassicSimilarity], result of:
          0.009161548 = score(doc=3706,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.104954086 = fieldWeight in 3706, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3706)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Qu, B.; Cong, G.; Li, C.; Sun, A.; Chen, H.: ¬An evaluation of classification models for question topic categorization (2012) 0.00
```
0.002290387 = product of:
  0.004580774 = sum of:
    0.004580774 = product of:
      0.009161548 = sum of:
        0.009161548 = weight(_text_:d in 237) [ClassicSimilarity], result of:
          0.009161548 = score(doc=237,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.104954086 = fieldWeight in 237, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=237)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

We study the problem of question topic classification using a very large real-world Community Question Answering (CQA) dataset from Yahoo! Answers. The dataset comprises 3.9 million questions and these questions are organized into more than 1,000 categories in a hierarchy. To the best knowledge, this is the first systematic evaluation of the performance of different classification methods on question topic classification as well as short texts. Specifically, we empirically evaluate the following in classifying questions into CQA categories: (a) the usefulness of n-gram features and bag-of-word features; (b) the performance of three standard classification algorithms (naive Bayes, maximum entropy, and support vector machines); (c) the performance of the state-of-the-art hierarchical classification algorithms; (d) the effect of training data size on performance; and (e) the effectiveness of the different components of CQA data, including subject, content, asker, and the best answer. The experimental results show what aspects are important for question topic classification in terms of both effectiveness and efficiency. We believe that the experimental findings from this study will be useful in real-world classification problems.

Alberts, I.; Forest, D.: Email pragmatics and automatic classification : a study in the organizational context (2012) 0.00

0.002290387 = product of:
  0.004580774 = sum of:
    0.004580774 = product of:
      0.009161548 = sum of:
        0.009161548 = weight(_text_:d in 238) [ClassicSimilarity], result of:
          0.009161548 = score(doc=238,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.104954086 = fieldWeight in 238, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=238)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Vilares, D.; Alonso, M.A.; Gómez-Rodríguez, C.: On the usefulness of lexical and syntactic processing in polarity classification of Twitter messages (2015) 0.00

0.002290387 = product of:
  0.004580774 = sum of:
    0.004580774 = product of:
      0.009161548 = sum of:
        0.009161548 = weight(_text_:d in 2161) [ClassicSimilarity], result of:
          0.009161548 = score(doc=2161,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.104954086 = fieldWeight in 2161, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2161)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Han, K.; Rezapour, R.; Nakamura, K.; Devkota, D.; Miller, D.C.; Diesner, J.: ¬An expert-in-the-loop method for domain-specific document categorization based on small training data (2023) 0.00

0.002290387 = product of:
  0.004580774 = sum of:
    0.004580774 = product of:
      0.009161548 = sum of:
        0.009161548 = weight(_text_:d in 967) [ClassicSimilarity], result of:
          0.009161548 = score(doc=967,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.104954086 = fieldWeight in 967, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=967)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Dolin, R.; Agrawal, D.; El Abbadi, A.; Pearlman, J.: Using automated classification for summarizing and selecting heterogeneous information sources (1998) 0.00

0.001943458 = product of:
  0.003886916 = sum of:
    0.003886916 = product of:
      0.007773832 = sum of:
        0.007773832 = weight(_text_:d in 1253) [ClassicSimilarity], result of:
          0.007773832 = score(doc=1253,freq=4.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.0890565 = fieldWeight in 1253, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0234375 = fieldNorm(doc=1253)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: D-Lib magazine. 4(1998) no.1, xx S

Schek, M.: Automatische Klassifizierung und Visualisierung im Archiv der Süddeutschen Zeitung (2005) 0.00

0.0016032709 = product of:
  0.0032065418 = sum of:
    0.0032065418 = product of:
      0.0064130835 = sum of:
        0.0064130835 = weight(_text_:d in 4884) [ClassicSimilarity], result of:
          0.0064130835 = score(doc=4884,freq=2.0), product of:
            0.08729101 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.045945734 = queryNorm
            0.07346786 = fieldWeight in 4884, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.02734375 = fieldNorm(doc=4884)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Language: d

Search (60 results, page 3 of 3)

Authors

Years

Languages

Themes