Search (20 results, page 1 of 1)

  • × year_i:[2000 TO 2010}
  • × theme_ss:"Automatisches Klassifizieren"
  1. Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.07
    0.07193616 = sum of:
      0.053634945 = product of:
        0.21453978 = sum of:
          0.21453978 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
            0.21453978 = score(doc=562,freq=2.0), product of:
              0.38173112 = queryWeight, product of:
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.045026023 = queryNorm
              0.56201804 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.25 = coord(1/4)
      0.018301213 = product of:
        0.036602426 = sum of:
          0.036602426 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
            0.036602426 = score(doc=562,freq=2.0), product of:
              0.15767346 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.045026023 = queryNorm
              0.23214069 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.5 = coord(1/2)
    
    Content
    Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
    Date
    8. 1.2013 10:22:32
  2. Yoon, Y.; Lee, C.; Lee, G.G.: ¬An effective procedure for constructing a hierarchical text classification system (2006) 0.06
    0.061675094 = product of:
      0.12335019 = sum of:
        0.12335019 = sum of:
          0.08064736 = weight(_text_:y in 5273) [ClassicSimilarity], result of:
            0.08064736 = score(doc=5273,freq=2.0), product of:
              0.21668325 = queryWeight, product of:
                4.8124003 = idf(docFreq=976, maxDocs=44218)
                0.045026023 = queryNorm
              0.3721901 = fieldWeight in 5273, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8124003 = idf(docFreq=976, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5273)
          0.04270283 = weight(_text_:22 in 5273) [ClassicSimilarity], result of:
            0.04270283 = score(doc=5273,freq=2.0), product of:
              0.15767346 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.045026023 = queryNorm
              0.2708308 = fieldWeight in 5273, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5273)
      0.5 = coord(1/2)
    
    Date
    22. 7.2006 16:24:52
  3. Yu, W.; Gong, Y.: Document clustering by concept factorization (2004) 0.03
    0.034563154 = product of:
      0.06912631 = sum of:
        0.06912631 = product of:
          0.13825262 = sum of:
            0.13825262 = weight(_text_:y in 4084) [ClassicSimilarity], result of:
              0.13825262 = score(doc=4084,freq=2.0), product of:
                0.21668325 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.045026023 = queryNorm
                0.6380402 = fieldWeight in 4084, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4084)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  4. Shen, D.; Chen, Z.; Yang, Q.; Zeng, H.J.; Zhang, B.; Lu, Y.; Ma, W.Y.: Web page classification through summarization (2004) 0.03
    0.02880263 = product of:
      0.05760526 = sum of:
        0.05760526 = product of:
          0.11521052 = sum of:
            0.11521052 = weight(_text_:y in 4132) [ClassicSimilarity], result of:
              0.11521052 = score(doc=4132,freq=2.0), product of:
                0.21668325 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.045026023 = queryNorm
                0.53170013 = fieldWeight in 4132, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4132)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  5. Chung, Y.-M.; Noh, Y.-H.: Developing a specialized directory system by automatically classifying Web documents (2003) 0.02
    0.02443984 = product of:
      0.04887968 = sum of:
        0.04887968 = product of:
          0.09775936 = sum of:
            0.09775936 = weight(_text_:y in 1566) [ClassicSimilarity], result of:
              0.09775936 = score(doc=1566,freq=4.0), product of:
                0.21668325 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.045026023 = queryNorm
                0.45116252 = fieldWeight in 1566, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1566)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  6. Chung, Y.M.; Lee, J.Y.: ¬A corpus-based approach to comparative evaluation of statistical term association measures (2001) 0.02
    0.020366535 = product of:
      0.04073307 = sum of:
        0.04073307 = product of:
          0.08146614 = sum of:
            0.08146614 = weight(_text_:y in 5769) [ClassicSimilarity], result of:
              0.08146614 = score(doc=5769,freq=4.0), product of:
                0.21668325 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.045026023 = queryNorm
                0.37596878 = fieldWeight in 5769, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5769)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Statistical association measures have been widely applied in information retrieval research, usually employing a clustering of documents or terms on the basis of their relationships. Applications of the association measures for term clustering include automatic thesaurus construction and query expansion. This research evaluates the similarity of six association measures by comparing the relationship and behavior they demonstrate in various analyses of a test corpus. Analysis techniques include comparisons of highly ranked term pairs and term clusters, analyses of the correlation among the association measures using Pearson's correlation coefficient and MDS mapping, and an analysis of the impact of a term frequency on the association values by means of z-score. The major findings of the study are as follows: First, the most similar association measures are mutual information and Yule's coefficient of colligation Y, whereas cosine and Jaccard coefficients, as well as X**2 statistic and likelihood ratio, demonstrate quite similar behavior for terms with high frequency. Second, among all the measures, the X**2 statistic is the least affected by the frequency of terms. Third, although cosine and Jaccard coefficients tend to emphasize high frequency terms, mutual information and Yule's Y seem to overestimate rare terms
  7. Subramanian, S.; Shafer, K.E.: Clustering (2001) 0.02
    0.018301213 = product of:
      0.036602426 = sum of:
        0.036602426 = product of:
          0.07320485 = sum of:
            0.07320485 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
              0.07320485 = score(doc=1046,freq=2.0), product of:
                0.15767346 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045026023 = queryNorm
                0.46428138 = fieldWeight in 1046, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1046)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    5. 5.2003 14:17:22
  8. Wu, K.J.; Chen, M.-C.; Sun, Y.: Automatic topics discovery from hyperlinked documents (2004) 0.02
    0.017281577 = product of:
      0.034563154 = sum of:
        0.034563154 = product of:
          0.06912631 = sum of:
            0.06912631 = weight(_text_:y in 2563) [ClassicSimilarity], result of:
              0.06912631 = score(doc=2563,freq=2.0), product of:
                0.21668325 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.045026023 = queryNorm
                0.3190201 = fieldWeight in 2563, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2563)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  9. Yoon, Y.; Lee, G.G.: Efficient implementation of associative classifiers for document classification (2007) 0.02
    0.017281577 = product of:
      0.034563154 = sum of:
        0.034563154 = product of:
          0.06912631 = sum of:
            0.06912631 = weight(_text_:y in 909) [ClassicSimilarity], result of:
              0.06912631 = score(doc=909,freq=2.0), product of:
                0.21668325 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.045026023 = queryNorm
                0.3190201 = fieldWeight in 909, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=909)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  10. Ko, Y.; Seo, J.: Text classification from unlabeled documents with bootstrapping and feature projection techniques (2009) 0.02
    0.017281577 = product of:
      0.034563154 = sum of:
        0.034563154 = product of:
          0.06912631 = sum of:
            0.06912631 = weight(_text_:y in 2452) [ClassicSimilarity], result of:
              0.06912631 = score(doc=2452,freq=2.0), product of:
                0.21668325 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.045026023 = queryNorm
                0.3190201 = fieldWeight in 2452, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2452)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  11. Xu, Y.; Bernard, A.: Knowledge organization through statistical computation : a new approach (2009) 0.02
    0.017281577 = product of:
      0.034563154 = sum of:
        0.034563154 = product of:
          0.06912631 = sum of:
            0.06912631 = weight(_text_:y in 3252) [ClassicSimilarity], result of:
              0.06912631 = score(doc=3252,freq=2.0), product of:
                0.21668325 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.045026023 = queryNorm
                0.3190201 = fieldWeight in 3252, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3252)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  12. Reiner, U.: Automatische DDC-Klassifizierung von bibliografischen Titeldatensätzen (2009) 0.02
    0.015251012 = product of:
      0.030502023 = sum of:
        0.030502023 = product of:
          0.061004046 = sum of:
            0.061004046 = weight(_text_:22 in 611) [ClassicSimilarity], result of:
              0.061004046 = score(doc=611,freq=2.0), product of:
                0.15767346 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045026023 = queryNorm
                0.38690117 = fieldWeight in 611, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=611)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 8.2009 12:54:24
  13. Na, J.-C.; Sui, H.; Khoo, C.; Chan, S.; Zhou, Y.: Effectiveness of simple linguistic processing in automatic sentiment classification of product reviews (2004) 0.01
    0.014401315 = product of:
      0.02880263 = sum of:
        0.02880263 = product of:
          0.05760526 = sum of:
            0.05760526 = weight(_text_:y in 2624) [ClassicSimilarity], result of:
              0.05760526 = score(doc=2624,freq=2.0), product of:
                0.21668325 = queryWeight, product of:
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.045026023 = queryNorm
                0.26585007 = fieldWeight in 2624, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.8124003 = idf(docFreq=976, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2624)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  14. Automatic classification research at OCLC (2002) 0.01
    0.010675708 = product of:
      0.021351416 = sum of:
        0.021351416 = product of:
          0.04270283 = sum of:
            0.04270283 = weight(_text_:22 in 1563) [ClassicSimilarity], result of:
              0.04270283 = score(doc=1563,freq=2.0), product of:
                0.15767346 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045026023 = queryNorm
                0.2708308 = fieldWeight in 1563, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1563)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    5. 5.2003 9:22:09
  15. Yi, K.: Automatic text classification using library classification schemes : trends, issues and challenges (2007) 0.01
    0.010675708 = product of:
      0.021351416 = sum of:
        0.021351416 = product of:
          0.04270283 = sum of:
            0.04270283 = weight(_text_:22 in 2560) [ClassicSimilarity], result of:
              0.04270283 = score(doc=2560,freq=2.0), product of:
                0.15767346 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045026023 = queryNorm
                0.2708308 = fieldWeight in 2560, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2560)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 9.2008 18:31:54
  16. Liu, R.-L.: Context recognition for hierarchical text classification (2009) 0.01
    0.009150607 = product of:
      0.018301213 = sum of:
        0.018301213 = product of:
          0.036602426 = sum of:
            0.036602426 = weight(_text_:22 in 2760) [ClassicSimilarity], result of:
              0.036602426 = score(doc=2760,freq=2.0), product of:
                0.15767346 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045026023 = queryNorm
                0.23214069 = fieldWeight in 2760, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2760)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2009 19:11:54
  17. Pfeffer, M.: Automatische Vergabe von RVK-Notationen mittels fallbasiertem Schließen (2009) 0.01
    0.009150607 = product of:
      0.018301213 = sum of:
        0.018301213 = product of:
          0.036602426 = sum of:
            0.036602426 = weight(_text_:22 in 3051) [ClassicSimilarity], result of:
              0.036602426 = score(doc=3051,freq=2.0), product of:
                0.15767346 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045026023 = queryNorm
                0.23214069 = fieldWeight in 3051, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3051)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 8.2009 19:51:28
  18. Mengle, S.; Goharian, N.: Passage detection using text classification (2009) 0.01
    0.007625506 = product of:
      0.015251012 = sum of:
        0.015251012 = product of:
          0.030502023 = sum of:
            0.030502023 = weight(_text_:22 in 2765) [ClassicSimilarity], result of:
              0.030502023 = score(doc=2765,freq=2.0), product of:
                0.15767346 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045026023 = queryNorm
                0.19345059 = fieldWeight in 2765, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2765)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2009 19:14:43
  19. Khoo, C.S.G.; Ng, K.; Ou, S.: ¬An exploratory study of human clustering of Web pages (2003) 0.01
    0.0061004045 = product of:
      0.012200809 = sum of:
        0.012200809 = product of:
          0.024401618 = sum of:
            0.024401618 = weight(_text_:22 in 2741) [ClassicSimilarity], result of:
              0.024401618 = score(doc=2741,freq=2.0), product of:
                0.15767346 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045026023 = queryNorm
                0.15476047 = fieldWeight in 2741, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=2741)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    12. 9.2004 9:56:22
  20. Reiner, U.: Automatische DDC-Klassifizierung bibliografischer Titeldatensätze der Deutschen Nationalbibliografie (2009) 0.01
    0.0061004045 = product of:
      0.012200809 = sum of:
        0.012200809 = product of:
          0.024401618 = sum of:
            0.024401618 = weight(_text_:22 in 3284) [ClassicSimilarity], result of:
              0.024401618 = score(doc=3284,freq=2.0), product of:
                0.15767346 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045026023 = queryNorm
                0.15476047 = fieldWeight in 3284, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3284)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 1.2010 14:41:24