Search (40 results, page 1 of 2)

  • × theme_ss:"Automatisches Klassifizieren"
  1. Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.07
    0.07232786 = sum of:
      0.053926993 = product of:
        0.21570797 = sum of:
          0.21570797 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
            0.21570797 = score(doc=562,freq=2.0), product of:
              0.3838097 = queryWeight, product of:
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.045271195 = queryNorm
              0.56201804 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.25 = coord(1/4)
      0.018400865 = product of:
        0.03680173 = sum of:
          0.03680173 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
            0.03680173 = score(doc=562,freq=2.0), product of:
              0.15853201 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.045271195 = queryNorm
              0.23214069 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.5 = coord(1/2)
    
    Content
    Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
    Date
    8. 1.2013 10:22:32
  2. Chan, L.M.; Lin, X.; Zeng, M.L.: Structural and multilingual approaches to subject access on the Web (2000) 0.03
    0.026756238 = product of:
      0.053512476 = sum of:
        0.053512476 = product of:
          0.10702495 = sum of:
            0.10702495 = weight(_text_:x in 507) [ClassicSimilarity], result of:
              0.10702495 = score(doc=507,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.55985385 = fieldWeight in 507, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.09375 = fieldNorm(doc=507)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  3. Zhang, X: Rough set theory based automatic text categorization (2005) 0.03
    0.025226025 = product of:
      0.05045205 = sum of:
        0.05045205 = product of:
          0.1009041 = sum of:
            0.1009041 = weight(_text_:x in 2822) [ClassicSimilarity], result of:
              0.1009041 = score(doc=2822,freq=4.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.5278353 = fieldWeight in 2822, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2822)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Isbn
    3-8206-0149-X
  4. Subramanian, S.; Shafer, K.E.: Clustering (2001) 0.02
    0.018400865 = product of:
      0.03680173 = sum of:
        0.03680173 = product of:
          0.07360346 = sum of:
            0.07360346 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
              0.07360346 = score(doc=1046,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.46428138 = fieldWeight in 1046, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1046)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    5. 5.2003 14:17:22
  5. Chan, L.M.; Lin, X.; Zeng, M.: Structural and multilingual approaches to subject access on the Web (1999) 0.02
    0.017837493 = product of:
      0.035674985 = sum of:
        0.035674985 = product of:
          0.07134997 = sum of:
            0.07134997 = weight(_text_:x in 162) [ClassicSimilarity], result of:
              0.07134997 = score(doc=162,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.3732359 = fieldWeight in 162, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.0625 = fieldNorm(doc=162)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  6. Jersek, T.: Automatische DDC-Klassifizierung mit Lingo : Vorgehensweise und Ergebnisse (2012) 0.02
    0.017837493 = product of:
      0.035674985 = sum of:
        0.035674985 = product of:
          0.07134997 = sum of:
            0.07134997 = weight(_text_:x in 122) [ClassicSimilarity], result of:
              0.07134997 = score(doc=122,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.3732359 = fieldWeight in 122, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.0625 = fieldNorm(doc=122)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    x
  7. Chung, Y.M.; Lee, J.Y.: ¬A corpus-based approach to comparative evaluation of statistical term association measures (2001) 0.02
    0.015766267 = product of:
      0.031532533 = sum of:
        0.031532533 = product of:
          0.06306507 = sum of:
            0.06306507 = weight(_text_:x in 5769) [ClassicSimilarity], result of:
              0.06306507 = score(doc=5769,freq=4.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.32989708 = fieldWeight in 5769, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5769)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Statistical association measures have been widely applied in information retrieval research, usually employing a clustering of documents or terms on the basis of their relationships. Applications of the association measures for term clustering include automatic thesaurus construction and query expansion. This research evaluates the similarity of six association measures by comparing the relationship and behavior they demonstrate in various analyses of a test corpus. Analysis techniques include comparisons of highly ranked term pairs and term clusters, analyses of the correlation among the association measures using Pearson's correlation coefficient and MDS mapping, and an analysis of the impact of a term frequency on the association values by means of z-score. The major findings of the study are as follows: First, the most similar association measures are mutual information and Yule's coefficient of colligation Y, whereas cosine and Jaccard coefficients, as well as X**2 statistic and likelihood ratio, demonstrate quite similar behavior for terms with high frequency. Second, among all the measures, the X**2 statistic is the least affected by the frequency of terms. Third, although cosine and Jaccard coefficients tend to emphasize high frequency terms, mutual information and Yule's Y seem to overestimate rare terms
  8. Walther, R.: Möglichkeiten und Grenzen automatischer Klassifikationen von Web-Dokumenten (2001) 0.02
    0.015607806 = product of:
      0.031215612 = sum of:
        0.031215612 = product of:
          0.062431224 = sum of:
            0.062431224 = weight(_text_:x in 1562) [ClassicSimilarity], result of:
              0.062431224 = score(doc=1562,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.32658142 = fieldWeight in 1562, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1562)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    x
  9. Yang, Y.; Liu, X.: ¬A re-examination of text categorization methods (1999) 0.02
    0.015607806 = product of:
      0.031215612 = sum of:
        0.031215612 = product of:
          0.062431224 = sum of:
            0.062431224 = weight(_text_:x in 3386) [ClassicSimilarity], result of:
              0.062431224 = score(doc=3386,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.32658142 = fieldWeight in 3386, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3386)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  10. Wille, J.: Automatisches Klassifizieren bibliographischer Beschreibungsdaten : Vorgehensweise und Ergebnisse (2006) 0.02
    0.015607806 = product of:
      0.031215612 = sum of:
        0.031215612 = product of:
          0.062431224 = sum of:
            0.062431224 = weight(_text_:x in 6090) [ClassicSimilarity], result of:
              0.062431224 = score(doc=6090,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.32658142 = fieldWeight in 6090, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=6090)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    x
  11. Hu, G.; Zhou, S.; Guan, J.; Hu, X.: Towards effective document clustering : a constrained K-means based approach (2008) 0.02
    0.015607806 = product of:
      0.031215612 = sum of:
        0.031215612 = product of:
          0.062431224 = sum of:
            0.062431224 = weight(_text_:x in 2113) [ClassicSimilarity], result of:
              0.062431224 = score(doc=2113,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.32658142 = fieldWeight in 2113, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2113)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  12. Reiner, U.: Automatische DDC-Klassifizierung von bibliografischen Titeldatensätzen (2009) 0.02
    0.015334055 = product of:
      0.03066811 = sum of:
        0.03066811 = product of:
          0.06133622 = sum of:
            0.06133622 = weight(_text_:22 in 611) [ClassicSimilarity], result of:
              0.06133622 = score(doc=611,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.38690117 = fieldWeight in 611, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=611)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 8.2009 12:54:24
  13. HaCohen-Kerner, Y. et al.: Classification using various machine learning methods and combinations of key-phrases and visual features (2016) 0.02
    0.015334055 = product of:
      0.03066811 = sum of:
        0.03066811 = product of:
          0.06133622 = sum of:
            0.06133622 = weight(_text_:22 in 2748) [ClassicSimilarity], result of:
              0.06133622 = score(doc=2748,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.38690117 = fieldWeight in 2748, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2748)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 2.2016 18:25:22
  14. Pfeffer, M.: Automatische Vergabe von RVK-Notationen anhand von bibliografischen Daten mittels fallbasiertem Schließen (2007) 0.01
    0.013378119 = product of:
      0.026756238 = sum of:
        0.026756238 = product of:
          0.053512476 = sum of:
            0.053512476 = weight(_text_:x in 558) [ClassicSimilarity], result of:
              0.053512476 = score(doc=558,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.27992693 = fieldWeight in 558, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.046875 = fieldNorm(doc=558)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    x
  15. Hagedorn, K.; Chapman, S.; Newman, D.: Enhancing search and browse using automated clustering of subject metadata (2007) 0.01
    0.013378119 = product of:
      0.026756238 = sum of:
        0.026756238 = product of:
          0.053512476 = sum of:
            0.053512476 = weight(_text_:x in 1168) [ClassicSimilarity], result of:
              0.053512476 = score(doc=1168,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.27992693 = fieldWeight in 1168, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1168)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    D-Lib magazine. 13(2007) nos.7/8, x S
  16. Helmbrecht-Schaar, A.: Entwicklung eines Verfahrens der automatischen Klassifizierung für Textdokumente aus dem Fachbereich Informatik mithilfe eines fachspezifischen Klassifikationssystems (2007) 0.01
    0.013378119 = product of:
      0.026756238 = sum of:
        0.026756238 = product of:
          0.053512476 = sum of:
            0.053512476 = weight(_text_:x in 1410) [ClassicSimilarity], result of:
              0.053512476 = score(doc=1410,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.27992693 = fieldWeight in 1410, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1410)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    x
  17. Choi, B.; Peng, X.: Dynamic and hierarchical classification of Web pages (2004) 0.01
    0.013378119 = product of:
      0.026756238 = sum of:
        0.026756238 = product of:
          0.053512476 = sum of:
            0.053512476 = weight(_text_:x in 2555) [ClassicSimilarity], result of:
              0.053512476 = score(doc=2555,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.27992693 = fieldWeight in 2555, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2555)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  18. Liu, X.; Yu, S.; Janssens, F.; Glänzel, W.; Moreau, Y.; Moor, B.de: Weighted hybrid clustering by combining text mining and bibliometrics on a large-scale journal database (2010) 0.01
    0.013378119 = product of:
      0.026756238 = sum of:
        0.026756238 = product of:
          0.053512476 = sum of:
            0.053512476 = weight(_text_:x in 3464) [ClassicSimilarity], result of:
              0.053512476 = score(doc=3464,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.27992693 = fieldWeight in 3464, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3464)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  19. Sommer, M.: Automatische Generierung von DDC-Notationen für Hochschulveröffentlichungen (2012) 0.01
    0.013378119 = product of:
      0.026756238 = sum of:
        0.026756238 = product of:
          0.053512476 = sum of:
            0.053512476 = weight(_text_:x in 587) [ClassicSimilarity], result of:
              0.053512476 = score(doc=587,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.27992693 = fieldWeight in 587, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.046875 = fieldNorm(doc=587)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    x
  20. Wu, M.; Liu, Y.-H.; Brownlee, R.; Zhang, X.: Evaluating utility and automatic classification of subject metadata from Research Data Australia (2021) 0.01
    0.013378119 = product of:
      0.026756238 = sum of:
        0.026756238 = product of:
          0.053512476 = sum of:
            0.053512476 = weight(_text_:x in 453) [ClassicSimilarity], result of:
              0.053512476 = score(doc=453,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.27992693 = fieldWeight in 453, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.046875 = fieldNorm(doc=453)
          0.5 = coord(1/2)
      0.5 = coord(1/2)