Search (8 results, page 1 of 1)

  • × theme_ss:"Automatisches Klassifizieren"
  • × year_i:[1990 TO 2000}
  1. Wätjen, H.-J.; Diekmann, B.; Möller, G.; Carstensen, K.-U.: Bericht zum DFG-Projekt: GERHARD : German Harvest Automated Retrieval and Directory (1998) 0.03
    0.03150274 = product of:
      0.06300548 = sum of:
        0.06300548 = product of:
          0.094508216 = sum of:
            0.06366888 = weight(_text_:k in 3065) [ClassicSimilarity], result of:
              0.06366888 = score(doc=3065,freq=2.0), product of:
                0.16142878 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.045220956 = queryNorm
                0.39440846 = fieldWeight in 3065, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3065)
            0.030839335 = weight(_text_:h in 3065) [ClassicSimilarity], result of:
              0.030839335 = score(doc=3065,freq=2.0), product of:
                0.11234917 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045220956 = queryNorm
                0.27449545 = fieldWeight in 3065, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3065)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  2. Yang, Y.; Liu, X.: ¬A re-examination of text categorization methods (1999) 0.01
    0.0074280365 = product of:
      0.014856073 = sum of:
        0.014856073 = product of:
          0.04456822 = sum of:
            0.04456822 = weight(_text_:k in 3386) [ClassicSimilarity], result of:
              0.04456822 = score(doc=3386,freq=2.0), product of:
                0.16142878 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.045220956 = queryNorm
                0.27608594 = fieldWeight in 3386, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3386)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    This paper reports a controlled study with statistical significance tests an five text categorization methods: the Support Vector Machines (SVM), a k-Nearest Neighbor (kNN) classifier, a neural network (NNet) approach, the Linear Leastsquares Fit (LLSF) mapping and a Naive Bayes (NB) classifier. We focus an the robustness of these methods in dealing with a skewed category distribution, and their performance as function of the training-set category frequency. Our results show that SVM, kNN and LLSF significantly outperform NNet and NB when the number of positive training instances per category are small (less than ten, and that all the methods perform comparably when the categories are sufficiently common (over 300 instances).
  3. Dubin, D.: Dimensions and discriminability (1998) 0.01
    0.007147951 = product of:
      0.014295902 = sum of:
        0.014295902 = product of:
          0.042887706 = sum of:
            0.042887706 = weight(_text_:22 in 2338) [ClassicSimilarity], result of:
              0.042887706 = score(doc=2338,freq=2.0), product of:
                0.15835609 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045220956 = queryNorm
                0.2708308 = fieldWeight in 2338, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2338)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    22. 9.1997 19:16:05
  4. Jenkins, C.: Automatic classification of Web resources using Java and Dewey Decimal Classification (1998) 0.01
    0.007147951 = product of:
      0.014295902 = sum of:
        0.014295902 = product of:
          0.042887706 = sum of:
            0.042887706 = weight(_text_:22 in 1673) [ClassicSimilarity], result of:
              0.042887706 = score(doc=1673,freq=2.0), product of:
                0.15835609 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045220956 = queryNorm
                0.2708308 = fieldWeight in 1673, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1673)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    1. 8.1996 22:08:06
  5. Wätjen, H.-J.: Automatisches Sammeln, Klassifizieren und Indexieren von wissenschaftlich relevanten Informationsressourcen im deutschen World Wide Web : das DFG-Projekt GERHARD (1998) 0.01
    0.005139889 = product of:
      0.010279778 = sum of:
        0.010279778 = product of:
          0.030839335 = sum of:
            0.030839335 = weight(_text_:h in 3066) [ClassicSimilarity], result of:
              0.030839335 = score(doc=3066,freq=2.0), product of:
                0.11234917 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045220956 = queryNorm
                0.27449545 = fieldWeight in 3066, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3066)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  6. Wätjen, H.-J.: GERHARD : Automatisches Sammeln, Klassifizieren und Indexieren von wissenschaftlich relevanten Informationsressourcen im deutschen World Wide Web (1998) 0.01
    0.0050882306 = product of:
      0.010176461 = sum of:
        0.010176461 = product of:
          0.030529384 = sum of:
            0.030529384 = weight(_text_:h in 3064) [ClassicSimilarity], result of:
              0.030529384 = score(doc=3064,freq=4.0), product of:
                0.11234917 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045220956 = queryNorm
                0.27173662 = fieldWeight in 3064, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3064)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    B.I.T.online. 1(1998) H.4, S.279-290
  7. Koch, T.: Nutzung von Klassifikationssystemen zur verbesserten Beschreibung, Organisation und Suche von Internetressourcen (1998) 0.00
    0.004111911 = product of:
      0.008223822 = sum of:
        0.008223822 = product of:
          0.024671467 = sum of:
            0.024671467 = weight(_text_:h in 1030) [ClassicSimilarity], result of:
              0.024671467 = score(doc=1030,freq=2.0), product of:
                0.11234917 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045220956 = queryNorm
                0.21959636 = fieldWeight in 1030, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1030)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    BuB. 50(1998) H.5, S.326-335
  8. Orwig, R.E.; Chen, H.; Nunamaker, J.F.: ¬A graphical, self-organizing approach to classifying electronic meeting output (1997) 0.00
    0.0035979224 = product of:
      0.007195845 = sum of:
        0.007195845 = product of:
          0.021587534 = sum of:
            0.021587534 = weight(_text_:h in 6928) [ClassicSimilarity], result of:
              0.021587534 = score(doc=6928,freq=2.0), product of:
                0.11234917 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.045220956 = queryNorm
                0.19214681 = fieldWeight in 6928, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=6928)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)