Search (13 results, page 1 of 1)

  • × theme_ss:"Automatisches Klassifizieren"
  • × type_ss:"el"
  • × year_i:[1990 TO 2000}
  1. Wätjen, H.-J.; Diekmann, B.; Möller, G.; Carstensen, K.-U.: Bericht zum DFG-Projekt: GERHARD : German Harvest Automated Retrieval and Directory (1998) 0.03
    0.028472245 = product of:
      0.085416734 = sum of:
        0.039029416 = weight(_text_:u in 3065) [ClassicSimilarity], result of:
          0.039029416 = score(doc=3065,freq=2.0), product of:
            0.107882105 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03294669 = queryNorm
            0.3617784 = fieldWeight in 3065, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.078125 = fieldNorm(doc=3065)
        0.046387315 = weight(_text_:k in 3065) [ClassicSimilarity], result of:
          0.046387315 = score(doc=3065,freq=2.0), product of:
            0.11761237 = queryWeight, product of:
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.03294669 = queryNorm
            0.39440846 = fieldWeight in 3065, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.078125 = fieldNorm(doc=3065)
      0.33333334 = coord(2/6)
    
  2. Yang, Y.; Liu, X.: ¬A re-examination of text categorization methods (1999) 0.01
    0.01257852 = product of:
      0.03773556 = sum of:
        0.0052644373 = weight(_text_:e in 3386) [ClassicSimilarity], result of:
          0.0052644373 = score(doc=3386,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.1111659 = fieldWeight in 3386, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3386)
        0.03247112 = weight(_text_:k in 3386) [ClassicSimilarity], result of:
          0.03247112 = score(doc=3386,freq=2.0), product of:
            0.11761237 = queryWeight, product of:
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.03294669 = queryNorm
            0.27608594 = fieldWeight in 3386, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3386)
      0.33333334 = coord(2/6)
    
    Abstract
    This paper reports a controlled study with statistical significance tests an five text categorization methods: the Support Vector Machines (SVM), a k-Nearest Neighbor (kNN) classifier, a neural network (NNet) approach, the Linear Leastsquares Fit (LLSF) mapping and a Naive Bayes (NB) classifier. We focus an the robustness of these methods in dealing with a skewed category distribution, and their performance as function of the training-set category frequency. Our results show that SVM, kNN and LLSF significantly outperform NNet and NB when the number of positive training instances per category are small (less than ten, and that all the methods perform comparably when the categories are sufficiently common (over 300 instances).
    Language
    e
  3. Subramanian, S.; Shafer, K.E.: Clustering (1998) 0.00
    0.0012534375 = product of:
      0.007520625 = sum of:
        0.007520625 = weight(_text_:e in 1103) [ClassicSimilarity], result of:
          0.007520625 = score(doc=1103,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.15880844 = fieldWeight in 1103, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.078125 = fieldNorm(doc=1103)
      0.16666667 = coord(1/6)
    
    Language
    e
  4. Shafer, K.E.: Evaluating Scorpion results (1998) 0.00
    0.0012534375 = product of:
      0.007520625 = sum of:
        0.007520625 = weight(_text_:e in 1569) [ClassicSimilarity], result of:
          0.007520625 = score(doc=1569,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.15880844 = fieldWeight in 1569, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.078125 = fieldNorm(doc=1569)
      0.16666667 = coord(1/6)
    
    Language
    e
  5. Chan, L.M.; Lin, X.; Zeng, M.: Structural and multilingual approaches to subject access on the Web (1999) 0.00
    0.00100275 = product of:
      0.0060165 = sum of:
        0.0060165 = weight(_text_:e in 162) [ClassicSimilarity], result of:
          0.0060165 = score(doc=162,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.12704675 = fieldWeight in 162, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0625 = fieldNorm(doc=162)
      0.16666667 = coord(1/6)
    
    Language
    e
  6. Koch, T.; Vizine-Goetz, D.: Automatic classification and content navigation support for Web services : DESIRE II cooperates with OCLC (1998) 0.00
    8.774062E-4 = product of:
      0.0052644373 = sum of:
        0.0052644373 = weight(_text_:e in 1568) [ClassicSimilarity], result of:
          0.0052644373 = score(doc=1568,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.1111659 = fieldWeight in 1568, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1568)
      0.16666667 = coord(1/6)
    
    Language
    e
  7. Dolin, R.; Agrawal, D.; El Abbadi, A.; Pearlman, J.: Using automated classification for summarizing and selecting heterogeneous information sources (1998) 0.00
    7.520625E-4 = product of:
      0.0045123748 = sum of:
        0.0045123748 = weight(_text_:e in 316) [ClassicSimilarity], result of:
          0.0045123748 = score(doc=316,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.09528506 = fieldWeight in 316, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=316)
      0.16666667 = coord(1/6)
    
    Language
    e
  8. Koch, T.; Vizine-Goetz, D.: DDC and knowledge organization in the digital library : Research and development. Demonstration pages (1999) 0.00
    7.520625E-4 = product of:
      0.0045123748 = sum of:
        0.0045123748 = weight(_text_:e in 942) [ClassicSimilarity], result of:
          0.0045123748 = score(doc=942,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.09528506 = fieldWeight in 942, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=942)
      0.16666667 = coord(1/6)
    
    Language
    e
  9. Koch, T.; Ardö, A.; Noodén, L.: ¬The construction of a robot-generated subject index : DESIRE II D3.6a, Working Paper 1 (1999) 0.00
    7.520625E-4 = product of:
      0.0045123748 = sum of:
        0.0045123748 = weight(_text_:e in 1668) [ClassicSimilarity], result of:
          0.0045123748 = score(doc=1668,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.09528506 = fieldWeight in 1668, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=1668)
      0.16666667 = coord(1/6)
    
    Language
    e
  10. Sebastiani, F.: ¬A tutorial an automated text categorisation (1999) 0.00
    7.520625E-4 = product of:
      0.0045123748 = sum of:
        0.0045123748 = weight(_text_:e in 3390) [ClassicSimilarity], result of:
          0.0045123748 = score(doc=3390,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.09528506 = fieldWeight in 3390, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=3390)
      0.16666667 = coord(1/6)
    
    Language
    e
  11. Koch, T.; Ardö, A.; Brümmer, A.: ¬The building and maintenance of robot based internet search services : A review of current indexing and data collection methods. Prepared to meet the requirements of Work Package 3 of EU Telematics for Research, project DESIRE. Version D3.11v0.3 (Draft version 3) (1996) 0.00
    5.01375E-4 = product of:
      0.00300825 = sum of:
        0.00300825 = weight(_text_:e in 1669) [ClassicSimilarity], result of:
          0.00300825 = score(doc=1669,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.063523374 = fieldWeight in 1669, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03125 = fieldNorm(doc=1669)
      0.16666667 = coord(1/6)
    
    Language
    e
  12. Search Engines and Beyond : Developing efficient knowledge management systems, April 19-20 1999, Boston, Mass (1999) 0.00
    5.01375E-4 = product of:
      0.00300825 = sum of:
        0.00300825 = weight(_text_:e in 2596) [ClassicSimilarity], result of:
          0.00300825 = score(doc=2596,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.063523374 = fieldWeight in 2596, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03125 = fieldNorm(doc=2596)
      0.16666667 = coord(1/6)
    
    Language
    e
  13. Dolin, R.; Agrawal, D.; El Abbadi, A.; Pearlman, J.: Using automated classification for summarizing and selecting heterogeneous information sources (1998) 0.00
    3.7603124E-4 = product of:
      0.0022561874 = sum of:
        0.0022561874 = weight(_text_:e in 1253) [ClassicSimilarity], result of:
          0.0022561874 = score(doc=1253,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.04764253 = fieldWeight in 1253, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0234375 = fieldNorm(doc=1253)
      0.16666667 = coord(1/6)
    
    Language
    e