Search (32 results, page 1 of 2)

  • × theme_ss:"Automatisches Klassifizieren"
  • × year_i:[1990 TO 2000}
  1. Wätjen, H.-J.; Diekmann, B.; Möller, G.; Carstensen, K.-U.: Bericht zum DFG-Projekt: GERHARD : German Harvest Automated Retrieval and Directory (1998) 0.03
    0.028472245 = product of:
      0.085416734 = sum of:
        0.039029416 = weight(_text_:u in 3065) [ClassicSimilarity], result of:
          0.039029416 = score(doc=3065,freq=2.0), product of:
            0.107882105 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03294669 = queryNorm
            0.3617784 = fieldWeight in 3065, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.078125 = fieldNorm(doc=3065)
        0.046387315 = weight(_text_:k in 3065) [ClassicSimilarity], result of:
          0.046387315 = score(doc=3065,freq=2.0), product of:
            0.11761237 = queryWeight, product of:
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.03294669 = queryNorm
            0.39440846 = fieldWeight in 3065, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.078125 = fieldNorm(doc=3065)
      0.33333334 = coord(2/6)
    
  2. Yang, Y.; Liu, X.: ¬A re-examination of text categorization methods (1999) 0.01
    0.01257852 = product of:
      0.03773556 = sum of:
        0.0052644373 = weight(_text_:e in 3386) [ClassicSimilarity], result of:
          0.0052644373 = score(doc=3386,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.1111659 = fieldWeight in 3386, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3386)
        0.03247112 = weight(_text_:k in 3386) [ClassicSimilarity], result of:
          0.03247112 = score(doc=3386,freq=2.0), product of:
            0.11761237 = queryWeight, product of:
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.03294669 = queryNorm
            0.27608594 = fieldWeight in 3386, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3386)
      0.33333334 = coord(2/6)
    
    Abstract
    This paper reports a controlled study with statistical significance tests an five text categorization methods: the Support Vector Machines (SVM), a k-Nearest Neighbor (kNN) classifier, a neural network (NNet) approach, the Linear Leastsquares Fit (LLSF) mapping and a Naive Bayes (NB) classifier. We focus an the robustness of these methods in dealing with a skewed category distribution, and their performance as function of the training-set category frequency. Our results show that SVM, kNN and LLSF significantly outperform NNet and NB when the number of positive training instances per category are small (less than ten, and that all the methods perform comparably when the categories are sufficiently common (over 300 instances).
    Language
    e
  3. Savic, D.: Designing an expert system for classifying office documents (1994) 0.01
    0.006009359 = product of:
      0.018028077 = sum of:
        0.0060165 = weight(_text_:e in 2655) [ClassicSimilarity], result of:
          0.0060165 = score(doc=2655,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.12704675 = fieldWeight in 2655, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0625 = fieldNorm(doc=2655)
        0.012011576 = product of:
          0.03603473 = sum of:
            0.03603473 = weight(_text_:29 in 2655) [ClassicSimilarity], result of:
              0.03603473 = score(doc=2655,freq=2.0), product of:
                0.11589616 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03294669 = queryNorm
                0.31092256 = fieldWeight in 2655, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2655)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Language
    e
    Source
    Records management quarterly. 28(1994) no.3, S.20-29
  4. Savic, D.: Automatic classification of office documents : review of available methods and techniques (1995) 0.01
    0.005258189 = product of:
      0.015774567 = sum of:
        0.0052644373 = weight(_text_:e in 2219) [ClassicSimilarity], result of:
          0.0052644373 = score(doc=2219,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.1111659 = fieldWeight in 2219, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2219)
        0.01051013 = product of:
          0.031530388 = sum of:
            0.031530388 = weight(_text_:29 in 2219) [ClassicSimilarity], result of:
              0.031530388 = score(doc=2219,freq=2.0), product of:
                0.11589616 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03294669 = queryNorm
                0.27205724 = fieldWeight in 2219, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2219)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Language
    e
    Source
    Records management quarterly. 29(1995) no.4, S.3-18
  5. Ruocco, A.S.; Frieder, O.: Clustering and classification of large document bases in a parallel environment (1997) 0.01
    0.005258189 = product of:
      0.015774567 = sum of:
        0.0052644373 = weight(_text_:e in 1661) [ClassicSimilarity], result of:
          0.0052644373 = score(doc=1661,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.1111659 = fieldWeight in 1661, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1661)
        0.01051013 = product of:
          0.031530388 = sum of:
            0.031530388 = weight(_text_:29 in 1661) [ClassicSimilarity], result of:
              0.031530388 = score(doc=1661,freq=2.0), product of:
                0.11589616 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03294669 = queryNorm
                0.27205724 = fieldWeight in 1661, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1661)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Date
    29. 7.1998 17:45:02
    Language
    e
  6. Dubin, D.: Dimensions and discriminability (1998) 0.01
    0.0052266745 = product of:
      0.015680023 = sum of:
        0.0052644373 = weight(_text_:e in 2338) [ClassicSimilarity], result of:
          0.0052644373 = score(doc=2338,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.1111659 = fieldWeight in 2338, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2338)
        0.010415585 = product of:
          0.031246753 = sum of:
            0.031246753 = weight(_text_:22 in 2338) [ClassicSimilarity], result of:
              0.031246753 = score(doc=2338,freq=2.0), product of:
                0.1153737 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03294669 = queryNorm
                0.2708308 = fieldWeight in 2338, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2338)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Date
    22. 9.1997 19:16:05
    Language
    e
  7. Jenkins, C.: Automatic classification of Web resources using Java and Dewey Decimal Classification (1998) 0.01
    0.0052266745 = product of:
      0.015680023 = sum of:
        0.0052644373 = weight(_text_:e in 1673) [ClassicSimilarity], result of:
          0.0052644373 = score(doc=1673,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.1111659 = fieldWeight in 1673, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1673)
        0.010415585 = product of:
          0.031246753 = sum of:
            0.031246753 = weight(_text_:22 in 1673) [ClassicSimilarity], result of:
              0.031246753 = score(doc=1673,freq=2.0), product of:
                0.1153737 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03294669 = queryNorm
                0.2708308 = fieldWeight in 1673, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1673)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Date
    1. 8.1996 22:08:06
    Language
    e
  8. May, A.D.: Automatic classification of e-mail messages by message type (1997) 0.00
    0.0015197124 = product of:
      0.009118274 = sum of:
        0.009118274 = weight(_text_:e in 6493) [ClassicSimilarity], result of:
          0.009118274 = score(doc=6493,freq=6.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.19254501 = fieldWeight in 6493, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=6493)
      0.16666667 = coord(1/6)
    
    Abstract
    This article describes a system that automatically classifies e-mail messages in the HUMANIST electronic discussion group into one of 4 classes: questions, responses, announcement or administartive. A total of 1.372 messages were analyzed. The automatic classification of a message was based on string matching between a message text and predefined string sets for each of the massage types. The system's automated ability to accurately classify a message was compared against manually assigned codes. The Cohen's Kappa of .55 suggested that there was a statistical agreement between the automatic and manually assigned codes
    Language
    e
  9. Ardö, A.; Koch, T.: Automatic classification applied to full-text Internet documents in a robot-generated subject index (1999) 0.00
    0.001504125 = product of:
      0.0090247495 = sum of:
        0.0090247495 = weight(_text_:e in 382) [ClassicSimilarity], result of:
          0.0090247495 = score(doc=382,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.19057012 = fieldWeight in 382, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.09375 = fieldNorm(doc=382)
      0.16666667 = coord(1/6)
    
    Language
    e
  10. McKiernan, G.: Automated categorisation of Web resources : a profile of selected projects, research, products, and services (1996) 0.00
    0.0012534375 = product of:
      0.007520625 = sum of:
        0.007520625 = weight(_text_:e in 2533) [ClassicSimilarity], result of:
          0.007520625 = score(doc=2533,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.15880844 = fieldWeight in 2533, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.078125 = fieldNorm(doc=2533)
      0.16666667 = coord(1/6)
    
    Language
    e
  11. Vizine-Goetz, D.: NetLab / OCLC collaboration seeks to improve Web searching (1999) 0.00
    0.0012534375 = product of:
      0.007520625 = sum of:
        0.007520625 = weight(_text_:e in 4180) [ClassicSimilarity], result of:
          0.007520625 = score(doc=4180,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.15880844 = fieldWeight in 4180, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.078125 = fieldNorm(doc=4180)
      0.16666667 = coord(1/6)
    
    Language
    e
  12. Möller, G.: Automatic classification of the World Wide Web using Universal Decimal Classification (1999) 0.00
    0.0012534375 = product of:
      0.007520625 = sum of:
        0.007520625 = weight(_text_:e in 494) [ClassicSimilarity], result of:
          0.007520625 = score(doc=494,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.15880844 = fieldWeight in 494, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.078125 = fieldNorm(doc=494)
      0.16666667 = coord(1/6)
    
    Language
    e
  13. Subramanian, S.; Shafer, K.E.: Clustering (1998) 0.00
    0.0012534375 = product of:
      0.007520625 = sum of:
        0.007520625 = weight(_text_:e in 1103) [ClassicSimilarity], result of:
          0.007520625 = score(doc=1103,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.15880844 = fieldWeight in 1103, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.078125 = fieldNorm(doc=1103)
      0.16666667 = coord(1/6)
    
    Language
    e
  14. Shafer, K.E.: Evaluating Scorpion results (1998) 0.00
    0.0012534375 = product of:
      0.007520625 = sum of:
        0.007520625 = weight(_text_:e in 1569) [ClassicSimilarity], result of:
          0.007520625 = score(doc=1569,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.15880844 = fieldWeight in 1569, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.078125 = fieldNorm(doc=1569)
      0.16666667 = coord(1/6)
    
    Language
    e
  15. Cheng, P.T.K.; Wu, A.K.W.: ACS: an automatic classification system (1995) 0.00
    0.00100275 = product of:
      0.0060165 = sum of:
        0.0060165 = weight(_text_:e in 2188) [ClassicSimilarity], result of:
          0.0060165 = score(doc=2188,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.12704675 = fieldWeight in 2188, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0625 = fieldNorm(doc=2188)
      0.16666667 = coord(1/6)
    
    Language
    e
  16. Losee, R.M.; Haas, S.W.: Sublanguage terms : dictionaries, usage, and automatic classification (1995) 0.00
    0.00100275 = product of:
      0.0060165 = sum of:
        0.0060165 = weight(_text_:e in 2650) [ClassicSimilarity], result of:
          0.0060165 = score(doc=2650,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.12704675 = fieldWeight in 2650, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0625 = fieldNorm(doc=2650)
      0.16666667 = coord(1/6)
    
    Language
    e
  17. Ingwersen, P.; Wormell, I.: Ranganathan in the perspective of advanced information retrieval (1992) 0.00
    0.00100275 = product of:
      0.0060165 = sum of:
        0.0060165 = weight(_text_:e in 7695) [ClassicSimilarity], result of:
          0.0060165 = score(doc=7695,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.12704675 = fieldWeight in 7695, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0625 = fieldNorm(doc=7695)
      0.16666667 = coord(1/6)
    
    Language
    e
  18. Chan, L.M.; Lin, X.; Zeng, M.: Structural and multilingual approaches to subject access on the Web (1999) 0.00
    0.00100275 = product of:
      0.0060165 = sum of:
        0.0060165 = weight(_text_:e in 162) [ClassicSimilarity], result of:
          0.0060165 = score(doc=162,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.12704675 = fieldWeight in 162, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0625 = fieldNorm(doc=162)
      0.16666667 = coord(1/6)
    
    Language
    e
  19. Koch, T.: Experiments with automatic classification of WAIS databases and indexing of WWW : some results from the Nordic WAIS/WWW project (1994) 0.00
    8.774062E-4 = product of:
      0.0052644373 = sum of:
        0.0052644373 = weight(_text_:e in 7209) [ClassicSimilarity], result of:
          0.0052644373 = score(doc=7209,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.1111659 = fieldWeight in 7209, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7209)
      0.16666667 = coord(1/6)
    
    Language
    e
  20. Losee, R.M.: Text windows and phrases differing by discipline, location in document, and syntactic structure (1996) 0.00
    8.774062E-4 = product of:
      0.0052644373 = sum of:
        0.0052644373 = weight(_text_:e in 6962) [ClassicSimilarity], result of:
          0.0052644373 = score(doc=6962,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.1111659 = fieldWeight in 6962, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=6962)
      0.16666667 = coord(1/6)
    
    Language
    e