Search (4 results, page 1 of 1)

  • × theme_ss:"Automatisches Indexieren"
  • × year_i:[1970 TO 1980}
  1. Sparck Jones, K.: Automatic keyword classification for information retrieval (1971) 0.02
    0.019226132 = product of:
      0.13458292 = sum of:
        0.06729146 = weight(_text_:classification in 5176) [ClassicSimilarity], result of:
          0.06729146 = score(doc=5176,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.70372736 = fieldWeight in 5176, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.15625 = fieldNorm(doc=5176)
        0.06729146 = weight(_text_:classification in 5176) [ClassicSimilarity], result of:
          0.06729146 = score(doc=5176,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.70372736 = fieldWeight in 5176, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.15625 = fieldNorm(doc=5176)
      0.14285715 = coord(2/14)
    
  2. Sparck Jones, K.; Jackson, D.M.: ¬The use of automatically obtained keyword classification for information retrieval (1970) 0.02
    0.015380906 = product of:
      0.107666336 = sum of:
        0.053833168 = weight(_text_:classification in 5177) [ClassicSimilarity], result of:
          0.053833168 = score(doc=5177,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.5629819 = fieldWeight in 5177, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.125 = fieldNorm(doc=5177)
        0.053833168 = weight(_text_:classification in 5177) [ClassicSimilarity], result of:
          0.053833168 = score(doc=5177,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.5629819 = fieldWeight in 5177, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.125 = fieldNorm(doc=5177)
      0.14285715 = coord(2/14)
    
  3. Salton, G.: Fast document classification in automatic information retrieval (1978) 0.01
    0.007690453 = product of:
      0.053833168 = sum of:
        0.026916584 = weight(_text_:classification in 2331) [ClassicSimilarity], result of:
          0.026916584 = score(doc=2331,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.28149095 = fieldWeight in 2331, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0625 = fieldNorm(doc=2331)
        0.026916584 = weight(_text_:classification in 2331) [ClassicSimilarity], result of:
          0.026916584 = score(doc=2331,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.28149095 = fieldWeight in 2331, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0625 = fieldNorm(doc=2331)
      0.14285715 = coord(2/14)
    
  4. Hafer, M.A.; Weiss, S.F.: Word segmentation by letter successor varieties (1974) 0.01
    0.0067291465 = product of:
      0.047104023 = sum of:
        0.023552012 = weight(_text_:classification in 4997) [ClassicSimilarity], result of:
          0.023552012 = score(doc=4997,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.24630459 = fieldWeight in 4997, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4997)
        0.023552012 = weight(_text_:classification in 4997) [ClassicSimilarity], result of:
          0.023552012 = score(doc=4997,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.24630459 = fieldWeight in 4997, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4997)
      0.14285715 = coord(2/14)
    
    Abstract
    This paper describes a method for automatically segmenting words into their stems and affixes. The process uses certain statistical properties of corpus (successor and predecessor letter variety counts) to indicate where words should be divided. Consequently, this process is less reliant on human intervention than are other methods for automated stemming. The segmentation system is used to construct stem dictionariesfor documnet classification. Information retrieval experiments are then performed using documents and queries so classified. Results show not only that this method is capable of high quality word segmentation, but also that its use in information retrieval produce results that are at least as good as thosse obtained using the more traditional stemming process.