Search (1 results, page 1 of 1)

  • × author_ss:"Cheng, K.-H."
  • × theme_ss:"Automatisches Indexieren"
  1. Cheng, K.-H.: Automatic identification for topics of electronic documents (1997) 0.03
    0.027969502 = product of:
      0.083908506 = sum of:
        0.083908506 = weight(_text_:electronic in 1811) [ClassicSimilarity], result of:
          0.083908506 = score(doc=1811,freq=4.0), product of:
            0.19623034 = queryWeight, product of:
              3.9095051 = idf(docFreq=2409, maxDocs=44218)
              0.05019314 = queryNorm
            0.4276021 = fieldWeight in 1811, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.9095051 = idf(docFreq=2409, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1811)
      0.33333334 = coord(1/3)
    
    Abstract
    With the rapid rise in numbers of electronic documents on the Internet, how to effectively assign topics to documents become an important issue. Current research in this area focuses on the behaviour of nouns in documents. Proposes, however, that nouns and verbs together contribute to the process of topic identification. Constructs a mathematical model taking into account the following factors: word importance, word frequency, word co-occurence, and word distance. Preliminary experiments ahow that the performance of the proposed model is equivalent to that of a human being