Search (1 results, page 1 of 1)

  • × author_ss:"Srinivasan, P."
  • × theme_ss:"Automatisches Indexieren"
  1. Srinivasan, P.: On generalizing the Two-Poisson Model (1990) 0.01
    0.012324194 = product of:
      0.036972582 = sum of:
        0.036972582 = product of:
          0.073945165 = sum of:
            0.073945165 = weight(_text_:index in 2880) [ClassicSimilarity], result of:
              0.073945165 = score(doc=2880,freq=2.0), product of:
                0.21880072 = queryWeight, product of:
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.050071523 = queryNorm
                0.33795667 = fieldWeight in 2880, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2880)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Automatic indexing is one of the important functions of a modern document retrieval system. Numerous techniques for this function have been proposed in the literature ranging from purely statistical to linguistically complex mechanisms. Most result from examining properties of terms. Examines term distribution within the framework of the Poisson models. Specifically examines the effectiveness of the Two-Poisson and the Three-Poisson model to see if generalisation results in increased effectiveness. The results show that the Two-Poisson model is only moderately effective in identifying index terms. In addition, generalisation to the Three-Poisson does not give any additional power. The only Poisson model which consistently works well is the basic One-Poisson model. Also discusses term distribution information.