Search (1 results, page 1 of 1)

Did you mean:
rvk_ss%3a%2200 74500 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f kataloge%2c katalogisierung %2f titelaufnahme%2c katalogisierung im ausland %28.6 1%29 %2f international%2c allgemeines%22 1
rvk_ss%3a%2200 74500 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f kataloge%2c katalogisierungs %2f titelaufnahme%2c katalogisierung im ausland %28.6 1%29 %2f international%2c allgemeinen%22 1
rvk_ss%3a%2200 74500 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f kataloge%2c katalogisierungs %2f titelaufnahme%2c katalogisierung im auslands %28.6 1%29 %2f international%2c allgemeines%22 1
rvk_ss%3a%2200 74500 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f kataloge%2c katalogisierungs %2f titelaufnahmen%2c katalogisierung im ausland %28.6 1%29 %2f international%2c allgemeines%22 1
rvk_ss%3a%2200 74500 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f katalogen%2c katalogisierungs %2f titelaufnahme%2c katalogisierung im ausland %28.6 1%29 %2f international%2c allgemeines%22 1

Yeh, J.-Y.; Ke, H.-R.; Yang, W.-P.; Meng, I.-H.: Text summarization using a trainable summarizer and latent semantic analysis (2005) 0.00
```
2.4720532E-4 = product of:
  0.005685722 = sum of:
    0.005685722 = product of:
      0.011371444 = sum of:
        0.011371444 = weight(_text_:1 in 1003) [ClassicSimilarity], result of:
          0.011371444 = score(doc=1003,freq=4.0), product of:
            0.059252728 = queryWeight, product of:
              2.4565027 = idf(docFreq=10304, maxDocs=44218)
              0.024120767 = queryNorm
            0.19191428 = fieldWeight in 1003, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.4565027 = idf(docFreq=10304, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1003)
      0.5 = coord(1/2)
  0.04347826 = coord(1/23)
```
Abstract

This paper proposes two approaches to address text summarization: modified corpus-based approach (MCBA) and LSA-based T.R.M. approach (LSA + T.R.M.). The first is a trainable summarizer, which takes into account several features, including position, positive keyword, negative keyword, centrality, and the resemblance to the title, to generate summaries. Two new ideas are exploited: (1) sentence positions are ranked to emphasize the significances of different sentence positions, and (2) the score function is trained by the genetic algorithm (GA) to obtain a suitable combination of feature weights. The second uses latent semantic analysis (LSA) to derive the semantic matrix of a document or a corpus and uses semantic sentence representation to construct a semantic text relationship map. We evaluate LSA + T.R.M. both with single documents and at the corpus level to investigate the competence of LSA in text summarization. The two novel approaches were measured at several compression rates on a data corpus composed of 100 political articles. When the compression rate was 30%, an average f-measure of 49% for MCBA, 52% for MCBA + GA, 44% and 40% for LSA + T.R.M. in single-document and corpus level were achieved respectively.

Source

Information processing and management. 41(2005) no.1, S.75-95