
Sentence Compression for the LSA-based Summarizer
We present a simple sentence compression approach for our summarizer based on latentsemantic analysis (LSA). The summarization method assesses each sentence by an LSA score.The compression algorithm removes unimportant clauses from a full sentence. Firstly, a sentence isdivided into clauses by Charniak parser, then compression candidates are generated and finally, thebest candidate is selected to represent the sentence. The candidates gain an importance score whichis directly proportional to its LSA score and indirectly to its length. We evaluated the approachin two ways. By intrinsic evaluation we found that the compressions produced by our algorithmare better than baseline ones but still worse than what humans can make. Then we compared theresulting summaries with human abstracts by a standard n-gram based ROUGE measure.
Keywords: Sentence compression, summarization, latent semantic analysis
Year: 2006

Authors of this publication:

Josef Steinberger
E-mail: jstein@kiv.zcu.cz

Karel Ježek
Phone: +420 377632475
E-mail: jezek_ka@kiv.zcu.cz
WWW: https://cs.wikipedia.org/wiki/Karel_Je%C5%BEek_(informatik)
Related Projects:

Automatic Text Summarisation | |
Authors: | Josef Steinberger, Karel Ježek, Michal Campr, Jiří Hynek |
Desc.: | Automatic text summarisation using various text mining methods, mainly Latent Semantic Analysis (LSA). |