
Multilingual Statistical News Summarisation: Preliminary Experiments with English
In this paper we present a generic approachfor summarising multilingual news clusters such as the ones produced by the Europe Media Monitor (EMM) system. It is generic because it uses robust statistical techniques to perform the summarisation step and its multilinguality is inherited fromthe multilingual entity disambiguation system used to buildthe source representation. We ran preliminary experimentswith the TAC 2008 data, an English corpus for summarisationresearch, and we obtained promising improvements over asummarisation system ranked in the top 20% at the TAC 2008competition.
Keywords: Text Summarization; Multilingual Text Mining;Entity Disambiguation; Latent Semantic Analysis;
Year: 2009
Authors of this publication:

Josef Steinberger
E-mail: jstein@kiv.zcu.cz
Related Projects:

Automatic Text Summarisation | |
Authors: | Josef Steinberger, Karel Ježek, Michal Campr, Jiří Hynek |
Desc.: | Automatic text summarisation using various text mining methods, mainly Latent Semantic Analysis (LSA). |