
Automatic Text Summarisation | |
Keywords: | summary, latent semantic analysis, summarisation, summarization, summary evaluation, sentence compression, paraphrasing, news, social media, |
Description: | As the Internet is growing exponentially, huge amount of information is available online. The information overload problem can be curtailed by automatic summarisation. Currently studied topics are: language-independent summarisation (LSA, LDA) of news, social media and scientific papers; summarisation evaluation in multiple languages; opinion and comparative summarisation; using coreference for summarisation; and summary generation (sentence compression and paraphrasing). |
Status: | Finished |
People on this project:

Josef Steinberger
E-mail: jstein@kiv.zcu.cz
Josef is an associated professor at the Department of computer science and engineering at the University of West Bohemia in Pilsen, Czech Republic. He is interested in media monitoring and analysis, mainly automatic text summarisation, sentiment analysis and coreference resolution.

Karel Ježek
Phone: +420 377632475
E-mail: jezek_ka@kiv.zcu.cz
WWW: https://cs.wikipedia.org/wiki/Karel_Je%C5%BEek_(informatik)
Karel is the former group coordinator and a supervisor of PhD students working at research projects of this Group.

Michal Campr
E-mail: mcampr@kiv.zcu.cz
WWW: http://home.zcu.cz/~mcampr/
Michal graduated from the University of West Bohemia in 2011, specialized in software engineering. He is interested in text summarization.

Jiřà Hynek
Phone: +420 603492837
E-mail: jhynek@kiv.zcu.cz
WWW: http://www.kiv.zcu.cz/staff/osobni.php?id_osoby=147&lang=EN
Jiri, a co-founder of the Text-Mining Research Group, works as a lecturer at the Dept. of Computer Science and Engineering. His research interests include machine learning and language-related problems. Jiri’s teaching activity is focused on good writing style and technical writing in general.
Publications:

Aspects of Multilingual News Summarisation | |
Authors: | Josef Steinberger, Hristo Tanev, Ralf Steinberger, Vanni Zavarella, Marco Turchi |
Source: | Steinberger, J., Tanev, H., Zavarella, V., Steinberger, R., Turchi, M. (2014): Aspects of Multilingual News Summarisation. In: Alessandro Fiori (ed.): Innovative Document Summarization Techniques: Revolutionizing Knowledge Understanding, Advances in Data Mining and Database Management series, pages 277-294, ISSN: 2327-1981, ISBN 978-1-4666-5019-0, DOI: 10.4018/978-1-4666-5019-0.ch012, IGI Global. |
Download: | ![]() |

Comparative Summarization via Latent Dirichlet Allocation | |
Authors: | Michal Campr, Karel Ježek |
Source: | Dateso 2013, pp. 80–86, ISBN 978-80-248-2968-5 |
Download: | ![]() |

Multi-document multilingual summarization corpus preparation, Part 2: Czech, Hebrew and Spanish | |
Authors: | M. Elhadad, S. Miranda-Jiménez, Josef Steinberger, G. Giannakopoulos |
Source: | Elhadad, M., Miranda-Jiménez, S., Steinberger, J. and Giannakopoulos, G.: Multi-document multilingual summarization corpus preparation, Part 2: Czech, Hebrew and Spanish. In: Proceedings of the MultiLing 2013 Workshop on Multilingual Multi-document Summarization, pages 13-19, ACL. Sofia, Bulgaria, 2013. |
Download: | ![]() |

Multilingual Statistical News Summarization | |
Authors: | Mijail Alexandrov Kabadjov, Josef Steinberger, Ralf Steinberger |
Source: | In: Thierry Poibeau, Horacio Saggion, Jakub Piskorski & Roman Yangarber (eds), Multi-source, Multilingual Information Extraction and Summarization, pages 229-252, ISSN: 2192-032X, DOI: 10.1007/978-3-642-28569-1_11, Springer. 2013. |
ISSN: | 2192-032X |
Download: | ![]() |

Multilingual Summarisation and Sentiment Analysis | |
Authors: | Josef Steinberger |
Source: | Josef Steinberger: Multilingual Summarisation and Sentiment Analysis, Habilitation thesis. University of West Bohemia, April, 2013. |
Download: | ![]() |

The UWB Summariser at Multiling-2013 | |
Authors: | Josef Steinberger |
Source: | Steinberger, J.: The UWB Summariser at Multiling-2013. In: Proceedings of the MultiLing 2013 Workshop on Multilingual Multi-document Summarization, pages 50-54, ACL. Sofia, Bulgaria, 2013. |
Download: | ![]() |

Topic models for comparative summarization | |
Authors: | Michal Campr, Karel Ježek |
Source: | Text, Speech, and Dialogue - 16th International Conference, TSD 2013, Pilsen, Czech Republic, September 1-5, 2013. Proceedings. Springer 2013 Lecture Notes in Computer Science ISBN 978-3-642-40584-6 |
Download: | ![]() |

Challenges and solutions in the opinion summarization of user-generated content | |
Authors: | Alexandra Balahur, Mijail Alexandrov Kabadjov, Josef Steinberger, Ralf Steinberger, Andrés Montoyo |
Source: | In: Journal of Intelligent Information Systems 39(2), pages 375-398, ISSN: 0925-9902, DOI: 10.1007/s10844-011-0194-z, Springer. 2012. |
ISSN: | 0925-9902 |
Download: | ![]() |
View record in Web of Science® |

Comparative Summarization via Latent Semantic Analysis | |
Authors: | Karel Ježek, Michal Campr |
Source: | Latest Trends in Information Technology, pp. 279-284, ISBN 978-1-61804-134-0, 2012 |
Download: | ![]() |

JRC’s Participation at TAC 2011: Guided and Multilingual Summarization Tasks | |
Authors: | Josef Steinberger, Mijail Alexandrov Kabadjov, Ralf Steinberger, Hristo Tanev, Marco Turchi, Vanni Zavarella |
Source: | In: Proceedings of the Text Analysis Conference 2011, National Institute of Standards and Technology (NIST). Gaithersburg, USA, 2012. |
Download: | ![]() |

Machine Translation for Multilingual Summary Content Evaluation | |
Authors: | Josef Steinberger, Marco Turchi |
Source: | In: Proceedings of the NAACL Workshop on Evaluation Metrics and System Comparison for Automatic Summarization, pages 19-27, ACL. Montreal, Canada, 2012. |
Download: | ![]() |

Relevance Ranking for Translated Texts | |
Authors: | Marco Turchi, Josef Steinberger, Lucia Specia |
Source: | In: Proceedings of 16th Annual Conference of the European Association for Machine Translation. Trento, Italy, 2012. |
Download: | ![]() |

TAC 2011 multiling pilot overview | |
Authors: | G. Giannakopoulos, M. El-Haj, B. Favre, M. Litvak, Josef Steinberger, V. Varma |
Source: | In: Proceedings of the Text Analysis Conference 2011, National Institute of Standards and Technology (NIST). Gaithersburg, USA, 2012. |
Download: | ![]() |

Aspect-Driven News Summarization | |
Authors: | Josef Steinberger, Hristo Tanev, Mijail Alexandrov Kabadjov, Ralf Steinberger |
Source: | In: International Journal of Computational Linguistics and Applications 2 (1-2), ISSN: 0976-0962, Bahri Publications. 2011. |
ISSN: | 0976-0962 |
Download: | ![]() |

Highly Multilingual Coreference Resolution Exploiting a Mature Entity Repository | |
Authors: | Josef Steinberger, Jenya Belyaeva, Jonathan Crawley, Leonida Della-Rocca, Mohamed Ebrahim, Maud Ehrmann, Mijail Alexandrov Kabadjov, Ralf Steinberger, Erik var der Goot |
Source: | In: Proceedings of the 8th International Conference Recent Advances in Natural Language Processing, pages 254-260. Hissar, Bulgaria, 2011. |
Download: | ![]() |

JRC’s Participation in the Guided Summarization Task at TAC 2010 | |
Authors: | Josef Steinberger, Hristo Tanev, Mijail Alexandrov Kabadjov, Ralf Steinberger |
Source: | In: Proceedings of the Text Analysis Conference 2010, National Institute of Standards and Technology (NIST). Gaithersburg, USA, 2011. |
Download: | ![]() |

Enhancing N-Gram-based Summary Evaluation Using Information Content and a Taxonomy | |
Authors: | Mijail Alexandrov Kabadjov, Josef Steinberger, Ralf Steinberger, Massimo Poesio, Bruno Pouliquen |
Source: | In: Advances in Information Retrieval, Lecture Notes in Computer Science 5993, pages 662-666, ISSN: 302-9743, DOI:10.1007/978-3-642-12275-0_71, Springer, 2010. |
ISSN: | 0302-9743 |
Download: | ![]() |
View record in Web of Science® |

Exploiting Higher-level Semantic Information for the Opinion-oriented Summarization of Blogs | |
Authors: | Alexandra Balahur, Mijail Alexandrov Kabadjov, Josef Steinberger |
Source: | In: International Journal of Computational Linguistics and Applications 1 (1-2), pages 45-59, ISSN: 0976-0962, Bahri Publications, 2010. |
ISSN: | 0976-0962 |
Download: | ![]() |

In Czech: Sumarizace textů | |
Authors: | Karel Ježek, Josef Steinberger |
Source: | Proceedings of Annual Database Conference DATAKON, October 16-19, 2010, Mikulov, Czech Rep., pp.3-23, ISBN 978-80-7368-424-2 |
Download: | ![]() |

NewsGist: A Multilingual Statistical News Summarizer | |
Authors: | Mijail Alexandrov Kabadjov, Martin Atkinson, Josef Steinberger, Ralf Steinberger, Erik var der Goot |
Source: | In: Machine Learning and Knowledge Discovery in Databases, Lecture Notes in Computer Science 6323, pages 591-594, ISSN: 302-9743, DOI:10.1007/978-3-642-15939-8_40, Springer. 2010. |
ISSN: | 302-9743 |
Download: | ![]() |

Using Parallel Corpora for Multilingual (Multi-Document) Summarisation Evaluation | |
Authors: | Marco Turchi, Josef Steinberger, Mijail Alexandrov Kabadjov, Ralf Steinberger |
Source: | In: Multilingual and Multimodal Information Access Evaluation, Springer Lecture Notes for Computer Science 6360/2010, pages 52-63, ISSN: 0302-9743, DOI:10.1007/978-3-642-15998-5_7, Springer. 2010. |
ISSN: | 0302-9743 |
Download: | ![]() |
View record in Web of Science® |

WB-JRC-UT’s Participation in TAC 2009: Update Summarization and AESOP Tasks | |
Authors: | Josef Steinberger, Mijail Alexandrov Kabadjov, Bruno Pouliquen, Ralf Steinberger, Massimo Poesio |
Source: | In: Proceedings of the Text Analysis Conference 2009, National Institute of Standards and Technology. Gaithersburg, USA, 2010. |
Download: | ![]() |

Wrapping up a Summary: from Representation to Generation | |
Authors: | Josef Steinberger, Marco Turchi, Mijail Alexandrov Kabadjov, Ralf Steinberger |
Source: | In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 382-386, ACL. Uppsala, Sweden, 2010. |
Download: | ![]() |

Evaluation Measures for Text Summarization | |
Authors: | Josef Steinberger, Karel Ježek |
Source: | In Computing and Informatics, volume 28 (2009), number 2, pages 251-275, Slovak Academy of Sciences, ISSN 1335-9150. |
ISSN: | 1335-9150 |
Download: | ![]() |
View record in Web of Science® |

In Czech: AktualizaÄnà sumarizace textů | |
Authors: | Josef Steinberger |
Source: | In Proceedings of Znalosti 2009, Brno, Czech Republic, February 2009, pp. 234–245, ISBN 978-80-227-3015-0. |

Multilingual Statistical News Summarisation: Preliminary Experiments with English | |
Authors: | Mijail Alexandrov Kabadjov, Josef Steinberger, Bruno Pouliquen, Ralf Steinberger, Massimo Poesio |
Source: | In Proceedings of the workshop 'Intelligent Analysis and Processing of Web News Content' (WI-IAT'09). Milano, Italy, IEEE-CS Press, September 2009. ISBN 978-0-7695-3801-3. |
View record in Web of Science® |

SUTLER: Update Summarizer Based on Latent Topics | |
Authors: | Josef Steinberger, Karel Ježek |
Source: | In Proceedings of TAC'08, NIST, Gaithersburgh, United States, 2009. |
Download: | ![]() |

Summarizing Opinions in Blog Threads | |
Authors: | Alexandra Balahur, Mijail Alexandrov Kabadjov, Josef Steinberger, Ralf Steinberger, Andrés Montoyo |
Source: | In: Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 606-613.Hong Kong, 2009. |
Download: | ![]() |

Text Summarization: An Old Challenge and New Approaches | |
Authors: | Josef Steinberger, Karel Ježek |
Source: | Foundations of Computational Intelligence Vol.6, pages 127- 149, Data Mining Book Series, Springer 2009, ISSN 1860-949X, ISBN 978-3-642-01090-3 |
ISSN: | 1860-949X |
Download: | ![]() |
View record in Web of Science® |

Update Summarization Based on Latent Semantic Analysis | |
Authors: | Josef Steinberger, Karel Ježek |
Source: | In Proceedings of 12th International Conference, TSD 2009, Pilsen, Czech Republic, September 2009. LNAI 5729, Springer-Verlag Berlin Heidelberg New York, ISBN 978-3-642-04207-2, ISSN 0302-9743. |
ISSN: | 0302-9743 |
Download: | ![]() |
View record in Web of Science® |

Update Summarization Based on Novel Topic Distribution | |
Authors: | Josef Steinberger, Karel Ježek |
Source: | In Proceedings of the 2009 ACM Symposium on Document Engineering, Munich, Germany, September 2009. Association for Computing Machinery, ISBN 978-1-60558-575-8. |
Download: | ![]() |
View record in Web of Science® |

Automatic Text Summarization (The state of the art 2007 and new challenges) | |
Authors: | Karel Ježek, Josef Steinberger |
Source: | In Proceedings of Znalosti 2008, Bratislava, Slovakia, February 2008, pp. 1–12, ISBN 978-80-227-2827-0. |
Download: | ![]() |

Web Topic Summarization | |
Authors: | Josef Steinberger, Karel Ježek, Martin Sloup |
Source: | Proceedings of the 12th International Conference on Electronic Publishing, ISBN 978-0-7727-6315-0, pp 322-334, Toronto, Canada 2008 |
Download: | ![]() |

Identifying Novel Information using Latent Semantic Analysis in the WiQA Task at CLEF 2006 | |
Authors: | Richard F. E. Sutcliffe, Josef Steinberger, Udo Kruschwitz, Massimo Poesio, Mijail Alexandrov Kabadjov |
Source: | In Lecture Notes in Computer Science 4730, 2007, pp. 541-549, Springer-Verlag Berlin Heidelberg, ISSN 0302-9743, ISBN 978-3-540-74998-1. |
ISSN: | 0302-9743 |
Download: | ![]() |
View record in Web of Science® |

Knowledge-poor Multilingual Sentence Compression | |
Authors: | Josef Steinberger, Roman TesaÅ™ |
Source: | In Proceedings of 7th Conference on Language Engineering, Cairo, Egypt, December 2007, pp. 369-379, The Egyptian Society of Language Engineering. |
Download: | ![]() |

LSA-Based Multi-Document Summarization | |
Authors: | Josef Steinberger, Martin Křišťan |
Source: | In Proceedings of 8th International PhD Workshop on Systems and Control, a Young Generation Viewpoint, Balatonfured, Hungary, September 2007, pp. 87-91, ISBN 978-963-311-365-3. |
Download: | ![]() |

Text Summarization within the LSA Framework | |
Authors: | Josef Steinberger |
Source: | PhD Thesis, University of West Bohemia in Pilsen, Czech Republic, January 2007. |
Download: | ![]() |

Two Uses of Anaphora Resolution in Summarization | |
Authors: | Josef Steinberger, Massimo Poesio, Mijail Alexandrov Kabadjov, Karel Ježek |
Source: | In Special Issue of Information Processing & Management on Summarization, volume 43, issue 6, November 2007, Elsevier Ltd., pp. 1663-1680. ISSN 0306-4573. |
ISSN: | 0306-4573 |
Download: | ![]() |
View record in Web of Science® |

Searching and Summarizing in Multilingual Environment | |
Authors: | Michal Toman, Josef Steinberger, Karel Ježek |
Source: | In Proceedings of the 10th International Conference on Electronic Publishing, Bansko, Bulgaria, June 2006, pp. 257-265, FOI-Commerce, ISBN 954-16-0049-9. |
Download: | ![]() |

Sentence Compression for the LSA-based Summarizer | |
Authors: | Josef Steinberger, Karel Ježek |
Source: | In Proceedings of the 7th International Conference on Information Systems Implementation and Modelling, Přerov, Czech Republic, April 2006, pp. 141-148, MARQ Ostrava, ISBN 80-86840-19-0. |
Download: | ![]() |

Improving LSA-based Summarization with Anaphora Resolution | |
Authors: | Josef Steinberger, Mijail Alexandrov Kabadjov, Massimo Poesio |
Source: | In Proceedings of Human Language Technology Conference/Conference on Empirical Methods in Natural Language Processing, Vancouver, Canada, October 2005, pp. 1–8, The Association for Computational Linguistics, ISBN 1-932432-55-8. |
Download: | ![]() |

In Czech: Hodnocenà kvality sumarizátorů textů | |
Authors: | Josef Steinberger, Karel Ježek |
Source: | In Proceedings of Znalosti 2005 Conference, Stará Lesná, Slovakia, February 2005, pp. 96–107, ISBN 80-248-0755-6. |
Download: | ![]() |

Text Summarization and Singular Value Decomposition | |
Authors: | Josef Steinberger, Karel Ježek |
Source: | Proceedings the 3rd International Conference on Advances in Information Systems, Lecture Notes in Computer Science 2457, October 2004, pp. 245–254, Springer-Verlag Berlin Heidelberg, ISSN 0302-9743, ISBN 3-540-23478-0. |
ISSN: | 0302-9743 |
View record in Web of Science® |

Using Latent Semantic Analysis in Text Summarization and Summary Evaluation | |
Authors: | Josef Steinberger, Karel Ježek |
Source: | In Proceedings of the 5th International Conference on Information Systems Implementation and Modelling, Rožnov p. Radhoštěm, Czech Republic, April 2004, pp. 93–100, MARQ Ostrava, ISBN 80-85988-99-2. |
Download: | ![]() |

A Practical Approach to Automatic Text Summarization | |
Authors: | Jiřà Hynek, Karel Ježek |
Source: | Proceedings of the 7th ICCC/IFIP International Conference on Electronic Publishing – ELPUB2003 Guimaraes, Portugal, Sely Costa et al. (Eds). Universidade de Minho, Portugal, ISBN 972-98921-2-1 |
Download: | ![]() |
Related Downloads:

Almus: Automatic Text Summarizer | |
Size: | 2 kB |
Desc: | The system creates a summary of a set of documents dealing with the same topic. It is also possible to generate an update summary by specifying the basic document collection. The summarization method is based on the latent semantic analysis. |
Related: | Automatic Text Summarisation |