![In Czech: Klasifikace multilinguálních korpusů s využitím tezauru EuroWordNet](./img/book.gif)
In Czech: Klasifikace multilinguálních korpusů s využitím tezauru EuroWordNet
Classification of Multilingual Corpora using the EuroWordNet Thesaurus
This paper deals with experiment results for multilingual document categorization. We describe a comparison of algorithmic principles and the methodologies used in our classification system. The aim of experiments was to verify the impact of multilingual thesaurus use on the quality of cross-language categorization. We present our results at the end of this article.
Keywords: classification, text corpus, thesaurus, EuroWordNet
Year: 2004
Download:
Full text [139 kB]
![download](./img/folder_inbox.gif)
Authors of this publication:
![](./photos/930_Toman_Michal.png)
Michal Toman
E-mail: mtoman@kiv.zcu.cz
Michal graduated at UWB in 2003, specialized in software engineering. Currently, he is a PhD student interested in information retrieval, multilingual text processing, word sense disambiguation and knowledge discovery.
![](./photos/337_Jezek_Karel.jpg)
Karel Ježek
Phone: +420 377632475
E-mail: jezek_ka@kiv.zcu.cz
WWW: https://cs.wikipedia.org/wiki/Karel_Je%C5%BEek_(informatik)
Karel is the former group coordinator and a supervisor of PhD students working at research projects of this Group.