Retrieving Citations on the Web

Retrieving Citations on the Web

A fundamental feature of research papers ishow many times they are cited in other articles, i.e. howmany later references to them there are. That is the onlyobjective way of evaluation how important or novel a paper'sideas are. With an increasing number of articlesavailable online, it has become possible to find these citationsin a more or less automated way. This paper firstdescribes existing possibilities of citations retrieval andindexing and then introduces CiteSeeker ÔÇô a tool for a fullyautomated citations retrieval. CiteSeeker starts crawlingthe World Wide Web from given start points and searchesfor specified authors and publications in a fuzzy manner.That means that certain inaccuracies in the inputs aretaken into account. CiteSeeker treats all common Internetfile formats, including PostScript and PDF documents andarchives. The project is based on the .NET technology.

Keywords: Citations, Retrieval, Web, Fuzzy Search, .NET, C#

Year: 2004

Download: download Full text [88 kB]

Authors of this publication:


Dalibor Fiala


E-mail: dalfia@kiv.zcu.cz
WWW: http://www.kiv.zcu.cz/~dalfia/

Dalibor is an associate professor at the Department of Computer Science and Engineering at the University of West Bohemia in Pilsen, Czech Republic. He is interested in web mining, information retrieval, and information science.

Karel Je┼żek


Phone:  +420 377632475, 377632400
E-mail: jezek_ka@kiv.zcu.cz
WWW: http://www-kiv.zcu.cz/~jezek_ka/

Karel is a group coordinator and a supervisor of PhD students working at research projects of this Group.