
Retrieving Citations on the Web
A fundamental feature of research papers ishow many times they are cited in other articles, i.e. howmany later references to them there are. That is the onlyobjective way of evaluation how important or novel a paper'sideas are. With an increasing number of articlesavailable online, it has become possible to find these citationsin a more or less automated way. This paper firstdescribes existing possibilities of citations retrieval andindexing and then introduces CiteSeeker – a tool for a fullyautomated citations retrieval. CiteSeeker starts crawlingthe World Wide Web from given start points and searchesfor specified authors and publications in a fuzzy manner.That means that certain inaccuracies in the inputs aretaken into account. CiteSeeker treats all common Internetfile formats, including PostScript and PDF documents andarchives. The project is based on the .NET technology.
Keywords: Citations, Retrieval, Web, Fuzzy Search, .NET, C#
Year: 2004

Authors of this publication:

Dalibor Fiala
E-mail: dalfia@kiv.zcu.cz
WWW: http://www.kiv.zcu.cz/~dalfia/

Karel Ježek
Phone: +420 377632475, 377632400
E-mail: jezek_ka@kiv.zcu.cz
WWW: http://www-kiv.zcu.cz/~jezek_ka/