
Retrieving Citations on the Web
A fundamental feature of research papers ishow many times they are cited in other articles, i.e. howmany later references to them there are. That is the onlyobjective way of evaluation how important or novel a paper'sideas are. With an increasing number of articlesavailable online, it has become possible to find these citationsin a more or less automated way. This paper firstdescribes existing possibilities of citations retrieval andindexing and then introduces CiteSeeker – a tool for a fullyautomated citations retrieval. CiteSeeker starts crawlingthe World Wide Web from given start points and searchesfor specified authors and publications in a fuzzy manner.That means that certain inaccuracies in the inputs aretaken into account. CiteSeeker treats all common Internetfile formats, including PostScript and PDF documents andarchives. The project is based on the .NET technology.
Keywords: Citations, Retrieval, Web, Fuzzy Search, .NET, C#
Year: 2004

Authors of this publication:

Dalibor Fiala
Phone: +420 377 63 2429
E-mail: dalfia@kiv.zcu.cz
WWW: http://www.kiv.zcu.cz/~dalfia/

Karel Ježek
Phone: +420 377632475
E-mail: jezek_ka@kiv.zcu.cz
WWW: https://cs.wikipedia.org/wiki/Karel_Je%C5%BEek_(informatik)
Related Projects:

Social Networks Analysis | |
Authors: | Karel Ježek, Dalibor Fiala, Michal Nykl |
Desc.: | Application of the PageRank algorithm and its modifications to the exploration of network structures, particularly citation and co-autorship networks. |