Retrieving Citations on the Web

Retrieving Citations on the Web

A fundamental feature of research papers ishow many times they are cited in other articles, i.e. howmany later references to them there are. That is the onlyobjective way of evaluation how important or novel a paper'sideas are. With an increasing number of articlesavailable online, it has become possible to find these citationsin a more or less automated way. This paper firstdescribes existing possibilities of citations retrieval andindexing and then introduces CiteSeeker ÔÇô a tool for a fullyautomated citations retrieval. CiteSeeker starts crawlingthe World Wide Web from given start points and searchesfor specified authors and publications in a fuzzy manner.That means that certain inaccuracies in the inputs aretaken into account. CiteSeeker treats all common Internetfile formats, including PostScript and PDF documents andarchives. The project is based on the .NET technology.

Keywords: Citations, Retrieval, Web, Fuzzy Search, .NET, C#

Year: 2004

Download: download Full text [88 kB]

Authors of this publication:

Dalibor Fiala

Phone: +420 377 63 2429

Dalibor is the research group coordinator and an associate professor at the Department of Computer Science and Engineering at the University of West Bohemia in Pilsen, Czech Republic. He is interested in data mining, web mining, information retrieval, informetrics, and information science.

Karel Je┼żek

Phone:  +420 377632475

Karel is the former group coordinator and a supervisor of PhD students working at research projects of this Group.

Related Projects:


Social Networks Analysis

Authors:  Karel Je┼żek, Dalibor Fiala, Michal Nykl
Desc.:Application of the PageRank algorithm and its modifications to the exploration of network structures, particularly citation and co-autorship networks.