Retrieving Citations on the Web

Retrieving Citations on the Web

A fundamental feature of research papers ishow many times they are cited in other articles, i.e. howmany later references to them there are. That is the onlyobjective way of evaluation how important or novel a paper'sideas are. With an increasing number of articlesavailable online, it has become possible to find these citationsin a more or less automated way. This paper firstdescribes existing possibilities of citations retrieval andindexing and then introduces CiteSeeker – a tool for a fullyautomated citations retrieval. CiteSeeker starts crawlingthe World Wide Web from given start points and searchesfor specified authors and publications in a fuzzy manner.That means that certain inaccuracies in the inputs aretaken into account. CiteSeeker treats all common Internetfile formats, including PostScript and PDF documents andarchives. The project is based on the .NET technology.

Keywords: Citations, Retrieval, Web, Fuzzy Search, .NET, C#

Year: 2004

Download: download Full text [88 kB]

Authors of this publication:


Dalibor Fiala


Phone: +420 377 63 2429
E-mail: dalfia@kiv.zcu.cz
WWW: http://www.kiv.zcu.cz/~dalfia/

Dalibor is the research group coordinator and an associate professor at the Department of Computer Science and Engineering at the University of West Bohemia in Pilsen, Czech Republic. He is interested in data mining, web mining, information retrieval, informetrics, and information science.

Karel Ježek


Phone:  +420 377632475
E-mail: jezek_ka@kiv.zcu.cz
WWW: https://cs.wikipedia.org/wiki/Karel_Je%C5%BEek_(informatik)

Karel is the former group coordinator and a supervisor of PhD students working at research projects of this Group.

Related Projects:


Project

Social Networks Analysis

Authors:  Karel Ježek, Dalibor Fiala, Michal Nykl
Desc.:Application of the PageRank algorithm and its modifications to the exploration of network structures, particularly citation and co-autorship networks.