Extracting information from CiteSeerÔÇÖs textual data

Extracting information from CiteSeerÔÇÖs textual data

This article deals with CiteSeer, a free online digital library and search engine of mainly computer science research papers. First, it discusses CiteSeerÔÇÖs features and structure and then it presents what useful information on publications and author collaborations can be extracted from its textual data. We show the basic properties of both the publication citation and author citation graph. Moreover, several parameters based on the structure of the collaboration graph of authors are discussed and their main statistical properties are shown.
The available full text is a preprint of the article.

Keywords: CiteSeer, publications, citations, researchers, collaboration.

Year: 2013

Journal ISSN: 1992-8645
Download: download Full text [304 kB]

Authors of this publication:

Dalibor Fiala

E-mail: dalfia@kiv.zcu.cz
WWW: http://www.kiv.zcu.cz/~dalfia/

Dalibor is an associate professor at the Department of Computer Science and Engineering at the University of West Bohemia in Pilsen, Czech Republic. He is interested in web mining, information retrieval, and information science.

Related Projects:


Social Networks Analysis

Authors:  Karel Je┼żek, Dalibor Fiala, Michal Nykl
Desc.:Application of PageRank algorithm and its modifications on exploration of network structures, particularly citation and coautorship networks.