Analysis of direct citation, co-citation and bibliographic coupling in scientific topic identification

Abstract

In our study, we examine the impact of citation network structures on the ability to discern valuable research topics in Computer Science literature. We use the bibliographic information available in the DBLP database to extract candidate phrases from scientific paper abstracts. Following that, we construct citation networks based on direct citation, co-citation and bibliographic coupling relationships between the papers. The candidate research topics, in the form of keyphrases and n-grammes, are subsequently ranked and filtered by a graph-text ranking algorithm. This selection of the highest ranked potential topics is further evaluated by domain experts and through the Wikipedia knowledge base. The results obtained from these citation networks are complementary, returning valid but non-overlapping output phrases between some pairs of networks. In particular, bibliographic coupling appears to capture more unique information than either direct citation or co-citation. These findings point towards the possible added value in combining bibliographic coupling analysis with other structures. At the same time, combining direct citation and co-citation is put into question. We expect our findings to be utilised in method design for research topic identification.

Publication
In Journal of Information Science
Rajmund Klemiński
Rajmund Klemiński
PhD Student

I am a PhD student with an interest in the analysis of science, otherwise known as scientometrics. My work revolves around networked data, its processing and representation.

Tomasz Kajdanowicz
Tomasz Kajdanowicz
Associate Professor, head of faculty’s doctoral studies, head of department

My research interests include representation learning, social network and media analysis, and machine learning.

Przemysław Kazienko
Przemysław Kazienko
Full Professor

My research interests include Social Network Anaysis and .

Related