Topological and Semantic Graph-based Author Disambiguation on DBLP Data in Neo4j

Franzoni, Valentina; Lepri, Michele; Milani, Alfredo

doi:10.1109/AIKE.2018.00054

Computer Science > Information Retrieval

arXiv:1901.08977 (cs)

[Submitted on 25 Jan 2019]

Title:Topological and Semantic Graph-based Author Disambiguation on DBLP Data in Neo4j

Authors:Valentina Franzoni, Michele Lepri, Alfredo Milani

View PDF

Abstract:In this work, we introduce a novel method for entity resolution author disambiguation in bibliographic networks. Such a method is based on a 2-steps network traversal using topological similarity measures for rating candidate nodes. Topological similarity is widely used in the Link Prediction application domain to assess the likelihood of an unknown link. A similarity function can be a good approximation for equality, therefore can be used to disambiguate, basing on the hypothesis that authors with many common co-authors are similar. Our method has experimented on a graph-based representation of the public DBLP Computer Science database. The results obtained are extremely encouraging regarding Precision, Accuracy, and Specificity. Further good aspects are the locality of the method for disambiguation assessment which avoids the need to know the global network, and the exploitation of only a few data, e.g. author name and paper title (i.e., co-authorship data).

Comments:	Pre-print of article presented at AIKE (Artificial Intelligence and Knowledge Engineering) IEEE Conference, September 2018, Laguna Hills, California (USA)
Subjects:	Information Retrieval (cs.IR); Digital Libraries (cs.DL); Social and Information Networks (cs.SI)
Cite as:	arXiv:1901.08977 [cs.IR]
	(or arXiv:1901.08977v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.1901.08977
Journal reference:	AIKE 2018: 239-243
Related DOI:	https://doi.org/10.1109/AIKE.2018.00054

Submission history

From: Valentina Franzoni [view email]
[v1] Fri, 25 Jan 2019 16:49:53 UTC (411 KB)

Computer Science > Information Retrieval

Title:Topological and Semantic Graph-based Author Disambiguation on DBLP Data in Neo4j

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Topological and Semantic Graph-based Author Disambiguation on DBLP Data in Neo4j

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators