MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

Guo, Yandong; Zhang, Lei; Hu, Yuxiao; He, Xiaodong; Gao, Jianfeng

Computer Science > Computer Vision and Pattern Recognition

arXiv:1607.08221 (cs)

[Submitted on 27 Jul 2016]

Title:MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

Authors:Yandong Guo, Lei Zhang, Yuxiao Hu, Xiaodong He, Jianfeng Gao

View PDF

Abstract:In this paper, we design a benchmark task and provide the associated datasets for recognizing face images and link them to corresponding entity keys in a knowledge base. More specifically, we propose a benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data. The rich information provided by the knowledge base helps to conduct disambiguation and improve the recognition accuracy, and contributes to various real-world applications, such as image captioning and news video analysis. Associated with this task, we design and provide concrete measurement set, evaluation protocol, as well as training data. We also present in details our experiment setup and report promising baseline results. Our benchmark task could lead to one of the largest classification problems in computer vision. To the best of our knowledge, our training dataset, which contains 10M images in version 1, is the largest publicly available one in the world.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1607.08221 [cs.CV]
	(or arXiv:1607.08221v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1607.08221

Submission history

From: Yandong Guo [view email]
[v1] Wed, 27 Jul 2016 19:18:16 UTC (3,368 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2016-07

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yandong Guo
Lei Zhang
Yuxiao Hu
Xiaodong He
Jianfeng Gao

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators