RhythmNet: End-to-end Heart Rate Estimation from Face via Spatial-temporal Representation

Niu, Xuesong; Shan, Shiguang; Han, Hu; Chen, Xilin

doi:10.1109/TIP.2019.2947204

Computer Science > Computer Vision and Pattern Recognition

arXiv:1910.11515 (cs)

[Submitted on 25 Oct 2019 (v1), last revised 4 Nov 2019 (this version, v2)]

Title:RhythmNet: End-to-end Heart Rate Estimation from Face via Spatial-temporal Representation

Authors:Xuesong Niu, Shiguang Shan, Hu Han, Xilin Chen

View PDF

Abstract:Heart rate (HR) is an important physiological signal that reflects the physical and emotional status of a person. Traditional HR measurements usually rely on contact monitors, which may cause inconvenience and discomfort. Recently, some methods have been proposed for remote HR estimation from face videos; however, most of them focus on well-controlled scenarios, their generalization ability into less-constrained scenarios (e.g., with head movement, and bad illumination) are not known. At the same time, lacking large-scale HR databases has limited the use of deep models for remote HR estimation. In this paper, we propose an end-to-end RhythmNet for remote HR estimation from the face. In RyhthmNet, we use a spatial-temporal representation encoding the HR signals from multiple ROI volumes as its input. Then the spatial-temporal representations are fed into a convolutional network for HR estimation. We also take into account the relationship of adjacent HR measurements from a video sequence via Gated Recurrent Unit (GRU) and achieves efficient HR measurement. In addition, we build a large-scale multi-modal HR database (named as VIPL-HR, available at 'this http URL), which contains 2,378 visible light videos (VIS) and 752 near-infrared (NIR) videos of 107 subjects. Our VIPL-HR database contains various variations such as head movements, illumination variations, and acquisition device changes, replicating a less-constrained scenario for HR estimation. The proposed approach outperforms the state-of-the-art methods on both the public-domain and our VIPL-HR databases.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1910.11515 [cs.CV]
	(or arXiv:1910.11515v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1910.11515
Related DOI:	https://doi.org/10.1109/TIP.2019.2947204

Submission history

From: Xuesong Niu [view email]
[v1] Fri, 25 Oct 2019 04:03:41 UTC (4,246 KB)
[v2] Mon, 4 Nov 2019 06:23:47 UTC (4,184 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RhythmNet: End-to-end Heart Rate Estimation from Face via Spatial-temporal Representation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RhythmNet: End-to-end Heart Rate Estimation from Face via Spatial-temporal Representation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators