Non-asymptotic Excess Risk Bounds for Classification with Deep Convolutional Neural Networks

Shen, Guohao; Jiao, Yuling; Lin, Yuanyuan; Huang, Jian

Computer Science > Machine Learning

arXiv:2105.00292 (cs)

[Submitted on 1 May 2021]

Title:Non-asymptotic Excess Risk Bounds for Classification with Deep Convolutional Neural Networks

Authors:Guohao Shen, Yuling Jiao, Yuanyuan Lin, Jian Huang

View PDF

Abstract:In this paper, we consider the problem of binary classification with a class of general deep convolutional neural networks, which includes fully-connected neural networks and fully convolutional neural networks as special cases. We establish non-asymptotic excess risk bounds for a class of convex surrogate losses and target functions with different modulus of continuity. An important feature of our results is that we clearly define the prefactors of the risk bounds in terms of the input data dimension and other model parameters and show that they depend polynomially on the dimensionality in some important models. We also show that the classification methods with CNNs can circumvent the curse of dimensionality if the input data is supported on an approximate low-dimensional manifold. To establish these results, we derive an upper bound for the covering number for the class of general convolutional neural networks with a bias term in each convolutional layer, and derive new results on the approximation power of CNNs for any uniformly-continuous target functions. These results provide further insights into the complexity and the approximation power of general convolutional neural networks, which are of independent interest and may have other applications. Finally, we apply our general results to analyze the non-asymptotic excess risk bounds for four widely used methods with different loss functions using CNNs, including the least squares, the logistic, the exponential and the SVM hinge losses.

Comments:	Guohao Shen and Yuling Jiao contributed equally to this work. Co-corresponding authors: Yuanyuan Lin (Email: ylin@sta.this http URL) and Jian Huang (Email: [email protected])
Subjects:	Machine Learning (cs.LG); Statistics Theory (math.ST)
MSC classes:	68T07, 62G05
Cite as:	arXiv:2105.00292 [cs.LG]
	(or arXiv:2105.00292v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2105.00292

Submission history

From: Jian Huang [view email]
[v1] Sat, 1 May 2021 15:55:04 UTC (569 KB)

Computer Science > Machine Learning

Title:Non-asymptotic Excess Risk Bounds for Classification with Deep Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Non-asymptotic Excess Risk Bounds for Classification with Deep Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators