Distribution of Classification Margins: Are All Data Equal?

Banburski, Andrzej; De La Torre, Fernanda; Pant, Nishka; Shastri, Ishana; Poggio, Tomaso

Computer Science > Machine Learning

arXiv:2107.10199 (cs)

[Submitted on 21 Jul 2021]

Title:Distribution of Classification Margins: Are All Data Equal?

Authors:Andrzej Banburski, Fernanda De La Torre, Nishka Pant, Ishana Shastri, Tomaso Poggio

View PDF

Abstract:Recent theoretical results show that gradient descent on deep neural networks under exponential loss functions locally maximizes classification margin, which is equivalent to minimizing the norm of the weight matrices under margin constraints. This property of the solution however does not fully characterize the generalization performance. We motivate theoretically and show empirically that the area under the curve of the margin distribution on the training set is in fact a good measure of generalization. We then show that, after data separation is achieved, it is possible to dynamically reduce the training set by more than 99% without significant loss of performance. Interestingly, the resulting subset of "high capacity" features is not consistent across different training runs, which is consistent with the theoretical claim that all training points should converge to the same asymptotic margin under SGD and in the presence of both batch normalization and weight decay.

Comments:	Previously online as CBMM Memo 115 on the CBMM MIT site
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2107.10199 [cs.LG]
	(or arXiv:2107.10199v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2107.10199

Submission history

From: Andrzej Banburski [view email]
[v1] Wed, 21 Jul 2021 16:41:57 UTC (32,401 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-07

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Andrzej Banburski
Tomaso A. Poggio

export BibTeX citation

Computer Science > Machine Learning

Title:Distribution of Classification Margins: Are All Data Equal?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Distribution of Classification Margins: Are All Data Equal?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators