Exact high-dimensional asymptotics for support vector machine

Liu, Haoyang

Statistics > Machine Learning

arXiv:1905.05125v1 (stat)

[Submitted on 13 May 2019 (this version), latest version 31 Jul 2019 (v2)]

Title:Exact high-dimensional asymptotics for support vector machine

Authors:Haoyang Liu

View PDF

Abstract:Support vector machine (SVM) is one of the most widely used classification methods. In this paper, we consider soft margin support vector machine used on data points with independent features, where the sample size $n$ and the feature dimension $p$ grows to $\infty$ in a fixed ratio $p/n\rightarrow \delta$. We propose a set of equations that exactly characterizes the asymptotic behavior of support vector machine. In particular, we give exact formula for (1) the variability of the optimal coefficients, (2) proportion of data points lying on the margin boundary (i.e. number of support vectors), (3) the final objective function value, and (4) expected misclassification error on new data points, which in particular implies exact formula for the optimal tuning parameter given a data generating mechanism. The global null case is considered first, where the label $y\in\{+1,-1\}$ is independent of the feature $x$. Then the signaled case is considered, where the label $y\in\{+1,-1\}$ is allowed to have a general dependence on the feature $x$ through a linear combination $a_0^Tx$. These results for the non-smooth hinge loss serve as an analogue to the recent results in \citet{sur2018modern} for smooth logistic loss. Our approach is based on heuristic leave-one-out calculations.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
Cite as:	arXiv:1905.05125 [stat.ML]
	(or arXiv:1905.05125v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1905.05125

Submission history

From: Haoyang Liu [view email]
[v1] Mon, 13 May 2019 16:25:44 UTC (431 KB)
[v2] Wed, 31 Jul 2019 21:54:03 UTC (431 KB)

Statistics > Machine Learning

Title:Exact high-dimensional asymptotics for support vector machine

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Exact high-dimensional asymptotics for support vector machine

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators