AMF: Aggregated Mondrian Forests for Online Learning

Mourtada, Jaouad; Gaïffas, Stéphane; Scornet, Erwan

Statistics > Machine Learning

arXiv:1906.10529 (stat)

[Submitted on 25 Jun 2019 (v1), last revised 15 May 2020 (this version, v2)]

Title:AMF: Aggregated Mondrian Forests for Online Learning

Authors:Jaouad Mourtada, Stéphane Gaïffas, Erwan Scornet

View PDF

Abstract:Random Forests (RF) is one of the algorithms of choice in many supervised learning applications, be it classification or regression. The appeal of such tree-ensemble methods comes from a combination of several characteristics: a remarkable accuracy in a variety of tasks, a small number of parameters to tune, robustness with respect to features scaling, a reasonable computational cost for training and prediction, and their suitability in high-dimensional settings. The most commonly used RF variants however are "offline" algorithms, which require the availability of the whole dataset at once. In this paper, we introduce AMF, an online random forest algorithm based on Mondrian Forests. Using a variant of the Context Tree Weighting algorithm, we show that it is possible to efficiently perform an exact aggregation over all prunings of the trees; in particular, this enables to obtain a truly online parameter-free algorithm which is competitive with the optimal pruning of the Mondrian tree, and thus adaptive to the unknown regularity of the regression function. Numerical experiments show that AMF is competitive with respect to several strong baselines on a large number of datasets for multi-class classification.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
Cite as:	arXiv:1906.10529 [stat.ML]
	(or arXiv:1906.10529v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1906.10529

Submission history

From: Stéphane Gaïffas [view email]
[v1] Tue, 25 Jun 2019 13:50:22 UTC (1,873 KB)
[v2] Fri, 15 May 2020 15:45:45 UTC (3,753 KB)

Statistics > Machine Learning

Title:AMF: Aggregated Mondrian Forests for Online Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:AMF: Aggregated Mondrian Forests for Online Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators