AdaBoostStumpsSampler #124

glevv · 2021-06-27T10:40:25Z

MC approximation of AdaBoost stump kernel #119

TimotheeMathieu · 2021-07-09T15:42:28Z

Thank you @glevv for this, some comments:

Could you include a short description and explanation of the difference between this and fastfood in the documentation ?
Also, I think on some datasets this may not be a good idea to use a scaling of 1/max(|x|) as this is can vary a lot. More generally I personally would use one of scikit-learn scalers on the data (for instance StandardScaler, what you propose if MinMaxScaler). Maybe you could include a scale_X which can be True or False among the parameters and by default this would use the StandardScaler which is the most common scaler ? THe best preprocessing really depends on the dataset so it should not be fixed in the algorithm.

Otherwise LGTM, thanks.

glevv · 2021-07-30T12:44:36Z

They are completely different kernels and methods of computing them.
Stump kernel was presented in Support Vector Machinery for Infinite Ensemble Learning, but it could be hard to compute exactly, so in the paper Uniform Approximation of Functions with Random Bases MC approximation was proposed (the same paper where MC approximation of RBF kernel - RBFSampler - is described).
I think that StumpKernelSampler/StumpSampler should be shorter and more consistent name (similar to RBFSampler).

As for the scaling, I think it is possible to remove it altogether and let users build their own pipelines (obviously stating in the docs that this method requires scaling). It will be consistent with other kernel methods/approximations (RBFSampler also requires scaling to give proper approximation) and original formulation in the paper.

glevv · 2023-01-10T09:59:42Z

Closed due to inactivity

glevv added 12 commits June 27, 2021 09:53

Create abs_sampler.py

20c54a3

Rename abs_sampler.py to _abssampler.py

fe11803

Update __init__.py

3dad337

Create test_abssampler.py

8e40c0d

Update _abssampler.py

c25b5b8

Update test_abssampler.py

c0e6ef3

Update test_abssampler.py

4833e26

Update _abssampler.py

f47869c

Update _abssampler.py

e5291e4

Update _abssampler.py

75f92cf

Update test_abssampler.py

5da235a

Update _abssampler.py

4cd302d

glevv closed this Jun 27, 2021

Update _abssampler.py

962d9f8

glevv reopened this Jun 30, 2021

glevv added 2 commits June 30, 2021 11:02

Update test_abssampler.py

3d0ffc6

Update _abssampler.py

679bf5d

glevv added 6 commits July 30, 2021 12:52

Update and rename _abssampler.py to _stumpsampler.py

a7216f8

Update __init__.py

cf64eb8

Update __init__.py

9419b52

Update and rename test_abssampler.py to test_stumpsampler.py

367417b

Update test_stumpsampler.py

4fc423d

Update _stumpsampler.py

8cbcfa9

glevv closed this Jan 10, 2023

glevv deleted the abssampler branch May 14, 2024 14:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AdaBoostStumpsSampler #124

AdaBoostStumpsSampler #124

glevv commented Jun 27, 2021 •

edited

Loading

TimotheeMathieu commented Jul 9, 2021

glevv commented Jul 30, 2021 •

edited

Loading

glevv commented Jan 10, 2023

AdaBoostStumpsSampler #124

AdaBoostStumpsSampler #124

Conversation

glevv commented Jun 27, 2021 • edited Loading

TimotheeMathieu commented Jul 9, 2021

glevv commented Jul 30, 2021 • edited Loading

glevv commented Jan 10, 2023

glevv commented Jun 27, 2021 •

edited

Loading

glevv commented Jul 30, 2021 •

edited

Loading