Session 1 On CatBoost Practical Introduction Notes
Session 1 On CatBoost Practical Introduction Notes
Advantages
1. Easy to use
2. Great result without hyperparameter tuning
3. Improved accuracy -> benchmark
1.
3.
Technical Aspects
Session 24 - CatBoost Page 3
Technical Aspects
1. Can handle categorical variables(text data also) using Ordered Target Encoding
2. Uses Symmetric Trees
3. Uses technique like Newton Raphson to calculate the output value of the
leaves
4. Can dynamically figure out learning rate
5. Snapshotting capability
6. Native integration with libraries such as SHAP, Plotly etc
7. Usage of smart data structures like 'Pool'
8. Built-in support for cross validation and hyperparameter tuning
9. Can handle missing values out of the box
10. Built in overfitting detector
11. Supports custom loss functions and metrics
12. Multi-threading and GPU support
13. Regularization