XG Boosting Reference

Uploaded by

awivawie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views6 pages

XG Boosting Reference

Uploaded by

awivawie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Reading: Reference guide: XGBoost tuning

Previously, you learned about gradient boosting machine models and studied how to build and
tune them with XGBoost’s scikit-learn API. This reading is a quick-reference guide to help you
when you’re building XGBoost models of your own. It includes information on the following
components:

● Import statements
● Hyperparameters

Import statements
The following are some of the most commonly used import statements for gradient boosting
models using the XGBoost library together with scikit-learn.

Models
For classification tasks:
from xgboost import XGBClassifier

For regression tasks:

from xgboost import XGBRegressor

Evaluation metrics

For classification tasks:

from sklearn.metrics import

accuracy_score(y_true, y_pred, *[, ...]) Accuracy classification score

average_precision_score(y_true, ...) Compute average precision (AP)

from prediction scores
confusion_matrix(y_true, y_pred, *) Compute confusion matrix to
evaluate the performance of the
training of a model

f1_score(y_true, y_pred, *[, ...]) Compute the F1 score, also known

as balanced F-score or F-measure

fbeta_score(y_true, y_pred, *, beta) Compute the F-beta score

metrics.log_loss(y_true, y_pred, *[, eps, ...]) Log loss, aka logistic loss or cross-
entropy loss

multilabel_confusion_matrix(y_true, ...) Compute a confusion matrix for

each class or sample

precision_recall_curve(y_true, ...) Compute precision-recall pairs for

different probability thresholds

precision_score(y_true, y_pred, *[, ...]) Compute the precision

recall_score(y_true, y_pred, *[, ...]) Compute the recall

roc_auc_score(y_true, y_score, *[, ...]) Compute Area Under the Receiver

Operating Characteristic Curve
(ROC AUC) from prediction scores

For regression tasks:

from sklearn.metrics import

mean_absolute_error(y_true, y_pred, *) Mean absolute error regression

loss

mean_squared_error(y_true, y_pred, *) Mean squared error regression

loss

mean_squared_log_error(y_true, y_pred, *) Mean squared logarithmic error

regression loss

median_absolute_error(y_true, y_pred, *) Median absolute error regression

loss

mean_absolute_percentage_error(...) Mean absolute percentage error

(MAPE) regression loss

r2_score(y_true, y_pred, *[, ...]) R2 (coefficient of determination)

regression score function

Hyperparameters
The following are some of the most important hyperparameters for gradient boosting
machine classification models built with the XGBoost library. These are the hyperparameters
that data professionals typically reach for first, because they are among the most intuitive and
they control the model at different levels using a diverse variety of mechanisms.

n_estimators