Cheat Sheet Tutorial

PyCaret is an open-source Python library that allows users to prepare data, create and evaluate machine learning models, and deploy models within a notebook environment. It supports various types of supervised and unsupervised learning algorithms for regression, classification, clustering, and anomaly detection. PyCaret also has functionality for time series analysis and natural language processing. Key features include an easy-to-use API, model tuning, stacking/blending, interpretability functions, and support for GPU/distributed computing.

Uploaded by

Maria Farina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

149 views2 pages

Cheat Sheet Tutorial

Uploaded by

Maria Farina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Loading data from PyCaret's repository (regression) (classification) * model: **plot= set_config() Natural Language Processing

Cheat sheet # loading data from pycaret

from pycaret.datasets import get_data
'omp'
'br'
'gpc'
'mlp
'naïve'
'grand_means'
'ts'
'cv'
save_config()
load_config()
# set up environment
from pycaret.nlp import *
PyCaret is an open source, low-code machine learning data = get_data('dataset_name') 'ard' 'ridge' 'snaive' 'acf' get_clusters() clf1 = setup(data=df, target='colunm')
library in Python that allows you to go preparing your 'par' 'rf' 'polytrend' 'acf' # create and evaluate model
data to deploying your model within minutes in Loading data using Pandas 'ransac' 'qda' 'arima' 'pacf' * model: **plot= model = create_model(*)
your choice of notebook environment # importing pandas 'tr' 'ada' 'exp_smooth' 'decomp_stl' ‘kmeans’ 'cluster' model_df = assign_model(*)
import pandas as pd 'huber' 'gbc' 'ets' 'diagnostics' ‘ap’ 'tsne' plot_model(model, plot=**)
Installing PyCaret df = pd.read_csv(r'dir/file_name.csv') 'kr' 'lda' 'theta' 'forecast' ‘meanshift’ 'elbow' evaluate_model(model)
# install pycaret 'svm' 'et' 'tbats' 'insample' ‘sc’ 'silhouette' model_tuned = tune_model(model=model,
pip install pycaret Supervised Learning 'knn' 'xgboost' 'bats' 'residuals' ‘hclust’ 'distance' supervised_target='column')
# install full version of pycaret 'dt' 'lightgbm' 'prophet' 'train_test_split' ‘dbscan’ 'distribution' # model deployment
pip install pycaret[full] Regression and Classification 'rf' 'catboost' 'lr_cds_dt' 'decomp_classical' ‘optics’ save_model(model, 'saved_model')
# install pycaret time series module # set up environment 'et' 'en_cds' ‘birch’ model_loaded = load_model('saved_model')
pip install pycaret-ts-alpha from pycaret.regression import * 'gbr' ‘ridge_cds_dt’ ‘kmodes’ # utils
from pycaret.classification import * 'mlp' ‘lasso_cds_dt’ pull()
PyCaret on GPU clf1 = setup(data = df, target='column') 'xgboost' ‘lar_cds_dt’ Anomaly Detection models()
# uninstall lightgbm CPU # create and evaluate model 'lightgbm' ‘llar_cds_dt’ # set up environment get_logs()
pip uninsstall lightgbm –y compare_models() 'catboost' ‘br_cds_dt’ from pycaret.anomaly import * get_config()
# install lightgbm GPU model = create_model(*) **plot= ‘huber_cds_dt’ clf1 = setup(data = df) set_config()
pip install lightgbm --install-option=--gpu model_tuned = tune_model(model) 'residuals_interactive' 'residuals' 'error' ‘par_cds_dt’ # create and evaluate model get_topics()
--install-option="--opencl-include-dir=/usr/ ens_model = ensemble_model(model, method=***) 'cooks' 'rfe' 'learning' 'vc' 'manifold' ‘omp_cds_dt’ model = create_model(*) * model:
/local/include/" --install-option="--opencl- blender = blend_models(top3) 'feature' 'feature_all' 'parameter' 'tree' ‘knn_cds_dt’ model_df = assign_model(*) 'lda' 'lsi' 'hdp' 'rp' 'nmf'
library=usr/local/cuda/lib64/libOpenCL.so" stacker = stack_models(top3) *** method= ‘dt_cds_dt’ plot_model(model, plot=**) 'frequency' 'distribution' 'bigram'
plot_model(model, plot=**) 'bagging' 'boosting' ‘rf_cds_dt’ evaluate_model(model) 'trigram' 'sentiment' 'pos' 'tsne'
Run PyCaret on a Docker Container evaluate_model(model) (1) classification only ‘et_cds_dt’ model_tuned = tune_model(model=model, 'topic_model' 'topic_distribution'
FROM python:3.7-slim interpret_model(model) ‘gbr_cds_dt’ supervised_target='column') 'wordcloud' 'umap'
WORKDIR /app (1) calibrate_model() Time Series Analysis ‘ada_cds_dt’ # make predictions ** plot=
ADD . /app (1) optimize_threshold() # set up environment ‘lightgbm_cds_dt’ df1 = predict_model(model=model, data=df) 'tsne' 'umap'
RUN apt-get update && apt-get install libgomp1 # make predictions from pycaret.time_series import * # model deployment
RUN pip install --trusted-host pypi.python.org df1 = predict_model(model=model, data=df) exp = setup(data = df, fh = 12) Unsupervised Learning save_model(model, 'saved_model') Association Rule
-r requirements.txt # model deployment # create and evaluate model model_loaded = load_model('saved_model') # set up environment
deploy_model(model=model,
compare_models()
CMD pytest # replace it with your entry point final_model = finalize_model(model) Clustering model_name=model_final, from pycaret.arules import *
save_model(model, 'saved_model') model = create_model(*) # set up environment platform = 'aws', authentication = {'bucket : clf1 = setup(data=df, transaction_id='colunm',
PyCaret Tutorials model_loaded = load_model('saved_model') model_tuned = tune_model(model) from pycaret.clustering import * 'S3-bucket-name'}) item_id='column')
deploy_model(model=model,
blender = blend_models(top3)
Classification model_name=model_final, clf1 = setup(data = df) # utils # create and evaluate model
Binary classification (Beginner) platform = 'aws', authentication = {'bucket : plot_model(model, plot=**) # create and evaluate model pull() model = create_model()
Binary classification (Intermediate) 'S3-bucket-name'}) final_model = finalize_model(model) model = create_model(*) models() plot_model(model, plot='2d')
Multiclass classification (Beginner) # utils # make predictions model_df = assign_model(*) get_metrics()
pull() pred_holdout = predict_model(arima) plot_model(model, plot=**) add_metric() Other Resources
Regression models() pred_unseen = predict_model(finalize_model( evaluate_model(model) remove_metric()
Regression (Beginner) get_metrics() arima), fh=24) model_tuned = tune_model(model=model, get_logs() PyCaret Github
Regression (Intermediate) add_metric() # model deployment supervised_target = 'column_name') get_config() PyCaert Slack
remove_metric() final_model = finalize_model(model) # make predictions set_config() Example Notebooks made by contributors
Clustering get_logs() save_model(model, 'saved_model') df1 = predict_model(model=model, data = df) save_config() Blog tutorials
Clustering (Beginner) get_config() model_loaded = load_model('saved_model') # model deployment load_config() Documentation 'The detailed API docs of PyCaret'
deploy_model(model=model,
set_config() get_clusters()
model_name=model_final, save_model(model, 'saved_model') Video Tutorials
Anomaly detection save_config() platform = 'aws', authentication = {'bucket : model_loaded = load_model('saved_model') * model: **plot= Discussions 'Have questions?'
deploy_model(model=model,
load_config()
Anomaly detection (Beginner) 'S3-bucket-name'}) model_name=model_final, 'abod' 'tsne' Changelog 'Changes and version history'
get_leaderboard() # utils platform = 'aws', authentication = {'bucket : 'cluster' 'umap' Roadmap of PyCaret
Natural Language Processing *model: pull() 'S3-bucket-name'}) 'histogram'
NLP (Beginner) (regression) (classification) models() # utils 'knn'
NLP (Intermediate) 'lr' 'lr' get_metrics() pull() 'lof'
'lasso' 'knn' add_metric() models() 'svm'
Association Rule Mining 'ridge' 'nb' remove_metric() get_metrics() 'pca'
Association Rule Mining (Beginner) 'en' 'dt' get_logs() add_metric() 'mcd'
'lar' 'svm' get_config() remove_metric() 'sod'
Time Series 'llar' 'rbfsvm' set_config() get_logs() 'sos'
Time series and forecasting (Beginner) save_config() get_config()
Parameters of setup() and its default values
pycaret.org
Clustering Anomaly Detection Regression & Classification Time Series
data, data,
preprocess = True, Preprocess = True,
imputation_type = 'simple’, imputation_type = 'simple’,
iterative_imputation_iters = 5, iterative_imputation_iters = 5, data = DataFrame, target = ’column_name’, create_clusters = False, data = [.Series, .DataFrame],
categorical_features = None, categorical_features = None, train_size = 0.7, cluster_iter = 20, preprocess = True,
categorical_imputation = 'mode’, categorical_imputation = 'mode’, test_data = None,
categorical_iterative_imputer = 'lightgbm’, categorical_iterative_imputer = 'lightgbm’, polynomial_features = False, imputation_type = 'simple’,
preprocess = True,
ordinal_features = None, ordinal_features = None, imputation_type = 'simple’,
polynomial_degree = 2, fold_strategy = 'expanding’,
high_cardinality_features = None, high_cardinality_features = None, iterative_imputation_iters = 5, trigonometry_features = False, fold = 3,
high_cardinality_method = 'frequency’, high_cardinality_method = 'frequency’, categorical_features = None, polynomial_threshold = 0.1, fh = 1,
numeric_features = None, numeric_features = None, categorical_imputation = 'constant’, group_features = None, seasonal_period = None,
numeric_imputation = 'mean’, numeric_imputation = 'mean’, categorical_iterative_imputer = 'lightgbm’, group_names = None, enforce_pi = False,
numeric_iterative_imputer = 'lightgbm’, numeric_iterative_imputer = 'lightgbm’, ordinal_features = None, feature_selection = False, n_jobs = -1,
date_features = None, date_features = None, high_cardinality_features = None,
ignore_features = None, ignore_features = None, feature_selection_threshold = 0.8, use_gpu = False,
high_cardinality_method = 'frequency’,
normalize = False, normalize = False, numeric_features = None,
feature_selection_method = 'classic’, custom_pipeline = None,
normalize_method = 'zscore’, normalize_method = 'zscore’, numeric_imputation = 'mean’, feature_interaction = False, html = True,
transformation = False, transformation = False, numeric_iterative_imputer = 'lightgbm’, feature_ratio = False, session_id = None,
transformation_method = 'yeo-johnson’, transformation_method = 'yeo-johnson’, date_features = None, interaction_threshold = 0.01, system_log = True,
handle_unknown_categorical = True, handle_unknown_categorical = True, ignore_features = None, transform_target = False, log_experiment = False,
unknown_categorical_method = 'least_frequent’, unknown_categorical_method = 'least_frequent’, normalize = False, transform_target_method = 'box-cox’, experiment_name = None,
pca = False, pca = False, normalize_method = 'zscore’,
pca_method = 'linear’, pca_method = 'linear’, data_split_shuffle = True, log_plots = False,
transformation = False,
pca_components = None, pca_components = None, transformation_method = 'yeo-johnson’,
data_split_stratify = False, log_profile = False,
ignore_low_variance = False, ignore_low_variance = False, handle_unknown_categorical = True, fold_strategy = 'kfold’, log_data = False,
combine_rare_levels = False, combine_rare_levels = False, unknown_categorical_method = 'least_frequent’, fold = 10, verbose = True,
rare_level_threshold = 0.1, rare_level_threshold = 0.1, pca = False, fold_shuffle = False, profile = False,
bin_numeric_features = None, bin_numeric_features = None, pca_method = 'linear’, fold_groups = None, profile_kwargs = None
remove_multicollinearity = False, remove_multicollinearity = False, pca_components = None, n_jobs = - 1,
multicollinearity_threshold = 0.9, multicollinearity_threshold = 0.9, ignore_low_variance = False, use_gpu = False,
remove_perfect_collinearity = False, remove_perfect_collinearity = False, combine_rare_levels = False, Association Rule
group_features = None, group_features = None, rare_level_threshold = 0.1,
custom_pipeline = None,
group_names = None, group_names = None, bin_numeric_features = None, html = True, data,
n_jobs = -1, n_jobs = - 1, remove_outliers = False, session_id = None, transaction_id =’column_name’,
use_gpu = False, use_gpu = False, outliers_threshold = 0.05, log_experiment = False, item_id = ’column_name’,
custom_pipeline = None, custom_pipeline = None, remove_multicollinearity = False, experiment_name = None, ignore_items = None,
html = True, html = True, multicollinearity_threshold = 0.9, session_id = None
log_plots = False,
session_id = None, session_id = None, remove_perfect_collinearity = True,
system_log = True, system_log = True, log_profile = False,
log_experiment = False, log_experiment = False, log_data = False, NLP
experiment_name = None, experiment_name = None, silent = False,
log_plots = False, log_plots = False, verbose = True, data,
log_profile = False, log_profile = False, profile = False, Target = ’column_name’,
log_data = False, log_data = False, profile_kwargs = None custom_stopwords = None,
silent = False, silent = False, Html = True,
verbose = True, verbose = True, session_id = None,
profile = False, profile = False, log_experiment = False,
profile_kwargs = None profile_kwargs = None experiment_name = None,
log_plots = False,
log_data = False,
Color code Verbose = True
required
optional

Machine Learning Lab Dlihebca6sem
100% (1)
Machine Learning Lab Dlihebca6sem
25 pages
Research Proposal
No ratings yet
Research Proposal
6 pages
AWS Machine Learning Specialty
100% (1)
AWS Machine Learning Specialty
67 pages
CIS-STA 3920 LN4.a Classification With KNN 7-20-21
No ratings yet
CIS-STA 3920 LN4.a Classification With KNN 7-20-21
12 pages
analysis-on-weight-capacity
No ratings yet
analysis-on-weight-capacity
4 pages
Machine Learning Business Report
75% (55)
Machine Learning Business Report
60 pages
multivariate
No ratings yet
multivariate
4 pages
ML Pgms_24Mar2025
No ratings yet
ML Pgms_24Mar2025
23 pages
gen ai 2 class
No ratings yet
gen ai 2 class
1 page
Min Chen, Yixue Hao, Kai Hwang, Fellow, IEEE, Lu Wang, and Lin Wang
No ratings yet
Min Chen, Yixue Hao, Kai Hwang, Fellow, IEEE, Lu Wang, and Lin Wang
7 pages
Adarsh
No ratings yet
Adarsh
6 pages
Prediction of Diabetes Using Machine Learning Analysis of 70000 Clinical Database Patient Record
No ratings yet
Prediction of Diabetes Using Machine Learning Analysis of 70000 Clinical Database Patient Record
5 pages
Tushar ML
No ratings yet
Tushar ML
52 pages
ML LAB
No ratings yet
ML LAB
33 pages
ClassificationofAppleQualityUsingXGB PDF
No ratings yet
ClassificationofAppleQualityUsingXGB PDF
10 pages
Data Analysis in Python_ML
No ratings yet
Data Analysis in Python_ML
21 pages
A Real-Time Intelligent System Based on Machine-Learning Methods for Improving Communication in Sign Language
No ratings yet
A Real-Time Intelligent System Based on Machine-Learning Methods for Improving Communication in Sign Language
19 pages
Session 2 Machine Learning Execution
No ratings yet
Session 2 Machine Learning Execution
12 pages
Report Intership Chapters
No ratings yet
Report Intership Chapters
39 pages
ML - Lab - Programs - J
No ratings yet
ML - Lab - Programs - J
18 pages
ML0101EN Clas Decision Trees Drug Py v1
No ratings yet
ML0101EN Clas Decision Trees Drug Py v1
12 pages
Heart Disease Prediction - Colab
No ratings yet
Heart Disease Prediction - Colab
18 pages
Fds Mannual
No ratings yet
Fds Mannual
39 pages
DenseNet For Brain Tumor Classification in MRI Images
100% (1)
DenseNet For Brain Tumor Classification in MRI Images
9 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
Fraud Detection Paper English
No ratings yet
Fraud Detection Paper English
19 pages
k means
No ratings yet
k means
5 pages
Basic Data Prep and Pre-Processing (2)
No ratings yet
Basic Data Prep and Pre-Processing (2)
12 pages
Python Numbers
No ratings yet
Python Numbers
1 page
QB 1
No ratings yet
QB 1
11 pages
Prediction of Mental Health in Human Being Using Machine Learning
No ratings yet
Prediction of Mental Health in Human Being Using Machine Learning
4 pages
ML lab_abbs
No ratings yet
ML lab_abbs
23 pages
Sentiment Analysis For Therapy Chatbots A Comparison of Supervised Learning Approaches
No ratings yet
Sentiment Analysis For Therapy Chatbots A Comparison of Supervised Learning Approaches
6 pages
AI and Robotics Complete practice set final - converted
No ratings yet
AI and Robotics Complete practice set final - converted
12 pages
Indian Currency Detection Using KNN Classifier
No ratings yet
Indian Currency Detection Using KNN Classifier
4 pages
Python Tytorial PDF
No ratings yet
Python Tytorial PDF
23 pages
Worksheet 8
No ratings yet
Worksheet 8
17 pages
Practical Labs Guide
No ratings yet
Practical Labs Guide
34 pages
ml lab programs 2
No ratings yet
ml lab programs 2
16 pages
Ml Lab Manual(Vim)
No ratings yet
Ml Lab Manual(Vim)
13 pages
ML(sudhanshu)
No ratings yet
ML(sudhanshu)
24 pages
Document
No ratings yet
Document
91 pages
Roll NO 2020
No ratings yet
Roll NO 2020
8 pages
LinearReg33
No ratings yet
LinearReg33
3 pages
data-science-ai-revision-notes
No ratings yet
data-science-ai-revision-notes
8 pages
Python Machine Learning - Logistic Regression
No ratings yet
Python Machine Learning - Logistic Regression
1 page
vishnu. ml
No ratings yet
vishnu. ml
26 pages
Electronics 12 00488 v2
No ratings yet
Electronics 12 00488 v2
34 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
36 pages
Python Machine Learning Linear Regression
No ratings yet
Python Machine Learning Linear Regression
1 page
Pandas Tutorial
No ratings yet
Pandas Tutorial
1 page
machinelearning_lab manual
No ratings yet
machinelearning_lab manual
26 pages
Welcome To Colaboratory - Colaboratory
No ratings yet
Welcome To Colaboratory - Colaboratory
5 pages
MLT Answer Key
No ratings yet
MLT Answer Key
10 pages
J1(SkillDzire)
No ratings yet
J1(SkillDzire)
49 pages
List of Imported Libraries
No ratings yet
List of Imported Libraries
12 pages
ML LabManual (1)
No ratings yet
ML LabManual (1)
16 pages
Incremental Clustering by Fast Search and Find of Density Peaks
No ratings yet
Incremental Clustering by Fast Search and Find of Density Peaks
7 pages
linear
No ratings yet
linear
2 pages
Final Report
No ratings yet
Final Report
40 pages
ml file syllabus
No ratings yet
ml file syllabus
43 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
ML 1
No ratings yet
ML 1
6 pages
ML MANUAL
No ratings yet
ML MANUAL
21 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
Machine Learning Laboratory: Manual
No ratings yet
Machine Learning Laboratory: Manual
52 pages
An Example Machine Learning Notebook
No ratings yet
An Example Machine Learning Notebook
28 pages
Impact of Outliers On Machine Learning Models
No ratings yet
Impact of Outliers On Machine Learning Models
2 pages
Data Science Toc Srinivas
No ratings yet
Data Science Toc Srinivas
4 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
ML_Exp
No ratings yet
ML_Exp
9 pages
Exercise and Experiment 3
No ratings yet
Exercise and Experiment 3
14 pages
1 An Introduction To Machine Learning With Scikit Learn
No ratings yet
1 An Introduction To Machine Learning With Scikit Learn
2 pages
Deep Learning Record
No ratings yet
Deep Learning Record
70 pages
MCQ of Machine Learning
100% (2)
MCQ of Machine Learning
151 pages
DNN ALL Practical 28
No ratings yet
DNN ALL Practical 28
34 pages
Datascienceusing Python Training
No ratings yet
Datascienceusing Python Training
11 pages
Unit 4 Basics of Feature Engineering
No ratings yet
Unit 4 Basics of Feature Engineering
33 pages
Unit-2 Feature Selection
No ratings yet
Unit-2 Feature Selection
92 pages
CO-367 Machine Learning Lab File: Submitted To: Submitted by
No ratings yet
CO-367 Machine Learning Lab File: Submitted To: Submitted by
12 pages
Unit 2 ML
No ratings yet
Unit 2 ML
93 pages
Project Report - Credit Card Fraud Detection
No ratings yet
Project Report - Credit Card Fraud Detection
11 pages
Pattern Recognition Lab
No ratings yet
Pattern Recognition Lab
24 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
16 pages
ML-Lab Manual - NEP - DSS
No ratings yet
ML-Lab Manual - NEP - DSS
23 pages
ML Question Bank - Beena Kapadia
No ratings yet
ML Question Bank - Beena Kapadia
3 pages
C Language Programming Codes
From Everand
C Language Programming Codes
Durgesh
No ratings yet
How to a Developers Guide to 4k: Developer edition, #3
From Everand
How to a Developers Guide to 4k: Developer edition, #3
Xinc Cyberwizard
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet

Cheat Sheet Tutorial

Uploaded by

Cheat Sheet Tutorial

Uploaded by

Loading data from PyCaret's repository (regression) (classification) * model: **plot= set_config() Natural Language Processing

Cheat sheet # loading data from pycaret

You might also like