0% found this document useful (0 votes)

8 views8 pages

Lab 3-ML

The document discusses the use of Random Forest Classifier and Random Forest Regressor for classification and regression tasks, respectively, using the Iris and Boston housing datasets. It outlines the steps for loading data, preprocessing, training, and evaluating models, along with example code. Additionally, it introduces IBM Watson AutoAI, a tool for automating the machine learning model building process, detailing steps for creating a project, uploading data, and running experiments.

Uploaded by

hellohellohihi112233

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views8 pages

Lab 3-ML

Uploaded by

hellohellohihi112233

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Classification Algorithm: Random Forest Classifier

We'll use the Random Forest Classifier for the classification task. Random

Forest is an ensemble learning method that creates multiple decision trees and

combines their results to make predictions. It is robust and works well for both

binary and multi-class classification problems.

Steps:

 Load the dataset

 Preprocess data (train-test split)

 Train the model

 Evaluate the model

Example Code:

We'll use the Iris dataset, a popular classification dataset, where we predict the

species of Iris flowers based on their features.

# Import necessary libraries

from sklearn.datasets import load_iris

from sklearn.model_selection import train_test_split

from sklearn.ensemble import RandomForestClassifier

from sklearn.metrics import accuracy_score, classification_report,

confusion_matrix

# Step 1: Load the Iris dataset

iris = load_iris()

X = iris.data # features (sepal length, sepal width, petal length, petal width)

y = iris.target # target variable (species of the flower)

# Step 2: Train-test split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3,

random_state=42)
# Step 3: Initialize the Random Forest Classifier model

rf_model = RandomForestClassifier(n_estimators=100, random_state=42)

# Step 4: Train the model

rf_model.fit(X_train, y_train)

# Step 5: Make predictions on the test set

y_pred = rf_model.predict(X_test)

# Step 6: Evaluate the model

accuracy = accuracy_score(y_test, y_pred)

print(f"Accuracy: {accuracy:.2f}")

# Additional evaluation metrics

print("\nClassification Report:\n", classification_report(y_test, y_pred))

print("\nConfusion Matrix:\n", confusion_matrix(y_test, y_pred))

Explanation:

1. Loading Data: We load the Iris dataset using load_iris() from

sklearn.datasets. The dataset contains 150 samples of iris flowers, and each

sample has 4 features (sepal length, sepal width, petal length, and petal

width).

2. Data Preprocessing: We split the dataset into training and testing sets using

train_test_split() (70% for training, 30% for testing).

3. Model Training: We use the RandomForestClassifier from

sklearn.ensemble to train the model. We set n_estimators=100 for the number

of trees in the forest.

4. Model Evaluation: We evaluate the model using accuracy_score, which

measures how many predictions are correct. Additionally, we print the

Classification Report and Confusion Matrix for detailed performance

analysis.
2. Regression Algorithm: Random Forest Regressor

Next, let's use Random Forest Regressor for a regression task. Random Forest

Regressor works similarly to the Random Forest Classifier but for continuous

target variables. It builds multiple decision trees and averages their predictions.

Steps:

 Load the dataset

 Preprocess data (train-test split)

 Train the model

 Evaluate the model

Example Code:

We'll use the Boston housing dataset, which contains information about various

properties of homes and the corresponding house prices (target variable).

# Import necessary libraries

from sklearn.datasets import load_boston

from sklearn.model_selection import train_test_split

from sklearn.ensemble import RandomForestRegressor

from sklearn.metrics import mean_squared_error, r2_score

# Step 1: Load the Boston housing dataset

boston = load_boston()

X = boston.data # features (e.g., crime rate, number of rooms, etc.)

y = boston.target # target variable (house prices)

# Step 2: Train-test split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3,

random_state=42)

# Step 3: Initialize the Random Forest Regressor model

rf_regressor = RandomForestRegressor(n_estimators=100, random_state=42)

# Step 4: Train the model

rf_regressor.fit(X_train, y_train)

# Step 5: Make predictions on the test set

y_pred = rf_regressor.predict(X_test)

# Step 6: Evaluate the model

mse = mean_squared_error(y_test, y_pred)

r2 = r2_score(y_test, y_pred)

print(f"Mean Squared Error: {mse:.2f}")

print(f"R-squared: {r2:.2f}")

Explanation:

1. Loading Data: We load the Boston housing dataset using load_boston() from

sklearn.datasets. The dataset contains information about 506 homes and 13

features (such as crime rate, average number of rooms, etc.), and the target is

the median value of homes in thousands of dollars.

2. Data Preprocessing: We split the dataset into training and testing sets (70%

for training, 30% for testing).

3. Model Training: We use the RandomForestRegressor from

sklearn.ensemble to train the model. Again, we set n_estimators=100 for the

number of trees in the forest.

4. Model Evaluation: We evaluate the model using the Mean Squared Error

(MSE), which measures the average squared difference between the actual

and predicted values. We also use R-squared (R²), which tells us how well

the model explains the variance in the data (a value close to 1 means a good

fit).
Build Auto AI model in Watson

IBM Watson AutoAI is a tool that allows users to automate the process of building,

training, and deploying machine learning models. It provides an easy-to-use

platform that performs tasks such as data preprocessing, model selection, and

hyperparameter tuning automatically. This makes it a great tool for both novice and

experienced data scientists who want to quickly create and deploy machine

learning models.

Here's a step-by-step tutorial on how to build an AutoAI model in IBM Watson

Studio.

Step 1: Login to IBM Cloud Account

Before using IBM Watson AutoAI, you need an IBM Cloud account and access to

Watson Studio. If you don't have an account, follow these steps:

1. Login to IBM Cloud account:

2. Access Watson Studio:

o Go to IBM Watson Studio.

o Once logged in, you can access the tools for building AutoAI models.

Step 2: Create a New Project in Watson Studio

Once you have access to IBM Watson Studio, follow these steps to create a new

project.

1. Login to Watson Studio:

o Go to the IBM Watson Studio dashboard and sign in with your IBM

Cloud credentials.
2. Create a New Project:

o Click on "New Project".

o Select "Create an empty project".

o Choose the project name, description, and a location for your project.

For example, name it "AutoAI_Model_Training".

o Select the "Machine Learning" option.

o Click Create.

Step 3: Upload Data to Your Project

Before building an AutoAI model, you need data. You can upload a dataset to

Watson Studio for training.

1. Go to the "Assets" section in your project and click on "Add to project".

2. Choose "Data" and upload a CSV file or a dataset from a public repository.

o For example, you can use the Iris dataset, Titanic dataset, or any other

tabular dataset that you want to use for training the model.

o Once uploaded, the data will appear in the "Data" section of the

project.

Step 4: Create an AutoAI Experiment

Now you can create the AutoAI experiment to automatically build a model.

1. Go to the "Assets" tab of your project.

2. Click on "Add to project", then select "AutoAI experiment".

3. In the "Create AutoAI experiment" window:

o Select the dataset you uploaded earlier.

o Choose whether it is a classification or regression task. For example, if

the target variable is categorical, choose Classification; if it's

continuous, choose Regression.

o Provide a name for your experiment (e.g., "AutoAI_Iris_Experiment").

4. Start the experiment:

o Click Create to start the experiment. The AutoAI system will

automatically analyze the dataset and generate an optimal machine

learning pipeline based on the problem type (classification or

regression

Step 5: Data Preprocessing and Feature Engineering

AutoAI automatically performs data preprocessing tasks such as:

 Missing value imputation: Filling missing values with suitable replacements

(mean, median, or mode).

 Feature scaling: Standardizing numerical features for models like SVM,

Logistic Regression, etc.

 Feature selection: Identifying the most relevant features for model training.

You don’t need to manually perform these steps, as AutoAI handles them during

the experiment.

Step 6: Model Selection and Hyperparameter Tuning

After preprocessing the data, AutoAI automatically runs several models and selects

the best-performing ones based on your data. This includes:

 Model Selection: AutoAI uses several algorithms (e.g., Random Forest,

XGBoost, Neural Networks, etc.) to automatically choose the best one for

your task.

 Hyperparameter Tuning: AutoAI will perform hyperparameter optimization

to fine-tune the model for the best possible performance.

This process can take anywhere from a few minutes to several hours, depending on

the size and complexity of the dataset.

Step 7: Review and Compare the Models

Once the AutoAI experiment finishes running, you can view a Model Performance

Dashboard that compares different models' performance on several metrics (e.g.,

accuracy, F1 score, ROC-AUC for classification, or MSE, R² for regression).

1. Go to the "Experiments" tab and open your experiment.

2. Review the list of trained models and their evaluation metrics.

o You can view performance details like accuracy, precision, recall, F1-

score (for classification), or R², mean squared error (for regression).

o IBM Watson AutoAI will also show the top-performing pipeline,

which is usually the best model.

Hexcel HBS Analysis
50% (2)
Hexcel HBS Analysis
3 pages
Bods Interview
100% (3)
Bods Interview
61 pages
Mill Reject Technical Specifications
No ratings yet
Mill Reject Technical Specifications
403 pages
Cat 3408
90% (10)
Cat 3408
2 pages
Candlesticks Report A Guide To Candlesticks
100% (2)
Candlesticks Report A Guide To Candlesticks
19 pages
MLT - Lab - Manual FINAL
No ratings yet
MLT - Lab - Manual FINAL
38 pages
Daily Accomplishment Report
No ratings yet
Daily Accomplishment Report
13 pages
Welcomes You To ISO 9001: 2015 Awareness Training Programme
100% (2)
Welcomes You To ISO 9001: 2015 Awareness Training Programme
184 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
Business Blueprint ESS
No ratings yet
Business Blueprint ESS
59 pages
Machine Learning With SQL
100% (1)
Machine Learning With SQL
12 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
22 pages
Introduction To Linear Programming Sau
No ratings yet
Introduction To Linear Programming Sau
42 pages
Machine Learning LAB: Practical-1
100% (2)
Machine Learning LAB: Practical-1
24 pages
RULE 64 Rev of Judgm of COMELEC & COA
No ratings yet
RULE 64 Rev of Judgm of COMELEC & COA
5 pages
Clarion IDE Users Guide
No ratings yet
Clarion IDE Users Guide
302 pages
7 Data Science / Machine Learning Cheat Sheets in One
100% (1)
7 Data Science / Machine Learning Cheat Sheets in One
9 pages
R Machine Learning PDF
No ratings yet
R Machine Learning PDF
137 pages
ML Adv
No ratings yet
ML Adv
51 pages
Safety and Instruction Manual: Meat Grinder
No ratings yet
Safety and Instruction Manual: Meat Grinder
20 pages
ML Practical Updated
No ratings yet
ML Practical Updated
64 pages
Iii Aid - ML
No ratings yet
Iii Aid - ML
30 pages
ML Practical
No ratings yet
ML Practical
61 pages
ML Lab-1
No ratings yet
ML Lab-1
32 pages
Module 5.pptx - 20250608 - 201231 - 0000
No ratings yet
Module 5.pptx - 20250608 - 201231 - 0000
43 pages
AI Manual
No ratings yet
AI Manual
69 pages
AI Documents
No ratings yet
AI Documents
25 pages
Integrated System Lab
No ratings yet
Integrated System Lab
25 pages
BigML WhizzML Tutorials
No ratings yet
BigML WhizzML Tutorials
45 pages
DL & AI - Lab Manual
No ratings yet
DL & AI - Lab Manual
33 pages
Module 4 - Supervised Learning - First ML Model
No ratings yet
Module 4 - Supervised Learning - First ML Model
23 pages
ML 6
No ratings yet
ML 6
15 pages
Tensor Flow and Keras Sample Programs
No ratings yet
Tensor Flow and Keras Sample Programs
22 pages
Practical Labs Guide
No ratings yet
Practical Labs Guide
34 pages
Ad 8511 ML Lab Record
No ratings yet
Ad 8511 ML Lab Record
27 pages
Is 14223 1 1995
No ratings yet
Is 14223 1 1995
10 pages
T3 Bda
No ratings yet
T3 Bda
27 pages
ML Practical 04
No ratings yet
ML Practical 04
19 pages
Sikorsky v. City of Newburgh, No. 23-1171 (2d Cir. May 2, 2025)
No ratings yet
Sikorsky v. City of Newburgh, No. 23-1171 (2d Cir. May 2, 2025)
13 pages
ChatGPT - MyLearning On Coding For Machine Learning
No ratings yet
ChatGPT - MyLearning On Coding For Machine Learning
16 pages
AAM PR QB
No ratings yet
AAM PR QB
13 pages
Model Learning Steps
No ratings yet
Model Learning Steps
12 pages
Practical File DL
No ratings yet
Practical File DL
14 pages
Deep Learning and Machine Learning: Lab Explanation
No ratings yet
Deep Learning and Machine Learning: Lab Explanation
34 pages
0 PDF
No ratings yet
0 PDF
9 pages
Final Report
No ratings yet
Final Report
17 pages
Advance AI and ML LAB
No ratings yet
Advance AI and ML LAB
16 pages
Assignment 4 R Program1
No ratings yet
Assignment 4 R Program1
11 pages
FREE AI Code Generator - Generate Code Online in Any Language
No ratings yet
FREE AI Code Generator - Generate Code Online in Any Language
12 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
Silver Oak College of Computer Application: Subject:Machine Learning
No ratings yet
Silver Oak College of Computer Application: Subject:Machine Learning
15 pages
REPSE Requirements
No ratings yet
REPSE Requirements
6 pages
LAB MANUAL For Machine Learning
No ratings yet
LAB MANUAL For Machine Learning
15 pages
Lec 03
No ratings yet
Lec 03
9 pages
Solution Assigment Chapter 5
No ratings yet
Solution Assigment Chapter 5
11 pages
Is 4410 AzureML Regression Predict Auto Price-1
No ratings yet
Is 4410 AzureML Regression Predict Auto Price-1
15 pages
Types of ML Systems
No ratings yet
Types of ML Systems
5 pages
Cover Page
No ratings yet
Cover Page
11 pages
06 Intro ERP Using GBI Case Study PP (Letter) en v2.11 PDF
No ratings yet
06 Intro ERP Using GBI Case Study PP (Letter) en v2.11 PDF
41 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
Write A Program To Demonstrate Decision Tree Algorithm For A Classification Problem and Perform Parameter Tuning For Better Results
No ratings yet
Write A Program To Demonstrate Decision Tree Algorithm For A Classification Problem and Perform Parameter Tuning For Better Results
5 pages
Price Opti Medium Code
No ratings yet
Price Opti Medium Code
15 pages
Presumption of Constitutionality
No ratings yet
Presumption of Constitutionality
17 pages
ML Pipeline
No ratings yet
ML Pipeline
6 pages
BA Project - Team17
No ratings yet
BA Project - Team17
13 pages
Lab Report 10 FDS
No ratings yet
Lab Report 10 FDS
7 pages
Lab 6
No ratings yet
Lab 6
4 pages
Lab 1. Boston House
No ratings yet
Lab 1. Boston House
7 pages
AIML 7 To 11
No ratings yet
AIML 7 To 11
7 pages
A3 Classification and Feature Engineering
No ratings yet
A3 Classification and Feature Engineering
2 pages
Project 1
No ratings yet
Project 1
4 pages
Kahoot Koonji Intro To PM Week 1 7
No ratings yet
Kahoot Koonji Intro To PM Week 1 7
7 pages
ML Algorithm
No ratings yet
ML Algorithm
2 pages
Practical 1ritesh
No ratings yet
Practical 1ritesh
3 pages
Week 7 Laboratory Activity
No ratings yet
Week 7 Laboratory Activity
12 pages
Exec Order On PCART
No ratings yet
Exec Order On PCART
5 pages
rEgWts4R0UawBF3O - Ae - K8 - ub3Df81EC - Create and Run AI Models Transcript - EN
No ratings yet
rEgWts4R0UawBF3O - Ae - K8 - ub3Df81EC - Create and Run AI Models Transcript - EN
3 pages
Expanding Mental Health Care in The Kingdom of Eswatini: Successes, Challenges and Recommendations From Initial Experiences in Lubombo Region
No ratings yet
Expanding Mental Health Care in The Kingdom of Eswatini: Successes, Challenges and Recommendations From Initial Experiences in Lubombo Region
8 pages
Nav Report
No ratings yet
Nav Report
3 pages
Quests in White Orchard The Witcher 3 Wiki
No ratings yet
Quests in White Orchard The Witcher 3 Wiki
1 page
Final Sudeshna Resume
No ratings yet
Final Sudeshna Resume
1 page
Cryptoasset Registration Flowchart
No ratings yet
Cryptoasset Registration Flowchart
1 page
Open The Dor
No ratings yet
Open The Dor
9 pages
Cloth Stock Management
No ratings yet
Cloth Stock Management
8 pages
University of The East Caloocan Campus
No ratings yet
University of The East Caloocan Campus
5 pages
Indian Institute of Management Bangalore: PGP 4 Term 2019-20
No ratings yet
Indian Institute of Management Bangalore: PGP 4 Term 2019-20
3 pages
Proposed RAT For VPs
No ratings yet
Proposed RAT For VPs
3 pages
Types of Plants:: Operations
No ratings yet
Types of Plants:: Operations
2 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet