Loading The Data: Numpy NP Sklearn - Datasets

This document demonstrates the scikit-learn 4-step modeling pattern for machine learning in Python. It shows how to (1) import an estimator class, (2) instantiate the model, (3) fit the model to training data, and (4) predict new observations. As an example, it loads the Iris dataset, fits a k-nearest neighbors model (with different values of K), and makes predictions on test data. It then repeats the process using a logistic regression model instead of k-NN.

Uploaded by

Suresh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views2 pages

Loading The Data: Numpy NP Sklearn - Datasets

Uploaded by

Suresh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Loading the data

In [ ]:
# import load_iris function from datasets module
import numpy as np
from sklearn.datasets import load_iris

# save "bunch" object containing iris dataset and its attributes

iris = load_iris()

# store feature matrix in "X"

X = iris.data

# store response vector in "y"

y = iris.target
In [ ]:
# print the shapes of X and y
print X.shape
print y.shape
scikit-learn 4-step modeling pattern
Step 1: Import the class you plan to use
In [ ]:
from sklearn.neighbors import KNeighborsClassifier
Step 2: "Instantiate" the "estimator"
• "Estimator" is scikit-learn's term for model
• "Instantiate" means "make an instance of"
In [ ]:
knn = KNeighborsClassifier(n_neighbors=1)
• Name of the object does not matter
• Can specify tuning parameters (aka "hyperparameters") during this step
• All parameters not specified are set to their defaults
In [ ]:
print knn
Step 3: Fit the model with data (aka "model training")
• Model is learning the relationship between X and y
• Occurs in-place
In [ ]:
knn.fit(X, y)
Step 4: Predict the response for a new observation
• New observations are called "out-of-sample" data
• Uses the information it learned during the model training process
In [ ]:
X_new = np.array([3, 5, 4, 2]).reshape(1,4)
knn.predict(X_new)
• Returns a NumPy array
• Can predict for multiple observations at once
In [ ]:
X_new =np.array([[3, 5, 4, 2], [5, 4, 3, 2]])
knn.predict(X_new)
Using a diferent value for K
In [ ]:
# instantiate the model (using the value K=5)
knn = KNeighborsClassifier(n_neighbors=5)

# fit the model with data

knn.fit(X, y)

# predict the response for new observations

knn.predict(X_new)
Using a diferent classification model
In [ ]:
# import the class
from sklearn.linear_model import LogisticRegression

# instantiate the model (using the default parameters)

logreg = LogisticRegression()

# fit the model with data

logreg.fit(X, y)

# predict the response for new observations

logreg.predict(X_new)

Introduction To Scikit Learn
100% (1)
Introduction To Scikit Learn
108 pages
Significance of Trees in Landscaping
No ratings yet
Significance of Trees in Landscaping
11 pages
Building A 3 Statement Financial Model: in Excel
No ratings yet
Building A 3 Statement Financial Model: in Excel
85 pages
Sandesh CS
No ratings yet
Sandesh CS
32,767 pages
STK4181 AF Power Amplifier (Split Power Supply) (45W + 45W Min, THD 0.08%)
No ratings yet
STK4181 AF Power Amplifier (Split Power Supply) (45W + 45W Min, THD 0.08%)
8 pages
Iped Action Plan
100% (1)
Iped Action Plan
5 pages
Scikit-Learn-Exercises - Jupyter Notebook
100% (2)
Scikit-Learn-Exercises - Jupyter Notebook
28 pages
Tax 2 Reviewer
100% (1)
Tax 2 Reviewer
164 pages
Year 9 Assessment Support Sample Unit 9aa
No ratings yet
Year 9 Assessment Support Sample Unit 9aa
14 pages
Scikit Learn
No ratings yet
Scikit Learn
25 pages
SymSitive 1609 - PB
No ratings yet
SymSitive 1609 - PB
8 pages
Retinal Disease
No ratings yet
Retinal Disease
16 pages
Supervised Learning With Scikit-Learn
No ratings yet
Supervised Learning With Scikit-Learn
178 pages
Sklearn
No ratings yet
Sklearn
141 pages
HVAC Qualification Plan 2025
No ratings yet
HVAC Qualification Plan 2025
2 pages
Slide 2 ML Basics
No ratings yet
Slide 2 ML Basics
42 pages
Data Visualization Using Matplotlib
No ratings yet
Data Visualization Using Matplotlib
30 pages
Algorithmeknn 121213175830 Phpapp02
No ratings yet
Algorithmeknn 121213175830 Phpapp02
52 pages
Scikit-Learn: Library For Machine Learning and Data Science With Python
No ratings yet
Scikit-Learn: Library For Machine Learning and Data Science With Python
11 pages
KNN Datacamp
No ratings yet
KNN Datacamp
31 pages
Scikit Learn
No ratings yet
Scikit Learn
17 pages
K-Nearest Neighbors Classifiers 2025
No ratings yet
K-Nearest Neighbors Classifiers 2025
33 pages
Big Data Practical
No ratings yet
Big Data Practical
20 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
CoursePresentation 200820 142855 PDF
No ratings yet
CoursePresentation 200820 142855 PDF
146 pages
Case Study - Classifier
No ratings yet
Case Study - Classifier
5 pages
European Unicorns
No ratings yet
European Unicorns
120 pages
Christ MSC Data Science
No ratings yet
Christ MSC Data Science
100 pages
Record
No ratings yet
Record
23 pages
ML Lab Programs (1-12)
No ratings yet
ML Lab Programs (1-12)
35 pages
Mod3 Classification
No ratings yet
Mod3 Classification
32 pages
Intragenerational Economic Mobility in Indonesia
No ratings yet
Intragenerational Economic Mobility in Indonesia
58 pages
101 Square Meals
No ratings yet
101 Square Meals
129 pages
Beginner's Guide To Implementing A Simple Machine Learning Project - DeV Community
No ratings yet
Beginner's Guide To Implementing A Simple Machine Learning Project - DeV Community
9 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
36 pages
Cambridge Assessment International Education: Hindi As A Second Language 0549/01 October/November 2019
No ratings yet
Cambridge Assessment International Education: Hindi As A Second Language 0549/01 October/November 2019
7 pages
Supervised Learning Using Python - Chapter1
No ratings yet
Supervised Learning Using Python - Chapter1
34 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
39 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
Building, Tuning, and Deploying Models
No ratings yet
Building, Tuning, and Deploying Models
11 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
1130048585final Petition
No ratings yet
1130048585final Petition
50 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
ML Lab Programs (1-13)
No ratings yet
ML Lab Programs (1-13)
44 pages
Masters in Business Administration (Mba) : Business & Corporate Law (BCL)
No ratings yet
Masters in Business Administration (Mba) : Business & Corporate Law (BCL)
46 pages
Chapter 1
No ratings yet
Chapter 1
34 pages
HH Online DMSIIT Supervised Classification PDF
No ratings yet
HH Online DMSIIT Supervised Classification PDF
63 pages
ChatGPT - MyLearning On Coding For Machine Learning
No ratings yet
ChatGPT - MyLearning On Coding For Machine Learning
16 pages
Machine Learning LAB
No ratings yet
Machine Learning LAB
20 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
Data Preprocessing
No ratings yet
Data Preprocessing
9 pages
ML KN
No ratings yet
ML KN
12 pages
Machine Learning Using Scikit-Learn
No ratings yet
Machine Learning Using Scikit-Learn
32 pages
Lecture 12 K-Nearest Neighbors
No ratings yet
Lecture 12 K-Nearest Neighbors
24 pages
Tle 6281
No ratings yet
Tle 6281
15 pages
Supervised Learning: Andreas Müller
No ratings yet
Supervised Learning: Andreas Müller
43 pages
Machine Learning With Python - Machine Learning Algorithms - KNN
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - KNN
15 pages
Python For Machine Learning Basics
No ratings yet
Python For Machine Learning Basics
36 pages
Lab 1 - Machine Learning with Python - ML Engineering مهم
No ratings yet
Lab 1 - Machine Learning with Python - ML Engineering مهم
10 pages
Chapter 2
No ratings yet
Chapter 2
46 pages
HH IIMC PythonBasics s2
No ratings yet
HH IIMC PythonBasics s2
42 pages
Usulan Alat Lab TKLP 2022 Asiin
No ratings yet
Usulan Alat Lab TKLP 2022 Asiin
11 pages
REPSOL GXR 5 - 1 v1
No ratings yet
REPSOL GXR 5 - 1 v1
1 page
Lab Manual
No ratings yet
Lab Manual
9 pages
Machine Learning With Scikit-Learn: George Boorman
No ratings yet
Machine Learning With Scikit-Learn: George Boorman
34 pages
PHARMACY RX Business Plan
No ratings yet
PHARMACY RX Business Plan
17 pages
Activity Design District Municipal Festival of Talents 2024
No ratings yet
Activity Design District Municipal Festival of Talents 2024
6 pages
Machine Learning Using Scikit-Learn
No ratings yet
Machine Learning Using Scikit-Learn
32 pages
امتحان+ الصف الاول الاعدادي+اول+3+وحدات+مستر+عرفات+و+محمد+رضا
No ratings yet
امتحان+ الصف الاول الاعدادي+اول+3+وحدات+مستر+عرفات+و+محمد+رضا
8 pages
Ch1 - Slides - Supervised Learning
No ratings yet
Ch1 - Slides - Supervised Learning
32 pages
Introduction To Data Analytics: Discuss Difference Between Data Science and Data Analytics
No ratings yet
Introduction To Data Analytics: Discuss Difference Between Data Science and Data Analytics
29 pages
Important Tables of Oral Pathology
No ratings yet
Important Tables of Oral Pathology
17 pages
Scikit Learn What Were Covering
No ratings yet
Scikit Learn What Were Covering
15 pages
Answer
No ratings yet
Answer
8 pages
Aam Codes
No ratings yet
Aam Codes
8 pages
Machine Learning Aiml
No ratings yet
Machine Learning Aiml
7 pages
Linear Regression: Scikit-Learn
No ratings yet
Linear Regression: Scikit-Learn
3 pages
Program 4
No ratings yet
Program 4
3 pages
Scikit-Learn Cheat Sheet Python For Data Science: Preprocessing The Data Evaluate Your Model's Performance
100% (1)
Scikit-Learn Cheat Sheet Python For Data Science: Preprocessing The Data Evaluate Your Model's Performance
1 page
Vidhan - Round2 - DMS Warriros
No ratings yet
Vidhan - Round2 - DMS Warriros
15 pages
4c Sklearn-Classification-Regression-Bkhw-Spring 2019
No ratings yet
4c Sklearn-Classification-Regression-Bkhw-Spring 2019
20 pages
Program 4
No ratings yet
Program 4
3 pages
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
100% (1)
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
1 page
Carefree Omega Awning Owner's Manual Installation Instructions
No ratings yet
Carefree Omega Awning Owner's Manual Installation Instructions
5 pages
Unit 1 Network and Security New Study Notes
No ratings yet
Unit 1 Network and Security New Study Notes
6 pages
Data Mining and Warehousing Concepts Lab: (ITPC - 228)
No ratings yet
Data Mining and Warehousing Concepts Lab: (ITPC - 228)
6 pages
KNN Lab
No ratings yet
KNN Lab
4 pages
It - S All About Neighbors - Completed
No ratings yet
It - S All About Neighbors - Completed
14 pages
CE OOO BOQ Solar PV System Contractor XXX
No ratings yet
CE OOO BOQ Solar PV System Contractor XXX
1 page
MSL884 ISS Syllabus
No ratings yet
MSL884 ISS Syllabus
3 pages
Project 1
No ratings yet
Project 1
4 pages
ML Algorithms
100% (1)
ML Algorithms
1 page
Scikit-Learn Cheat Sheet
No ratings yet
Scikit-Learn Cheat Sheet
1 page
Scikit-Learn Cheat Sheet
No ratings yet
Scikit-Learn Cheat Sheet
1 page
Week10 KNN Practical
No ratings yet
Week10 KNN Practical
4 pages
Linear Regression: Scikit-Learn
No ratings yet
Linear Regression: Scikit-Learn
3 pages
Definitions Goals and Scope of Counseling
No ratings yet
Definitions Goals and Scope of Counseling
1 page
Theory of Oscilloscope
No ratings yet
Theory of Oscilloscope
19 pages
Tip Izvještaja: Instalirani Softver Na Računaru SERVKUPACA-SEF
No ratings yet
Tip Izvještaja: Instalirani Softver Na Računaru SERVKUPACA-SEF
2 pages
PM-02 - 08 - Technical Completion of The Workorder
No ratings yet
PM-02 - 08 - Technical Completion of The Workorder
2 pages
120 Advanced JavaScript Interview Questions
From Everand
120 Advanced JavaScript Interview Questions
Hernando Abella
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet