0% found this document useful (0 votes)

25 views

Python - Vectorized - Tute - Jupyter Notebook

Uploaded by

Anvitha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views

Python - Vectorized - Tute - Jupyter Notebook

Uploaded by

Anvitha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Python Tutorial for Vectorizing form of Implementation

Example 1: Illustrative Example of Neural Network Implementation via

Vectorizing the function
In [22]:  1 import numpy as np
2
3 # Small number of data samples
4
5 # x0, x1, x2 = 1., 2., 3.
6 # bias, w1, w2 = 0.1, 0.3, 0.5
7
8 # x = [x0, x1, x2]
9 # w = [bias, w1, w2]
10 # x_vec, w_vec = np.array(x), np.array(w)
11
12 # Large number of data samples
13 x, w = np.random.rand(100000), np.random.rand(100000)
14 x_vec, w_vec = np.array(x), np.array(w)
In [23]:  1 # Python code to demonstrate the working of # zip()
2
3 # # initializing lists
4 # name = [ "Manjeet", "Nikhil", "Shambhavi", "Astha" ]
5 # marks = [ 40, 50, 60, 70 ]
6
7 # # using zip() to map values
8 # mapped = zip(name, marks)
9 # # converting values to print as set
10 # mapped = set(mapped)
11
12 # # printing resultant values
13 # print ("The zipped result is : ",end="")
14 # print (mapped)
15
16 # #Unzipping the Value Using zip()
17 # c, v, = zip(*mapped)
18 # print('c =', c)
19 # print('v =',v)
In [24]:  1 # Neural network output with For Loop statement
2 def forloop(x, w):
3 z = 0.
4 for i in range(len(x)):
5 z += x[i] * w[i]
6 return z
7
8 # Neural network output with listcomprehension statement
9 def listcomprehension(x, w):
10 z = sum(x_i*w_i for x_i, w_i in zip(x, w))
11 return z
12
13 # Neural network output with Vectorized form
14 def vectorized(x, w):
15 z = x_vec.dot(w_vec)
16 # z = (x_vec.transpose()).dot(w_vec)
17 return z
18
19

Comparison of Processing Speed of above three different forms of

implemenattion
In [25]:  1 # Method-1: forloop
2 import time
3 t10 = time.time()
4 print(forloop(x,w))
5 t11 = time.time()
6 time_forloop = t11 - t10
7 print(time_forloop)

25005.565200480993
0.03804349899291992
In [26]:  1 # Method-2: ListComprehension Implemenattion
2 import time
3 t20 = time.time()
4 print(listcomprehension(x,w))
5 t21 = time.time()
6 time_listComp = t21 - t20
7 print(time_listComp)

25005.565200480993
0.02210259437561035

In [27]:  1 # Method-3: Vectorized Implemenattion

2 import time
3 t30 = time.time()
4 print(vectorized(x_vec,w_vec))
5 t31 = time.time()
6 time_vectorized = t31 - t30
7 print(time_vectorized)

25005.565200480753
0.0009987354278564453
In [28]:  1 import matplotlib.pyplot as plt
2 %matplotlib inline
3 # plt.style.use('ggplot')
4
5 x = ['Method-1', 'Method-2', 'Method-3']
6 Processing_Times = [time_forloop, time_listComp, time_vectorized]
7
8
9 x_pos = [i for i, _ in enumerate(x)]
10
11 plt.bar(x_pos, Processing_Times, color='green')
12 plt.xlabel("<-------------Method-----------> ")
13 plt.ylabel("<----------Time-------------> ")
14 plt.title(" Time Vs Method ")
15
16 plt.xticks(x_pos, x)
17
18 plt.show()
Example 2: Illustrative Example for Predicting House sales price using
Boston house dataset
In [29]:  1 import numpy as np
2 import matplotlib.pyplot as plt
3 %matplotlib inline
4
5 from sklearn.datasets import load_boston
6 boston_data = load_boston()
7 print(boston_data['DESCR'])
.. _boston_dataset:

Boston house prices dataset

---------------------------

Data Set Characteristics:

:Number of Instances: 506

:Number of Attributes: 13 numeric/categorical predictive. Median Value (attribute 14) is usually the target.

:Attribute Information (in order):

- CRIM per capita crime rate by town
- ZN proportion of residential land zoned for lots over 25,000 sq.ft.
- INDUS proportion of non-retail business acres per town
- CHAS Charles River dummy variable (= 1 if tract bounds river; 0 otherwise)
- NOX nitric oxides concentration (parts per 10 million)
- RM average number of rooms per dwelling
- AGE proportion of owner-occupied units built prior to 1940
- DIS weighted distances to five Boston employment centres
- RAD index of accessibility to radial highways
- TAX full-value property-tax rate per $10,000
- PTRATIO pupil-teacher ratio by town
- B 1000(Bk - 0.63)^2 where Bk is the proportion of black people by town
- LSTAT % lower status of the population
- MEDV Median value of owner-occupied homes in $1000's

:Missing Attribute Values: None

:Creator: Harrison, D. and Rubinfeld, D.L.

This is a copy of UCI ML housing dataset.

https://archive.ics.uci.edu/ml/machine-learning-databases/housing/ (https://archive.ics.uci.edu/ml/machine-learn
ing-databases/housing/)

This dataset was taken from the StatLib library which is maintained at Carnegie Mellon University.

The Boston house-price data of Harrison, D. and Rubinfeld, D.L. 'Hedonic

prices and the demand for clean air', J. Environ. Economics & Management,
vol.5, 81-102, 1978. Used in Belsley, Kuh & Welsch, 'Regression diagnostics
...', Wiley, 1980. N.B. Various transformations are used in the table on
pages 244-261 of the latter.

The Boston house-price data has been used in many machine learning papers that address regression
problems.

.. topic:: References

- Belsley, Kuh & Welsch, 'Regression diagnostics: Identifying Influential Data and Sources of Collinearity',
Wiley, 1980. 244-261.
- Quinlan,R. (1993). Combining Instance-Based and Model-Based Learning. In Proceedings on the Tenth Internati
onal Conference of Machine Learning, 236-243, University of Massachusetts, Amherst. Morgan Kaufmann.
In [30]:  1 # take the boston data
2 data = boston_data['data']
3 # we will only work with two of the features: INDUS and RM
4 # x_input = data[:, [2,]] # for single feature of input data (INDUS)
5 x_input = data[:, [2,5]] # for two features of input data (INDUS and RM)
6 # x_input = data[:, [2,5,7]] # for three features of input data (INDUS,RM, and DIS)
7 # x_input = data[:, ] # All features of input data
8 y_target = boston_data['target']
9 # print(x_input.shape[1])
10 # print(x_input)
11 # print(y_target.shape[0])
12 # print(y_target)
13
14 # Individual plots for the two features:
15 plt.title('Industrialness vs Med House Price')
16 plt.scatter(x_input[:, 0], y_target)
17 plt.xlabel('Industrialness')
18 plt.ylabel('Med House Price')
19 plt.show()
20
21 plt.title('Avg Num Rooms vs Med House Price')
22 plt.scatter(x_input[:, 1], y_target)
23 plt.xlabel('Avg Num Rooms')
24 plt.ylabel('Med House Price')
25 plt.show()
26
27 # plt.title('Avg weighted distances vs Med House Price')
28 # plt.scatter(x_input[:, 2], y_target)
29 # plt.xlabel('Avg weighted distances ')
30 # plt.ylabel('Med House Price')
31 # plt.show()
32
Define cost function: Non-vectorized form
1 𝑁
(𝑦,𝑡) = 𝑁 ∑(𝑦(𝑖) − 𝑡(𝑖) )2
𝑖=1
1 𝑁
(𝑦,𝑡) = 𝑁 ∑(𝑤1 𝑥(𝑖)1 + 𝑤2 𝑥(𝑖)2 + 𝑏 − 𝑡(𝑖) )2
𝑖=1
In [31]:  1 # Non-vectorized implementation
2 def cost(w1, w2, b, X, t):
3 '''
4 Evaluate the cost function in a non-vectorized manner for
5 inputs `X` and targets `t`, at weights `w1`, `w2` and `b`.
6 '''
7 costs = 0
8 for i in range(len(t)):
9 # y_i = w1 * X[i, 0] + w2 * X[i, 0] + b # for single feature of input data
10 y_i = w1 * X[i, 0] + w2 * X[i, 1] + b # for two features of input data
11 # y_i = w1 * X[i] + w2 * X[i] + b # All features of input data
12 t_i = t[i]
13 costs += (y_i - t_i) ** 2
14 return costs / len(t)
15

In [32]:  1
2 cost(3, 5, 1, x_input, y_target)

Out[32]: 2475.821173270752

In [33]:  1
2 cost(3, 5, 0, x_input, y_target)

Out[33]: 2390.2197701086957

Vectorizing the cost function:

(𝑦,𝑡) = 𝑁1 ‖𝐗𝐰 + 𝐛 − 𝐭‖2
In [35]:  1 def cost_vectorized(w1, w2, b, X, t):
2 '''
3 Evaluate the cost function in a vectorized manner for
4 inputs `X` and targets `t`, at weights `w1`, `w2` and `b`.
5 '''
6 N = len(y_target)
7 w = np.array([w1, w2])
8 # print(w)
9 y = np.dot(X, w) + b * np.ones(N)
10 cost_vect = np.sum((y - t)**2) / (N)
11 return cost_vect
12

In [36]:  1 cost_vectorized(3, 5, 1, x_input, y_target)

Out[36]: 2475.821173270751

In [37]:  1
2
3 cost(3, 5, 0, x_input, y_target)

Out[37]: 2390.2197701086957

Comparing Processing Speed of the Vectorized vs Nonvectorized

code
We'll see below that the vectorized code already runs ~2x faster than the non-vectorized code! Hopefully this will convince you to always
vectorized your code whenever possible
In [38]:  1 import time
2 t40 = time.time()
3 print(cost(3, 5, 1, x_input, y_target))
4 t41 = time.time()
5 time_CostNonvect = t41 - t40
6 print(time_CostNonvect)

2475.821173270752
0.0009961128234863281

In [39]:  1 import time

2 t50 = time.time()
3 print(cost_vectorized(3, 5, 1, x_input, y_target))
4 t51 = time.time()
5 time_CostVect = t51 - t50
6 print(time_CostVect)

2475.821173270751
0.0009663105010986328
In [40]:  1 import matplotlib.pyplot as plt
2 %matplotlib inline
3 # plt.style.use('ggplot')
4
5 x = ['Cost_NonVectorized', 'Cost_Vectorized']
6 Processing_Times = [time_CostNonvect, time_CostVect]
7
8
9 x_pos = [i for i, _ in enumerate(x)]
10
11 plt.bar(x_pos, Processing_Times, color='green')
12 plt.xlabel("<-------------Method-----------> ")
13 plt.ylabel("<----------Time-------------> ")
14 plt.title(" Time Vs Method ")
15
16 plt.xticks(x_pos, x)
17
18 plt.show()
In [ ]:  1

House Price Prediction: Project Description
No ratings yet
House Price Prediction: Project Description
11 pages
1 Tutorial: Linear Regression
No ratings yet
1 Tutorial: Linear Regression
8 pages
Sklearn Tutorial: DNN On Boston Data
No ratings yet
Sklearn Tutorial: DNN On Boston Data
9 pages
Import As Import As From Import: "Mean Squared Errors: "
No ratings yet
Import As Import As From Import: "Mean Squared Errors: "
1 page
Linear Reg
No ratings yet
Linear Reg
25 pages
Document From Jahnavi
No ratings yet
Document From Jahnavi
20 pages
machinelearning
No ratings yet
machinelearning
26 pages
T2_summary_VHA
No ratings yet
T2_summary_VHA
14 pages
Assignment 1
100% (1)
Assignment 1
3 pages
PRJ Housuing Price
No ratings yet
PRJ Housuing Price
14 pages
0.1 Guilherme Marthe - Boston House Pricing Challenge
100% (1)
0.1 Guilherme Marthe - Boston House Pricing Challenge
15 pages
Lab 1. Boston House
No ratings yet
Lab 1. Boston House
7 pages
Machine Learning Prediction
No ratings yet
Machine Learning Prediction
9 pages
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
14 pages
MLLabManual
No ratings yet
MLLabManual
24 pages
Emllab
No ratings yet
Emllab
6 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
Coding Question
No ratings yet
Coding Question
6 pages
Week 6 LAB
No ratings yet
Week 6 LAB
13 pages
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
16 pages
Docu 4
No ratings yet
Docu 4
3 pages
House Pricing
No ratings yet
House Pricing
15 pages
Copy of Project 4 _ House Price Prediction.ipynb - Colab
No ratings yet
Copy of Project 4 _ House Price Prediction.ipynb - Colab
5 pages
dl lab prog 2
No ratings yet
dl lab prog 2
2 pages
Sofcomputing Da2
No ratings yet
Sofcomputing Da2
7 pages
som
No ratings yet
som
19 pages
ML Book Notes
No ratings yet
ML Book Notes
9 pages
mayhoc
No ratings yet
mayhoc
51 pages
Vertopal.com C1 W2 Lab02 Multiple Variable Soln
No ratings yet
Vertopal.com C1 W2 Lab02 Multiple Variable Soln
11 pages
Data Pre Processing
No ratings yet
Data Pre Processing
2 pages
vertopal.com_22644501_lab02 (4)
No ratings yet
vertopal.com_22644501_lab02 (4)
14 pages
boston_housing
No ratings yet
boston_housing
17 pages
module_2
No ratings yet
module_2
35 pages
AIMLlatestmodule 2Notes Removed
No ratings yet
AIMLlatestmodule 2Notes Removed
33 pages
f3683849-7ca6-4854-8f96-af11b6e837ec
No ratings yet
f3683849-7ca6-4854-8f96-af11b6e837ec
20 pages
4 - Học Máy Cơ Bản - Hồi Quy Tuyến Tính
No ratings yet
4 - Học Máy Cơ Bản - Hồi Quy Tuyến Tính
113 pages
Boston Dataset
No ratings yet
Boston Dataset
6 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
Making predictions
No ratings yet
Making predictions
13 pages
The Boston Housing Dataset
100% (1)
The Boston Housing Dataset
4 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
Ex7 HTML
No ratings yet
Ex7 HTML
3 pages
Module 2
No ratings yet
Module 2
20 pages
Pattern - Recognition - 3 - Code With Output
No ratings yet
Pattern - Recognition - 3 - Code With Output
7 pages
Faisal Nadeem (SAP# 30601)
No ratings yet
Faisal Nadeem (SAP# 30601)
7 pages
Xgboost
No ratings yet
Xgboost
12 pages
MDS372_LAB4_2448001
No ratings yet
MDS372_LAB4_2448001
17 pages
Module 2notes
No ratings yet
Module 2notes
44 pages
DM_LabManual_teena
No ratings yet
DM_LabManual_teena
6 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
Machine Learning Laboratory
No ratings yet
Machine Learning Laboratory
23 pages
Regression Dataset
No ratings yet
Regression Dataset
3 pages
SVM (Support Vector Machine) For Classification - by Aditya Kumar - Towards Data Science
100% (1)
SVM (Support Vector Machine) For Classification - by Aditya Kumar - Towards Data Science
28 pages
Dawit House
No ratings yet
Dawit House
49 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
07bRegresionLinealBostonVerdConEstandarizacion - Jupyter Notebook
No ratings yet
07bRegresionLinealBostonVerdConEstandarizacion - Jupyter Notebook
17 pages
Regression Problem
No ratings yet
Regression Problem
28 pages
Introduction To Machine Learning (ML) With Sklearn
No ratings yet
Introduction To Machine Learning (ML) With Sklearn
10 pages
Ai Last 5
No ratings yet
Ai Last 5
4 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Module 4 Latches and Flip-Flops
No ratings yet
Module 4 Latches and Flip-Flops
84 pages
Question Bank DPD All Modules
No ratings yet
Question Bank DPD All Modules
5 pages
Model QP DPD
No ratings yet
Model QP DPD
4 pages
Internal Assessment 2 Important Questions - 230327 - 124239
No ratings yet
Internal Assessment 2 Important Questions - 230327 - 124239
1 page
Scholarship Schemes
No ratings yet
Scholarship Schemes
12 pages
Research Statement Djimadoumngar
0% (1)
Research Statement Djimadoumngar
2 pages
Metrology & Instrumentation Course File1
No ratings yet
Metrology & Instrumentation Course File1
112 pages
(Ebook) Philosophy and Logic of Quantum Physics: An Investigation of the Metaphysical and Logical Implications of Quantum Physics by Jan Philipp Dapprich, Annika Schuster ISBN 9783631667255, 9783653062861, 3631667256, 3653062861 - The ebook is ready for instant download and access
100% (1)
(Ebook) Philosophy and Logic of Quantum Physics: An Investigation of the Metaphysical and Logical Implications of Quantum Physics by Jan Philipp Dapprich, Annika Schuster ISBN 9783631667255, 9783653062861, 3631667256, 3653062861 - The ebook is ready for instant download and access
57 pages
Memories and The City
No ratings yet
Memories and The City
3 pages
Lesson Plan 1 For Science
100% (1)
Lesson Plan 1 For Science
7 pages
Semester 1 - 2019 - November - Basic Electrical Engineering Pattern 2019
No ratings yet
Semester 1 - 2019 - November - Basic Electrical Engineering Pattern 2019
4 pages
Math Reviewer Q2
No ratings yet
Math Reviewer Q2
2 pages
T2 E 1635 Year 5 Reading Assessment Marking Scheme
No ratings yet
T2 E 1635 Year 5 Reading Assessment Marking Scheme
11 pages
MiniOTDR 3x
No ratings yet
MiniOTDR 3x
8 pages
Physics Question Bank (2)
No ratings yet
Physics Question Bank (2)
2 pages
S V Exercises
No ratings yet
S V Exercises
2 pages
3 Types of Rocks PPT
100% (2)
3 Types of Rocks PPT
35 pages
Leadership Training Proposal
No ratings yet
Leadership Training Proposal
3 pages
T821950001PF - Basic Engineering Design Data (Bedd)
No ratings yet
T821950001PF - Basic Engineering Design Data (Bedd)
28 pages
Examen Final - Semana 8 - INV - SEGUNDO BLOQUE-CULTURA Y ECONOMIA REGIONAL DE ASIA - (GRUPO1)
No ratings yet
Examen Final - Semana 8 - INV - SEGUNDO BLOQUE-CULTURA Y ECONOMIA REGIONAL DE ASIA - (GRUPO1)
10 pages
Structural Vibration Control 02 PDF
No ratings yet
Structural Vibration Control 02 PDF
28 pages
Other Important Government Schemes
No ratings yet
Other Important Government Schemes
32 pages
12th Geography Important Questions With Solutions Watermarked
No ratings yet
12th Geography Important Questions With Solutions Watermarked
79 pages
Why Play Based Learning PDF
100% (1)
Why Play Based Learning PDF
2 pages
Ils Z Rwy10 Cat
No ratings yet
Ils Z Rwy10 Cat
1 page
Values PowerPoint
No ratings yet
Values PowerPoint
14 pages
Big Five Personality Traits PDF
No ratings yet
Big Five Personality Traits PDF
40 pages
Unit 3. Alphabet of Lines
No ratings yet
Unit 3. Alphabet of Lines
6 pages
How To Work With The Drangels
No ratings yet
How To Work With The Drangels
32 pages
Multicollinearity and Regression Analysis
No ratings yet
Multicollinearity and Regression Analysis
12 pages
Change Management PowerPoints
No ratings yet
Change Management PowerPoints
185 pages
SINDA FLUINT General Purpose Thermal Fluid Network Analyzer
No ratings yet
SINDA FLUINT General Purpose Thermal Fluid Network Analyzer
1,602 pages
A Review of Shunt Active Power Filters With Fuzzy Logic Controller PDF
No ratings yet
A Review of Shunt Active Power Filters With Fuzzy Logic Controller PDF
5 pages
Manually Operated Portable Gantry Lifting Machine
No ratings yet
Manually Operated Portable Gantry Lifting Machine
29 pages
Total Strain Cracks
No ratings yet
Total Strain Cracks
13 pages