0% found this document useful (0 votes)

38 views13 pages

Python Programming Tutorial For Machine Learning Beginners Using

Uploaded by

kayforts

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views13 pages

Python Programming Tutorial For Machine Learning Beginners Using

Uploaded by

kayforts

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

By Khadijah Sa’ad Mohammed

Python Programming Tutorial for Machine Learning Beginners Using

Google Colab

Introduction to Google Colab

Google Colab is an online platform that lets you write and run Python code through the browser. It

is particularly useful for machine learning projects because it provides free access to GPUs (Graphics

Processing Units), which can speed up the computation needed for machine learning.

Getting Started with Google Colab:

1. Go to Google Colab. https://colab.research.google.com/

2. Sign in with your Google account.
3. Click File > New notebook to start a new project, creating a notebook where you can enter
Python code.

Session 1: Python Basics in Google Colab

1. Writing Your First Program:
● Task: Print "Hello, Machine Learning!" to the screen.
● How: Type the following Python code into a new cell in your Google Colab notebook:
print("Hello, Machine Learning!")
● Execute: Press Shift + Enter to run the code in the cell.

2. Understanding Comments

In Python, the hash symbol # is used to start a comment in the code. A comment is a line of text
in your program that is not executed as part of the program. Its primary purpose is to annotate
the code to help programmers understand the code's functionality or intent more easily.
Comments can also be used to temporarily disable parts of your code during testing without
deleting it.

1 www.culminatech.com Contact: [email protected]

Here's how you use comments in Python:

Examples of Using Comments

1. Single-Line CommentsYou can write a comment on its own line or at the end of a line of
code:

# This is a single-line comment

print("Hello, world!") # This comment follows a line of code

3. Understanding Variables and Basic Data Types

● Variables: Think of variables as containers that store data values. You can name them
whatever you like.
● Data Types: Python has several data types including integers, floats (decimal
numbers), and strings (text).
age = 25 # Integer: Whole number
height = 5.9 # Float: Decimal number
name = "Alice" # String: Text
print(age, height, name)

2 www.culminatech.com Contact: [email protected]

Session 2: Lists, Dictionaries, Arrays and Import
1. Working with Lists:
● Purpose: Lists store multiple items in a single variable.
● Example:

fruits = ["apple", "banana", "cherry"]

print(fruits)

2. Exploring Dictionaries:

● Purpose: Dictionaries hold data as key-value pairs, which is similar to how a real dictionary
works with word definitions.
● Example:

student = {"name": "John", "age": 22}

print(student)

What is an Array?
An array is a data structure that stores a collection of items. In programming, arrays are used to
organize data so that a related set of values can be easily sorted or searched. Unlike Python's
built-in list type, which can store items of different data types, arrays typically require all
elements to be of the same type, making them more efficient for certain operations.

Arrays in Python can be created and manipulated using the NumPy library, which provides a
high-performance array object that is central to doing numerical computations.

What is import?
In Python, import is a keyword used to include the code contained in another Python source
file. In other words, import allows you to access and use functions, classes, and variables defined

3 www.culminatech.com Contact: [email protected]

in other Python files. This is particularly useful for accessing Python libraries, which are
collections of modules that include pre-written code you can include in your projects.

For example:
import numpy
This line tells Python to load the NumPy library, making all of NumPy's functions and features
available in your script.

What does as do in Python?

The as keyword in Python is used in conjunction with the import statement to create an alias
for the imported module. This lets you refer to the module with a different name, usually a
shorter one, which is handy if you need to call it frequently in your code.

For example:
import numpy as np
Here, np is an alias for numpy. This means that instead of typing numpy.array() to create a
new array, you can use the shorter np.array().

What is NumPy?
NumPy, which stands for Numerical Python, is an open-source Python library that is widely used
in data science and scientific computing. It's known for its powerful array object, but it also
provides:

● Functions for performing complex mathematical and logical operations on arrays.

● Tools for integrating C, C++, and Fortran code.
● Useful features for linear algebra, Fourier transforms, and random number generation.

4 www.culminatech.com Contact: [email protected]

NumPy arrays are faster and more compact than Python lists. An array consumes less memory
and is convenient to use for mathematical operations, particularly if you have to perform
operations on large data sets, which is typical in machine learning and data analysis tasks.

Example of Using NumPy

Here’s a simple example that demonstrates creating an array with NumPy and performing a
mathematical operation:

# Importing the NumPy library and giving it an alias 'np'

import numpy as np

# Creating a NumPy array

arr = np.array([1, 2, 3, 4, 5])

# Performing an element-wise addition

new_arr = arr + 5

print(new_arr) # Output: [6 7 8 9 10]

This example shows how to create a NumPy array and then add a number to every element in
the array using NumPy's ability to handle vectorized operations efficiently. Such capabilities
make NumPy an invaluable tool for data processing in Python.

5 www.culminatech.com Contact: [email protected]

Session 3: Data Handling with Pandas
1. Introduction to Pandas

● Purpose: Pandas is a library that simplifies data manipulation and analysis.

● Loading Data:

import pandas as pd # Import the pandas library

url =
'https://raw.githubusercontent.com/datasciencedojo/datasets/master/tit
anic.csv'
data = pd.read_csv(url)
print(data.head()) # Displays the first 5 rows of the dataset

2. Basic Data Operations:

● View Data: See what your data looks like.

print(data.head()) # First 5 rows

● Statistics: Get a summary of the statistics pertaining to the DataFrame.

print(data.describe()) # Summary statistics

Further Explanation: Loading and Understanding the Titanic Survival Dataset

This dataset includes various passenger attributes such as age, sex, passenger class, and
whether the passenger survived the sinking of the Titanic.

Step-by-Step Instructions to Load and View the Titanic Dataset:

1. Importing Libraries:

import pandas as pd # Pandas library for data manipulation

6 www.culminatech.com Contact: [email protected]

2. Loading the Dataset from a URL:
The Titanic dataset is available on many platforms, but we'll use a version from GitHub

that is clean and ready to use.

url =
'https://raw.githubusercontent.com/datasciencedojo/datasets/master/
titanic.csv'

data = pd.read_csv(url)

3. Displaying the First Few Rows:

print(data.head()) # Displays the first 5 rows of the dataset

Here’s how the output might look, displaying key data for the first few passengers:

PassengerId Survived Pclass Name Sex Age SibSp Parch Ticket Fare
Cabin Embarked

0 1 0 3 Braund, Mr. Owen Harris male 22.0 1 0 A/5 21171 7.2500 NaN
S

1 2 1 1 Cumings, Mrs. John Bradley (Florence Briggs Th... female

38.0 1 0 PC 17599 71.2833 C85 C

2 3 1 3 Heikkinen, Miss. Laina female 26.0 0 0 STON/O2. 3101282

7.9250 NaN S

3 4 1 1 Futrelle, Mrs. Jacques Heath (Lily May Peel) female 35.0 1

0 113803 53.1000 C123 S

4 5 0 3 Allen, Mr. William Henry male 35.0 0 0 373450 8.0500 NaN S

7 www.culminatech.com Contact: [email protected]

Key Columns in the Dataset:

● PassengerId: An identifier for the passenger.

● Survived: Whether the passenger survived (1) or not (0).
● Pclass: The passenger's class (1st, 2nd, or 3rd) which is a proxy for socio-economic
status.
● Name: The name of the passenger.
● Sex: The passenger's gender.
● Age: The passenger's age.
● SibSp: The number of siblings or spouses the passenger had aboard.
● Parch: The number of parents or children the passenger had aboard.
● Ticket: The passenger's ticket number.
● Fare: How much the passenger paid for the ticket.
● Cabin: The passenger's cabin number.
● Embarked: The port where the passenger embarked (C = Cherbourg; Q = Queenstown; S
= Southampton).

Basic Data Examination

It's useful to get a quick overview of the dataset, including checking for missing values and
understanding the distribution of numerical values.

# Summary statistics for numerical features

print(data.describe())

# Count of missing values per column

print(data.isnull().sum())

8 www.culminatech.com Contact: [email protected]

Simple Visualization

Visualizing data can provide insights that are not obvious from raw numbers alone.

import seaborn as sns

import matplotlib.pyplot as plt
# Visualizing survival rates based on passenger class
sns.barplot(x='Pclass', y='Survived', data=data)
plt.title('Survival Rates by Passenger Class')
plt.show()

This approach with the Titanic dataset makes it easier for beginners to grasp basic data handling
and analysis operations in Python, providing a foundation to build on for more complex
machine learning tasks.

9 www.culminatech.com Contact: [email protected]

Session 4: Introduction to Machine Learning with Scikit-learn
1. Supervised Learning Example: Logistic Regression

- Purpose: Predict a category based on input variables.

● Preparing Data:
from sklearn.model_selection import train_test_split
X = data.drop(columns=['Outcome']) # Features
y = data['Outcome'] # Target variable
X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.2)

● Building and Training the Model:

from sklearn.linear_model import LogisticRegression
model = LogisticRegression()
model.fit(X_train, y_train)

● Making Predictions and Evaluating the Model:

predictions = model.predict(X_test)
from sklearn.metrics import accuracy_score
print("Accuracy:", accuracy_score(y_test, predictions))

10 www.culminatech.com Contact: [email protected]

2. Unsupervised Learning Example: K-means Clustering

● Purpose: Group data points into clusters based on feature similarity.

● Example:
from sklearn.cluster import KMeans
# Example dataset
X = pd.DataFrame({
"x": [1, 2, 3, 6, 7, 8],
"y": [1, 1, 2, 6, 7, 8]
})
kmeans = KMeans(n_clusters=2)
kmeans.fit

Further explanation: K-means Clustering Explained

K-means clustering is an unsupervised learning algorithm that seeks to partition a set of data
points into a specified number of clusters K. The goal is to divide the data such that the sum of
the squared distance between the data points and the centroid (mean) of their respective
clusters is minimized

1. Importing Libraries

import pandas as pd
from sklearn.cluster import KMeans
import matplotlib.pyplot as plt
● pandas is used for data manipulation and analysis.
● sklearn.cluster contains the K-means clustering algorithm.
● matplotlib.pyplot is used for creating visualizations to see the results.
2. Creating an Example Dataset

X = pd.DataFrame({
"x": [1, 2, 3, 6, 7, 8],
"y": [1, 1, 2, 6, 7, 8]

11 www.culminatech.com Contact: [email protected]

})

● This step initializes a DataFrame X with two features: x and y. These

features represent coordinates in a 2D space where each point is a data
point to be clustered.
3. Applying K-means Clustering

kmeans = KMeans(n_clusters=2)

kmeans.fit(X)

● KMeans(n_clusters=2) initializes the K-means algorithm to partition the

data into 2 clusters.
● .fit(X) fits the model to the dataset X. This method runs the K-means
clustering algorithm, which involves assigning initial centroids randomly,
then iteratively relocating them to minimize the within-cluster sum of
squares.
4. Storing Cluster Labels

labels = kmeans.labels_

X['Cluster'] = labels

● kmeans.labels_ retrieves the cluster labels for each data point in X. These
labels indicate the cluster to which each data point belongs.
● Adding these labels to the DataFrame X as a new column named Cluster
helps in tracking and visualizing which point belongs to which cluster.
5. Visualizing the Clusters

plt.scatter(X['x'], X['y'], c=X['Cluster'], cmap='viridis')

plt.title('K-means Clustering')
plt.xlabel('X coordinate')
plt.ylabel('Y coordinate')
plt.show()

12 www.culminatech.com Contact: [email protected]

● plt.scatter() creates a scatter plot of the x and y coordinates. The color
of each point (c=X['Cluster']) varies according to the cluster
assignment, which helps in visually distinguishing the clusters.
● cmap='viridis' specifies the color map used to color the data points.
Different color maps can be used depending on preference or visibility.
● plt.show() displays the plot. This visualization shows the final clustering
result, where the goal of minimizing the intra-cluster distances and
maximizing the inter-cluster distances is visually evident.

Thank you for following this Python Programming Tutorial for Machine Learning Beginners.
If you have any questions or feedback, please feel free to reach out!

Khadijah Sa’ad Mohammed

[email protected]
www.culminatech.com

Happy coding!

13 www.culminatech.com Contact: [email protected]

Ultimate Step by Step Guide To Machine Learning Using Python Predictive
100% (3)
Ultimate Step by Step Guide To Machine Learning Using Python Predictive
56 pages
Python Data Science - A Beginner's Guide To Mastering Analysis, Visualization, and Machine Learning by A. Eich Liana
No ratings yet
Python Data Science - A Beginner's Guide To Mastering Analysis, Visualization, and Machine Learning by A. Eich Liana
86 pages
Python For Data Science Extended Ebook PDF
100% (5)
Python For Data Science Extended Ebook PDF
56 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Python For Data Science
No ratings yet
Python For Data Science
89 pages
PyTorch - Advanced Deep Learning
No ratings yet
PyTorch - Advanced Deep Learning
237 pages
Python Crash Course
100% (1)
Python Crash Course
9 pages
Unit 3
No ratings yet
Unit 3
110 pages
Feature Engineering - Introduction
No ratings yet
Feature Engineering - Introduction
74 pages
NumPy and Pandas: Essential Python Libraries
No ratings yet
NumPy and Pandas: Essential Python Libraries
72 pages
Python Data Analysis Guide
No ratings yet
Python Data Analysis Guide
75 pages
Python GTU Study Material E-Notes 3 16012021061619AM
No ratings yet
Python GTU Study Material E-Notes 3 16012021061619AM
36 pages
Unit 1
No ratings yet
Unit 1
69 pages
Data Wrangling with Python Guide
No ratings yet
Data Wrangling with Python Guide
61 pages
Sew Eurodrive PDF
No ratings yet
Sew Eurodrive PDF
116 pages
Data Science Python
No ratings yet
Data Science Python
42 pages
DS Final
No ratings yet
DS Final
46 pages
02 Python Basics
No ratings yet
02 Python Basics
52 pages
Python Data Analysis Introduction
No ratings yet
Python Data Analysis Introduction
259 pages
Week 1: 1 The Python Programming Language: Functions
No ratings yet
Week 1: 1 The Python Programming Language: Functions
9 pages
Wa0024.
No ratings yet
Wa0024.
35 pages
Introductiontocourse: 1 The Python Programming Language: Functions
No ratings yet
Introductiontocourse: 1 The Python Programming Language: Functions
11 pages
Python Basics and Data Analytics
No ratings yet
Python Basics and Data Analytics
46 pages
Week 3 Python
No ratings yet
Week 3 Python
152 pages
FIT1043 - Lecture 2 - 2024 Slides
No ratings yet
FIT1043 - Lecture 2 - 2024 Slides
55 pages
Unit - V
No ratings yet
Unit - V
29 pages
oG1M8adGXOGe DHBiQVrXgXHO6GrHU01tHWZgd tpRqUW65xGX9ufzrZMtM6hjBWlvlYViPn6r2Cgghq2M8oiXNNdf0HeL-DQvJKWM
No ratings yet
oG1M8adGXOGe DHBiQVrXgXHO6GrHU01tHWZgd tpRqUW65xGX9ufzrZMtM6hjBWlvlYViPn6r2Cgghq2M8oiXNNdf0HeL-DQvJKWM
42 pages
Data Science Workshop - Day 1
No ratings yet
Data Science Workshop - Day 1
80 pages
Ch02 Statlearn Lab
No ratings yet
Ch02 Statlearn Lab
58 pages
Python Numpy Tutorial (With Jupyter and Colab)
No ratings yet
Python Numpy Tutorial (With Jupyter and Colab)
29 pages
AML LAB MANUAL Yash
No ratings yet
AML LAB MANUAL Yash
60 pages
Week 1
No ratings yet
Week 1
31 pages
DAwHPC L03 Data Cleaning Practical
No ratings yet
DAwHPC L03 Data Cleaning Practical
43 pages
Python Numpy-Github - Io
No ratings yet
Python Numpy-Github - Io
25 pages
Sonek Python Lesson 1 To 3
No ratings yet
Sonek Python Lesson 1 To 3
18 pages
Tutorial1 KNN
No ratings yet
Tutorial1 KNN
18 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
68 pages
MTE204 Data Python
No ratings yet
MTE204 Data Python
45 pages
Fds Lab Manual
No ratings yet
Fds Lab Manual
31 pages
Numpy Data Analysis and Visualisation With Python
No ratings yet
Numpy Data Analysis and Visualisation With Python
75 pages
Python Review
No ratings yet
Python Review
50 pages
Data Analysis Python Read The Docs Io en Latest
No ratings yet
Data Analysis Python Read The Docs Io en Latest
79 pages
01 Introduction To Python
No ratings yet
01 Introduction To Python
36 pages
Lab 2 DWM
No ratings yet
Lab 2 DWM
13 pages
NOXON Iradio Manual GB
No ratings yet
NOXON Iradio Manual GB
60 pages
Part1 Cours Python
No ratings yet
Part1 Cours Python
62 pages
SENG419-python 98745
No ratings yet
SENG419-python 98745
103 pages
01 Introduction To Python
No ratings yet
01 Introduction To Python
36 pages
Introduction To Python: Arun Kumar
No ratings yet
Introduction To Python: Arun Kumar
41 pages
Bos Cse (Ai&ml) - 1-05-25
No ratings yet
Bos Cse (Ai&ml) - 1-05-25
35 pages
Fortidb User Guide 430
No ratings yet
Fortidb User Guide 430
206 pages
Q-Step WS 06112019 Data Analysis and Visualisation With Python
No ratings yet
Q-Step WS 06112019 Data Analysis and Visualisation With Python
76 pages
TDSSKiller.3.1.0.28 08.11.2020 18.26.36 Log
No ratings yet
TDSSKiller.3.1.0.28 08.11.2020 18.26.36 Log
46 pages
ECA PROJECT Alcohol
No ratings yet
ECA PROJECT Alcohol
27 pages
Python For Data Science Quickstart Guide
No ratings yet
Python For Data Science Quickstart Guide
13 pages
Module 1.foundations of Data Science
No ratings yet
Module 1.foundations of Data Science
17 pages
Introduction To Python 1
No ratings yet
Introduction To Python 1
13 pages
Python For Data Science
No ratings yet
Python For Data Science
20 pages
Python for Engineering Students
No ratings yet
Python for Engineering Students
30 pages
Wa0005.
No ratings yet
Wa0005.
29 pages
BHT1500B UsersManual E3 PDF
No ratings yet
BHT1500B UsersManual E3 PDF
238 pages
Survey On QoEQoS Correlation Models For Multimedia
No ratings yet
Survey On QoEQoS Correlation Models For Multimedia
21 pages
Emu Log
No ratings yet
Emu Log
253 pages
Lecture05 Handout
No ratings yet
Lecture05 Handout
42 pages
Information Systems Quiz Questions
No ratings yet
Information Systems Quiz Questions
14 pages
Oracle Cash Management Guide
No ratings yet
Oracle Cash Management Guide
13 pages
Minecraft Launcher Debug Log
No ratings yet
Minecraft Launcher Debug Log
14 pages
Chapter 7: Operations and Postimplementation Chapter Objectives
No ratings yet
Chapter 7: Operations and Postimplementation Chapter Objectives
7 pages
Nursing Informatics Overview
100% (1)
Nursing Informatics Overview
29 pages
Join Logical File: Overview: Outline
No ratings yet
Join Logical File: Overview: Outline
10 pages
Xii - Computer Science
No ratings yet
Xii - Computer Science
8 pages
Certified Cyber Warrior 3.1a
No ratings yet
Certified Cyber Warrior 3.1a
8 pages
Network Security Essentials Guide
No ratings yet
Network Security Essentials Guide
22 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
AcronisBackup 12.5 Userguide en-US
No ratings yet
AcronisBackup 12.5 Userguide en-US
261 pages
Python & Excel for Data Science
No ratings yet
Python & Excel for Data Science
19 pages
Xilinx FPGA FFT Algorithm Design
No ratings yet
Xilinx FPGA FFT Algorithm Design
3 pages
Clinical Data Management
No ratings yet
Clinical Data Management
5 pages
Soal Dan Jawaban Studi Kasus
No ratings yet
Soal Dan Jawaban Studi Kasus
4 pages
2-2 External Lab
No ratings yet
2-2 External Lab
3 pages
STPI Tech Staff Recruitment Notice
No ratings yet
STPI Tech Staff Recruitment Notice
3 pages
Afzal Resume
No ratings yet
Afzal Resume
1 page
Getting Started With Python Cheat Sheet
No ratings yet
Getting Started With Python Cheat Sheet
1 page
Alexander Tan: Expert Graphic Designer Profile
No ratings yet
Alexander Tan: Expert Graphic Designer Profile
1 page
MCQ Test On Ms Word
No ratings yet
MCQ Test On Ms Word
2 pages
Steven Slate Drums 3.5 Guide
No ratings yet
Steven Slate Drums 3.5 Guide
61 pages
Hogan Seth-Resume 2016 09 01
No ratings yet
Hogan Seth-Resume 2016 09 01
3 pages

Python Programming Tutorial For Machine Learning Beginners Using

Uploaded by

Python Programming Tutorial For Machine Learning Beginners Using

Uploaded by

By Khadijah Sa’ad Mohammed

Python Programming Tutorial for Machine Learning Beginners Using

Introduction to Google Colab

Getting Started with Google Colab:

1. Go to Google Colab. https://colab.research.google.com/

Session 1: Python Basics in Google Colab

1 www.culminatech.com Contact: [email protected]

Examples of Using Comments

# This is a single-line comment

print("Hello, world!") # This comment follows a line of code

3. Understanding Variables and Basic Data Types

2 www.culminatech.com Contact: [email protected]

fruits = ["apple", "banana", "cherry"]

student = {"name": "John", "age": 22}

3 www.culminatech.com Contact: [email protected]

What does as do in Python?

● Functions for performing complex mathematical and logical operations on arrays.

4 www.culminatech.com Contact: [email protected]

Example of Using NumPy

# Importing the NumPy library and giving it an alias 'np'

# Creating a NumPy array

# Performing an element-wise addition

print(new_arr) # Output: [6 7 8 9 10]

5 www.culminatech.com Contact: [email protected]

● Purpose: Pandas is a library that simplifies data manipulation and analysis.

import pandas as pd # Import the pandas library

2. Basic Data Operations:

● View Data: See what your data looks like.

print(data.head()) # First 5 rows

● Statistics: Get a summary of the statistics pertaining to the DataFrame.

print(data.describe()) # Summary statistics

Further Explanation: Loading and Understanding the Titanic Survival Dataset

Step-by-Step Instructions to Load and View the Titanic Dataset:

import pandas as pd # Pandas library for data manipulation

6 www.culminatech.com Contact: [email protected]

that is clean and ready to use.

3. Displaying the First Few Rows:

print(data.head()) # Displays the first 5 rows of the dataset

1 2 1 1 Cumings, Mrs. John Bradley (Florence Briggs Th... female

2 3 1 3 Heikkinen, Miss. Laina female 26.0 0 0 STON/O2. 3101282

3 4 1 1 Futrelle, Mrs. Jacques Heath (Lily May Peel) female 35.0 1

4 5 0 3 Allen, Mr. William Henry male 35.0 0 0 373450 8.0500 NaN S

7 www.culminatech.com Contact: [email protected]

● PassengerId: An identifier for the passenger.

Basic Data Examination

# Summary statistics for numerical features

# Count of missing values per column

8 www.culminatech.com Contact: [email protected]

import seaborn as sns

9 www.culminatech.com Contact: [email protected]

- Purpose: Predict a category based on input variables.

● Building and Training the Model:

● Making Predictions and Evaluating the Model:

10 www.culminatech.com Contact: [email protected]

● Purpose: Group data points into clusters based on feature similarity.

Further explanation: K-means Clustering Explained

11 www.culminatech.com Contact: [email protected]

● This step initializes a DataFrame X with two features: x and y. These

● KMeans(n_clusters=2) initializes the K-means algorithm to partition the

plt.scatter(X['x'], X['y'], c=X['Cluster'], cmap='viridis')

12 www.culminatech.com Contact: [email protected]

Khadijah Sa’ad Mohammed

13 www.culminatech.com Contact: [email protected]

You might also like