0% found this document useful (0 votes)

3 views33 pages

ML Lab File

The document outlines a course on Basics of Machine Learning and Applications, detailing various Python libraries essential for data science, such as NumPy, Pandas, and Scikit-learn. It includes experiments on data preprocessing, mathematical operations, data structures, and implementing machine learning models like linear regression and KNN. Additionally, it emphasizes the importance of data cleaning and transformation in preparing datasets for machine learning.

Uploaded by

muskansonisdgh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views33 pages

ML Lab File

Uploaded by

muskansonisdgh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

DEPARTMENT OF COMPUTER

SCIENCE AND ENGINEERING

Basics Of Machine Learning and
Applications CS 106

SUBMITTED TO: Mr. Abhishek Sir

SUBMITTED BY: MUSKAN SONI
(24/A04/071)
Table of Contents
Serial Experiments Name Date Page No.
No.
1 Write down 10 basic library for 01/01/2025 1-
python implementation
2 Data Preprocessing step by step 03/01/2025 5
implementation
3 Control Flow Statements 05/01/2025 10
4 Functions in Python 07/01/2025 15
5 File Handling in Python 09/01/2025 20
6 NumPy Basics 11/01/2025 25
7 Pandas for Data Analysis 13/01/2025 30
8 Data Visualization 15/01/2025 35
9 Web Scraping with BS4 17/01/2025 40
10 Machine Learning Basics 19/01/2025 45
Experiment 1:
Write down 10 basic library for python
implementation:
1.1: NumPy – Numerical Computations
What is NumPy?
NumPy (Numerical Python) is a fundamental library for numerical computing in
Python. It provides support for handling large multi-dimensional arrays and
matrices, along with mathematical functions to operate on these arrays.
Key Features:
• A.) Provides high-performance multidimensional arrays.
• B.) Supports mathematical and statistical operations.
• C.) More efficient than Python lists (uses less memory).
Applications:
• A.) Scientific computing.
• B.) Data analysis and machine learning.
• C.) Image and signal processing.
1.2. Pandas – Data Manipulation & Analysis
What is Pandas?
Pandas is a powerful library used for data manipulation and analysis. It provides
two main data structures:
Series (1D labeled array)
DataFrame (2D labeled table similar to a spreadsheet)
Key Features:
• Handles structured data efficiently (CSV, Excel, databases).
• Supports filtering, grouping, merging, and statistical analysis.
• Works well with NumPy and Matplotlib.
Applications:
• Data preprocessing in machine learning.
• Financial and economic data analysis.
• Handling missing data in datasets.
•
1.3: Matplotlib – Data Visualization
What is Matplotlib?
Matplotlib is a Python library used to create static, animated, and
interactive plots. It allows users to generate bar charts, line graphs,
scatter plots, histograms, and more.
Key Features:
• Highly customizable visualization library.
• Can generate plots in multiple formats (PNG, PDF, SVG).
• Works well with NumPy and Pandas.
Applications:
• Data visualization in research and analysis.
• Exploratory Data Analysis (EDA) in machine learning.
• Plotting real-time sensor data.

•
1.4: Seaborn – Statistical Data Visualization
What is Seaborn?
Seaborn is built on top of Matplotlib and provides statistical visualizations with
better aesthetics. It is widely used for data analysis and visual storytelling.
Supports advanced plots like heatmaps.Key Features:
• Supports advanced plots like heatmaps, violin plots, and pair plots.
• Works seamlessly with Pandas DataFrames.
• Includes built-in datasets for practice.
Applications:
• Data science projects.
• Statistical modeling and analysis.
• Correlation analysis (e.g., heatmaps).

1.5: Scikit-learn – Machine Learning

What is Scikit-learn?
Scikit-learn is a popular Python library for machine learning and data mining. It
provides simple and efficient tools for classification, regression, clustering, and
dimensionality reduction.
Key Features:
• Built-in datasets for testing ML models.
• Supports supervised and unsupervised learning.
• Works well with NumPy, Pandas, and Matplotlib.
Applications:
• Predictive modeling (e.g., price predictions, medical diagnosis).
• Text classification (e.g., spam detection).
• Image recognition and face detection.

1.6: TensorFlow – Deep Learning & AI

What is TensorFlow?
TensorFlow is an open-source deep learning framework developed by Google. It
is used for building and training neural networks in AI applications.
Key Features:
• Supports both CPU and GPU acceleration for fast computations.
• Provides tools for building neural networks (ANN, CNN, RNN).
• Works well with large datasets and complex models.
Applications:
• Image and speech recognition.
• Self-driving cars (object detection).
• Natural Language Processing (NLP) in chatbots and translators.

1.7: Requests – HTTP Requests (Fetching Data from APIs)

What is Requests?
Requests is a library used to send HTTP requests in Python. It allows users to
fetch data from web services and APIs (Application Programming Interfaces).
Key Features:
• Supports GET, POST, PUT, DELETE requests.
• Handles authentication, cookies, and sessions.
• Can be used to scrape websites and fetch live data.
Applications:
• Fetching weather data from APIs.
• Automating web interactions.
• Downloading content from websites.

1.8 : BeautifulSoup – Web Scraping

What is BeautifulSoup?
BeautifulSoup is a Python library used for web scraping. It helps extract specific
data from HTML and XML files.
Key Features:
• Parses HTML and XML files.
• Extracts data from web pages easily.
• Works well with Requests for scraping live websites.
Applications:
• Scraping job listings from websites.
• Extracting data from Wikipedia.
• Automating data collection from the web.
1.9: Flask – Web Development
What is Flask?
Flask is a lightweight web framework used to develop web applications in
Python. It is simple, scalable, and widely used for API development.
Key Features:
• Provides an easy way to create RESTful APIs.
• Has built-in support for routing and database connections.
• Lightweight and easy to integrate with front-end frameworks.
Applications:
• Developing web-based machine learning applications.
• Creating REST APIs for mobile apps.
• Backend for small-scale web applications.
1.10: OpenCV – Image Processing & Computer Vision
What is OpenCV?
OpenCV (Open Source Computer Vision) is a library used for image processing
and computer vision tasks. It can handle image recognition, object detection,
and video analysis.
Key Features:
• Supports face detection, edge detection, and image transformations.
• Works with real-time video processing.
• Can integrate with deep learning models.
Applications:
• Face recognition in security systems.
• Object detection in autonomous vehicles.
• Barcode and QR code scanning.

•
• Summary Table
Library Purpose
NumPy Numerical computations, array handling

Pandas Data manipulation and analysis

Matplotlib Data visualization (charts, graphs)

Seaborn Statistical data visualization

Scikit-learn Machine learning models

TensorFlow Deep learning and AI

Requests Fetching data from web APIs

BeautifulSoup Web scraping and data extraction

Flask Web development (APIs, backend)

OpenCV Image processing and computer vision

Experiment 2:
Data preprocessing step with implementation:
Data preprocessing in machine learning (ML) is the process of transforming raw
data into a clean and structured format before feeding it into a machine learning
model. It's a crucial step because real-world data is often incomplete,
inconsistent, or noisy.
Why idis it important?
Data preprocessing is essential because it prepares raw data for modeling by
cleaning, transforming, and organizing it. Most algorithms can’t handle missing
values, inconsistent formats, or non-numeric data, so preprocessing ensures the
data is usable. It improves model accuracy by helping the algorithm learn
patterns more effectively and reduces noise and bias for fairer predictions. It
also speeds up training and helps prevent overfitting or underfitting.Good
preprocessing leads to smarter, faster, and more reliable models.

Steps in Data Preprocessing:

2.1: Data Cleaning
1. Handling missing values by filling in with mean/median, or dropping
2. Removing duplicates

3. Fixing inconsistencies (e.g., typos or formatting issues)

2.2:Data Transformation
1. Encoding categorical data: Converting text labels into numbers

2. Log transformations or Box-Cox: Making data more normally distributed if

needed

2.3:Feature Selection / Extraction

1. Selecting the most relevant variables (features) for the model
2. Creating new features (feature engineering

2.4: Data Splitting

1. Dividing the dataset into training, validation, and test sets.

INPUT:
OUTPUT:

EXPERIMENT 3
Basics mathematics functions operation using python:
3.1.Basic Math Operations in Python

These are basic functions like addition, subtraction, multiplication, exponential, etc.
performed using python.

3.1.1: Using Functions for Math

3.1.2: Using the math Module

3.2 More Complex Mathematical Functions in Python

3.2.1 Logarithmic and Exponential Functions

3.2.2 Factorials and Combinations

4. Rounding, Floor, Ceil & Modf

5. Complex Numbers (cmath module)

6. Using NumPy for Arrays of Math
EXPERIMENT 4:
To implement the python data structure Array, list, vector,
matrix, dictionary with basic operation in python
1.Array:
An array is a fixed-type collection of elements stored in a contiguous memory
location.In Python, it's created using the array module and holds data of the
same type.
Input:

Output:

2. List
A list is a built-in, ordered, and mutable collection that can hold
elements of different types. It supports dynamic resizing and is used
frequently in Python programming.
INPUT:

OUTPUT:

3. Vector
A vector is essentially a 1D array, often implemented using NumPy for
mathematical operations.
It supports efficient numerical computation and broadcasting in Python.

Input:
Output:

4. Matrix
A matrix is a 2D array-like structure used to represent rows and columns of
data. In Python, it’s usually implemented using NumPy for linear algebra and
data manipulation.

Input:
Output:
5. Dictionary
A dictionary is a key-value pair data structure used to store and retrieve data
efficiently. Keys must be unique and immutable, while values can be any data
type.
Input:

Output:
EXPERIMENT 5:
Implement the linear regression model on house price prediction
also calculate the weight and bias using gradient descend
To implement a linear regression model on house price prediction and calculate the weight
and bias using gradient descent, we'll follow these steps:

1. Load a Dataset: Load the given dataset or create synthetic dataset.

To implement a linear regression model on house price prediction and calculate the weight
and bias using gradient descent, we'll follow these steps:

1. Create a Dataset: For simplicity, let's generate a synthetic dataset where the
independent variable is the number of rooms in a house and the dependent variable is the
house price.

2. Linear Regression Model: The model equation will be y = w * x + b, where y is the

house price, x is the number of rooms, w is the weight (slope), and b is the bias (intercept).

3. Gradient Descent: We will use gradient descent to minimize the cost function (Mean
Squared Error, MSE) and calculate the optimal weight w and bias b.
Input:
Output:
EXPERIMENT 6
Prediction model making confusion matrix using logistic regression
EXPERIMENT 7

Implement optimised model on given dataset.

EXPERIMENT 8

Implement KNN and K-Means Clustering

K-Nearest Neighbors (KNN) is a supervised learning algorithm used for classification. It works by finding the ‘k’
closest data points (neighbors) to a new point and assigns the most common label among them. It’s simple and
effective, especially for small datasets.

On the other hand, K-Means Clustering is an unsupervised algorithm used to group data into clusters based on
similarity. It starts by choosing ‘k’ cluster centers and then repeatedly assigns points to the nearest cluster and
updates the centers. KNN needs labeled data, while K-Means works without labels and helps discover patterns
in data.
EXPERIMENT 9

Use matplotlib and seaborn to visualize relationships in a dataset.

Cs3361 Data Science Laboratory
No ratings yet
Cs3361 Data Science Laboratory
139 pages
Dsbda Unit4
No ratings yet
Dsbda Unit4
110 pages
Let Us Create Super Ai by Chat GPT and Muwanguz David
No ratings yet
Let Us Create Super Ai by Chat GPT and Muwanguz David
133 pages
Data Science Lab Exp Lis
No ratings yet
Data Science Lab Exp Lis
72 pages
Roadmap
No ratings yet
Roadmap
27 pages
ML Lab Manual (Final) Dtu
No ratings yet
ML Lab Manual (Final) Dtu
52 pages
Lab Manual
No ratings yet
Lab Manual
80 pages
FINAL FDS MANUAL Print
No ratings yet
FINAL FDS MANUAL Print
55 pages
Python Library Functions
No ratings yet
Python Library Functions
12 pages
Building a Product Master
From Everand
Building a Product Master
Edufdev
No ratings yet
Elc Report
No ratings yet
Elc Report
12 pages
5616-1700040380952-HND - APDD - W10 - Data Structures and Data Analysis Libraries in Python
No ratings yet
5616-1700040380952-HND - APDD - W10 - Data Structures and Data Analysis Libraries in Python
16 pages
IoT Lab File
No ratings yet
IoT Lab File
44 pages
l9 Scientific Python Proc
No ratings yet
l9 Scientific Python Proc
30 pages
Top 20 Python Libraries For Data Science
No ratings yet
Top 20 Python Libraries For Data Science
15 pages
TY FDS Workbook
No ratings yet
TY FDS Workbook
56 pages
Test Project
No ratings yet
Test Project
17 pages
Top 20 Incredibly Impressive Trending Python Libraries To Work With
No ratings yet
Top 20 Incredibly Impressive Trending Python Libraries To Work With
15 pages
15 Python Libraries For Data Science
No ratings yet
15 Python Libraries For Data Science
17 pages
Lab - Manual FDS
No ratings yet
Lab - Manual FDS
12 pages
Unit 7: Problem Solving Real World Programming Problems
No ratings yet
Unit 7: Problem Solving Real World Programming Problems
36 pages
Unit 5 Python
No ratings yet
Unit 5 Python
11 pages
D P Lab Manual
No ratings yet
D P Lab Manual
54 pages
Pds Notes
No ratings yet
Pds Notes
5 pages
Final Project Report
No ratings yet
Final Project Report
34 pages
Face Mask Detection
No ratings yet
Face Mask Detection
32 pages
Python For Data Exploration
No ratings yet
Python For Data Exploration
28 pages
PDS Labmanualword
No ratings yet
PDS Labmanualword
32 pages
Practical 1
No ratings yet
Practical 1
8 pages
Introduction To Popular-1
No ratings yet
Introduction To Popular-1
15 pages
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
Sec-D ML Practical File PDF
No ratings yet
Sec-D ML Practical File PDF
19 pages
In Python, A Library Is A Collection of Pre-Writt...
No ratings yet
In Python, A Library Is A Collection of Pre-Writt...
3 pages
Suraj Report File
No ratings yet
Suraj Report File
17 pages
KNIME Workflow Design and Automation: Definitive Reference for Developers and Engineers
From Everand
KNIME Workflow Design and Automation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Lab 2 Report
No ratings yet
Lab 2 Report
6 pages
Data Preprocessing-AIML Algorithm1
No ratings yet
Data Preprocessing-AIML Algorithm1
47 pages
AI/ML Python Modules
No ratings yet
AI/ML Python Modules
17 pages
Practical 1to10
No ratings yet
Practical 1to10
32 pages
Numpy Lib
No ratings yet
Numpy Lib
19 pages
CS3361 - Data Science Laboratory
No ratings yet
CS3361 - Data Science Laboratory
31 pages
Data Ty
No ratings yet
Data Ty
59 pages
ML Exp
No ratings yet
ML Exp
9 pages
Python For Data Science
No ratings yet
Python For Data Science
22 pages
Ass1 DSBDA Writeup
No ratings yet
Ass1 DSBDA Writeup
8 pages
Report Format (1) .Docx - 20240508 - 124537 - 0000
No ratings yet
Report Format (1) .Docx - 20240508 - 124537 - 0000
11 pages
Data Processing With Python and R
No ratings yet
Data Processing With Python and R
6 pages
ML Python Basics
No ratings yet
ML Python Basics
2 pages
PDF 1675791423
No ratings yet
PDF 1675791423
11 pages
Machine Learning - Manual
No ratings yet
Machine Learning - Manual
32 pages
Machine Learning Document
No ratings yet
Machine Learning Document
7 pages
Top 18 Python Libraries
100% (1)
Top 18 Python Libraries
11 pages
10 Essential Python Libraries For Data Professionals - by Sigli Mumuni - Medium
No ratings yet
10 Essential Python Libraries For Data Professionals - by Sigli Mumuni - Medium
6 pages
The Most Popular Python Libraries
No ratings yet
The Most Popular Python Libraries
7 pages
Essential Python Libraries and Functions For Data Science 1706295212
No ratings yet
Essential Python Libraries and Functions For Data Science 1706295212
12 pages
DUTY DRAWBACK.2023.Section 74 - 76
No ratings yet
DUTY DRAWBACK.2023.Section 74 - 76
21 pages
Q1. What Are: Python Standard Library Ans
No ratings yet
Q1. What Are: Python Standard Library Ans
6 pages
Damage Assessment After Accidental Events PDF
50% (2)
Damage Assessment After Accidental Events PDF
72 pages
100 Must-Know PythonMl Interview Questions and Answers 2024 - Devinterview - Io
No ratings yet
100 Must-Know PythonMl Interview Questions and Answers 2024 - Devinterview - Io
1 page
History and Evolution of IBM Mainframes
100% (1)
History and Evolution of IBM Mainframes
39 pages
Machine Learning Python Packages
No ratings yet
Machine Learning Python Packages
9 pages
Stations of The Heart
100% (2)
Stations of The Heart
2 pages
Computer Engineering Syllabus Sem IV - Mumbai University
No ratings yet
Computer Engineering Syllabus Sem IV - Mumbai University
28 pages
Basel Convention
No ratings yet
Basel Convention
33 pages
Successful Seed
No ratings yet
Successful Seed
38 pages
Core Libraries For Machine Learning
No ratings yet
Core Libraries For Machine Learning
5 pages
Biology: 1 2 9 Respiration
40% (5)
Biology: 1 2 9 Respiration
5 pages
Pendulate User Guide
No ratings yet
Pendulate User Guide
26 pages
Diggory v. Hogwarts - Wizarding World Mock Trial Association
No ratings yet
Diggory v. Hogwarts - Wizarding World Mock Trial Association
85 pages
Special Condition of Contract
No ratings yet
Special Condition of Contract
28 pages
RS26 Manual
No ratings yet
RS26 Manual
42 pages
Basic Libraries For Data Science
No ratings yet
Basic Libraries For Data Science
4 pages
FFT10603 Tutorial - Functions and Graphs MCQ With Solutions
No ratings yet
FFT10603 Tutorial - Functions and Graphs MCQ With Solutions
10 pages
Lesson Plan
No ratings yet
Lesson Plan
3 pages
VT 266
No ratings yet
VT 266
20 pages
p7 - Circuits - Series Parallel
No ratings yet
p7 - Circuits - Series Parallel
27 pages
1980 Reinvention in The Innovation Process
No ratings yet
1980 Reinvention in The Innovation Process
16 pages
An Experimental Study On The Performance of Corrugated Cardboard As A Sustainable Sound-Absorbing and Insulating Material
No ratings yet
An Experimental Study On The Performance of Corrugated Cardboard As A Sustainable Sound-Absorbing and Insulating Material
12 pages
Priests in Panties - . .
No ratings yet
Priests in Panties - . .
18 pages
Appendix HPGD1103
No ratings yet
Appendix HPGD1103
14 pages
Remaining Awake Through A Great Revolution - MLK
No ratings yet
Remaining Awake Through A Great Revolution - MLK
10 pages
7
No ratings yet
7
3 pages
Compuestos de Dicalcogenuro de Tungsteno - Tungsten Dichalcogenide Compound
No ratings yet
Compuestos de Dicalcogenuro de Tungsteno - Tungsten Dichalcogenide Compound
15 pages
A Study of The Effectiveness of Post-Treatment After Hair Straightening Process PDF
No ratings yet
A Study of The Effectiveness of Post-Treatment After Hair Straightening Process PDF
6 pages
Mysql Ass 1 2 Class Xi
No ratings yet
Mysql Ass 1 2 Class Xi
2 pages
Byram Estate
No ratings yet
Byram Estate
3 pages
Creating A Pygame Window
No ratings yet
Creating A Pygame Window
3 pages
Addition & Multiplication Principle: Permutation and Combination
No ratings yet
Addition & Multiplication Principle: Permutation and Combination
56 pages
Enclosed Meters: Single-And Multi-Unit
No ratings yet
Enclosed Meters: Single-And Multi-Unit
8 pages
Amar Singh Dadal Booking Ticket
No ratings yet
Amar Singh Dadal Booking Ticket
1 page