Skill Based Projects - Data - Science (See List On Last Page)
Skill Based Projects - Data - Science (See List On Last Page)
Course Objectives
● To provide fundamental knowledge about Data Science
● To understand data pre-processing techniques
● To analyze the Data Science techniques solutions for real world problems
—----------------------------------------------------------------------------------------------------------------------------
List of Experiment:
1. Study of data science libraries such as Numpy, Pandas etc. for Numerical computations and
data manipulation.
2. Write a python program to perform descriptive statistics such as Central Tendency Measures
(Mean, Median and Mode), Measure of Dispersion (Variance, Standard Deviation), Skewness
and Kurtosis.
3. Write a program to Create visualizations (box plots, scatter plots) to identify outliers and
relationships between variables.
4. Write a program to show various encoding methods used for ordinal data
5. Write a program to Normalize and standardize the data.
6. Build a linear regression model to predict a continuous target variable and evaluate model
performance using R-squared, MSE, and RMSE.
7. Implement logistic regression for binary classification and Evaluate accuracy, sensitivity, and
specificity.
8. Construct a decision tree for multi-class classification using various feature selection techniques.
9. Implement the k-means clustering Algorithm for a given dataset.
10. Implement the DBSCAN Algorithm for a given dataset and explain how it can be used for outlier
detection
—------------------------------------------------------------------------------------------------------------------
-
Dr. Abhishek Bhatt, Faculty, Centre for AI, MITS-DU, Gwalior
MADHAV INSTITUTE OF TECHNOLOGY & SCIENCE, GWALIOR (M.P.), INDIA
Deemed to be University
(Declared under Distinct Category by Ministry of Education, Government of India)
NAAC ACCREDITED WITH A++ GRADE
Centre for Artificial Intelligence
9. Write a python program to calculate the Variance, Standard Deviation, Skewness and
Kurtosis.
10. Write a program to calculate Z-Score for any data.
Macro Projects:
1. Write a program to identify the missing value in any dataset and how to handle and
replace it.
2. Write a program to show one hot encoding in any dataset.
3. Write a program to show label encoding in any dataset.
4. Write a python program to count the frequency of occurrence of a word (Frequency
distributions) in a body of text.
5. Write a python program to draw correlation matrix.
6. Write a program to draw residual Plot for any data.
7. Write a program to show various distributions of Data over any Dataset.
8. Write a program to compute weighted averages in Python either defining your own
functions or using Numpy.
9. Develop a machine learning model to detect fraudulent credit card transactions. Explore
anomaly detection techniques and evaluate model performance.
10. Build a model to detect fake news articles.
Dr. Abhishek Bhatt, Faculty, Centre for AI, MITS-DU, Gwalior
MADHAV INSTITUTE OF TECHNOLOGY & SCIENCE, GWALIOR (M.P.), INDIA
Deemed to be University
(Declared under Distinct Category by Ministry of Education, Government of India)
NAAC ACCREDITED WITH A++ GRADE
Centre for Artificial Intelligence
Mini Projects:
1. Use historical stock price data to predict future stock prices. Explore time series analysis and
machine learning algorithms.
2. Create a system that detects driver fatigue based on facial expressions or eye movements. Use
computer vision techniques and machine learning.
3. Analyze sentiments in text data.
4. Consider any Dataset from online repository to design and implement a Priceprediction
problem.
5. Consider any Dataset from online repository to design and implement a problem usingLinear
Regression and Logistic Regression.
6. Consider any Dataset from online repository and demonstrate working of variousfeature
selection and normalization techniques.
7. Design and implement weather forecasting system.
8. Customer Segmentation- Identify segments of customers to target the potential user base using
clustering (i.e. K-means clustering). Divide customers into groups according to common
characteristics like gender, age, interests and spending habits. Dataset: Mall_Customers dataset
9. Fake News Detection- fake news is sometimes transmitted through the internet by some
unauthorised sources, which creates issues for the targeted person and it makes them panic and
leads to even violence. Dataset: fake-news kaggle.
10. Cab Pickups Analysis- cab pickup and distribution, time, days when pickup happens regularly,
Dataset: Uber-Pickups dataset
—----------------------------------------------------------------------------------------------------------------------------
Recommended Books:
1. "The Data Science Handbook" by Field Cady, Publisher: Wiley
2. "Data Science from Scratch" by Joel Grus, Publisher: O'Reilly
3. “An Introduction to Statistical Learning: With Applications in R” by Gareth M. James, Daniela
Witten, Trevor Hastie, Robert Tibshirani, Publisher: Springer
4. “An Introduction to Statistical Learning: With Applications in R” by Gareth M. James, Daniela
Witten, Trevor Hastie, Robert Tibshirani, Publisher: Springer
—----------------------------------------------------------------------------------------------------------------------------
Course Outcomes:
After completion of the course, students will be able to
CO1. define the concepts and importance of data science.
CO2. describe and investigate the data.
CO3. implement descriptive and inferential statistics approach on real world data
CO4. develop real world solutions using supervised and unsupervised learning methods.
CO5. evaluate the best performing algorithms based on performance metrics.
CO6. examine the stability of machine learning based models.
Dr. Abhishek Bhatt, Faculty, Centre for AI, MITS-DU, Gwalior
MADHAV INSTITUTE OF TECHNOLOGY & SCIENCE, GWALIOR (M.P.), INDIA
Deemed to be University
(Declared under Distinct Category by Ministry of Education, Government of India)
NAAC ACCREDITED WITH A++ GRADE
Centre for Artificial Intelligence
Project-10 Project-1
0901AI221076
0901AI233D03
0901AI221077
To
0901AI221078
0901AI233D06
0901AI233D01
0901MC221066
0901AI233D02