5.DataScience THEORY LESSON PLAN
5.DataScience THEORY LESSON PLAN
Course Learning
The purpose of learning this course is to:
Rationale (CLR):
CLR-1: Understand the basics of data
Course Outcomes
At the end of this course, learners will be able to:
(CO):
CO-1: Understand the relationship between data
CO-4: Constructs the graphs and plots to represent the data using python packages
CO-5: Apply the principles of the data science techniques to predict and forecast the outcome of real-world problem.
Assignments
Learning
Delivered as
Proposed resources (A)/Tests(T)
S. Unit Topics planned for the Teaching POs & planned(* to
Hour Date / that can
No No period(s) Aids used PSOs be filled after
Period be planned for
the class)
referred the
Topic
UNIT I-- Introduction to Data Science, Numpy and Pandas
Introduction to Data science: Black Board
1 I H-1 Facets of data, Data Science & Power LR1,3 PO5
Process Point Slides
Introduction to Numpy: Black Board
2 1 H-2 Numpy, creating array, & Power LR4,5,8 PO5
attributes Point Slides
Numpy Arrays objects:
Black Board
Creating Arrays, basic
3 I H-3 & Power LR4,5,8 PO5
operations (Array Join, split,
Point Slides
search, sort),
Black Board
4 I H-4 Indexing, Slicing and iterating & Power LR4,5,8 PO5
Point Slides CLA
Assignment
Black Board
Copying arrays, Arrays shape
5 I H-5 & Power LR4,5,8 PO5
manipulation, Identity array
Point Slides
Eye function Pandas:
Black Board
Exploring Data using
6 I H-6 & Power LR4,5,8 PO5
Series,Exploring Data using
Point Slides
DataFrames
Black Board
7 I H-7 Index objects, Re index & Power LR4,5,8 PO5
Point Slides
Drop Entry, Selecting Entries, Black Board
8 I H-8 Data Alignment, Rank and & Power LR4,5,8 PO5
Sort, Summary Statistics Point Slides
Index Hierarchy Data
Black Board
Acquisition: Gather
9 I H-9 & Power LR4,5,8 PO5
information from different
Point Slides
sources
Black Board
Web APIs, Open Data
10 I H-10 & Power LR4,5,8 PO5
Sources, Web Scrapping.
Point Slides
Total periods for Unit 1 = 10 periods
UNIT II - Data Wrangling, Data Cleaning and Preparation
Assignments
Learning
Delivered as
Proposed resources (A)/Tests(T)
S. Unit Topics planned for the Teaching POs & planned(* to
Hour Date / that can
No No period(s) Aids used PSOs be filled after
Period be planned for
the class)
referred the
Topic
Black Board
Data Handling: Problem faced
11 II H11 & Power LR1,2,3,6 PO5
when handling large data
Point Slides
Black Board
General techniques for
12 II H12 & Power LR1,2,3,6 PO5
handling large volume of data
Point Slides
Black Board
General programming tips for
13 II H13 & Power LR1,2,3,6 PO5
dealing large data sets
Point Slides
Black Board CLA
Data Wrangling: Clean,
14 II H14 & Power LR1,2,3,6 Assignment PO5
Transform, Merge
Point Slides
Reshape: Combining and Black Board
15 II H15 Merging Datasets, Merging on & Power LR1,2,3,6 PO5
Index Point Slides
Black Board
Concatenate, Combining with
16 II H16 & Power LR1,2,3,6 PO5
overlap, Reshaping
Point Slides
Pivoting Data Cleaning and Black Board
17 II H17 Preparation: Handling Missing & Power LR1,2,3,6 PO5
Data, Data Transformation Point Slides
Black Board
String Manipulation,
18 II H18 & Power LR1,2,3,6 PO5
summarizing
Point Slides
Black
Binning, classing and Board &
19 II H19 LR1,2,3,6 PO5
Standardization Power Point
Slides
Black Board
20 II H20 Outlier/Noise& Anomalies. & Power LR1,2,3,6 PO5
Point Slides
Total periods for Unit 2 = 10 periods
UNIT III -Visualization
Assignments
Learning
Delivered as
Proposed resources (A)/Tests(T)
S. Unit Topics planned for the Teaching POs & planned(* to
Hour Date / that can
No No period(s) Aids used PSOs be filled after
Period be planned for
the class)
referred the
Topic
Customizing Plots: Black Board
21 III H21 Introduction to Matplotlib, & Power LR5,7 PO5
Plots, making subplots, Point Slides
Controlling axes, Ticks, Labels Black Board
22 III H22 and legends, annotations and & Power LR5,7 PO5
drawing on subplots Point Slides
1. Grus, J. (2019). Data Science from Scratch, 2nd Edition. O'Reilly Media, Inc. 5. Vanderplas, J. T. (2017). Python data science handbook: Essential tools for working with
2. Jiawei Han, Micheline Kamber and Jian Pei (2012), Data Mining Concepts data. O'Reilly Media, Inc.
and Techniques, Third Edition, Elsevier. 6. Jeffrey S. Saltz and Jeffrey M. Stanton (2018), An Introduction to Data Science, Sage
Learning 3. Davy Cielen, Arno D. B. Meysman, and Mohamed Ali (2016), Introducing Data Publication.
Resources Science: Big data, machine learning, and more, using Python tools, Manning 7. Shai Vaingast (2014), “Beginning Python Visualization Crafting Visual Transformation
Publications. Scripts”, Second Edition, Apress.
4. McKinney, W. (2018). Python for data analysis: Data wrangling with pandas, 8. Wes Mc Kinney (2012). “Python for Data Analysis”, O'Reilly Media.
NumPy, and IPython. O'Reilly Media, Inc.
Learning Assessment
Continuous Learning Assessment (CLA)
Formative Life-Long Summative
Bloom’s CLA-1 Average of unit test Learning CLA-2 Final Examination
Level of (50%) (10%) (40% weightage)
Thinking Theory Practice Theory Practice Theory Practice
Level 1 Remember 40% - 20% - 40% -
Level 2 Understand 40% - 20% - 40% -
Level 3 Apply 10% - 20% - 10% -
Level 4 Analyze 10% - 20% - 10% -
Level 5 Evaluate - - 10% - - -
Level 6 Create - - 10% - - -
Total 100 % 100 % 100 %
Course Designers
Experts from Industry Experts from Higher Technical Institutions Internal Experts
Dr. Veeramanickam. M.R.M, Associate Professor 1. Mr. Snehith Allam Raju Senior Manager Advanced Analytics & 1. Dr.V.Kalpana, SRMIST
Chitkara University Institute of Engineering and Technology Architecture Envista Holdings Corporation, Hyderabad.
2. Dr.G.Vadivu, SRMIST