0% found this document useful (0 votes)
33 views8 pages

CDAP - Data Science - 05-07-24-2

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
33 views8 pages

CDAP - Data Science - 05-07-24-2

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

II/III

OUTCOME
SYLLABUS (WRITE THE TOPICS TO BE COVERED (SYLLBUS
UNIT (UNIT NAME) CO) PART NO. TOPICS) SUB TOPICS (WHAT TO BE TAUGHT) BT LEVEL
1 DATA SCIENCE Introduction to Data science BTL2
Need for Data Science, Definition, Benefits and uses of datascience and big
1 Benefits data BTL2
Structured, Unstructured , Natural Language,
Machine Generated data, Graph based and
Network data, Audio,image and video,
1 Facets of Data streaming data BTL2
CO1-Describ
Data Science Process-An
e Data
MODULE I 1 introduction Overview of datascience process BTL2
science
INTRODUCT Spend time for understanding goals and
process and
1 ION TO 2 Setting research goal context of your research BTL2
analyse the
DATA Start with data stored within the company,
datasets to
SCIENCE Cleansing the data, correct errors as early as
build the
possible, combining data from different
model.
2 retrieving data, data cleansing sources, transforming data BTL2
Overview of exploratory data
2 Analysis,data exploration Taking deep dive in exploring data analysis BTL2
Model and variable selection,Model
2 Building the model execution, BTL2
presenting findings and building applications
2 Presenting and Automation on top of them BTL2

1 Types of Data and Variables Three types of data and types of variables BTL2
Frequency Distribution for quantitative data,
1 Describing Data with Tables, Guidelines BTL2
CO2-
Organize 1 Describing Data with Tables, outliers, Relative Frequency Distributions BTL2
MODULE II
datasets and Graphs for Quantitative Data,, typical
Describing
execute 1 Describing Data with Tables, shapes, BTL3
2 Data And
various A Graph for Qualitative (Nominal) Data,
Relationship
operations 1 Graphs and Averages Mode,Media, Mean, Which Average? BTL3
s
using Python 2 Graphs and Averages Averages for Qualitative and Ranked Data BTL1
Libraries. 2 Describing Variability Intuitive Approach, BTL1
2 Describing Variability Range, Variance BTL2
Normal Distributions and Standard The Normal Curve, Z Scores, Standard
2 scores(z) Normal Curve BTL2

An Intuitive Approach, Scatterplots, A


Correlation Coefficient for Quantitative
1 Data Relations: Correlation Data:R BTL2
CO3 - 1 Regression Two Rough Predictions BTL2
Analyse Placement of Line,Predictive Errors, Total
various 1 Regression Line Predictive error BTL2
3
Module III
datasets 1 Least Square Regression Line Need a mathematical solution, BTL2
Data
using 2 Least Square Regression Line Least Square Regression Equation BTL2
Analysis
machine
2 Standard Error of Estimate Finding the standard Error of Estimate BTL4
learning
Algorithms. 2 Populations and Samples Real Populations, BTL4
2 Populations and Samples Hypothetical populations BTL3
2 Populations and Samples Optimal Sample Size, Optimal Sample Size BTL3

Basics of Numpy Arrays, Array


Indexing:Accessing Single Elements, Array
Slicing: Accessing Sub Arrays, Array
1 Python NumPy arrays Concatenation and Splitting BTL4
The Slowness of Loops, Introducing Ufuncs,
1 Array computation, Exploring Numpy's Ufuncs, BTL4
Summing the values in Array, Minimum and
1 Aggregations Maximum, BTL4
Example: Counting Rainy Days, Comparison
Operators as ufuncs, Working with Boolean
CO4-Articulat Comparisons, Masks, Boolean Logic, Arrays, Boolean Arrays as Masks, Combined
Module IV
e different 1 indexing indexing, Example Binning Data BTL4
Data
visualization 1 Introduction to Pandas Installing and using pandas BTL4
4 Munging
plots for Dataframe object, The Pandas Index object,
with Python
various Data manipulation, Data indexing Data Selection in Series, Data Selection in
Libraries
datasets. 2 and data selection DataFrame, BTL2
Trade-Offs in missing data conventions,
Missing data in pandas, Operating on Null
2 Dealing with missing data Values BTL2
Concat and Append, Concatenation of
Numpy Arrays, Simple Concatenation with
pd. Concat, Planets Data, Simple
Aggregation in Pandas, GroupBy:Split,
2 Datasets combining, Aggregation Apply , combine BTL4
Motivating pivot tables, Pivot tables by hand,
2 Pivot tables pivot table syntax, BTL4

General Matplotlib Tips and importing,


Setting styles, how to display plots, saving
1 Importing Matplotlib figures to file BTL4
Simple line plots, Adjusting the Plot:Line
Colors and Styles,Adjusting the plot: Axes
Limits, Labeling plots,Scatter plots, Scatter
1 Plots plots with plt.plot and plt.scatter BTL4
CO5- Build 1 Error Visualization Basic Error bars, continuous errors BTL4
Machine 1 Density and contour plots Visualizing a three -dimensional function BTL4
Module V
Learning 1 Creating histograms Two dimensional histograms BTL4
5 Data
Models for Customizing plot legends, choosing
Visualization
Real-Time elements for legend, legend for size of
applications. points, multiple legends, customizing color
bars, multiple subplots, plt.axes: Subplots by
2 Legends, Colors , Subplots Hand BTL4
Transforms and Text Position, Arrows and
Annotation, Example:Effect of Holidays on
2 Text and annotation US births BTL4
2 Customization Plot Customization by Hand BTL4
2 Analysis Outlier. Changing the Defaults: rcparams BTL4
CGB1204 DATA SCIENCE
UNIT SYLLABUS (UNIT OUTCOME PART TOPICS TO BE COVERED (SYLLBUS SUB TOPICS (WHAT TO BE TAUGHT) BT LEVEL
1 Introduction to Data science
DATA SCIENCE BTL2
CO1-Describe 1
Need for Data Science, Definition, Benefits Benefits and uses of datascience and big data BTL2
Data science 1 Facets of Data Structured, Unstructured , Natural Language, BTL3
MODULE I
process and Data Science Process-An introduction Overview of datascience process
1 INTRODUCTION 2 BTL2
analyse the
TO DATA SCIENCE 2 Setting research goal, Spend time for understanding goals and BTL2
datasets to build
2 retrieving data, data cleansing Start with data stored within the company, BTL4
the model.
2 Overview of exploratory data Analysis,data Taking deep dive in exploring data analysis BTL3
2 Building the model Model and variable selection,Model execution, BTL3
2 Presenting and Automation presenting findings and building applications BTL3
CO2- Organize BTL2
1 Types of Data and Variables Three types of data and types of variables
MODULE II datasets and
1 Describing Data with Tables, Graphs and Frequency Distribution for quantitative data, BTL3
2 Describing Data execute various
1 Describing Variability Intuitive Approach, Range, Variance BTL3
And Relationships operations using
2 Normal Distributions and Standard scores(z) The Normal Curve, Z Scores, Standard BTL3
Python Libraries
1 Data Relations: Correlation An Intuitive Approach, Scatterplots, A BTL4
CO3 - Analyse
1 Regression Two Rough Predictions BTL4
various datasets
Module III 1 Regression Line Placement of Line,Predictive Errors, Total BTL4
3 using machine
Data Analysis 1 Least Square Regression Line Need a mathematical solution, Least Square BTL4
learning
Algorithms. 2 Standard Error of Estimate Finding the standard Error of Estimate BTL4
2 Populations and Samples Real Populations, Hypothetical populations, BTL3
1 Python NumPy arrays Basics of Numpy Arrays, Array BTL2
1 Array computation, The Slowness of Loops, Introducing Ufuncs, BTL2
1 Aggregations Summing the values in Array, Minimum and BTL3
CO4-Articulate 1 Comparisons, Masks, Boolean Logic Example: Counting Rainy Days, Comparison BTL3
Module IV different 1 Indexing- Combined indexing, Example Binning Data BTL4
4 Data Munging with visualization plots 2 Introduction to Pandas Installing and using pandas BTL4
Python Libraries for various 2 Data manipulation, Data indexing and data Dataframe object, The Pandas Index object, Data BTL4
datasets. 2 Dealing with missing data Trade-Offs in missing data conventions, BTL4
2 Datasets combining Concat and Append, Concatenation of Numpy BTL3
2 aggregation Planets Data, Simple Aggregation in Pandas, BTL4
2 Pivot tables Motivating pivot tables, Pivot tables by hand, BTL4
General Matplotlib Tips and importing, Setting
1 Importing Matplotlib styles, how to display plots, saving figures to
file BTL4
CO5- Build 1 Plots Simple line plots, Adjusting the Plot:Line Colors BTL3
Machine Learning 1 Error Visualization Basic Error bars, continuous errors BTL4
Module V
5 Models for 1 Density and contour plots Visualizing a three -dimensional function BTL4
Data Visualization
Real-Time 2 Creating histograms Two dimensional histograms BTL3
applications. 2 Legends, Colors , Subplots Customizing plot legends, choosing elements BTL3
2 Text and annotation Transforms and Text Position, Arrows and BTL3
2 Customization Plot Customization by Hand BTL4
2 Analysis Outlier. Changing the Defaults: rcparams BTL4

Total BTL: 39
Red: 4 10.3
Green: 8 20.5
Blue: 8 20.5
Block: 19 48.7
100.0
M.Kumarasamy College of Engineering , Karur
ACADEMICYEAR 2024-2025 (ODD SEMESTER)
CDAP : CGB1204 – Data Science
II Year / III Semester
III A SECTION B SECTION
TOPICS TO BE
UNI SYLLABUS OUTCOME Planned Delivered Planned Delivered PART BT
Content type COVERED (SYLLBUS SUB TOPICS (WHAT TO BE TAUGHT)
T (UNIT NAME) (WRITE THE CO) Date date Date date NO. LEVEL
TOPICS)
THEORY 1 Data Science Introduction to Data science BTL2
Need for Data
Benefits and uses of datascience and big
THEORY 1 Science, Definition, BTL2
data
Benefits
Structured, Unstructured , Natural
Language, Machine Generated data,
THEORY 1 Facets of Data BTL2
Graph based and Network data,
Audio,image and video, streaming data
Data Science
THEORY 1 Process-An Overview of datascience process BTL2
CO1-Describe introduction
MODULE I Data science Spend time for understanding goals and
THEORY 2 Setting research goal BTL2
INTRODUCTION process and context of your research
1
TO DATA analyse the Start with data stored within the
SCIENCE datasets to build company, Cleansing the data, correct
retrieving data, data
the model. THEORY 2 errors as early as possible, combining BTL2
cleansing
data from different sources,
transforming data
Overview of
exploratory data Taking deep dive in exploring data
THEORY 2 BTL2
Analysis,data analysis
exploration
Model and variable selection,Model
THEORY 2 Building the model BTL2
execution,
Presenting and presenting findings and building
THEORY 2 BTL2
Automation applications on top of them

Types of Data and Three types of data and types of


THEORY 1 BTL2
Variables variables
Describing Data with Frequency Distribution for quantitative
THEORY 1 BTL2
Tables, data, Guidelines
Describing Data with
THEORY 1 outliers, Relative Frequency Distributions BTL2
Tables,
CO2- Organize
THEORY/ SIMPLE Describing Data with Graphs for Quantitative Data, typical
MODULE II datasets and 1 BTL3
PROBLEMS/ USE CASES Tables, shapes,
Describing Data execute various
2 THEORY/ SIMPLE A Graph for Qualitative (Nominal) Data,
And operations 1 Graphs and Averages BTL3
PROBLEMS/ USE CASES Mode,Media, Mean, Which Average?
Relationships using Python
Averages for Qualitative and Ranked
Libraries. THEORY 2 Graphs and Averages BTL1
Data
THEORY 2 Describing Variability Intuitive Approach, BTL1
THEORY 2 Describing Variability Range, Variance BTL2
Normal Distributions
The Normal Curve, Z Scores, Standard
THEORY 2 and Standard BTL2
Normal Curve
scores(z)

An Intuitive Approach, Scatterplots, A


Data Relations:
THEORY 1 Correlation Coefficient for Quantitative BTL2
Correlation
Data:R
THEORY 1 Regression Two Rough Predictions BTL2
Placement of Line,Predictive Errors,
THEORY 1 Regression Line BTL2
Total Predictive error
CO3 - Analyse Least Square
THEORY 1 Need a mathematical solution, BTL2
various datasets Regression Line
Module III Data
3 using machine Least Square
Analysis THEORY 2 Least Square Regression Equation BTL2
learning Regression Line
Algorithms. COMPLEX PROBLEMS/ Standard Error of
2 Finding the standard Error of Estimate BTL4
USECASES/ THEORY Estimate
COMPLEX PROBLEMS/ Populations and
2 Real Populations, BTL4
USECASES/ THEORY Samples
THEORY/ SIMPLE Populations and
2 Hypothetical populations BTL3
PROBLEMS/ USE CASES Samples
THEORY/ SIMPLE Populations and Optimal Sample Size, Optimal Sample
2 BTL3
PROBLEMS/ USE CASES Samples Size

Basics of Numpy Arrays, Array


COMPLEX PROBLEMS/ Indexing:Accessing Single Elements,
1 Python NumPy arrays BTL4
USECASES/ THEORY Array Slicing: Accessing Sub Arrays,
Array Concatenation and Splitting
COMPLEX PROBLEMS/ The Slowness of Loops, Introducing
1 Array computation, BTL4
USECASES/ THEORY Ufuncs, Exploring Numpy's Ufuncs,
COMPLEX PROBLEMS/ Summing the values in Array, Minimum
1 Aggregations BTL4
USECASES/ THEORY and Maximum,

Example: Counting Rainy Days, Comparison


Comparisons, Masks,
COMPLEX PROBLEMS/ Operators as ufuncs, Working with Boolean
1 Boolean Logic, BTL4
USECASES/ THEORY Arrays, Boolean Arrays as Masks, Combined
indexing
indexing, Example Binning Data
CO4-Articulate
Module IV Data
different COMPLEX PROBLEMS/ Introduction to
Munging with 1 Installing and using pandas BTL4
4 visualization USECASES/ THEORY Pandas
Python
plots for various
Libraries Data manipulation, Dataframe object, The Pandas Index
datasets. 2
THEORY Data indexing and object, Data Selection in Series, Data BTL2
data selection Selection in DataFrame,

Trade-Offs in missing data conventions,


Dealing with missing
THEORY 2 Missing data in pandas, Operating on BTL2
data
Null Values
Concat and Append, Concatenation of
Numpy Arrays, Simple Concatenation
COMPLEX PROBLEMS/ Datasets combining,
2 with pd. Concat, Planets Data, Simple BTL4
USECASES/ THEORY Aggregation
Aggregation in Pandas, GroupBy:Split,
Apply , combine
COMPLEX PROBLEMS/ Motivating pivot tables, Pivot tables by
2 Pivot tables BTL4
USECASES/ THEORY hand, pivot table syntax,

General Matplotlib Tips and importing,


COMPLEX PROBLEMS/
1 Importing Matplotlib Setting styles, how to display plots, BTL4
USECASES/ THEORY
saving figures to file

Simple line plots, Adjusting the Plot:Line


COMPLEX PROBLEMS/ Colors and Styles,Adjusting the plot:
1 Plots BTL4
USECASES/ THEORY Axes Limits, Labeling plots,Scatter plots,
Scatter plots with plt.plot and plt.scatter

COMPLEX PROBLEMS/
1 Error Visualization Basic Error bars, continuous errors BTL4
USECASES/ THEORY
CO5- Build COMPLEX PROBLEMS/ Density and contour Visualizing a three -dimensional
Machine 1 BTL4
USECASES/ THEORY plots function
Module V Data Learning COMPLEX PROBLEMS/
5 1 Creating histograms Two dimensional histograms BTL4
Visualization Models for USECASES/ THEORY
Real-Time
applications. Customizing plot legends, choosing
elements for legend, legend for size of
COMPLEX PROBLEMS/ Legends, Colors ,
2 points, multiple legends, customizing BTL4
USECASES/ THEORY Subplots
color bars, multiple subplots, plt.axes:
Subplots by Hand

Transforms and Text Position, Arrows


COMPLEX PROBLEMS/
2 Text and annotation and Annotation, Example:Effect of BTL4
USECASES/ THEORY
Holidays on US births
COMPLEX PROBLEMS/
2 Customization Plot Customization by Hand BTL4
USECASES/ THEORY
COMPLEX PROBLEMS/
2 Analysis Outlier. Changing the Defaults: rcparams BTL4
USECASES/ THEORY

Course Coordinator DCC Head HOD / IT

You might also like