0% found this document useful (0 votes)
76 views4 pages

Fundementalsof Data Science

This document outlines a course on fundamentals of data science. The course aims to provide students with an overview of data science concepts and technologies. It covers topics like data importing, preprocessing, analysis, model development and evaluation using Python. The course is divided into 5 units that teach these concepts through practical examples and a hands-on project. Upon completing the course, students will understand core data science concepts and be able to apply machine learning and data analysis techniques.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
76 views4 pages

Fundementalsof Data Science

This document outlines a course on fundamentals of data science. The course aims to provide students with an overview of data science concepts and technologies. It covers topics like data importing, preprocessing, analysis, model development and evaluation using Python. The course is divided into 5 units that teach these concepts through practical examples and a hands-on project. Upon completing the course, students will understand core data science concepts and be able to apply machine learning and data analysis techniques.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Department of Computer Science and Engineering, GITAM Deemed to be University

CoureCode Fundamentals of Data Science L T P S J C


SDG No. 2 0 0 0 0 2

Course Description:

Pre-requisites : None
Alternate Exposure : None
This course is designed as a window to the field of Data Science. Students will get a bird’s
eye view of the technology and process involved in using Data Science for meaningful
inferences. They can also gain enough knowledge to understand the machine learning
process for data science and complete a hands-on project using standard data sets.

Course Objectives:

● Provide a basic foundation for data science and application areas related to it.

● Understand the underlying core concepts and emerging technologies in data science.

● Learn the process of working with data on a large scale.

● Explore the concepts of data preprocessing, model development and evaluation and
tuning.

UNIT 1 Introduction 5 hours

What is Data Science?, Fundamentals of Data Science, The many paths to Data Science,
Data Science Topics and Algorithms, Cloud for Data Science; Foundations of Big Data,
What is Hadoop?, How Big Data is driving Digital Transformation, Data Science Skills and
Big Data, Neural Networks and Deep Learning, Applications of Machine Learning; How
should Companies Get Started in Data Science?, Applications of Data Science, How can
someone become a Data Scientist?, Recruiting for Data Science.

Learning Outcomes:

After completion of this unit, the student will be able to:

• learn the definition and importance of Data Science (L2).

• describe how Big Data affects Digital Transformation (L2).

• Identify areas where Machine Learning can be applied (L3).

B Tech. Computer Science and Engineering w.e.f. 2021-22 admitted batch


Department of Computer Science and Engineering, GITAM Deemed to be University

• Identify areas where Data Science can be applied (L3).

UNIT 2 Importing Datasets and Data Wrangling with Python 6 hours

Understanding Data, Python Packages for Data Science, Importing and Exporting Data in
Python, Analyzing Data with Python, Accessing Databases with Python.

Learning Outcomes:

After completion of this unit, the student will be able to:

• Identify and import Python Packages for Data Science (L2).

•Import and Export Data using Python (L2).

• Analyze data using Python (L4).

• Access data from databases using Python (L3).

UNIT 3 Data Preprocessing and Analysis with Python 6 hours

Python for - Pre-processing Data, Dealing with Missing Values, Data Formatting, Data
Normalization, Binning, Turning categorical variables into quantitative variables;
Exploratory Data Analysis, Descriptive Statistics, Groupby in Python, Correlation,
Correlation - Statistics, Association between two categorical variables: Chi-Square.

Learning Outcomes:

After completion of this unit, the student will be able to:

• Pre-process data using Python (L3).

• Perform exploratory data analysis (L3).

• Define correlation and find correlation using Python (L2)

• Find Association using Python (L2).

UNIT 4 Model Development and Evaluation 6 hours

Model Development, Linear Regression and Multiple Linear Regression, Model Evaluation

B Tech. Computer Science and Engineering w.e.f. 2021-22 admitted batch


Department of Computer Science and Engineering, GITAM Deemed to be University

using Visualization, Polynomial Regression and Pipelines, Measures for In-Sample


Evaluation, Prediction and Decision Making

Learning Outcomes:

After completion of this unit, the student will be able to:

• define linear and multiple regression (L2).

• evaluate models using visualization (L3).

• apply regression for in-sample evaluation (L4).

• apply regression fro prediction and decision making (L4)

UNIT 5 Model Refinement and Tuning 5 hours

Model Evaluation and Refinement, Overfitting, Underfitting and Model Selection, Ridge
Regression Introduction, Grid Search

Learning Outcomes:

After completion of this unit, the student will be able to:

• refine models (L3).

• check the model design for overfitting and underfitting (L4).

• perform ridge regression for model tuning (L4).

• use grid search for model tuning (L4)

Textbooks:

1. Introducing Data Science, Davy Cielen, Arno D. B. Meysman and Mohamed Ali,
Manning Publications,2016.
2. Think Like a Data Scientist, Brian Godsey, Manning Publications, 2017.

References:

B Tech. Computer Science and Engineering w.e.f. 2021-22 admitted batch


Department of Computer Science and Engineering, GITAM Deemed to be University

1. https://www.coursera.org/learn/what-is-datascience#about
2. https://www.coursera.org/learn/data-analysis-with-python/home/info
3. Data Science from Scratch: First Principles with Python, Joel Grus, O’Reilly, 1st
edition, 2015.
4. Doing Data Science, Straight Talk from the Frontline, Cathy O'Neil, Rachel Schutt,
O’ Reilly, 1st edition, 2013

Course Outcomes:
Upon successful completion of the course, students will be able to:
● Understand the fundamental concepts of data science.
● Evaluate the data analysis techniques for applications handling large data
● Experiment with the data science process.
● Apply the concept of machine learning in the data science process.
● Visualize and present model inference using various tools

APPROVED IN:

BOS :<< date >> ACADEMIC COUNCIL: <<date>>

SDG No. & Statement:

SDG Justification:

B Tech. Computer Science and Engineering w.e.f. 2021-22 admitted batch

You might also like