0% found this document useful (0 votes)
96 views5 pages

Course Outline DPA

This document provides a course outline for a 3-credit course on data preprocessing and visualization. The course aims to help students utilize data visualization tools to uncover insights, prepare data for analytics tools and techniques, and solve analytics problems using Python programming. Key topics covered include data cleaning, feature selection, normalization, data imports in Python, data manipulation, basic data visualizations, and a case study on retail data preprocessing. Assessments include group assignments, quizzes, a midterm, and final exam. Required resources include textbooks on Python data analysis and machine learning cookbooks.

Uploaded by

Karan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
96 views5 pages

Course Outline DPA

This document provides a course outline for a 3-credit course on data preprocessing and visualization. The course aims to help students utilize data visualization tools to uncover insights, prepare data for analytics tools and techniques, and solve analytics problems using Python programming. Key topics covered include data cleaning, feature selection, normalization, data imports in Python, data manipulation, basic data visualizations, and a case study on retail data preprocessing. Assessments include group assignments, quizzes, a midterm, and final exam. Required resources include textbooks on Python data analysis and machine learning cookbooks.

Uploaded by

Karan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

Course Outline: Data Preprocessing and Visualization

Course code: IM510


Credit:3 , Core Course
Area: Information Management
Program: PGDM-B&FS
Term III Academic Year 2020-21
_______________________________________________________________________
Instructor(s) Name Purva Grover Click or tap here to
enter text.
Room no. 408 Click or tap here to
enter text.
Email [email protected] Click or tap here to
enter text.
Phone (Extn no.) Click or tap here to enter Click or tap here to
text. enter text.
Meeting Hours By Appointment Click or tap here to
enter text.
______________________________________________________________________
1. COURSE DESCRIPTION

Data pre-processing is an important step in analytics. Data gathered from various sources in an organization
are often loosely controlled, resulting in out-of-range values, impossible data combinations, missing values,
etc. There may be data which may be irrelevant and redundant or noisy and unreliable. Analyzing such data
can produce misleading results. Thus, the representation and quality of data is first and foremost before
applying any machine learning technique.

Data pre-processing includes cleaning, instance selection, normalization, transformation, feature extraction
and selection, etc. The product of data pre-processing is the final training set. Data pre-processing may
affect the way in which outcomes of the final data processing can be interpreted.

There are different methods used to identify trends and patterns in the processed data. One of the most
valuable tools is data visualization. Regardless of industry or size, data visualization is emerging as an
important concept in all types of businesses to help make sense of their data. Thus, data visualization is
seen as an important skill for all managers.

COs >>
COs and POs >>
2. COURSE OUTCOMES (COs) COs, POs and CECs >>

After going through the course, the students would be able to:

Course Outcome 1 (CO1): utilize data visualization tools to uncover insights.

Course Outcome 2 (CO2): prepare data for applying Analytics tools and techniques.
Course Outcome 3 (CO3): solve analytics problems using Python Programming Language

COs >>
COs and POs >>
COs, POs and CECs >>

3. MAPPING OF THE PROGRAM OUTCOMES (POs) AND COURSE OUTCOME (COs)

Program Outcomes (POs)


CO1 CO2 CO3
PO1: Student should be able to write well organized and grammatically correct
business reports and letters.

PO2: Student should be able to make effective oral presentations.

PO3: Student should be able to demonstrate critical thinking skills by 3 3


understanding the issues with a strong focus on banking and financial services,
evaluating alternatives on the basis of multiple perspectives and presenting a
solution including conclusions and implications

PO4: Student should be to demonstrate problem solving skills by understanding 3


and defining the problem related particularly to banking and financial services,
analyzing it and solving it by applying appropriate theories, tools and
techniques.

PO5: Student should be able to illustrate the role of responsible leadership in


management.

PO6: Student should be able to identify social concerns and ethical issues in
management.

PO7: Students should be able to identify challenges faced by banks and


financial institutions in the global business environment.

PO8: Student should be able to take decisions in the global business


environment with special focus on banking and financial institutions.

4. PEDAGOGY

The course will be taught with a blend of presentations, interactive lectures and discussions. It
will be supplemented by assignments, and practical exercises.
Software: Python, Tableau
5. COURSE EVALUATION COMPONENTS (CECs)
Group Assignment/ Presentation/ Exercise : 35 % (17.5%- Tableau + 17.5%- Python)
Quiz : 10% (5%- Tableau + 5%- Python)
Mid Term : 25 %
End Term exam : 30 %

COs >>
6. MAPPING BETWEEN COs, POs and CECs COs and POs >>
COs, POs and CECs >>

Course Evaluation Components (CECs)


Group Quiz Mid Term End Term exam
COs POs Assignment/
Presentation/
Exercise
CO1 PO3 
CO2 PO3  
CO3 PO4 

7. RESOURCES
7.1 Textbook

Pandas for Everyone: Python Data Analysis by Daniel Y. Chen (Author), Pearson Education;

7.2 Reference Books

Data Analysis and Visualization Using Python by Dr. Ossama Embarak (Apress)
Machine Learning with Python Cookbook by Chris Albon (Published by O’Reilly)
Communicating Data with Tableau by Ben Jones (Published by O’Reilly)

7.3 Online References (if any)

Anaconda (Data Science Platform). URL - https://www.anaconda.com/


Python 3.7 documentation (https://docs.python.org/3.7/)
The Python Standard Library (https://docs.python.org/3.6/py-modindex.html)
https://www.codecademy.com/learn/learn-python-3(Learning Python
https://www.w3schools.com/python/default.asp(Learning Python 3)
Tableau. https://www.tableau.com/

8. SESSION PLAN

Sessio Topics Readings/Cases


n No.
1-2 Introduction • Intro to Python
to Python • Downloading and Installing
(Appendixes • IDLE User Interface, Running Program
: Installation • Reading Code, Printing Comments
& Command • User Input, Logic in Python
Line) • Using Interactive Help
• Variables and Naming, Numbers and functions, Strings and Text,
Date-Time

Additional Readings: Davenport, T. H., & Patil, D. J. (2012). Data


scientist: The Sexiest Job of the 21st Century. Harvard Business Review,
90(5), 70-76.
3-5 Data Types, • Conditional Statements & Loops
Data Import • Iterative Data Types
in Python • Introduction to Functions
(Text Book: • Data type conversion
Chapter 1,2 • Sorting data
& 4) • Merging, Subsetting datasets
• Importing data from various formats:
• CSV, Delimited text files, Excel etc.
6-8 Data Pre- • Need for Pre-processing
processing & • Importing Libraries
Data • Handling Missing Data
Manipulatio • Handling Categorical Data
n • Splitting the dataset into training and testing datasets
(Text Book: • Standardizing Data
Chapter 5, 6, • Feature Scaling
7 & 10)
9 Data • Basic Plots & Charts like Scatter Plot, Line Chart, Histogram, etc.
Visualization
in Python
(Text Book:
Chapter 3)
10 Data Pre- • Dataset used: Retail Dataset
processing • Group Exercise
Use Case
11 Story Telling • What is visualization
Using Data • What is story telling
• Understanding the context
• Choosing an appropriate visual

Additional Readings: Berinato, S. (2016). Visualizations that really


work. Harvard Business Review, 94(6).
Additional Readings: Duarte, N. (2014). The quick and dirty on data
visualization. Harvard Business Review. Product #: H00RKA-PDF-ENG
12 Connecting • Navigating the Tableau User Interface
your Data & • Understanding data sources
Building • Sourcing and Connecting to Data
your first • Joining Data
Visualization • Understanding data types
using
Tableau
13-14 Concepts of • Use of Filters and Formatting
Business • Understanding types of visualization (Piecharts, Barcharts, Treemaps,
Intelligence Scatterplots, Geo-maps, Storylines)
& Data • Understanding Marks
Visualization • Understanding Calculations
using • Understanding KPIs
Tableau
HBR Case Study –Data Analytics at DBS Group Audit: The Future of
Auditing is Auditing the Future. Product #: SMU265-PDF-ENG. URL:
https://hbsp.harvard.edu/product/SMU265-PDF-
ENG?Ntt=Data+Analytics+at+DBS+Group&itemFindingMethod=Searc
h
15 Building • Understanding Maps
dashboards • Understanding Page Shelf
using • Understanding Stories
Tableau
16 Publishing • Publishing your Visualization using Tableau
your • Understanding Tableau Public
Visualization • Avoiding Common Pitfalls (Tips, Tricks and Timesavers)
17-18- Framing Relating industry cases/ problems about how people are deploying and
19 Business using Tableau to turn data into understandable information- from
Problems & identification of business problem to building KPIs and dashboards from
Use Cases scratch:
for Visual • Session 17: Sales & Marketing Data – i) Retail Store Dataset and/or
Analytics ii) Video Games Sale Dataset
• Session 18: Visual Analytics for Financial Data- i) Economic
Indicators for Indian Economy and/or ii) Foreign Exchange Rate
Dataset
• Session 19: Handling Unstructured Data- i) Analyzing Tweets (or data
from any other social source)

HBR Case Study – Private Banking Advisers at BCB Edmonton (A), (B)
& (C), Product #: UV7686-PDF-ENG, UV7690-PDF-ENG, UV7692-
PDF-ENG
20 Group • Overview of concepts
Presentation • Group Assignment

9. ACADEMIC INTEGRITY & CLASS RULES

a) Plagiarism is the use of or presentation of ideas, works that are not one’s own and which are not
common knowledge, without granting credit to the originator. Plagiarism is unacceptable in IMI
and will invite penalty. Type and extent of penalty will be at the discretion of the concerned faculty.

b) Cheating means using written, verbal or electronic sources of aid during an examination/ quiz/
assignment or providing such assistance to other students (except in cases where it is expressly
permitted by the faculty). It also includes providing false data or references/list of sources which
either do not exist or have not been used, having another individual write your paper or assignment
or purchasing a paper for one’s own submission. Cheating is strictly prohibited at IMI and will
invite penalty as per policies of the Institute.

You might also like