Course Outline DPA
Course Outline DPA
Data pre-processing is an important step in analytics. Data gathered from various sources in an organization
are often loosely controlled, resulting in out-of-range values, impossible data combinations, missing values,
etc. There may be data which may be irrelevant and redundant or noisy and unreliable. Analyzing such data
can produce misleading results. Thus, the representation and quality of data is first and foremost before
applying any machine learning technique.
Data pre-processing includes cleaning, instance selection, normalization, transformation, feature extraction
and selection, etc. The product of data pre-processing is the final training set. Data pre-processing may
affect the way in which outcomes of the final data processing can be interpreted.
There are different methods used to identify trends and patterns in the processed data. One of the most
valuable tools is data visualization. Regardless of industry or size, data visualization is emerging as an
important concept in all types of businesses to help make sense of their data. Thus, data visualization is
seen as an important skill for all managers.
COs >>
COs and POs >>
2. COURSE OUTCOMES (COs) COs, POs and CECs >>
After going through the course, the students would be able to:
Course Outcome 2 (CO2): prepare data for applying Analytics tools and techniques.
Course Outcome 3 (CO3): solve analytics problems using Python Programming Language
COs >>
COs and POs >>
COs, POs and CECs >>
PO6: Student should be able to identify social concerns and ethical issues in
management.
4. PEDAGOGY
The course will be taught with a blend of presentations, interactive lectures and discussions. It
will be supplemented by assignments, and practical exercises.
Software: Python, Tableau
5. COURSE EVALUATION COMPONENTS (CECs)
Group Assignment/ Presentation/ Exercise : 35 % (17.5%- Tableau + 17.5%- Python)
Quiz : 10% (5%- Tableau + 5%- Python)
Mid Term : 25 %
End Term exam : 30 %
COs >>
6. MAPPING BETWEEN COs, POs and CECs COs and POs >>
COs, POs and CECs >>
7. RESOURCES
7.1 Textbook
Pandas for Everyone: Python Data Analysis by Daniel Y. Chen (Author), Pearson Education;
Data Analysis and Visualization Using Python by Dr. Ossama Embarak (Apress)
Machine Learning with Python Cookbook by Chris Albon (Published by O’Reilly)
Communicating Data with Tableau by Ben Jones (Published by O’Reilly)
8. SESSION PLAN
HBR Case Study – Private Banking Advisers at BCB Edmonton (A), (B)
& (C), Product #: UV7686-PDF-ENG, UV7690-PDF-ENG, UV7692-
PDF-ENG
20 Group • Overview of concepts
Presentation • Group Assignment
a) Plagiarism is the use of or presentation of ideas, works that are not one’s own and which are not
common knowledge, without granting credit to the originator. Plagiarism is unacceptable in IMI
and will invite penalty. Type and extent of penalty will be at the discretion of the concerned faculty.
b) Cheating means using written, verbal or electronic sources of aid during an examination/ quiz/
assignment or providing such assistance to other students (except in cases where it is expressly
permitted by the faculty). It also includes providing false data or references/list of sources which
either do not exist or have not been used, having another individual write your paper or assignment
or purchasing a paper for one’s own submission. Cheating is strictly prohibited at IMI and will
invite penalty as per policies of the Institute.