0% found this document useful (0 votes)
95 views

Data Science

Data science involves extracting meaning from data to tell easily understood stories. General tasks for data scientists include getting domain understanding, defining the problem, preprocessing data, visualizing data, identifying appropriate modeling techniques, analyzing results, and communicating outputs. Data science is used in many fields like pharmaceutical, healthcare, banking, travel, and government for applications like fraud detection, forecasting, recommendation systems, and policy planning. To build a career in data science requires skills in statistics, machine learning, communication, understanding customer domains, and asking the right questions of data.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
95 views

Data Science

Data science involves extracting meaning from data to tell easily understood stories. General tasks for data scientists include getting domain understanding, defining the problem, preprocessing data, visualizing data, identifying appropriate modeling techniques, analyzing results, and communicating outputs. Data science is used in many fields like pharmaceutical, healthcare, banking, travel, and government for applications like fraud detection, forecasting, recommendation systems, and policy planning. To build a career in data science requires skills in statistics, machine learning, communication, understanding customer domains, and asking the right questions of data.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 19

Introduction to Data

science
What it is ?

 “Goal is in extracting meaning from data and creating data products and
seeks to use all available and relevant data to effectively tell a story that
can be easily understood by non-practitioners.”
 - From Wikipedia (not on latest version of the page)
What it is ?(Contd..)
General Tasks of a Data Scientist
 Get a little domain understanding
 Define the problem statement well
 Pre-process data to fix data issues like duplicates, missing values, etc.
 Visualize data to the extent possible for better understanding and to see basic
patterns
 Identify what kind of a problem it is (Prediction/Forecasting, Classification,
Optimization and/or Managing Big Data)
 Identify appropriate modeling techniques and build models
 Analyze results and iterate, as needed; DO NOT trust software outputs blindly
‟ Remember:
 Garbage In, Garbage Out
 Visualize outputs and Communicate
General Tasks(Contd..)
Classification; Supervised and
Unsupervised
Prediction/Forecasting is finding the
line/plane closest to all points
Forecasting
Optimization
Analysis
Churn can be heartbreaking
Retail Analytics

How your shopping habits


reveal even the most powerful
and private information
Recommendation Engine
Text Mining

 Natural Language Processing


 Sentiment Analysis
 Information Retrieval Systems
Other Important Applications

 Pharmaceutical -Fraud detection in clinical trials; Drug development process


 Healthcare - Non-compliance in taking prescription drugs
 Banking and Insurance - Fraud detection; Credit scoring; Cross-selling and
upselling products;
Detecting money laundering; Forecasting stock prices
 Travel and Hospitality - Improve customer experience
 Politics - Predict winners; Identify fence-sitters
Other Important Applications

 Retail and Telecommunications - Customer retention; Enhancing supply chain


efficiency;
 Improving customer service quality; Planning store locations; Cross-selling and
upselling;
 Recommendation systems; Sales forecasting
 Government - Policy planning; Effective use of resources; Security against
terrorist attacks;
 Effective policing by understanding crime patterns; Weather predictions;
Calamity predictions
How to build career in Data Science

 Statistics
 Machine Learning
 Communications
 Understanding customer domain - Be inquisitive
 Asking the right questions of the data
Finally

 Data is everywhere; you can’t escape it.


 You Can Make Better Decisions Using Data Science / Big Data Analytics
Thank you

You might also like