SlideShare a Scribd company logo
2
Most read
4
Most read
10
Most read
Hello…
Welcome
To the Talk on
Data Science Applications and Use cases
Agenda…
• What is Data Science?
• Big Data Challenges
• Data Science vs Software Engineering
• Data Science Applications & Use cases
• Conclusion
What is Data Science?
Data Science is the science which uses computer science, statistics and machine
learning, visualization and human-computer interactions to collect, clean, integrate,
analyze, visualize, interact with data to create data products.
“Using data to make better decisions, optimize processes and improve products
and services.”
“What distinguishes data science itself from the tools and techniques
is the central goal of deploying effective decision-making models to a
production environment. “
– John Mount & Nina Zumel, Practical Data Science with R
Big Data Challenges
• Dealing with Data Growth
• Generating insights in a timely manner
• Integrating disparate data sources
• Validating Data
• Securing Bigdata
• Organizational resistance
Data science applications and usecases
‘Data science’ is “Data-Driven Decision” making, to help the business to
make good choices, whereas software engineering is the methodology
for software product development without any confusions about the
requirements.
Data Science vs Software Engineering
Data Science Competence Groups - Research
Data Science Competence includes 5
areas/groups
• Data Analytics
• Data Science Engineering
• Domain Expertise
• Data Management
• Scientific Methods (or Business Process
Management)
Scientific Methods
• Design Experiment
• Collect Data
• Analyse Data
• Identify Patterns
• Hypothesise Explanation
• Test Hypothesis
Business Operations
• Operations Strategy
• Plan
• Design & Deploy
• Monitor & Control
• Improve & Re-design
Data Science Competence includes 5
areas/groups
• Data Analytics
• Data Science Engineering
• Domain Expertise
• Data Management
• Scientific Methods (or Business Process
Management)
Scientific Methods
• Design Experiment
• Collect Data
• Analyse Data
• Identify Patterns
• Hypothesise Explanation
• Test Hypothesis
Business Process
Operations/Stages
• Design
• Model/Plan
• Deploy & Execute
• Monitor & Control
• Optimise & Re-design
Data Science Competences Groups – Business
Design
Modelling
Execution
Monitoring
Optimisation
RESEARCH
DATA
ANALYTICS
ALGORITHMSANALYTIC
SYSTEMS
ENGINEERING
COMPETENCES
DOMAIN
EXPERTISE DATA
SCIENCE
Data
Management
Scientific
Methods
Business Process
Management
Data Science Applications & Use cases
• RECOMMENDER SYSTEMS
• CREDIT SCORING
• DYNAMIC PRICING
• CUSTOMER CHURN
• FRAUD DETECTION
RECOMMENDER SYSTEMS
WHAT IS A RECOMMENDER SYSTEM?
A model that filters information to present users with a curated subset
of options they’re likely to find appealing
HOW DOES IT WORK?
Generally via a collaborative approach (considering user’s previous
behavior) or content-based approach (based on discrete assigned
characteristics)
WHAT IS A REAL USE CASE?
Tendril uses recommendation models to match eligible customers with
new or existing energy products
CREDIT SCORING
WHAT IS CREDIT SCORING?
A model that determines an applicant’s creditworthiness for a mortgage,
loan or credit card
HOW DOES IT WORK?
A set of decision management rules evaluates how likely an applicant is to
repay debts
WHAT IS A REAL USE CASE?
Ferratum Bank uses machine learning models to reach prospective
customers that may have been overlooked by traditional banking
institutions
DYNAMIC PRICING
WHAT IS DYNAMIC PRICING?
Modeling price as a function of supply, demand, competitor pricing and
exogenous factors
HOW DOES IT WORK?
Generalized linear models and classification trees are popular
techniques for estimating the “right” price to maximize expected
revenue.
WHAT IS A REAL USE CASE?
Turo uses dynamic pricing models to suggest prices to the people who
list and rent out cars
CUSTOMER CHURN
WHAT IS CUSTOMER CHURN?
Predicting which customers are going to abandon a product or service
HOW DOES IT WORK?
Data scientists may consider using support vector machines, random
forest or k-nearest-neighbors algorithms
WHAT IS A REAL USE CASE?
EAB combines data from transcripts, standardized test scores,
demographics and more to identify students at risk of not graduating.
FRAUD DETECTION
WHAT IS FRAUD DETECTION?
Detecting and preventing fraudulent financial transactions from being
processed
HOW DOES IT WORK?
Fraud detection is a binary classification problem: “is this transaction
legitimate or not?”
WHAT IS A REAL USE CASE?
Via SMS Group uses a combination of complex data lookups and
decision algorithms written in R and implemented in PHP to assess
whether a loan applicant is fraudulent
Works Cited
• https://www.yhat.com/whitepapers/data-science-in-practice
• http://wikibon.org/blog/role-of-the-data-scientist/
• https://www.cyfronet.krakow.pl/cgw16/presentations/S8_02_present
ation-Edison-CGW-26-10-2016.pdf
Thank You
Sreenatha Reddy K R
krsreenatha@gmail.com
https://in.linkedin.com/in/sreenathaa

More Related Content

PPTX
Introduction to Data Science
PPTX
Introduction to Data Science.pptx
PPT
PN JUNCTION
PPTX
Introduction to data science
PPTX
Data science
PPTX
Group discussion
PPTX
Wireless Sensor Networks ppt
PPTX
Introduction to C programming
Introduction to Data Science
Introduction to Data Science.pptx
PN JUNCTION
Introduction to data science
Data science
Group discussion
Wireless Sensor Networks ppt
Introduction to C programming

What's hot (20)

PPTX
Big Data Analytics
ODP
Machine Learning with Decision trees
PDF
Data science presentation
PPT
01 Data Mining: Concepts and Techniques, 2nd ed.
PPTX
introduction to data science
PPTX
Supervised learning and Unsupervised learning
PPTX
Introduction to Data Science
PDF
Exploratory data analysis data visualization
PPTX
Introduction to data science.pptx
PDF
Linear regression
PPTX
Ppt on data science
PPT
Data mining slides
 
PPTX
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
PPTX
Lecture #01
PPTX
Machine learning ppt
PPTX
Data Science
PDF
Introduction to data analytics
PPT
Machine learning
PDF
Supervised and Unsupervised Machine Learning
PPTX
Data science
Big Data Analytics
Machine Learning with Decision trees
Data science presentation
01 Data Mining: Concepts and Techniques, 2nd ed.
introduction to data science
Supervised learning and Unsupervised learning
Introduction to Data Science
Exploratory data analysis data visualization
Introduction to data science.pptx
Linear regression
Ppt on data science
Data mining slides
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Lecture #01
Machine learning ppt
Data Science
Introduction to data analytics
Machine learning
Supervised and Unsupervised Machine Learning
Data science
Ad

Similar to Data science applications and usecases (20)

PPTX
Data Science Training in Chandigarh h
PPTX
Data science and business analytics
PPT
PPT
PDF
Data driven decision making
PDF
Operationalize analytics through modern data strategy
PPTX
Big Data Analytics information And Tools
PPT
datamining.ppt
PPT
datamining.ppt
PPTX
datamining management slyabbus and ppt.pptx
PPTX
Data Science Mastery Course in Pitampura
PPTX
Data Mining Presentation for College Harsh.pptx
PPTX
Join Axtria - Ingenious Insights
PPTX
Unit 1.pptx Anna University Business Analytics
PPTX
Big Data & Data Science Pengantar Imu Komputer_C5.pptx
PDF
Data Analysis and Analytics.pdf
PPTX
Exploratory data analysis for business MODULE 1.pptx
PDF
Data Mining and Business Analytics by Seyed Ziae Mousavi Mojab
PPTX
Tools and techniques for predictive analytics
PPTX
Big Data Analysis: Transforming Industries and Unlocking Potential​
Data Science Training in Chandigarh h
Data science and business analytics
Data driven decision making
Operationalize analytics through modern data strategy
Big Data Analytics information And Tools
datamining.ppt
datamining.ppt
datamining management slyabbus and ppt.pptx
Data Science Mastery Course in Pitampura
Data Mining Presentation for College Harsh.pptx
Join Axtria - Ingenious Insights
Unit 1.pptx Anna University Business Analytics
Big Data & Data Science Pengantar Imu Komputer_C5.pptx
Data Analysis and Analytics.pdf
Exploratory data analysis for business MODULE 1.pptx
Data Mining and Business Analytics by Seyed Ziae Mousavi Mojab
Tools and techniques for predictive analytics
Big Data Analysis: Transforming Industries and Unlocking Potential​
Ad

More from Sreenatha Reddy K R (10)

PPT
Linux security firewall and SELinux
PPT
Mail server setup
PPT
Linux System Administration - Web Server and squid setup
PPTX
Linux System Administration - NFS Server
PPTX
Linux System Administration - DNS
PPTX
DHCP and NIS
PPT
Linux commands and file structure
PPTX
Linux booting process - Linux System Administration
PPTX
Introduction to tcp ip linux networking
PPTX
Access control list acl - permissions in linux
Linux security firewall and SELinux
Mail server setup
Linux System Administration - Web Server and squid setup
Linux System Administration - NFS Server
Linux System Administration - DNS
DHCP and NIS
Linux commands and file structure
Linux booting process - Linux System Administration
Introduction to tcp ip linux networking
Access control list acl - permissions in linux

Recently uploaded (20)

PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Challenges and opportunities in feeding a growing population
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPTX
1intro to AI.pptx AI components & composition
PPTX
Data-Driven-Credit-Card-Launch-A-Wells-Fargo-Case-Study.pptx
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PDF
Taxes Foundatisdcsdcsdon Certificate.pdf
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
LESSON-1-NATURE-OF-MATHEMATICS.pptx patterns
PDF
Master Databricks SQL with AccentFuture – The Future of Data Warehousing
PPTX
Computer network topology notes for revision
PPTX
咨询新西兰毕业证(UCOL毕业证书)联合理工学院毕业证国外毕业证
PPTX
artificial intelligence deeplearning-200712115616.pptx
PPTX
Purple and Violet Modern Marketing Presentation (1).pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
Foundation of Data Science unit number two notes
PDF
Chad Readey - An Independent Thinker
PPTX
Business Acumen Training GuidePresentation.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Challenges and opportunities in feeding a growing population
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
1intro to AI.pptx AI components & composition
Data-Driven-Credit-Card-Launch-A-Wells-Fargo-Case-Study.pptx
Moving the Public Sector (Government) to a Digital Adoption
Taxes Foundatisdcsdcsdon Certificate.pdf
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
LESSON-1-NATURE-OF-MATHEMATICS.pptx patterns
Master Databricks SQL with AccentFuture – The Future of Data Warehousing
Computer network topology notes for revision
咨询新西兰毕业证(UCOL毕业证书)联合理工学院毕业证国外毕业证
artificial intelligence deeplearning-200712115616.pptx
Purple and Violet Modern Marketing Presentation (1).pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Miokarditis (Inflamasi pada Otot Jantung)
Foundation of Data Science unit number two notes
Chad Readey - An Independent Thinker
Business Acumen Training GuidePresentation.pptx

Data science applications and usecases

  • 1. Hello… Welcome To the Talk on Data Science Applications and Use cases
  • 2. Agenda… • What is Data Science? • Big Data Challenges • Data Science vs Software Engineering • Data Science Applications & Use cases • Conclusion
  • 3. What is Data Science? Data Science is the science which uses computer science, statistics and machine learning, visualization and human-computer interactions to collect, clean, integrate, analyze, visualize, interact with data to create data products. “Using data to make better decisions, optimize processes and improve products and services.” “What distinguishes data science itself from the tools and techniques is the central goal of deploying effective decision-making models to a production environment. “ – John Mount & Nina Zumel, Practical Data Science with R
  • 4. Big Data Challenges • Dealing with Data Growth • Generating insights in a timely manner • Integrating disparate data sources • Validating Data • Securing Bigdata • Organizational resistance
  • 6. ‘Data science’ is “Data-Driven Decision” making, to help the business to make good choices, whereas software engineering is the methodology for software product development without any confusions about the requirements. Data Science vs Software Engineering
  • 7. Data Science Competence Groups - Research Data Science Competence includes 5 areas/groups • Data Analytics • Data Science Engineering • Domain Expertise • Data Management • Scientific Methods (or Business Process Management) Scientific Methods • Design Experiment • Collect Data • Analyse Data • Identify Patterns • Hypothesise Explanation • Test Hypothesis Business Operations • Operations Strategy • Plan • Design & Deploy • Monitor & Control • Improve & Re-design
  • 8. Data Science Competence includes 5 areas/groups • Data Analytics • Data Science Engineering • Domain Expertise • Data Management • Scientific Methods (or Business Process Management) Scientific Methods • Design Experiment • Collect Data • Analyse Data • Identify Patterns • Hypothesise Explanation • Test Hypothesis Business Process Operations/Stages • Design • Model/Plan • Deploy & Execute • Monitor & Control • Optimise & Re-design Data Science Competences Groups – Business Design Modelling Execution Monitoring Optimisation RESEARCH DATA ANALYTICS ALGORITHMSANALYTIC SYSTEMS ENGINEERING COMPETENCES DOMAIN EXPERTISE DATA SCIENCE Data Management Scientific Methods Business Process Management
  • 9. Data Science Applications & Use cases • RECOMMENDER SYSTEMS • CREDIT SCORING • DYNAMIC PRICING • CUSTOMER CHURN • FRAUD DETECTION
  • 10. RECOMMENDER SYSTEMS WHAT IS A RECOMMENDER SYSTEM? A model that filters information to present users with a curated subset of options they’re likely to find appealing HOW DOES IT WORK? Generally via a collaborative approach (considering user’s previous behavior) or content-based approach (based on discrete assigned characteristics) WHAT IS A REAL USE CASE? Tendril uses recommendation models to match eligible customers with new or existing energy products
  • 11. CREDIT SCORING WHAT IS CREDIT SCORING? A model that determines an applicant’s creditworthiness for a mortgage, loan or credit card HOW DOES IT WORK? A set of decision management rules evaluates how likely an applicant is to repay debts WHAT IS A REAL USE CASE? Ferratum Bank uses machine learning models to reach prospective customers that may have been overlooked by traditional banking institutions
  • 12. DYNAMIC PRICING WHAT IS DYNAMIC PRICING? Modeling price as a function of supply, demand, competitor pricing and exogenous factors HOW DOES IT WORK? Generalized linear models and classification trees are popular techniques for estimating the “right” price to maximize expected revenue. WHAT IS A REAL USE CASE? Turo uses dynamic pricing models to suggest prices to the people who list and rent out cars
  • 13. CUSTOMER CHURN WHAT IS CUSTOMER CHURN? Predicting which customers are going to abandon a product or service HOW DOES IT WORK? Data scientists may consider using support vector machines, random forest or k-nearest-neighbors algorithms WHAT IS A REAL USE CASE? EAB combines data from transcripts, standardized test scores, demographics and more to identify students at risk of not graduating.
  • 14. FRAUD DETECTION WHAT IS FRAUD DETECTION? Detecting and preventing fraudulent financial transactions from being processed HOW DOES IT WORK? Fraud detection is a binary classification problem: “is this transaction legitimate or not?” WHAT IS A REAL USE CASE? Via SMS Group uses a combination of complex data lookups and decision algorithms written in R and implemented in PHP to assess whether a loan applicant is fraudulent
  • 15. Works Cited • https://www.yhat.com/whitepapers/data-science-in-practice • http://wikibon.org/blog/role-of-the-data-scientist/ • https://www.cyfronet.krakow.pl/cgw16/presentations/S8_02_present ation-Edison-CGW-26-10-2016.pdf
  • 16. Thank You Sreenatha Reddy K R [email protected] https://in.linkedin.com/in/sreenathaa

Editor's Notes

  • #14: Churn rate describes the rate at which customers abandon a product or service. Understanding customers’ likelihood to churn is particularly important for subscription-based models, everything ranging from traditional cable or gym memberships to recently popularized monthly subscription boxes.