John B. Rollins, Ph.D.
IBM Analytics | IBM Corporation
Foundational Data Science Methodology
2015 IBM Corporation
Introduction
Why we are interested in data science
- Solve problems and answer questions
- Gain useful insights through modeling to predict outcomes or discover
underlying patterns
Rapidly evolving technologies
- Platform growth
- In-database analytics
- Text analysis
- Automation
2015 IBM Corporation
Data science methodology
Why?
- To provide a guiding strategy
What?
- General strategy that guides the processes and activities within a given
domain
- Does not depend on particular technologies or tools
- Not a set of techniques or recipes
- Provides the data scientist with a framework for how to proceed to obtain
answers
2015 IBM Corporation
Methodology diagram
Business
Understanding
Analytic
Approach
Data
Requirements
Feedback
Data Collection
Deployment
Data
Understanding
Evaluation
Modeling
Data
Preparation
2015 IBM Corporation