Xii Analytical Approach

The document outlines a five-part analytical approach in data science methodology, including stages from problem definition to feedback. Each part emphasizes the importance of business understanding, data collection, preparation, modeling, evaluation, deployment, and iterative feedback. The process is designed to ensure that data scientists effectively address business problems using appropriate statistical and machine learning techniques.

Uploaded by

priyansh23fe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views3 pages

Xii Analytical Approach

Uploaded by

priyansh23fe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

ANALYTICAL APPROACH (DATA SCIENCE METHODOLOGY)

There are five parts, each of which contains more steps:

1. From Problem to Approach
2. From Requirements to Collection
3. From Understanding to Preparation
4. From Modeling to Evaluation
5. From Deployment to Feedback

1. From Problem to Approach

Business Understanding-Every project, regardless of its size, starts with business

understanding, which lays the foundation for successful resolution of the business problem. The
business sponsors needing the analytic solution play the critical role in this stage by defining the
problem, project objectives and solution requirements from a business perspective. And, believe
it or not—even with nine stages still to go—this first stage is the hardest.

After clearly stating a business problem, the data scientist can define the analytic approach to
solving it. Doing so involves expressing the problem in the context of statistical and machine
learning techniques so that the data scientist can identify techniques suitable for achieving the
desired outcome. Selecting the right analytic approach depends on the question being asked.
Once the problem to be addressed is defined, the appropriate analytic approach for the problem is
selected in the context of the business requirements.

2. From Requirements to Collection

Data Requirements is the stage where we identify the necessary data content, formats, and
sources for initial data collection. This includes 5W1H approach.

 In the Data Collection Stage, data scientists identify the available data resources relevant to
the problem domain.


3. From Understanding to Preparation

 Now that the data collection stage is complete, data scientists use descriptive statistics and
visualization techniques to understand data better. Data scientists, explore the dataset to
understand its content, , quality, and initial insights about the data. Gaps in data will be
identified and plans to either fill or make substitutions will have to be made. They determine
if additional data is necessary to fill any gaps but also to verify the quality of the data.

 In the Data Preparation stage, data scientists prepare data for modeling, by cleaning the
data and make it error free for use during modelling.

From Modeling to Evaluation

Once data are prepared for the chosen machine learning algorithm, we are ready for modeling.

 Modeling focuses on developing models that are either descriptive or predictive, and these
models are based on the analytic approach that was taken statistically or through machine
learning. Descriptive modeling is a mathematical process that describes real-world events
and the relationships between factors responsible for them, for example, a descriptive model
might examine things like: if a person did this, then they’re likely to prefer that. Predictive
modeling is a process that uses data mining and probability to forecast outcomes; for
example, a predictive model might be used to determine whether an email is a spam or not.
For predictive modeling, data scientists use a training set that is a set of historical data in
which the outcomes are already known. This step can be repeated more times until the model
understands the question and answer to it.
 In the Model Evaluation stage, data scientists can evaluate the model in two ways: Hold-Out
and Cross-Validation. In the Hold-Out method, the dataset is divided into three subsets:
a training set as we said in the modeling stage; a validation set that is a subset used to
assess the performance of the model built in the training phase; a test set is a subset to
evaluate the likely future performance of a model.

From Deployment to Feedback

 The Deployment stage depends on the purpose of the model, and it may be rolled out to a
limited group of users or in a test environment.

 The Feedback stage is usually made the most from the customer. Customers after the
deployment stage can say if the model works for their purposes or not. Data scientists take
this feedback and decide if they should improve the model; that’s because the process from
modeling to feedback is highly iterative.

Rapid Miner Cheat Doc
67% (6)
Rapid Miner Cheat Doc
14 pages
DL Unit-2
No ratings yet
DL Unit-2
24 pages
Data Science
100% (2)
Data Science
33 pages
6 - Data Science Methodology
No ratings yet
6 - Data Science Methodology
20 pages
Data Science Process
No ratings yet
Data Science Process
101 pages
DTS Modul Data Science Methodology
100% (1)
DTS Modul Data Science Methodology
56 pages
Part1 Ds ML Introduction
No ratings yet
Part1 Ds ML Introduction
61 pages
Lesson 6 Data Life Cycle Part 2
No ratings yet
Lesson 6 Data Life Cycle Part 2
30 pages
Unit2-Data Science
No ratings yet
Unit2-Data Science
20 pages
BSR-Data Science
No ratings yet
BSR-Data Science
308 pages
M1 - FDS
No ratings yet
M1 - FDS
19 pages
FILE Ai
No ratings yet
FILE Ai
10 pages
IML-IITKGP - Assignment 1 Solution
No ratings yet
IML-IITKGP - Assignment 1 Solution
7 pages
Data Science: Lesson 5
No ratings yet
Data Science: Lesson 5
6 pages
3 - The Data Science Method
No ratings yet
3 - The Data Science Method
8 pages
Data Science Methodology
No ratings yet
Data Science Methodology
26 pages
Artificial Intelligence - (Unit - 1)
No ratings yet
Artificial Intelligence - (Unit - 1)
47 pages
IBM Q1 Technical Marketing ASSET2 - Data Science Methodology-Best Practices For Successful Implementations Ov37176 PDF
No ratings yet
IBM Q1 Technical Marketing ASSET2 - Data Science Methodology-Best Practices For Successful Implementations Ov37176 PDF
6 pages
W3 - DA Life Cycle
No ratings yet
W3 - DA Life Cycle
49 pages
BMW M-4
No ratings yet
BMW M-4
108 pages
Data Science Through R Lesson-2 Data Science in Action: Prof - Dr. A. B. Chowdhury, HOD, CA
No ratings yet
Data Science Through R Lesson-2 Data Science in Action: Prof - Dr. A. B. Chowdhury, HOD, CA
39 pages
Crop Recommendation Using Machine Learning Techniques IJERTCONV10IS11044
No ratings yet
Crop Recommendation Using Machine Learning Techniques IJERTCONV10IS11044
3 pages
Data Science Methodology
No ratings yet
Data Science Methodology
4 pages
Life Cycle of Data Science - Complete Step-By-step Guide
No ratings yet
Life Cycle of Data Science - Complete Step-By-step Guide
3 pages
Week 3
No ratings yet
Week 3
3 pages
Life Cycle of DS Project
No ratings yet
Life Cycle of DS Project
9 pages
Activity 3. Mind Map. Data Science Methodology
No ratings yet
Activity 3. Mind Map. Data Science Methodology
4 pages
Module 5 - Data Science Methodologies
No ratings yet
Module 5 - Data Science Methodologies
9 pages
Unit 1: Capstone Project
No ratings yet
Unit 1: Capstone Project
21 pages
ML
No ratings yet
ML
49 pages
Data Science Lifecycle
No ratings yet
Data Science Lifecycle
3 pages
Predicting Material Properties From 3D Printer Settings Using Machine Learning Techniques
100% (1)
Predicting Material Properties From 3D Printer Settings Using Machine Learning Techniques
19 pages
Module I (Introduction Data Analytics Life Cycle) Part II
No ratings yet
Module I (Introduction Data Analytics Life Cycle) Part II
103 pages
Module1 Data Science
No ratings yet
Module1 Data Science
15 pages
Breaking The Trilemma of Privacy, Utility, Efficiency Via Controllable Machine Unlearning
No ratings yet
Breaking The Trilemma of Privacy, Utility, Efficiency Via Controllable Machine Unlearning
12 pages
Big Data
No ratings yet
Big Data
4 pages
Bjerre Et Al. - 2022 - Assessing Spatial Transferability of A Random Fore
No ratings yet
Bjerre Et Al. - 2022 - Assessing Spatial Transferability of A Random Fore
11 pages
Data Science
No ratings yet
Data Science
5 pages
Helmet Detection
No ratings yet
Helmet Detection
7 pages
Robust Manga Page Colorization Via Coloring Latent Space
No ratings yet
Robust Manga Page Colorization Via Coloring Latent Space
17 pages
Chapter 1 - Intr To DS and Business Understanding
No ratings yet
Chapter 1 - Intr To DS and Business Understanding
35 pages
Introduction Data Science Edited
No ratings yet
Introduction Data Science Edited
33 pages
Electronics 13 02465
No ratings yet
Electronics 13 02465
28 pages
Customer Classification by Past Purchase Data Analysis
No ratings yet
Customer Classification by Past Purchase Data Analysis
4 pages
Deep Learning Based Brain Tumor Detection and Classification
No ratings yet
Deep Learning Based Brain Tumor Detection and Classification
6 pages
5 Data Science Project Lifecycle
No ratings yet
5 Data Science Project Lifecycle
33 pages
EBook - Data Science 4
No ratings yet
EBook - Data Science 4
14 pages
对冲基金收益预测与选择的横断面机器学习方法
No ratings yet
对冲基金收益预测与选择的横断面机器学习方法
25 pages
Team1 - Data Science Methodology
No ratings yet
Team1 - Data Science Methodology
39 pages
Data Analytics I Unit Notes
No ratings yet
Data Analytics I Unit Notes
8 pages
Icpram 2025
No ratings yet
Icpram 2025
15 pages
Business Analytics Unit I
No ratings yet
Business Analytics Unit I
45 pages
Rajpreet Finalized Dissertation
No ratings yet
Rajpreet Finalized Dissertation
110 pages
Data Science Methodology
No ratings yet
Data Science Methodology
21 pages
CSCI946 w3 - DataPrep
No ratings yet
CSCI946 w3 - DataPrep
58 pages
Heart Disease Prediction Project Documentation
No ratings yet
Heart Disease Prediction Project Documentation
22 pages
AndroPack A Hybrid Method To Detect Packed Android Malware With Ensemble Learning
No ratings yet
AndroPack A Hybrid Method To Detect Packed Android Malware With Ensemble Learning
4 pages
Unit2 DATA SCIENCE
No ratings yet
Unit2 DATA SCIENCE
8 pages
Data Science Methodology
No ratings yet
Data Science Methodology
3 pages
Life Cycle
No ratings yet
Life Cycle
35 pages
CCCS CIC AndMal 2020
No ratings yet
CCCS CIC AndMal 2020
6 pages
PA DL Consolidated
No ratings yet
PA DL Consolidated
94 pages
Data Science
No ratings yet
Data Science
3 pages
Unit 2 - DS - 1st Year
No ratings yet
Unit 2 - DS - 1st Year
7 pages
Project Report Template AICTE Internship 2025
No ratings yet
Project Report Template AICTE Internship 2025
21 pages
Aastha Jain
No ratings yet
Aastha Jain
1 page
Majeed MV-Soccer Motion-Vector Augmented Instance Segmentation For Soccer Player Tracking CVPRW 2024 Paper
No ratings yet
Majeed MV-Soccer Motion-Vector Augmented Instance Segmentation For Soccer Player Tracking CVPRW 2024 Paper
11 pages
Dsur Ea2352001010391 W3
No ratings yet
Dsur Ea2352001010391 W3
3 pages
Workflow of Supervised Learning
No ratings yet
Workflow of Supervised Learning
2 pages
Data Science Process Stages Lecture 2
No ratings yet
Data Science Process Stages Lecture 2
4 pages
AL3451 Assignment Question1
No ratings yet
AL3451 Assignment Question1
3 pages
Module 1B
No ratings yet
Module 1B
65 pages
Unit 4 - Question Bank and Answers
No ratings yet
Unit 4 - Question Bank and Answers
23 pages
Autods: Towards Human-Centered Automation of Data Science: Dakuo Wang Josh Andres Justin Weisz
No ratings yet
Autods: Towards Human-Centered Automation of Data Science: Dakuo Wang Josh Andres Justin Weisz
12 pages
Datas Unit1
No ratings yet
Datas Unit1
20 pages
HTTTTC - Final Exam
No ratings yet
HTTTTC - Final Exam
4 pages
21bce5801 53620
No ratings yet
21bce5801 53620
49 pages
Unit 3 (DS)
No ratings yet
Unit 3 (DS)
32 pages
CH 2
No ratings yet
CH 2
26 pages
ML-UNIT - I - Part A
No ratings yet
ML-UNIT - I - Part A
88 pages
Introduction To Data Science Methodology
No ratings yet
Introduction To Data Science Methodology
45 pages
Capstone Project
No ratings yet
Capstone Project
28 pages
Unit 2 - Data Science Methodology Notes
No ratings yet
Unit 2 - Data Science Methodology Notes
26 pages
AI Student HandbookXII
No ratings yet
AI Student HandbookXII
48 pages
Capstone Project - Unit2
No ratings yet
Capstone Project - Unit2
81 pages
PM Unit 1
No ratings yet
PM Unit 1
41 pages
Liceria Tech
No ratings yet
Liceria Tech
12 pages
Ds 3
No ratings yet
Ds 3
9 pages

Xii Analytical Approach

Uploaded by

Xii Analytical Approach

Uploaded by

ANALYTICAL APPROACH (DATA SCIENCE METHODOLOGY)

There are five parts, each of which contains more steps:

1. From Problem to Approach

Business Understanding-Every project, regardless of its size, starts with business

2. From Requirements to Collection

3. From Understanding to Preparation

From Modeling to Evaluation

From Deployment to Feedback

You might also like