0% found this document useful (0 votes)
16 views4 pages

Data Analytics Lifecycle

The Data Analytics Lifecycle outlines the systematic process of generating, collecting, processing, and analyzing data to achieve business goals. It consists of six phases: Discovery, Data Preparation, Model Planning, Model Building, Communication Results, and Operationalize, each with specific tasks and tools. This lifecycle allows data professionals to manage data effectively and extract valuable insights for organizational success.

Uploaded by

sanag0404
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views4 pages

Data Analytics Lifecycle

The Data Analytics Lifecycle outlines the systematic process of generating, collecting, processing, and analyzing data to achieve business goals. It consists of six phases: Discovery, Data Preparation, Model Planning, Model Building, Communication Results, and Operationalize, each with specific tasks and tools. This lifecycle allows data professionals to manage data effectively and extract valuable insights for organizational success.

Uploaded by

sanag0404
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

lOMoARcPSD|41410730

Data Analytics Lifecycle

Big Data (Velammal Engineering College)

Scan to open on Studocu

Studocu is not sponsored or endorsed by any college or university


Downloaded by Sana G ([email protected])
lOMoARcPSD|41410730

Data Analytics Lifecycle

Data Analytics Lifecycle defines the roadmap of how data is


generated, collected, processed, used, and analyzed to achieve
business goals. It offers a systematic way to manage data for
converting it into information that can be used to fulfill
organizational and project goals. The process provides the
direction and methods to extract information from the data and
proceed in the right direction to accomplish business goals.

Data professionals use the lifecycle’s circular form to proceed with


data analytics in either forward or backward direction.

DATA ANALYTICS LIFECYCLE PHASES


There’s no defined structure of the phases in the life cycle of Data
Analytics, and thus, there may not be uniformity in these steps.
There can be some data professionals that follow additional steps,
while there may be some who skip some stages altogether or
work on different phases simultaneously.

Phase 1: Discovery –
 The data science team learn and investigate the problem.
 Develop context and understanding.
 Come to know about data sources needed and available for
the project.
 The team formulates initial hypothesis that can be later tested
with data.
Phase 2: Data Preparation –
 Steps to explore, preprocess, and condition data prior to modeling
and analysis.
 It requires the presence of an analytic sandbox, the team execute,
load, and transform, to get data into the sandbox.
 Data preparation tasks are likely to be performed multiple times and
not in predefined order.
 Several tools commonly used for this phase are – Hadoop, Alpine
Miner, Open Refine, etc.

Downloaded by Sana G ([email protected])


lOMoARcPSD|41410730

Phase 3: Model Planning –

 Team explores data to learn about relationships between


variables and subsequently, selects key variables and the most
suitable models.
 In this phase, data science team develop data sets for training,
testing, and production purposes.
 Team builds and executes models based on the work done in
the model planning phase.
 Several tools commonly used for this phase are – Matlab,
STASTICA.
Phase 4: Model Building –
 Team develops datasets for testing, training, and production
purposes.
 Team also considers whether its existing tools will suffice for
running the models or if they need more robust environment
for executing models.
 Free or open-source tools – Rand PL/R, Octave, WEKA.
 Commercial tools – Matlab , STASTICA.

Phase 5: Communication Results –


 After executing model team need to compare outcomes of modelling to
criteria established for success and failure.
 Team considers how best to articulate findings and outcomes to various
team members and stakeholders, taking into account warning,
assumptions.
 Team should identify key findings, quantify business value, and develop
narrative to summarize and convey findings to stakeholders.
Phase 6: Operationalize –
 The team communicates benefits of project more broadly and sets up
pilot project to deploy work in controlled way before broadening the work
to full enterprise of users.
 This approach enables team to learn about performance and related
constraints of the model in production environment on small scale , and
make adjustments before full deployment.
 The team delivers final reports, briefings, codes.
 Free or open source tools – Octave, WEKA, SQL, MADlib.

Downloaded by Sana G ([email protected])


lOMoARcPSD|41410730

Downloaded by Sana G ([email protected])

You might also like