Data Analytics
Data Analytics
“The constant increase in data processing speeds and bandwidth, the nonstop invention
of new tools for creating, sharing, and consuming data, and the steady addition of new
data creators and consumers around the world, ensure that data growth continues
unabated. Data begets more data in a constant virtuous cycle.” – Forbes 2020 Report
Modern Data Ecosystem involves an interconnected, independent, and continually
evolving entities.
It includes:
Data integrated from disparate sources
Different types of analysis and skills to generate insights
Active stakeholders to collaborate and act on insights generated
Tools, applications, and infrastructure to store, process, and disseminate data as
required.
Data Sources
Structured
Unstructured
Steps:
1. Pull a copy of the data from the original sources into a data repository. (acquiring the
data you need).
- Challenges: reliability, security, and integrity of data
2. Raw data needs to get organized, cleaned up, optimized for access, and conform to
compliances and standards enforced in the organization.
- Challenges: data management, repositories that provide high availability,
flexibility, accessibility, and security
3. Users pulling data from the enterprise repository
Business stakeholders
APPs
Programmers Analysts
Data Science use cases
- Challenges: Interfaces, APIs, Application (if it is inline with their needs)