1 Introduction to Data Analytics
1 Introduction to Data Analytics
Control
The group adds
controls to the process
Have the number of views gone up? Are sales stronger this
month than last?
2. Diagnostic analytics: This focuses more on why something
happened. It involves more diverse data inputs and a bit of
hypothesizing. Did the weather affect beer sales? Did that latest
marketing campaign impact sales?
3. Predictive analytics: This moves to what is likely going to
happen in the near term. What happened to sales the last time
we had a hot summer? How many weather models predict a hot
summer this year?
4. Prescriptive analytics: This suggests a course of action. For
example, we should add an evening shift to the brewery and
rent an additional tank to increase output if the likelihood of a
hot summer is measured as an average of these five weather
models and the average is above 58%,
Data analytics underpins many quality control systems in the
financial world, including the ever-popular Six Sigma program.
2. Data Management
Data requires a database to contain, manage, and provide access to
the information tht has been gathered. The next step in data analytics
is therefore the creation of such a database to manage the
information.
While some people or organizations may store data in Microsoft Excel
spreadsheets, Excel is limited for this purpose and is more a tool for
basic analysis and calculations such as in finance. Relational
databases are a much better option than Excel for data storage. They
allow for the storage of much greater volumes of data, and allow for
efficient access. The relational structure allows for tables to easily be
used together. Structured Query Language, known by its initials SQL,
is the computer language used to work on and query from relational
databases. Created in 1979, SQL allows for easy interaction with
relational databases enabling datasets to be queried, built, and
analysized.3
3. Statistical Analysis
The third step is statistical analysis. It involves the interpretation of
the gathered and stored data into models that will hopefully reveal
trends that can be used to interpret future data. This is achieved
through open-source programming languages such as Python. More
specific tools for data analytics, like R, can be used for statistical
analysis or graphical modeling.
4. Data Presentation
The results of the data analytics process are meant to be shared. The
final step is formatting the data so it’s accessible to and
understandable by others, particularly those individuals within a
company who are responsible for growth, analysis, efficiency, and
operations. Having access can be beneficial to shareholders as well.
1. Descriptive Analytics
Descriptive analytics is a branch of data analytics that focuses on
summarizing historical data to gain insights into past events or
phenomena. It involves organizing and presenting data in a
meaningful way through visualization techniques, such as charts,
graphs, and dashboards. Descriptive analytics aims to provide a clear
and concise snapshot of what has happened, allowing organizations to
understand patterns, trends, and key metrics. By analyzing
descriptive analytics, businesses can gain valuable insights into their
past performance, customer behavior, and market trends, which can
inform decision-making and help drive future strategies.
2. Diagnostic Analytics:
Diagnostic analytics is a form of data analytics that delves deeper into
understanding the root causes and reasons behind specific events or
outcomes. It goes beyond descriptive analytics by investigating the
relationships between variables to uncover insights and explanations.
Diagnostic analytics involves conducting exploratory analysis and
applying statistical techniques to identify patterns, correlations, and
anomalies within the data. By analyzing diagnostic analytics,
organizations can gain a comprehensive understanding of why certain
events occurred, enabling them to address underlying issues, optimize
processes, and make informed decisions to prevent similar
occurrences in the future.
3. Predictive Analytics:
Predictive analytics is a field within data analytics that employs
historical data and statistical modeling methods to predict future
outcomes or trends. Its objective is to make well-informed forecasts
and estimations based on the analysis of patterns, correlations, and
connections present in the data. By utilizing a range of statistical and
machine learning algorithms, predictive analytics creates predictive
models that enable organizations to anticipate customer behavior,
market trends, demand patterns, and other important factors. This
capability empowers businesses to proactively make decisions,
optimize the allocation of resources, manage risks effectively, and
gain a competitive edge by leveraging data-derived insights to shape
their future strategies and actions.
4. Prescriptive Analytics:
Prescriptive analytics is an advanced field in data analytics that
employs historical data, mathematical models, optimization
algorithms, and simulation methods to offer guidance on the best
actions or decisions to attain desired outcomes. Unlike descriptive
and predictive analytics, which concentrate on understanding past
occurrences and forecasting future trends, prescriptive analytics
takes an additional step by proposing precise courses of action. By
considering various limitations, goals, and potential situations,
prescriptive analytics enables organizations to make data-informed
choices that maximize the achievement of desired results. It supports
businesses to enhance operational efficiency, allocate resources
effectively, manage risks, and make well-informed decisions that drive
improved performance and a competitive edge.
5. Cognitive Analytics:
Cognitive analytics refers to the application of advanced technologies
and techniques that enable systems and machines to mimic human
cognitive abilities, such as perception, learning, reasoning, and
problem-solving. It combines elements of artificial intelligence,
machine learning, natural language processing, and other cognitive
computing technologies to analyze and interpret complex data sets.
Cognitive analytics goes beyond traditional data analysis by
incorporating context, understanding, and inference capabilities to
derive deeper insights and make more informed decisions. It enables
organizations to unveil patterns, trends, and relationships within large
volumes of structured and unstructured data, leading to improved
decision-making, enhanced customer experiences, and the discovery
of valuable business opportunities.
These five types of analytics frameworks are not mutually exclusive
and can often be combined to create more comprehensive analysis
approaches. Organizations can select and tailor a framework based on
their specific goals, data availability, and analytical capabilities.
In addition to these core frameworks, there are other considerations
within a data analytics framework, such as data collection, data
preprocessing, data exploration, and modeling techniques. These
elements ensure that the data is appropriately prepared, outliers and
missing values are addressed, and the most suitable analytical
methods are applied.
Data and analytics framework: tools and techniques
Key Tools:
Data Visualization Tools:
Tableau, Power BI, which allow for interactive dashboards and
graphical representation of data to identify patterns and trends.
Programming Languages:
Python, with libraries like Pandas and Matplotlib, for data
manipulation, analysis, and visualization.
Statistical Software:
SAS, for advanced statistical analysis and predictive modeling.
Spreadsheet Applications:
Microsoft Excel, for basic data cleaning, sorting, filtering, and
pivot tables.
Key Techniques:
Descriptive Analytics:
Summarizing key metrics and trends from data using measures
like averages, counts, and frequencies.
Diagnostic Analytics:
Exploring the "why" behind observed patterns by drilling down
into data to identify root causes.
Predictive Analytics:
Using statistical models to forecast future trends and outcomes
based on historical data.
Prescriptive Analytics:
Providing recommendations or actionable insights based on
predictive analysis to optimize decisions.
Data Mining:
Discovering hidden patterns and relationships within large
datasets using techniques like clustering, association rule mining,
and decision trees.
Time Series Analysis:
Studying trends and seasonality in data that is collected over
time.
Regression Analysis:
Identifying relationships between variables to predict changes in
one variable based on another.
Natural Language Processing (NLP):
Analyzing unstructured text data like customer reviews to extract
sentiment and insights.