Week-1-Lecture
Week-1-Lecture
DATA ANALYSIS
WEEK 1 LECTURE
Data Analytics
Data Analysis
Data analytics is the science of
Data analysis is a process of analyzing raw data to make
inspecting, cleansing, transforming, conclusions about that information.
and modelling data with the goal of Many of the techniques and processes
discovering useful information, of data analytics have been
informing conclusions, and automated into mechanical processes
supporting decision-making. and algorithms that work over raw
data for human consumption
Disciplines Under Data Science
Data and Information Visualization Machine Learning
Data and information visualization is Machine learning (ML) is the
an interdisciplinary field that deals subset of artificial intelligence
with the graphic representation of (AI) that focuses on building
data and information. It is a systems that learn—or improve
particularly efficient way of performance—based on the data
communicating when the data or they consume.
information is numerous as for
example a time series
Disciplines Under Data Science
Artificial Intelligence
Data Mining
Artificial intelligence (AI)
Data mining is an essential process for many is intelligence demonstrated
data analytics tasks. This involves extracting by machines, as opposed to the natural
data from unstructured data sources. These intelligence displayed
may include written text, large complex by animals including humans. AI
databases, or raw sensor data. The key steps research has been defined as the field of
in this process are to extract, transform, and study of intelligent agents, which refers
to any system that perceives its
load data (often called ETL.)
environment and takes actions that
maximize its chance of achieving its
goals
Disciplines Under Data Science
Statistics Natural Language Processing
Statistics is the discipline that concerns the Natural language processing (NLP) is a subfield
collection, organization, analysis, of linguistics, computer science, and artificial
intelligence concerned with the interactions
interpretation, and presentation of data.[ In between computers and human language, in
applying statistics to a scientific, industrial, or particular how to program computers to process
social problem, it is conventional to begin and analyze large amounts of natural
with a statistical population or a statistical language data. The goal is a computer capable of
model to be studied "understanding" the contents of documents,
including the contextual nuances of the language
within them.
Data Analysis
Data Cleaning: Not all of the data you collect will be useful, so it’s
time to clean it up. This process is where you remove white
spaces, duplicate records, and basic errors. Data cleaning is
mandatory before sending the information on for analysis.
Data Analysis: Here is where you use data analysis software and
other tools to help you interpret and understand the data and
arrive at conclusions. Data analysis tools include Excel, Python,
R, Looker, Rapid Miner, Chartio, Metabase, Redash, and Microsoft
Power BI.
Data Analysis Process
Data Interpretation: Now that you have your results, you need to
interpret them and come up with the best courses of action, based on
your findings.
2. Companies can use the insights they gain from data analysis
to inform their decisions, leading to better outcomes.
After asking the main question of what happened, the next step is
to dive deeper and ask why did it happen. This is where diagnostic
analysis comes in. it takes the insights found from descriptive
analytics and drive down to find the causes of those outcomes
Business Applications of Diagnostic include:
A freight company investigating the cause of slow shipments in a
certain region
A Saas company drilling down to determine which marketing
activities increased trials.
Predictive Analysis
It attempts to answer the question what is likely to happen . This type of
analytics utilizes previous data to make predictions about future outcomes . It
uses summarized data to make logical predictions of the outcomes of events. It
relies on statistical modelling which requires added technology and manpower
to forecast.
Business Applications of Predictive Analysis include:
Risk Assessment
Sales Forecasting
Using Customer Segmentation to determine which leads have the best chance
of converting
Predictive Analytics in Customer success teams
Prescriptive Analysis
Spreadsheets
Microsoft Excel Google Sheets
Programming Language
Python R
Data Visualization
Tableau
Power BI
Querying Language
NO SQL
SQL
Data Cleaning Critical Thinking
Statistics