Lecture
Lecture
ANALYTICS
Prepared By:
Lecturer: Muhammad
Bilal
DATA ANALYTICS
1. What is data analytics?
Most companies are collecting loads of data all the time—but, in
its raw form, this data doesn’t really mean anything. This is where
data analytics comes in. Data analytics is the process of analyzing
raw data in order to draw out meaningful, actionable insights,
which are then used to inform and drive smart business decisions.
HOW DATA ANALYTICS HELP US
IT’S ALL ABOUT FINDING PATTERNS IN A DATASET WHICH CAN TELL YOU SOMETHING
USEFUL AND RELEVANT ABOUT A PARTICULAR AREA OF THE BUSINESS
DATA ANALYTICS HELPS YOU TO MAKE SENSE OF THE PAST AND TO PREDICT FUTURE
TRENDS AND BEHAVIORS
ARMED WITH THE INSIGHTS DRAWN FROM THE DATA, BUSINESSES AND
ORGANIZATIONS ARE ABLE TO DEVELOP A MUCH DEEPER UNDERSTANDING OF THEIR
AUDIENCE, THEIR INDUSTRY, AND THEIR COMPANY AS A WHOLE—AND, AS A RESULT,
ARE MUCH BETTER EQUIPPED TO MAKE DECISIONS AND PLAN AHEAD.
HISTORY OF DATA ANALYTICS
The use of Data Analytics by business can be found as far back as the 19th century,
when Frederick Winslow Taylor initiated time management exercises. Another example is
when Henry Ford measured the speed of assembly lines. In the late 1960s, analytics
began receiving more attention as computers became decision-making support systems.
•Predictive Analytics
•Big Data Analytics
•Cognitive Analytics
•Prescriptive Analytics
•Descriptive Analytics
•Enterprise Decision Management
•Retail Analytics
•Augmented Analytics
•Web Analytics
•Call Analytics
2. What’s the difference between data
analytics and data science?
You’ll find that the terms “data science” and “data analytics” tend
to be used interchangeably. However, they are two different fields
and denote two distinct career paths. What’s more, they each have
a very different impact on the business or organization.
data analysts tackle and solve discrete questions about data,
often on request, revealing insights that can be acted upon by
other stakeholders,
•Single character
•Boolean (true or false)
•Text (string)
•Number (integer or floating-point)
•Picture
•Sound
•Video
WHAT IS META DATA?
• Data that provide information about other data.
• Structured data: This data is basically an organized data. It generally refers to data that has
defined the length and format of data.
• Semi- Structured data: This data is basically a semi-organised data. It is generally a form of
data that do not conform to the formal structure of data. Log files are the examples of this
type of data.
• Unstructured data: This data basically refers to unorganized data. It generally refers to data
that doesn’t fit neatly into the traditional row and column structure of the relational
database. Texts, pictures, videos etc. are the examples of unstructured data which can’t be
stored in the form of rows and columns.
VARIETY:
WHAT ARE THE 5 V'S OF BIG DATA?
Velocity:
Value:
•After having the 4 V’s into account there comes one more V which stands
for Value! The bulk of Data having no Value is of no good to the company,
unless you turn it into something useful.
Variability:
•How fast or available data that extent is the structure of your data is
changing?
•How often does the meaning or shape of your data change?
•Example: if you are eating same ice-cream daily and the taste just keep
changing.
STRUCTURED AND UNSTRUCTURED
Facebook Friends Undergraduate Degree
Home Town
Housemates
Archery Clubs
Work Placement
Graduate Mixer
Facebook Friends
Technical Communication
Lu Xin Damien Lafferty Niall Turbitt
Dublin Cork
TEXT
•Forms the majority of unstructured data
•Nearly one million bits of content shared on
Facebook every minute
•Over 100,000 tweets per minute
TEXT MINING EXAMPLE
• People’s mood on coffee, wine, beer and soda from Twitter
• Compare tweets to database of positive and negative words
• Calculate a sentiment score:
Score = # of Positive Words - # of Negative Words
Qualitative data can answer questions such as “how this has happened” or and
“why this has happened”.
3. Nominal data
Nominal data is used just for labeling variables, without any type of quantitative
value. The name ‘nominal’ comes from the Latin word “nomen” which means
‘name’.
The nominal data just name a thing without applying it to order. Actually, the
nominal data could just be called “labels.”
Ordinal data shows where a number is in order. This is the crucial difference from nominal types of data.
Ordinal data is data which is placed into some kind of order by their position on a scale. Ordinal data may
indicate superiority.
However, you cannot do arithmetic with ordinal numbers because they only show sequence.
Ordinal variables are considered as “in between” qualitative and quantitative variables.
In other words, the ordinal data is qualitative data for which the values are ordered.
In comparison with nominal data, the second one is qualitative data for which the values cannot be placed
in an ordered.
We can also assign numbers to ordinal data to show their relative position. But we cannot do math with
those numbers. For example: “first, second, third…etc.”
As we mentioned above discrete and continuous data are the two key types of
quantitative data.
In statistics, marketing research, and data science, many decisions depend on
whether the basic data is discrete or continuous.
5. Discrete data
Discrete data is a count that involves only integers. The discrete values cannot be
subdivided into parts.
For example, the number of children in a class is discrete data. You can count whole
individuals. You can’t count 1.5 kids.
To put in other words, discrete data can take only certain values. The data variables
cannot be divided into smaller parts.
It has a limited number of possible values e.g. days of the month.