Introduction
Introduction
So basically: ‘EVERYTHING’ can be Data- from the food we eat, to the hair we make, to the clothes we
wear, to the music we listen to….EVERYTHING!
Types of Data
There are 3 main types of data namely;
1. Structured Data; This is data that is arranged in a rigid tabular format. E.g; Exam result Data,
Attendance List Data…
2. Semi-structured Data; This is data that is mixed, it is not in a rigid format, but it has consistent
character like is seen in structured Data. E.g; arranging market list in a csv(comma separated
values) format. *[bread, Fish, Meat, Milk]
3. Unstructured Data; This is data that is complex, it is impossible to arrange in a rigid tabular
format. E.g; videos, images, text messages
*NOTE: Data analysts usually work with structured and semi-structured data because they are far more
easier to analyze, analysis of unstructured data can be done by more advanced big-data scientists.
-Data analysis involves extracting meaning from data in a way that’s useful to a decision-maker. Data
analytics is actually broader in scope, Data analytics refers to the process of using data and analytical
tools and techniques to find new insights and make new predictions which are usually for the benefit of
the organization.
-PLAINLY; Data analytics involves the broad-field of using special analytical tools and techniques to
generate useful insights from data that will therefore help organizations to make correct data-driven
decisions, whereas data analysis involves the specific process of analyzing raw data.
*Many people in MANY different fields can apply the data analysis process for the purpose of analyzing
a particular dataset, but only a true ‘Data Analyst’ has ALL the required skills and knowledge bank to
work in a firm or organization and actively help them to work with their data, so as to help them make
very informed and accurate decisions that can help take their company or business farther.
SUMMARY; So the real thing we’re actually studying right now is ‘Data analytics’, because ‘Data analysis’
as a discipline basically focuses on the process of actually analyzing data to get insights, whereas ‘Data
analytics’ as a discipline focuses on both the process of actually analyzing data and its direct Real-world
application in businesses and organizations.
*The best use case of the data analysis process is in businesses and specific organizations (like
governmental or even non-governmental) because you can’t just analyze data without any aim at heart,
you analyze with the aim of improving something- such as making a governmental organization much
better, or helping a business to move to the next level.
[FOR THIS COURSE THO, WE’LL BE USING THE NAME ‘DATA ANALYSIS’]
*ALL Data Scientists can be called ‘Data Analysts’, and can actively ‘analyze data’, whereas, not all Data
Analyst can be called Data scientists.
-Whereas Data analysis is mainly focused on just finding reasonable and useful information(insights),
Data Science is focused on discovering hidden patterns and relationships, that can therefore help a
business or co-operation to be far ahead of their competitors or contemporaries.
-Data Scientists use more advanced statistical and analytical tools and techniques because they are
more focused on making predictions of the ‘Future’ and some of the normal Data analysts tools and
techniques are not best soothed for that
-Data Science also involves analyzing much larger datasets because they are usually doing far more
advanced work than a regular data analysts work scheme.
KEY SUMMARY
1. D.Sc is the Senior brother of Data analysis- ALL data scientist are data analyst
2. D.Sc aims to find hidden patterns and relationships unlike D.A that just wants to find insights
3. D.Sc is usually focused on ‘Making Predictions’ for the future, while DA just wants to check and
understand what happened in the ‘Past’
4. D.Scientists usually analyze much larger datasets as they already have more advanced skillsets