0% found this document useful (0 votes)
9 views14 pages

Types of Big Data

Uploaded by

Chinmay Patil
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views14 pages

Types of Big Data

Uploaded by

Chinmay Patil
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 14

Types of Big Data

Big data can be classified into 3-types

1. Structured:
2. Semi-Structured:
3. Unstructured:
1. Structured data
Structured Data is used to refer to the data which is already

stored in databases, in an ordered manner.


Here,data is present in a structured schema, along with all

the required columns. It is in a structured or tabular format.


Data that is stored in a relational database management
system is an example of structured data
Any data that can be stored, accessed and processed in the

form of fixed format is termed as a 'structured' data.


It accounts for about 20% of the total existing data and is

used the most in programming and computer-related


activities.
There are two sources of structured data- machines and

humans.
All the data received from sensors, weblogs, and financial
systems are classified under machine-generated data.
These include medical devices, GPS data, data of usage

statistics captured by servers and applications and the huge


amount of data that usually move through trading platforms,
to name a few
Human-generated structured data mainly includes all the

data a human input into a computer, such as his name and


other personal details.
 When a person clicks a link on the internet, or even makes

a move in a game, data is created- this can be used by


companies to figure out their customer behavior and make
the appropriate decisions and modifications.
Example Of Structured Data

An 'Employee' table in a database is an example of Structured


Data

Employee_ID Employee_Name Gender Department Salary_In_lacs

2365 Rajesh Kulkarni Male Finance 650000

3398 Pratibha Joshi Female Admin 650000

7465 Shushil Roy Male Admin 500000

7500 Shubhojit Das Male Finance 500000

7699 Priya Sane Female Finance 550000


2. Unstructured data:
Any data with unknown form or the structure is classified
as unstructured data.
Any data which has an unfamiliar model or structure is

categorized as unstructured data.


About 80% of the total account for unstructured big data.
In addition to the size being huge, un-structured data poses

multiple challenges in terms of its processing for deriving


value out of it.
Unstructured data is also classified based on its source,

into machine-generated or human-generated.


Machine-generated data accounts for all the satellite

images, the scientific data from various experiments and


radar data captured by various facets of technology.
Human-generated unstructured data is found in abundance
across the internet since it includes social media data,
mobile data, and website content.
This means that the pictures we upload to Facebook or

Instagram handle, the videos we watch on YouTube and


even the text messages we send all contribute to the gigantic
heap that is unstructured data.
Examples of unstructured data include text, video, audio,

mobile activity, social media activity, satellite imagery,


surveillance imagery – the list goes on and on.
The Unstructured data is further divided into –
a. Captured data:
It is the data based on the user’s behavior. The best example
to understand it is GPS via smartphones which help the user
each and every moment and provides a real-time output.
b. User-generated data:
It is the kind of unstructured data where the user itself will
put data on the internet every movement. For example,
Tweets and Re-tweets, Likes, Shares, Comments, on
Youtube, Facebook, etc.
Now day organizations have wealth of data available with
them but unfortunately, they don't know how to derive value
out of it since this data is in its raw form or unstructured
format.
Examples Of Un-structured Data
The output returned by 'Google Search‘
3. Semi-structured
Semi-structured data can contain both the forms of data.
 Information that is not in the traditional database format as

structured data, but contains some organizational properties which


make it easier to process, are included in semi-structured data.
. For example, NoSQL documents are considered to be semi-

structured, since they contain keywords that can be used to process


the document easily.

We can see semi-structured data as a structured in form but it is


actually not defined with e.g. a table definition in relational
DBMS.
 Example of semi-structured data is a data represented in an XML

file.
The line between unstructured data and semi-structured data has

always been unclear since most of the semi-structured data appear


to be unstructured at a glance
Example Of Semi-structured Data
Personal data stored in an XML file-
<rec><name>PrashantRao</name><sex>Male</sex><age>35</age></rec>
<rec><name>Seema R.</name><sex>Female</sex><age>41</age></rec>
<rec><name>Satish Mane</name><sex>Male</sex><age>29</age></rec>
<rec><name>Subrato Roy</name><sex>Male</sex><age>26</age></rec>
<rec><name>Jeremiah J.</name><sex>Male</sex><age>35</age></rec>
Data Growth over the years
Difference between Structured, Semi-structured
and Unstructured data
Semi-structured Unstructured
Factors Structured data
data data

It is more flexible than


It is flexible in nature
It is dependent and less structured data but less
Flexibility and there is an
flexible than flexible than
absence of a schema
unstructured data
Matured transaction and The transaction is No transaction
Transaction
various concurrency adapted from DBMS management and no
Management
technique not matured concurrency
Queries over
Query Structured query allow An only textual
anonymous nodes are
performance complex joining query is possible
possible
This is based on
It is based on the It is based on RDF and
Technology character and library
relational database table XML
data
Thank You!!!

You might also like