100% found this document useful (1 vote)
44 views12 pages

Types of Bigdata: Advance Database Techniques (CP402)

This document discusses the different types of big data: structured, unstructured, and semi-structured data. Structured data has a fixed format and resides in defined fields, examples include machine and human generated data stored in databases. Unstructured data has no clear format and includes text, images, and videos from social media and sensors. Semi-structured data contains both structured and unstructured elements, like data stored in XML files. The types differ in their flexibility and the technologies used to manage them.

Uploaded by

Darshan Tank
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
44 views12 pages

Types of Bigdata: Advance Database Techniques (CP402)

This document discusses the different types of big data: structured, unstructured, and semi-structured data. Structured data has a fixed format and resides in defined fields, examples include machine and human generated data stored in databases. Unstructured data has no clear format and includes text, images, and videos from social media and sensors. Semi-structured data contains both structured and unstructured elements, like data stored in XML files. The types differ in their flexibility and the technologies used to manage them.

Uploaded by

Darshan Tank
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 12

Advance Database Techniques(CP402)

Types of Bigdata

Khushali Jivani (18CP310)


Snowy Machhar (18CP312)
What is Bigdata?
  data but with a huge size
 A collection of data that is huge in volume and yet growing exponentially with time
 Examples-
Types Of Bigdata

Structured data

Unstructured data

Semi-Structured data
Structured data
 Data that can be stored, accessed and processed in the form of fixed format
 Data that resides in fix fields within a record or file
 Souces of Structured data --
 1) Machine generated data-
These include medical devices, GPS data, data of usage statistics captured by servers and applications and the
huge amount of data that usually move through trading platforms.

 2) Human generated data-


 includes all the data a human input into a computer.
This can be used by companies to figure out their customer behavior and make the appropriate decisions and
modifications.
Example

[Employee table that contain information of employees]


Unstructured data
 They have no clear format in storage.
 example of unstructured data is a heterogeneous data source containing a combination of
simple text files, images, videos etc.

 Divided into two parts-


• 1) Captured data -it is the data based on the user’s behavior. Ex. GPS help the user each and
every moment and provides a real-time output.
• 2) User-Generated data -user itself will put data on the internet every movement. Ex. Tweets
and Re-tweets, Likes, Shares, Comments, on Youtube, Facebook, etc.
 Sources of Unstructured data—
 Human-generated Unstructured data -social media data, mobile data, and website content
  Machine-generated Unstructured data - satellite images, the scientific data from various
experiments and radar data

Example

[The output returned by 'Google Search’]


Semistructured data
 Semi-structured data can contain both the forms of data.
 We can see semi-structured data as a structured in form but it is actually not defined.
 Ex. of semi-structured data is a data represented in an XML file.
Example

[Personal data stored in an XML file]


Difference between Structured, Unstructured and Semi-structured data

   Factors Structured data    Unstructured data Semi-structured data

Flexibility It is flexible in nature and It is more flexible than


there is an absence of a structured data but less
It is dependent and less schema than flexible than
flexible unstructured data

Technology It is based on the This is based on character


relational database table and library data It is based on XML
References
 https://www.knowledgehut.com/blog/big-data/types-of-big-data
 https://www.guru99.com/what-is-big-data.html
 https://www.upgrad.com/blog/what-is-big-data-types-characteristics-benefits-and-examples/
Thank You

You might also like