0% found this document useful (0 votes)
2 views20 pages

C-1.1 Types of Digital Data

The document discusses structured, unstructured, and semi-structured data, highlighting their definitions, pros and cons, tools, and use cases. Structured data is organized and easily processed, while unstructured data is more complex and requires specialized expertise. Big data encompasses all three types and has transformative applications across various industries, including education, insurance, banking, and government.

Uploaded by

seceh93562
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views20 pages

C-1.1 Types of Digital Data

The document discusses structured, unstructured, and semi-structured data, highlighting their definitions, pros and cons, tools, and use cases. Structured data is organized and easily processed, while unstructured data is more complex and requires specialized expertise. Big data encompasses all three types and has transformative applications across various industries, including education, insurance, banking, and government.

Uploaded by

seceh93562
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 20

STRUCTURED DATA

Any data that can be stored, accessed and processed in the


form of fixed format is termed as a 'structured' data.

Examples Of Structured Data :TABLES


• Structured data is typically categorized as
quantitative data which is highly organized and
easily decipherable by machine learning algorithms.
• It was developed by IBM® in 1974, structured query
language (SQL) is the programming language used
to manage structured data.
• By using a relational (SQL) database, business users
can quickly input, search and manipulate structured
data.
Pros
• Easily used by machine learning (ML) algorithms: The specific and organized
architecture of structured data eases the manipulation and querying of ML data.
• Easily used by business users: Structured data do not require an in-depth
understanding of different types of data and how they function. With a basic
understanding of the topic relative to the data, users can easily access and interpret
the data.
• Accessible by more tools: Since structured data predates unstructured data, there are
more tools available for using and analyzing structured data.
Cons
• Limited usage: Data with a predefined structure can only be used for its intended
purpose, which limits its flexibility and usability.
• Limited storage options: Structured data are usually stored in data storage systems
with rigid schemas (for example, “data warehouses”). Therefore, changes in data
requirements necessitate an update of all structured data, which leads to a
massive expenditure of time and resources.
Structured data tools
• OLAP: Performs high-speed,
multidimensional data analysis from
unified, centralized data stores.
• SQLite: (link resides outside
ibm.com) Implements a self-contained,
serverless, zero-configuration, transactional
relational database engine.
• MySQL: Embeds data into mass-deployed
software, particularly mission-critical,
heavy-load production system.
• PostgreSQL: Supports SQL and JSON
querying as well as high-tier programming
languages (C/C+, Java, Python, among
others.).
Use cases for structured data
• Customer relationship management (CRM): CRM software runs structured
data through analytical tools to create datasets that reveal customer
behavior patterns and trends.
• Online booking: Hotel and ticket reservation data (for example, dates,
prices, destinations, among others.) fits the “rows and columns” format
indicative of the pre-defined data model.
• Accounting: Accounting firms or departments use structured data to process
and record financial transactions.
Unstructured Data

•Any data with unknown form is classified as unstructured


data. In addition to the size being huge, un-structured data
poses multiple challenges in terms of its processing for
deriving value out of it.

•Examples Of UnStructured Data : Output of Google search


• Unstructured data is typically categorized as qualitative data,
cannot be processed and analyzed through conventional data
tools and methods. Since unstructured data does not have a
predefined data model, it is best managed in
non-relational (NoSQL) databases. Another way to manage
unstructured data is to use data lakes to preserve it in raw
form.

• The importance of unstructured data is rapidly increasing.


Pros
• Native format: Unstructured data is stored in its native format, remains undefined until
needed. It enables data scientists to prepare and analyze only the data they need.
• Fast accumulation rates: Since there is no need to predefine the data, it can be collected
quickly and easily.
• Data lake storage: Allows for massive storage and pay-as-you-use pricing, which cuts costs
and eases scalability.

Cons
• Requires expertise: Due to its undefined or non-formatted nature, data science expertise is
required to prepare and analyze unstructured data. This is beneficial to data analysts but
alienates unspecialized business users who might not fully understand specialized data
topics or how to utilize their data.
• Specialized tools: Specialized tools are required to manipulate unstructured data, which
limits product choices for data managers.
Unstructured data tools
• MongoDB: Uses flexible documents to process data for cross-platform
applications and services.
• DynamoDB: Delivers single-digit millisecond performance at any scale
through built-in security, in-memory caching and backup and restore.
• Hadoop: Provides distributed processing of large data sets using simple
programming models and no formatting requirements.
• Azure: Enables agile cloud computing for creating and managing apps
through Microsoft’s data centers.
Use cases for unstructured data
• Data mining: Enables businesses to use unstructured data to identify
consumer behavior, product sentiment and purchasing patterns to better
accommodate their customer base.
• Predictive data analytics: Alert businesses of important activity ahead of
time so they can properly plan and accordingly adjust to significant market
shifts.
• Chatbots: Perform text analysis to route customer questions to the
appropriate answer sources.
S.NO Structured Data Unstructured
Data

1 Self-service access Requires data


science
expertise

2 Only select data types Many varied


types

3 Commonly stored in data Commonly


warehouses stored in data
lakes

4 Predefined format Native format


Semi-structured Data

•Semi-structured data can contain both the forms of data.


•Semi-structured data (for example, JSON, CSV, XML) is the “bridge”
between structured and unstructured data. It does not have a
predefined data model and is more complex than structured data, yet
easier to store than unstructured data.

•Examples Of Semi-Structured Data :


•Personal data stored in an XML file-
•<rec><name>PrashantRao</name><sex>Male</sex><age>35</age></rec>
• <rec><name>Seema R.</name><sex>Female</sex><age>41</age></rec>
<rec><name>Satish Mane</name><sex>Male</sex><age>29</age></rec>
Big data is a combination of structured,
semistructured and unstructured data
collected by organizations that can be
mined for information and used in
machine learning projects, predictive
modeling and other advanced analytics
applications.
Applications of
Big Data
• Government
• Manufacturing
• Healthcare
• Marketing
• Telecommunication and Media
• Education
• Financial and Banking System
• Insurance
• Internet of Things (IoT)
• Information technology
Big Data in Education
industry
•Following are some of the fields in education industry that have been
transformed by big data motivated changes
• Customized and dynamic learning programs
• Reframing course material
• Grading Systems
• Career prediction
Big Data in Insurance
industry
•Following are some of the fields in Insurance industry that have been
transformed by big data motivated changes

• Collecting information
• Gaining customer insight
• Fraud detection
• Threat mapping
Big Data in Banking Sector

•Study and analysis of big data can help detect –

• The misuse of credit cards


• Misuse of debit cards
• Business clarity
• Customer statistics alteration
• Money laundering
• Risk Mitigation

Big data in Government
industry
• Big data in government can have an enormous impact — local, national and global.

With so many complex issues on the table today, governments have their work cut out
trying to make sense of all the information they receive and make vital decisions that
affect millions of people.

You might also like