WOLAITA SODO UNIVERSITY
SCHOOLS OF INFORMATICS
DEPARTMENT OF INFORMATION SYSSTEM
SEMINAR REPORT ON
BIG DATA
Date 30,06,2014 e.c
INTRODUCTION
Big Data is a collection of data that is huge in volume, yet growing
exponentially with time.
it is a data with so large size and complexity that none of traditional data
management tools can store it or process it efficiently.
Big data is also a data but with huge size.
Big data is a concept used to describe
a large volume of data, which are both structured and unstructured.
it increased day to day by any system or business
is collection of data sets large and complex to process using on-hand
database management tools or traditional data processing application.
TYPES OF BIG DATA
Structured: Any data that can be stored, accessed and processed in the
form of fixed format is termed as a ‘structured’ data.
Unstructured: Any data with unknown form or the structure is classified as
unstructured data.
Semi-structured: Semi-structured data can contain both the forms of data.
THE 3V’S OF BIG DATA
Volume: Organizations and firms gather as well as pull together
different data from different sources so this loads our volume so we
can handle by using technology like Apache spark.
Velocity: Data is now streaming at an exceptional speed, which has
to be dealt with suitably.
Variety: The releases of data from various systems have diverse
HOW IT WORKS
A big data architecture is designed to handle:-
Ingestion
processing
analysis of data that is too large or complex for traditional database systems.
Advantage of big data
Businesses can utilize outside intelligence while taking decisions Access
to social data from search engines and sites like Facebook, twitter are
enabling organizations to fine tune their business strategies.
Improved customer service.
Early identification of risk to the product/services, if any.
Better operational efficiency.
Disadvantage of big data
Rapid Data Growth: The growth velocity at such a high rate creates a
problem to look for insights using it. There no 100% efficient way to filter
out relevant data.
Storage: The generation of such a massive amount of data needs space
for storage, and organizations face challenges to handle such extensive
data without suitable tools and technologies.
Unreliable Data: It cannot be guaranteed that the big data collected and
analyzed are totally (100%) accurate. Redundant data, contradicting data, or
incomplete data are challenges that remain within it.
Data Security: Firms and organizations storing such massive data (of users) can
be a target of cybercriminals, and there is a risk of data getting stolen. Hence,
encrypting such colossal data is also a challenge for firms and organizations.
Application area of big data
Health care
to personalized medicine and prescriptive analytics due to the role of
big data systems.
Media and entertainment
To creating, advertising, and distributing their content using new
business models.
the customer requirements to view digital content from any location
and at any time.
Customer use different media like Facebook, YouTube, telegram etc.…
IoT(INTERNET OF THING)
IoT devices generate continuous data and send them to a server on a
daily basis.
These data are mined to provide the interconnectivity of devices.
ANY QUESTION ?