Besufekad BIG DATA 1
Besufekad BIG DATA 1
Besufekad BIG DATA 1
SCHOOL OF INFORMATICS
DEPARTMENT OF INFORMATION SYSTEM
A SEMINAR REPORT
ON BIG DATA
PREPARD BY STUDENT
BESUFEKAD ALIYU
Table of Contents
INTRODUCTION.................................................................................................................................... 2
WHAT IS BIG DATA ......................................................................................................................... 3
How big data was developed and who are the contributors ............................................................ 3
How big data works ............................................................................................................................. 4
Advantages of Big Data ....................................................................................................................... 5
Disadvantages of Big Data......................................................................................................................... 6
Applications of Big Data .......................................................................................................................... 7
CONCLUSION ........................................................................................................................................ 8
Reference ................................................................................................................................................... 9
1
BIG DATA
SEMINAR REPORT
INTRODUCTION
One of the greatest trendy ideas these days is Big Data (BD).
2
BIG DATA
SEMINAR REPORT
Definition :-The term "Big Data" refers to the evolution and use of technologies
at provide the right user at the right time with the right information from a mass of
data that has been growing exponentially for a long time in our society. The
challenge is not only to deal with rapidly increasing volumes of data but also the
difficulty of managing increasingly heterogeneous formats as well as increasingly
complex and interconnected data. Being a complex polymorphic object, its
definition varies according to the communities that are interested in it as a user or
provider of services. Invented by the giants of the web, the Big Data presents itself
as a solution designed to provide everyone a real-time access to giant databases.
Big Data is a very difficult concept to define precisely, since the very notion of big
in terms of volume of data varies from one area to another. It is not defined by a set
of technologies, on the contrary, it defines a category of techniques and
technologies. This is an emerging field, and as we seek to learn how to implement
this new paradigm and harness the value, the definition is changing.
How big data was developed and who are the contributors
The term ‘Big Data’ has been in use since the early 1990s. Although it is not exactly
known who first used the term, most people credit John R. Mashey (who at the time
worked at Silicon Graphics) for making the term popular. In its true essence, Big Data
is not something that is completely new or only of the last two decades. Over the course
of centuries, people have been trying to use data analysis and analytics techniques to
support their decision-making process. In order to understand the context of Big Data
today, it is important to understand how each phase contributed to the contemporary
meaning of Big Data.
3
BIG DATA
SEMINAR REPORT
Big data analytics refers to collecting, processing, cleaning, and analyzing large datasets to help
organizations operationalize their big data.
Collect Data
Data collection looks different for every organization. With today’s technology, organizations
can gather both structured and unstructured data from a variety of sources — from cloud
storage to mobile applications to in-store IoT sensors and beyond.
Process Data
Once data is collected and stored, it must be organized properly to get accurate results on
analytical queries, especially when it’s large and unstructured. One processing option is batch
processing, which looks at large data blocks over time. Batch processing is useful when there
is a longer turnaround time between collecting and analyzing data. Stream processing looks at
small batches of data at once, shortening the delay time between collection and analysis for
quicker decision-making.
Clean Data
Data big or small requires scrubbing to improve data quality and get stronger results; all data
must be formatted correctly, and any duplicative. Dirty data can obscure and mislead, creating
flawed insights
Analyze Data
Getting big data into a usable state takes time. Once it’s ready, advanced analytics processes
can turn big data into big insights. Some of these big data analysis methods include:
Data mining sorts through large datasets to identify patterns and relationships by identifying
anomalies and creating data clusters.
Predictive analytics uses an organization’s historical data to make predictions about the future,
identifying upcoming risks and opportunities.
4
BIG DATA
SEMINAR REPORT
Deep learning imitates human learning patterns by using artificial intelligence and machine
learning to layer algorithms and find patterns in the most complex and abstract data.(What Is
Big Data? How Does Big Data Work? | Built In n.d.)
Enhanced decision-making: Big Data provides organization with access to a vast amount of
information from various sources, enabling them to make data-driven decisions. By analyzing
patterns, trends, and correlations within the data, businesses can gain valuable insights that
guide their strategies.
Improved operational efficiency: Big Data analytics helps organization’s optimize their
operations by identifying inefficiencies, bottlenecks, and areas for improvement. By
streamlining processes and workflows, businesses can enhance productivity, reduce costs, and
deliver better results.
Personalization and customer insights: Big Data enables organization’s to understand their
customers at a deeper level. By analyzing customer data, businesses can identify preferences,
behavior’s, and needs, allowing them to personalize products, services, and marketing
campaigns to enhance the customer experience.
Cost savings: Big Data analytics can uncover cost-saving opportunities for organizations. By
analyzing data related to operations, supply chains, and resource allocation, businesses can
identify areas where costs can be minimized, such as optimizing inventory management or
reducing energy consumption.
Competitive advantage: Leveraging Big Data effectively can provide organizations with a
competitive edge. By analyzing market trends, consumer behavior, and competitor data,
businesses can identify new opportunities, innovate, and stay ahead of the competition.
5
BIG DATA
SEMINAR REPORT
Improved risk management: Big Data analytics helps organizations identify and mitigate
risks. By analyzing historical and real-time data, businesses can anticipate potential risks, such
as fraud, cybersecurity threats, or supply chain disruptions, and implement proactive measures
to address them.
Innovation and product development: Big Data fuels innovation by providing insights into
market demands, customer preferences, and emerging trends. Organizations can leverage these
insights to develop new products, enhance existing ones, and align their offerings with evolving
customer needs. (TheKnowledgeAcademy n.d.)
Data quality and reliability: Big Data poses challenges related to data quality and reliability.
The volume, as well as a variety of data sources can lead to issues such as incomplete or
inconsistent data, which may affect the accuracy and validity of analysis.
Data overload and complexity: Dealing with massive volumes of data can be overwhelming.
Managing and processing large datasets requires advanced infrastructure, tools, and expertise.
The complexity of Big Data can make it challenging to extract meaningful insights efficiently.
High implementation and maintenance costs: Implementing Big Data solutions can be
costly for organizations. Investments in infrastructure, technology, and skilled personnel are
required, which may pose challenges, especially for small and medium-sized businesses.
Ethical considerations: Using Big Data raises ethical concerns regarding privacy, consent,
and the potential misuse of personal information. Organizations must establish ethical
frameworks, transparency, and governance practices to ensure responsible data usage.
6
BIG DATA
SEMINAR REPORT
Data governance and compliance: Big Data introduces challenges in terms of data
governance and compliance with regulations. Organizations need to navigate complex data
governance frameworks and make sure that they comply with data protection laws and
industry-specific regulations.
Tracking Customer Spending Habit, Shopping Behavior: In big retails store (like Amazon,
Walmart, Big Bazar etc.) management team has to keep data of customer’s spending habit (in
which product customer spent, in which brand they wish to spent, how frequently they spent),
shopping behavior, customer’s most liked product (so that they can keep those products in the
store).
Recommendation: By tracking customer spending habit, shopping behavior, big retails store
provide a recommendation to the customer. E-commerce site like Amazon, Walmart, and
Flipkart does product recommendation. They track what product a customer is searching, based
on that data they recommend that type of product to that customer.
Smart Traffic System: Data about the condition of the traffic of different road, collected
through camera kept beside the road, at entry and exit point of the city, GPS device placed in
the vehicle (Ola, Uber cab, etc.). Secure Air Traffic System: At various places of flight (like
propeller etc.) sensors present.
Auto Driving Car: Big data analysis helps drive a car without human interpretation. In the
various spot of car camera, a sensor placed, that gather data like the size of the surrounding car,
obstacle, distance from those, etc. (Applications of Big Data 2019)
7
BIG DATA
SEMINAR REPORT
CONCLUSION
The availability of Big Data, low-cost commodity hardware, and new information
management and analytic software have produced a unique moment in the history of
data analysis. The convergence of these trends means that we have the capabilities
required to analyze astonishing data sets quickly and cost-effectively for the first time in
history. These capabilities are neither theoretical nor trivial. They represent a genuine
leap forward and a clear opportunity to realize enormous gains in terms of efficiency,
productivity, revenue, and profitability. The Age of Big Data is here, and these are
truly revolutionary times if both business and technology professionals continue to
work together and deliver on the promise.(Conclusion - Big Data, Big Analytics n.d.)
8
BIG DATA
SEMINAR REPORT
Reference
“Applications of Big Data.” 2019. GeeksforGeeks. https://www.geeksforgeeks.org/applications-of-big-
data/ (February 22, 2024).
“Conclusion - Big Data, Big Analytics: Emerging Business Intelligence and Analytic Trends for Today’s
Businesses [Book].” https://www.oreilly.com/library/view/big-data-
big/9781118239155/xhtml/Conclusion.html (February 22, 2024).
“What Is Big Data? How Does Big Data Work? | Built In.” https://builtin.com/big-data (February 22,
2024).
9
BIG DATA