Big Data
Big Data
Big Data
Big data is data or datasets that are so large or complex that traditional data processing
applications are not sufficient and distributed databases are required. Companies like Google,
eBay, LinkedIn and Facebook were all built around big data from the start. Industry 4.0 is the
fusion of the real world with the virtual world. This digital revolution is marked by technology
that takes advantage of Big Data and Artificial Intelligence (AI) to nurture automatic learning
systems. It is a collection of huge and complex datasets, including huge amounts of data, social
media analysis, data management functions, real-time data and more. Big Data is a data whose
complexity requires new techniques, algorithms, and analytics to manage it and extract value and
hidden knowledge from it. We need a different platform named Hadoop as the core platform for
structuring Big Data, and solves the problem of making it useful for analytics purposes.
Challenges include sensor design, data collection, storage, sharing, archiving, analytics,
visualization, privacy of information, and more. with traditional and technical tools. Modern
factories are becoming increasingly complex and connected, bringing new challenges that can be
addressed through big data-driven automation. The big data volumes resulting from the IoT and
modern computing systems have enabled smart factories to rapidly increase their efficiency and
achieve significant benefits in terms of longer uptime, faster production and fewer errors. The
networking possibilities made possible by the IoT have created very large and complex
information networks. Without Big Data, 4.0 technologies would not be able to decode all this
information and extract value from it to learn, generate models for predictive analytics and act
autonomously and accurately. Without Big Data there would be no Industry 4.0 and the smart
technologies that support it.
1.0 Introduction
Big data has become an integral part of our digital world, transforming the way we collect,
process, and utilize information. Its sheer volume, velocity, and variety demand innovative
approaches to storage, analysis, and decision-making. By effectively harnessing the power of big
data through advanced technologies, organizations and researchers can uncover valuable
insights, drive innovation, and stay competitive in today's data-driven landscape.
1.1 Big Data
Big data is a term that refers to large sets of complex data that can be used to analyses patterns,
trends, and relationships. It is used to improve decision-making, identify customer behavior, and
drive business strategies. Companies are increasingly relying on big data to gain insights into
their customers, products, and processes. This paper will examine how companies can use big
data to gain insights into their customer base, product offerings, and business strategies.
• Data overload: big data analysis often involves large amounts of data that are difficult to
process and manage. This can lead to data overload, which in turn can create confusion
and difficulty in data analysis.
• Data quality: Big data analysis is highly dependent on the quality of the data. If the data
is of poor quality or contains errors, this may lead to incorrect conclusions being drawn
from the analysis.
• Security and Privacy: Big data analysis often requires the collection and storage of
large amounts of personal data. As a result, security and privacy issues need to be
addressed to ensure that personal data is not misused or exposed.
• Cost: Big data analytics can be expensive due to the hardware and software required to
process and analyze the data.
• Complexity: Big data analytics can be complex and requires specialized knowledge and
skills, which can be difficult to achieve.
The research objective of big data can vary based on the specific context and goals of the study.
However, here are a few common research objectives in the field of big data:
• To examine the impact of big data in the context of Industrial Revolution 4.0, specifically
in e-business areas.
• To explore the concept of big data and its relevance to Industrial Revolution 4.0 in the
context of e-business areas.
1.7 Methodology
All the works were done by the following way:
Planning
1.7.1 Planning: At first, we made a plan about how we will work for it.
1.7.2 Data collection: We start collecting data after planning and fulfilling requirements.
There is no primary data. We took the references from various website. All the
information provided on this term paper is from secondary sources, such as various links,
web pages, and some books
1.7.3 Data analyzing: After collecting data we start analyzing those data. We eliminate
unnecessary data and process those data for analyzing.
1.7.4 Result: After proper data collecting and analyzing we got the result and started making
our term paper.
1.8 Limitations
There were some limitations while making this study. The limitations are given below:
1. Lack of reliable sources: Relevant data was missing in the internet. Fake data were more in
online.
Book:
❖ Viktor Mayer-Schönberger and Kenneth Neil Cukier in their book “Big Data: a
revolution that will transform how we live, work, and think” talked about some facts of
big data.
❖ Sholom M. Weiss, Nitin Indurkhya wrote about Big Data in future days in their book
“Predictive Data Mining”
❖ “Cloud is the digital wonderland of Internet of Things, powered by Artificial
Intelligence and Big Data”- Enamul Haque, Digital Transformation Through Cloud
Computing: Developing a sustainable business strategy to eschew extinction.
Journal:
❖ “Every company has big data in its future and every company will eventually be in the
data business.” – Thomas H. Davenport, President’s Distinguished Professor in IT and
Management.
❖ “Information has gone from scarce to superabundant. That brings huge new benefits”,
says Kenneth Cukier (American Journalist).
1) Structured data consists of numbers and text that can be easily classified and studied.
These data are generated by network sensors embedded in electronic devices,
smartphones, and global positioning system (GPS) devices, among other things.
Structured data also includes sales numbers, account balances, and transaction
information.
2) Unstructured data includes more sophisticated information like comments on social
networking sites, images and other multimedia, and customer evaluations from
commercial websites. These data are difficult to categorize or analyze mathematically.
3) Semi-structured data can contain both the forms of data. We can see semi-structured
data as a structured in form but it is actually not defined with e.g. a table definition in
relational DBMS. Example of semi-structured data is a data represented in an XML file.
3) Velocity: Data in motion. The speed at which data is created, processed and
analyzed continues to accelerate.
3.1.3 Importance of Big Data
Reduction in Cost: Big data can significantly reduce costs for businesses in various ways. By
analyzing large volumes of data, organizations can identify areas of inefficiency, streamline
operations, and optimize resource allocation.
Time Consumption: Big data analytics can help save time by automating and accelerating
various processes. With the ability to process and analyze vast amounts of data quickly,
organizations can gain actionable insights in real time. This allows for faster decision-making
and problem-solving
Decision Making: Big data provides organizations with valuable insights that drive informed
decision-making. By analyzing large and diverse datasets, organizations can uncover patterns,
trends, and correlations that were previously hidden. These insights can guide strategic planning,
product development, marketing campaigns, and customer segmentation.
New Product Development in the Market: Big data plays a crucial role in the development of
new products and services. By analyzing customer behavior, market trends, and competitor data,
organizations can identify unmet needs and market gaps. This information can inform the design
and development of innovative products and services that align with customer preferences.
Real-Time Failure Measurement: Big data enables real-time monitoring and measurement of
systems and processes, allowing organizations to detect failures or anomalies immediately. By
analyzing data from various sources, such as sensors or log files, organizations can identify
potential failures before they occur or escalate.
Risk Management: Big data analytics plays a vital role in risk management. By analyzing large
datasets, organizations can identify and assess potential risks, such as fraud, cybersecurity
threats, or supply chain disruptions.
Business intelligence: Big data analytics can assist companies in learning about industry trends,
rivalry, and commercial prospects. This can aid businesses in making wise decisions and
enhancing their competitiveness.
Operational Efficiency: Big data analytics can help businesses streamline their processes and
increase productivity. Increased productivity and cost savings may result from this.
Innovation: Big data analytics can provide light on cutting-edge products and fresh commercial
strategies. This can assist businesses in innovating and developing new items.
2. Time Management: Data sets that could take hours to calculate are calculated in seconds or
minutes by Macy's merchandise pricing optimization software.
3. Assistance with Corporate Decisions: Big data's major goal is to support internal business
choices like what kinds of new items should be made available to consumers. How much stock
ought to be held back? And how much must our item cost.?
4. Creating Fresh Big Data-Based Products: Big data must be utilized to develop new goods
and services. The best example is LinkedIn, which has developed goods and solutions such as
jobs you may be interested in, who has viewed my profile, individuals you may know, and many
others using big data. People have been drawn to LinkedIn by these concepts.
• This is one of the biggest and most publicized areas of big data use today.
• Here, big data is used to better understand customers and their behaviors and preferences.
• Companies are keen to expand their traditional data sets with social media data, browser
logs as well as text analytics and sensor data to get a more complete picture of their
customers.
• The big objective, in many cases, is to create predictive models. Using big data, Telecom
companies can now better predict customer churn; Wal-Mart can predict what products
will sell, and car insurance companies understand how well their customers actually
drive.
• The clinical trials of the future won’t be limited by small sample sizes but could
potentially include everyone.
• Big data techniques are already being used to monitor babies in a specialist premature
and sick baby unit.
• By recording and analyzing every heart beat and breathing pattern of every baby, the unit
was able to develop algorithms that can now predict infections 24 hours before any
physical symptoms appear.
• Big data analytics help machines and devices become smarter and more autonomous.
• For example, big data tools are used to operate Google’s self-driving car. The Toyota
Prius is fitted with cameras, GPS as well as powerful computers and sensors to safely
drive on the road without the intervention of human beings.
• Big data is applied heavily in improving security and enabling law enforcement.
• Others use big data techniques to detect and prevent cyber-attacks.
• Police forces use big data tools to catch criminals and even predict criminal activity and
credit card companies use big data use it to detect fraudulent transactions.
• Big data is used to improve many aspects of our cities and countries.
• For example, it allows cities to optimize traffic flows based on real time traffic
information as well as social media and weather data.
• A number of cities are currently piloting big data analytics with the aim of turning
themselves into Smart Cities, where the transport infrastructure and utility processes are
all joined up, where a bus would wait for a delayed train and where traffic signals predict
traffic volumes and operate to minimize jams.
Rapid Growth of Big Data: The volume, velocity, and variety of data are growing at an
unprecedented rate. This rapid growth poses challenges in terms of storage, processing, and
analysis. Scaling up the computational resources and implementing distributed processing
frameworks become essential to cope with the rapid growth of big data.
Data storage: Keeping big amounts of data in storage can be expensive and need a lot of storage
space. To manage massive data, organizations must invest in storage infrastructure.
Unreliable data: Big data encompasses data from various sources, including structured and
unstructured data, social media feeds, user-generated content, and more. This diversity
introduces challenges related to data quality and reliability. Unreliable data can arise due to
errors, inconsistencies, biases, or incomplete information.
Data quality: It can be difficult to confirm the reliability and validity of huge data. Problems
with data quality might result in inaccurate conclusions and judgments.
Data security: Big data frequently contains sensitive and private information. Big data
protection from hacks and breaches is a difficult task for businesses.
Customer Insights: Big data analytics can help organizations gain insights into customer
behavior, preferences, and needs. By analyzing customer data, organizations can identify
patterns and trends that can help them personalize their products and services to better meet
customer needs.
Market Trends: Big data analytics can help organizations gain insights into market trends,
including changes in consumer behavior, emerging technologies, and new business models. This
can help organizations stay ahead of the competition and identify new business opportunities.
Operational Efficiency: Big data analytics can help organizations optimize their operations and
improve efficiency. By analyzing operational data, organizations can identify inefficiencies and
areas for improvement, which can lead to cost savings and increased productivity.
Fraud Detection: Big data analytics can help organizations detect and prevent fraud. By
analyzing large volumes of data, organizations can identify patterns and anomalies that may
indicate fraudulent activity.
Predictive Analytics: Big data analytics can help organizations predict future trends and
outcomes. By analyzing historical data, organizations can identify patterns and make predictions
about future events, such as customer behavior or market trends.
➨ Big data analysis is not useful in short run. It needs to be analyzed for longer duration to
leverage its benefits.
• Improved efficiency and productivity in e-business areas through big data analytics.
• Enhanced decision-making capabilities by utilizing data-driven insights.
• Personalization and customization of products and services based on customer
preferences.
• Improved customer experience and satisfaction through targeted marketing and
personalized recommendations.
• Enhanced supply chain management through real-time tracking and predictive analytics.
• Efficient resource allocation and optimization of business processes.
• Better risk management and identification of market trends.
• Increased competitiveness and innovation in the e-business sector.
Industrial Big Data analytics use cases: Leveraging machine learning and predictive analytics
algorithms, industrial big data can help to create value in various use case scenarios like
predictive maintenance, product quality prediction in early stages of the production process and
product quality optimization, prediction and prevention of critical situation in continuous
production processes, prediction of product lifetime, assembly plan prediction for new 3D
product designs, energy demand prediction, demand forecasting, price forecasting, and many
other use cases.
The way Big Data did this lead to the fourth industrial revolution: The interconnection
between systems and computers and the ability to analyze large amounts of data have made
possible the existence of intelligent machines that can make informed decisions without any
human involvement. The Internet of Things (IoT) has been connecting elements for many years,
but the value extracted from the data through the Big Data has taken the term to a new level: The
Internet of Systems.
These are just some of the benefits of Big Data analysis in production systems, but there are
many more; improved security, load optimization, supply chain management, non-conformity
analysis etc.
5.2 Recommendation
While the study goes through many different aspects of big data, like the three elements of big
data volume, variety, velocity and it also offers four principal recommendations for organizations
to progress their big data strategy and maximize the business value from enterprise data. Those
four recommendations include:
1. Implement robust data governance frameworks to ensure ethical and responsible use of big
data in e-business areas.
2. Explore advanced analytics techniques such as machine learning and predictive modeling to
gain actionable insights and optimize e-business operations.
3. Develop data-driven business strategies that leverage big data to drive innovation, improve
customer experiences, and achieve competitive advantage.
4. Conduct regular evaluations and assessments of big data initiatives to measure their impact
and identify areas for improvement.
References
[1] “Role of Big Data in Industry 4.0 and Beyond.” Nasscom the Official Community of Indian
IT Industry, community.nasscom.in/communities/big-data-analytics/role-big-data-
[2] Taylor, David. “Introduction to BIG DATA: What Is, Types, Characteristics & Example.”
[3] Yumpu.com. “Draft Big Data Strategy - Australian Government Information And ...”
Yumpu.com, www.yumpu.com/en/document/view/31766155/draft-big-data-strategy-
[4] Naimi, A. I., and D. J. Westreich. “Big Data: A Revolution That Will Transform How We
Live, Work, and Think.” American Journal of Epidemiology, vol. 179, no. 9, 8 Apr.
causal.unc.edu/files/2011/08/Naimi2014BigDataRevolutionReview.pdf,
https://doi.org/10.1093/aje/kwu085.
[5] Weiss, Sholom M., and Nitin Indurkhya. Predictive Data Mining: A Practical Guide. Google
books.google.com.bd/books?id=xzVD8C2YpnQC&printsec=frontcover&redir_esc=y#v=
www.goodreads.com/book/show/52669803-digital-transformation-through-cloud-
report/2010/02/27/data-data-everywhere.