File 1

Uploaded by

rathna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views3 pages

File 1

Uploaded by

rathna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

1. What is big data analytics?

Big data analytics describes the process of uncovering trends, patterns, and correlations in
large amounts of raw data to help make data-informed decisions. These processes use familiar
statistical analysis techniques—like clustering and regression—and apply them to more extensive
datasets with the help of newer tools.

2. How big data analytics works

Big data analytics refers to collecting, processing, cleaning, and analyzing large datasets to
help organizations operationalize their big data.

1. Collect Data
Data collection looks different for every organization. With today’s technology, organizations
can gather both structured and unstructured data from a variety of sources — from cloud storage to
mobile applications to in-store IoT sensors and beyond. Some data will be stored in data
warehouses where business intelligence tools and solutions can access it easily. Raw or unstructured
data that is too diverse or complex for a warehouse may be assigned metadata and stored in a data
lake.
2. Process Data
Once data is collected and stored, it must be organized properly to get accurate results on
analytical queries, especially when it’s large and unstructured. Available data is growing
exponentially, making data processing a challenge for organizations. One processing option is batch
processing, which looks at large data blocks over time. Batch processing is useful when there is a
longer turnaround time between collecting and analyzing data. Stream processing looks at small
batches of data at once, shortening the delay time between collection and analysis for quicker
decision-making. Stream processing is more complex and often more expensive.

3. Clean Data
Data big or small requires scrubbing to improve data quality and get stronger results; all data
must be formatted correctly, and any duplicative or irrelevant data must be eliminated or accounted
for. Dirty data can obscure and mislead, creating flawed insights.

4. Analyze Data
Getting big data into a usable state takes time. Once it’s ready, advanced analytics processes can
turn big data into big insights. Some of these big data analysis methods include:
 Data mining sorts through large datasets to identify patterns and relationships by identifying
anomalies and creating data clusters.
 Predictive analytics uses an organization’s historical data to make predictions about the
future, identifying upcoming risks and opportunities.
 Deep learning imitates human learning patterns by using artificial intelligence and machine
learning to layer algorithms and find patterns in the most complex and abstract data.
3. Big data analytics tools and technology
Big data analytics cannot be narrowed down to a single tool or technology. Instead, several types
of tools work together to help you collect, process, cleanse, and analyze big data. Some of the major
players in big data ecosystems are listed below.

 Hadoop is an open-source framework that efficiently stores and processes big datasets on
clusters of commodity hardware. This framework is free and can handle large amounts of
structured and unstructured data, making it a valuable mainstay for any big data operation.
 NoSQL databases are non-relational data management systems that do not require a fixed
scheme, making them a great option for big, raw, unstructured data. NoSQL stands for “not
only SQL,” and these databases can handle a variety of data models.
 MapReduce is an essential component to the Hadoop framework serving two functions. The
first is mapping, which filters data to various nodes within the cluster. The second is reducing,
which organizes and reduces the results from each node to answer a query.
 YARN stands for “Yet Another Resource Negotiator.” It is another component of second-
generation Hadoop. The cluster management technology helps with job scheduling and
resource management in the cluster.
 Spark is an open source cluster computing framework that uses implicit data parallelism and
fault tolerance to provide an interface for programming entire clusters. Spark can handle both
batch and stream processing for fast computation.
 Tableau is an end-to-end data analytics platform that allows you to prep, analyze,
collaborate, and share your big data insights. Tableau excels in self-service visual analysis,
allowing people to ask new questions of governed big data and easily share those insights
across the organization.

The big benefits of big data analytics

The ability to analyze more data at a faster rate can provide big benefits to an organization,
allowing it to more efficiently use data to answer important questions. Big data analytics is important
because it lets organizations use colossal amounts of data in multiple formats from multiple sources to
identify opportunities and risks, helping organizations move quickly and improve their bottom lines.
Some benefits of big data analytics include:

 Cost savings. Helping organizations identify ways to do business more efficiently

 Product development. Providing a better understanding of customer needs
 Market insights. Tracking purchase behavior and market trends

4. What is the convergence of key trends in big data?

Data is the ultimate resource that fuels AI models to acquire new skills. Big data and AI,
therefore, share a synergetic relationship wherein AI algorithms can extract unprecedented insights
from big dataset

Unstructured data usage

Unstructured data, or nonrelational data, makes up a significant portion of the enterprise data that
exists today. Examples of unstructured data used every day include:

 Social media: Social media has a component of semi-structured data (e.g., data that does not
conform to a data model but has some structure) but the content of each social media message
itself is unstructured.

 Email: While we sometimes consider this semi-structured, email message fields are text fields
that are not easily analyzed. Email content may include video, audio, or photo content as well,
making them unstructured.

 Text files: Almost all traditional business files — including word processing documents (e.g.,
Google Docs or Microsoft Word), presentations (e.g., Microsoft PowerPoint), notes, and
PDFs — are classified as unstructured data.

 Survey responses: When open-ended feedback is gathered via survey (e.g., text box) or
through respondents selecting "liked" photos, unstructured data is being gathered.
 Scientific data: Scientific data can include field surveys, space exploration, seismic imagery,
atmospheric data, topographic data, weather data, and medical data. While these types of data
may have a base structure for collection, the data itself is often unstructured and may not lend
itself to traditional analysis tools and dashboards.

 Machine and sensor data: Billions of small files from IoT (Internet of Things) devices, such
as mobile phones and iPads, generate significant amounts of unstructured data. In addition,
business systems’ log files, which are not consistent in structure, also create vast amounts of
unstructured data.

23PCSC10 Data Science and Analytics
No ratings yet
23PCSC10 Data Science and Analytics
118 pages
UNIT 1 - BIG DATA ANALYTICS Full
No ratings yet
UNIT 1 - BIG DATA ANALYTICS Full
28 pages
AD3491 - FDSA - Unit I - Introduction - Part I
100% (2)
AD3491 - FDSA - Unit I - Introduction - Part I
23 pages
Big Data Analysis by Deshbandhu
No ratings yet
Big Data Analysis by Deshbandhu
368 pages
Big Data Analytics M1
No ratings yet
Big Data Analytics M1
27 pages
Unit 1 Understanding Big Data
No ratings yet
Unit 1 Understanding Big Data
17 pages
BDM Review Session
No ratings yet
BDM Review Session
179 pages
Big Data Analytics Unit-1
100% (2)
Big Data Analytics Unit-1
5 pages
CCS334 BIG DATA ANALYTICS Session 1 Intr
No ratings yet
CCS334 BIG DATA ANALYTICS Session 1 Intr
18 pages
FUNDAMENTALS OF BIG DATA ANALYTICS Digital Notes
No ratings yet
FUNDAMENTALS OF BIG DATA ANALYTICS Digital Notes
121 pages
Big Data Processing
No ratings yet
Big Data Processing
38 pages
Convergence in Big Data Analytics
No ratings yet
Convergence in Big Data Analytics
5 pages
Big Data Notes
No ratings yet
Big Data Notes
89 pages
Unit 1 Big Data Analytics Full
No ratings yet
Unit 1 Big Data Analytics Full
29 pages
Cognitive Computing (Course Code: 18CS3272) : CO1 - Session4 Session Topic: The Elements of A Cognitive System
No ratings yet
Cognitive Computing (Course Code: 18CS3272) : CO1 - Session4 Session Topic: The Elements of A Cognitive System
9 pages
Fbda Unit-1
No ratings yet
Fbda Unit-1
17 pages
Big Data 1 - 1
No ratings yet
Big Data 1 - 1
98 pages
Question Bank
No ratings yet
Question Bank
62 pages
Big Data Analytics
No ratings yet
Big Data Analytics
19 pages
Unit 1 Big Data
No ratings yet
Unit 1 Big Data
124 pages
Big Data
No ratings yet
Big Data
67 pages
Data, Big
No ratings yet
Data, Big
90 pages
Kwasu-Csc204 Big Data Computing and Security-1
No ratings yet
Kwasu-Csc204 Big Data Computing and Security-1
57 pages
Wade 200622083
No ratings yet
Wade 200622083
153 pages
BDA1-4 Bunits
No ratings yet
BDA1-4 Bunits
113 pages
Finance - Unit 4
No ratings yet
Finance - Unit 4
39 pages
Emerging Tech Lecture Notes 3 (Chapters 1-4)
No ratings yet
Emerging Tech Lecture Notes 3 (Chapters 1-4)
207 pages
Ccs334 Unit 1
No ratings yet
Ccs334 Unit 1
44 pages
Complete Doc - Lavanya
No ratings yet
Complete Doc - Lavanya
95 pages
Unit Ii Web Designing
No ratings yet
Unit Ii Web Designing
55 pages
Big Data
No ratings yet
Big Data
47 pages
Big Data - Module 1
No ratings yet
Big Data - Module 1
35 pages
Machine Learning With Unstructured Data
No ratings yet
Machine Learning With Unstructured Data
25 pages
Bda Unit-1
No ratings yet
Bda Unit-1
43 pages
Quote: "Data Is Widely Available. What Is Scarce Is The Ability To Extract Wisdom From It."
No ratings yet
Quote: "Data Is Widely Available. What Is Scarce Is The Ability To Extract Wisdom From It."
58 pages
Kwasu-Csc204 Module 1 Big Data Computing and Security 2
No ratings yet
Kwasu-Csc204 Module 1 Big Data Computing and Security 2
22 pages
Course2 - Cloud Digital Leader
No ratings yet
Course2 - Cloud Digital Leader
39 pages
A Holistic Framework For Knowledge Discovery and Management: Contributed Articles
No ratings yet
A Holistic Framework For Knowledge Discovery and Management: Contributed Articles
6 pages
BIG DATA INTRODUCTION Hadoop
No ratings yet
BIG DATA INTRODUCTION Hadoop
24 pages
Case Study Question
No ratings yet
Case Study Question
16 pages
Big Data Technology Report With Pages Removed
No ratings yet
Big Data Technology Report With Pages Removed
32 pages
Bigdata Unit 1
No ratings yet
Bigdata Unit 1
20 pages
Business Analytics Notes
No ratings yet
Business Analytics Notes
31 pages
Bda Unit 1
No ratings yet
Bda Unit 1
20 pages
Essential CIO Guide To AI PDF
No ratings yet
Essential CIO Guide To AI PDF
36 pages
Intro To Big Data Analytics
No ratings yet
Intro To Big Data Analytics
14 pages
AWS Re:invent 2024-Pioneering The Future of Generative AI and Enterprise Solutions - by Onkar Mishra - Jan, 2025 - Medium
No ratings yet
AWS Re:invent 2024-Pioneering The Future of Generative AI and Enterprise Solutions - by Onkar Mishra - Jan, 2025 - Medium
48 pages
Ccs 334
No ratings yet
Ccs 334
16 pages
DSBDA EndSem2023 12F FlyHigh
No ratings yet
DSBDA EndSem2023 12F FlyHigh
20 pages
Unit - I Question & Answer
No ratings yet
Unit - I Question & Answer
23 pages
Unit 1 Understanding Big Data
No ratings yet
Unit 1 Understanding Big Data
17 pages
Configuration Management: Software Maintenance and
No ratings yet
Configuration Management: Software Maintenance and
8 pages
Unit-1 Final Sgs
No ratings yet
Unit-1 Final Sgs
24 pages
Unit 1 Understanding Big Data
No ratings yet
Unit 1 Understanding Big Data
17 pages
Big Data Analytics 1
No ratings yet
Big Data Analytics 1
22 pages
Big Data Bigger Outcomes
No ratings yet
Big Data Bigger Outcomes
6 pages
Bda Q&a
No ratings yet
Bda Q&a
15 pages
Document
No ratings yet
Document
5 pages
CC Unit 3 Imp Questions
No ratings yet
CC Unit 3 Imp Questions
15 pages
Big Data and E-Government A Review
No ratings yet
Big Data and E-Government A Review
8 pages
Title - Concept of Big Data: Presented by - Divyanshu Upadhyay Naman Gupta Adarsh Pandey Pankaj Chaudhary Shivbrat Singh
No ratings yet
Title - Concept of Big Data: Presented by - Divyanshu Upadhyay Naman Gupta Adarsh Pandey Pankaj Chaudhary Shivbrat Singh
17 pages
1.big Data and Its Importance
No ratings yet
1.big Data and Its Importance
17 pages
By Sandra Mendonça
No ratings yet
By Sandra Mendonça
28 pages
Group 4
No ratings yet
Group 4
10 pages
Sem Csen1301
No ratings yet
Sem Csen1301
12 pages
Big Data Report
No ratings yet
Big Data Report
10 pages
Big Data Analytics Project Proposal by Slidesgo
No ratings yet
Big Data Analytics Project Proposal by Slidesgo
12 pages
Unit - 1 Bda
No ratings yet
Unit - 1 Bda
14 pages
Webessentials Mini Project Report
No ratings yet
Webessentials Mini Project Report
38 pages
BDA Notes Part 1
No ratings yet
BDA Notes Part 1
11 pages
Big Data Ashish
No ratings yet
Big Data Ashish
7 pages
Telling Time Using Skip Counting: Objectives
No ratings yet
Telling Time Using Skip Counting: Objectives
14 pages
Iot Analytics
No ratings yet
Iot Analytics
14 pages
BDA Module
No ratings yet
BDA Module
6 pages
Quantitative Methods - Organizing, Visualizing and Describing Data
No ratings yet
Quantitative Methods - Organizing, Visualizing and Describing Data
24 pages
Msword&rendition 1
No ratings yet
Msword&rendition 1
9 pages
Big Data Analytics
No ratings yet
Big Data Analytics
8 pages
Big Data Analytics-Report
No ratings yet
Big Data Analytics-Report
7 pages
Legal Tech Trends 2024 Report
No ratings yet
Legal Tech Trends 2024 Report
23 pages
Webessentials Mini Project Report
No ratings yet
Webessentials Mini Project Report
38 pages
Big Data Analytics
No ratings yet
Big Data Analytics
6 pages
Big Data Analytics
No ratings yet
Big Data Analytics
5 pages
Big Data
No ratings yet
Big Data
3 pages
Big Data Analytics
No ratings yet
Big Data Analytics
4 pages
Application Fields and Research Gaps of Process Mining in Manufacturing Companies
No ratings yet
Application Fields and Research Gaps of Process Mining in Manufacturing Companies
14 pages
Introduction Part
No ratings yet
Introduction Part
5 pages
Quiz Show 2 (Semi Final) : Round 2 3 Categories Negative Marks For Buzzer Round
No ratings yet
Quiz Show 2 (Semi Final) : Round 2 3 Categories Negative Marks For Buzzer Round
28 pages
What's Is Big D-WPS Office
No ratings yet
What's Is Big D-WPS Office
3 pages
Introduction To Big Data
No ratings yet
Introduction To Big Data
4 pages
21BCAD5C01 IDA Module 1 Notes
No ratings yet
21BCAD5C01 IDA Module 1 Notes
24 pages
Critical Evaluation Summary Worksheet (1) 2
No ratings yet
Critical Evaluation Summary Worksheet (1) 2
3 pages
Big Data Framework For National E-Governance Plan: Rajagopalan M.R
No ratings yet
Big Data Framework For National E-Governance Plan: Rajagopalan M.R
5 pages
A2ia Overview
No ratings yet
A2ia Overview
16 pages
Renganayagi Varatharaj College of Engineering Salvarpatti, Sivakasi - 626 128
No ratings yet
Renganayagi Varatharaj College of Engineering Salvarpatti, Sivakasi - 626 128
1 page
EXPERIMENT11
No ratings yet
EXPERIMENT11
8 pages
IBM Watson - How Cognitive Computing Can Be Applied
No ratings yet
IBM Watson - How Cognitive Computing Can Be Applied
14 pages
Experiment-6 (Web Essentials)
No ratings yet
Experiment-6 (Web Essentials)
5 pages
Experiment 2
No ratings yet
Experiment 2
4 pages
Web 1
No ratings yet
Web 1
10 pages
!DOCTYPE HTML
No ratings yet
!DOCTYPE HTML
4 pages
Experiment-1 (Web Essentials)
No ratings yet
Experiment-1 (Web Essentials)
9 pages
Web Essentials 2
No ratings yet
Web Essentials 2
9 pages
WI 2023 Dynamics of Interpersonal Relationships
No ratings yet
WI 2023 Dynamics of Interpersonal Relationships
7 pages
Iq Bot Brochure
No ratings yet
Iq Bot Brochure
4 pages
Data Analyst - Data - Scientist - ML - Engineer
No ratings yet
Data Analyst - Data - Scientist - ML - Engineer
3 pages
Renganayagi Varatharaj College of Engineering Salvarpatti, Sivakasi - 626 128
No ratings yet
Renganayagi Varatharaj College of Engineering Salvarpatti, Sivakasi - 626 128
1 page

File 1

Uploaded by

File 1

Uploaded by

1. What is big data analytics?

2. How big data analytics works

The big benefits of big data analytics

 Cost savings. Helping organizations identify ways to do business more efficiently

4. What is the convergence of key trends in big data?

Unstructured data usage

You might also like