0% found this document useful (0 votes)
4 views

Module_1_Session_3 Analytic Processes and Tools _ Analysis vs Reporting _ Modern Data Analytic Tools

Big data analytics involves collecting, processing, cleaning, and analyzing large datasets to derive insights for organizations. Key processes include data collection from various sources, data processing through batch or stream methods, data cleaning to ensure quality, and advanced analysis techniques like data mining and predictive analytics. The document also distinguishes between analysis and reporting, highlighting the importance of analysis for actionable insights, and lists modern tools such as Hadoop, NoSQL databases, Spark, and R-Programming that facilitate big data operations.

Uploaded by

arrakkalabu15
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

Module_1_Session_3 Analytic Processes and Tools _ Analysis vs Reporting _ Modern Data Analytic Tools

Big data analytics involves collecting, processing, cleaning, and analyzing large datasets to derive insights for organizations. Key processes include data collection from various sources, data processing through batch or stream methods, data cleaning to ensure quality, and advanced analysis techniques like data mining and predictive analytics. The document also distinguishes between analysis and reporting, highlighting the importance of analysis for actionable insights, and lists modern tools such as Hadoop, NoSQL databases, Spark, and R-Programming that facilitate big data operations.

Uploaded by

arrakkalabu15
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

CS6CRT19 Big Data Analytics Module 1

Processes in Big Data Analytics


Big data analytics refers to collecting, processing, cleaning, and analyzing
large datasets to help organizations operationalize their big data.

1. Collect Data

Data collection looks different for every organization. With today’s


technology, organizations can gather both structured and unstructured data from a
variety of sources — from cloud storage to mobile applications to in-store IoT
sensors and beyond. Some data will be stored in data warehouses where business
intelligence tools and solutions can access it easily.

Raw or unstructured data that is too diverse or complex for a warehouse may
be assigned metadata and stored in a data lake.

2. Process Data

Once data is collected and stored, it must be organized properly to get accurate
results on analytical queries, especially when it’s large and unstructured. Available
data is growing exponentially, making data processing a challenge for organizations.

One processing option is batch processing, which looks at large data blocks
over time. Batch processing is useful when there is a longer turnaround time between
collecting and analyzing data.

Stream processing looks at small batches of data at once, shortening the


delay time between collection and analysis for quicker decision-making. Stream
processing is more complex and often more expensive.

3. Clean Data

Swamy Saswathikananda College, Poothotta 1


CS6CRT19 Big Data Analytics Module 1

Data requires scrubbing to improve data quality and get stronger results; all
data must be formatted correctly, and any duplicative or irrelevant data must be
eliminated or accounted for. Dirty data can obscure and mislead, creating flawed
insights.

4. Analyze Data

Getting big data into a usable state takes time. Once it’s ready, advanced
analytics processes can turn big data into big insights. Some of these big data
analysis methods include:

● Data mining sorts through large datasets to identify patterns and relationships

by identifying anomalies and creating data clusters.


● Predictive analytics uses an organization’s historical data to make

predictions about the future, identifying upcoming risks and opportunities.


● Deep learning imitates human learning patterns by using artificial

intelligence and machine learning to layer algorithms and find patterns in the
most complex and abstract data.

Analysis Vs Reporting

Following are the five major differences between Analysis and Reporting:

1. Purpose

Reporting helps companies monitor their data even before digital technology
booms. Various organizations have been dependent on the information it brings to
their business, as reporting extracts that and makes it easier to understand.

Swamy Saswathikananda College, Poothotta 2


CS6CRT19 Big Data Analytics Module 1

Analysis interprets data at a deeper level. While reporting can link between
cross-channels of data, provide comparison, and make understand information easier
(think of a dashboard, charts, and graphs, which are reporting tools and not analysis
reports), analysis interprets this information and provides recommendations on
actions.

2. Tasks

Reporting includes building, configuring, consolidating, organizing,


formatting, and summarizing. It’s very similar to the above mentioned like turning
data into charts, graphs, and linking data across multiple channels.

Analysis consists of questioning, examining, interpreting, comparing, and


confirming. With big data, predicting is possible as well.

3. Outputs

Reporting has a push approach, as it pushes information to users and outputs


come in the forms of canned reports, dashboards, and alerts.

Analysis has a pull approach, where a data analyst draws information to


further probe and to answer business questions. Analysis presentations are
comprised of insights, recommended actions, and a forecast of its impact on the
company—all in a language that’s easy to understand at the level of the user who’ll
be reading and deciding on it.

4. Delivery

Analysis requires a more custom approach, with human minds doing superior
reasoning and analytical thinking to extract insights, and technical skills to provide

Swamy Saswathikananda College, Poothotta 3


CS6CRT19 Big Data Analytics Module 1

efficient steps towards accomplishing a specific goal. This is why data analysts and
scientists are demanded these days, as organizations depend on them to come up
with recommendations for leaders or business executives to make decisions about
their businesses.

5. Value

Reporting itself is just numbers. Without drawing insights and getting reports
aligned with your organization’s big picture, you can’t make decisions based on
reports alone.

Data analysis is the most powerful tool to bring into your business. Employing
the powers of analysis can be comparable to finding gold in your reports, which
allows your business to increase profits and further develop.

Modern Big Data Analytics Tools and Technology

Big data analytics cannot be narrowed down to a single tool or technology.


Instead, several types of tools work together to help you collect, process, cleanse,
and analyze big data. Some of the major players in big data ecosystems are listed
below.

● Hadoop is an open-source framework that efficiently stores and processes big

datasets on clusters of commodity hardware. This framework is free and can


handle large amounts of structured and unstructured data, making it a valuable
mainstay for any big data operation.
● NoSQL databases are non-relational data management systems that do not

require a fixed scheme, making them a great option for big, raw, unstructured

Swamy Saswathikananda College, Poothotta 4


CS6CRT19 Big Data Analytics Module 1

data. NoSQL stands for “not only SQL,” and these databases can handle a
variety of data models.
● Spark is an open source cluster computing framework that uses implicit data

parallelism and fault tolerance to provide an interface for programming entire


clusters. Spark can handle both batch and stream processing for fast
computation.
● R-Programming R is a free open source software programming language
and a software environment for statistical computing and graphics. It is
used by data miners for developing statistical software and data analysis. It
has become a highly popular tool for big data in recent years.

Swamy Saswathikananda College, Poothotta 5

You might also like