0% found this document useful (0 votes)
9 views8 pages

Disciplines - Unit 3

Big data analytics requires a range of data management and analysis disciplines including statistics, machine learning, data mining, text mining, and database management systems. Statistics provides theories for testing hypotheses from data and identifying patterns. Machine learning uses data to train systems to learn and make decisions with minimal human intervention. Data mining extracts usable data from large raw datasets by recognizing hidden patterns. Text mining transforms unstructured text into structured data for analysis or machine learning. Database management systems handle high volumes of streaming data with continuous queries.

Uploaded by

Suja Mary
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views8 pages

Disciplines - Unit 3

Big data analytics requires a range of data management and analysis disciplines including statistics, machine learning, data mining, text mining, and database management systems. Statistics provides theories for testing hypotheses from data and identifying patterns. Machine learning uses data to train systems to learn and make decisions with minimal human intervention. Data mining extracts usable data from large raw datasets by recognizing hidden patterns. Text mining transforms unstructured text into structured data for analysis or machine learning. Database management systems handle high volumes of streaming data with continuous queries.

Uploaded by

Suja Mary
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 8

Disciplines that support the big data analytics

process

Unit-3
Introuction
Big data analytics is going to the next frontier for
innovation, productivity and competitive advantage.
Automated data collection tools and mature database
technology lead to tremendous amount of data stored
in databases.
The statistical tools and Six sigma tools could be the
key to turning big data into manageable inferences.
A range of data management and analysis disciplines
are:
Statistics
Machine learning
Data mining
Text mining
Database management systems
Statistics

Statistical data analysis is a procedure of performing


various statistical operations.
 Quantitative research, which seeks to quantify the
data, and typically, applies some form of statistical
analysis.
It provides the theory for testing hypotheses about
various insights from data.
Machine Learning
Machine learning is a branch of artificial intelligence
based on the idea that systems can learn from data,
identify patterns and make decisions with minimal
human intervention.
ML is a discipline at the crossroads of big data and
artificial intelligence, which presents a discipline that
seeks to solve complex logical problems by
“imitating” the human cognitive system.
Data mining

 Data mining is a process of extracting usable data from a larger


set of raw data. It is a subset of data analysis. It implies an
efficient and continuous method of recognizing and discovering
hidden patterns and data throughout a huge dataset.
 Data mining is a particular step in this process application of
specific algorithms for extracting models from data.
 Data mining and knowledge discovery combines theory and
heuristics toward extracting knowledge. To this end, data
cleaning, learning and visualization might be also employed.
 The main task of data mining is using methods to automatically
extract useful information from these data and make them
available to decision-makers.
Text mining

Text mining is an artificial intelligence (AI)


technology that uses natural language processing
(NLP) to transform the free (unstructured) text in
documents and databases into normalized, structured
data suitable for analysis or to drive machine learning
(ML) algorithms.
The application of text mining techniques to solve
business problems is called text analytics.
Data streams management systems

These systems handle transient streams, including


continuous queries, while being able to handle data with
very high ingestion rates, including streams featuring
unpredictable arrival times and characteristics.
Conclusions
The analysis of these data will help us to better
understand our behaviors (strong points, weak points
and improvement points) to better intervene in future
because of the analysis models that can be developed in
the form of algorithms.

You might also like