0% found this document useful (0 votes)
25 views10 pages

MPP Is The Most Mature, Proven, and Widely Deployed Mechanism For Storing and Analyzing Large Amounts of Data

MPP (Massively Parallel Processing) systems spread large amounts of data across independent storage and CPU resources, allowing for much faster query execution. In MPP, many smaller, simultaneous queries can run in parallel instead of just one large query. Traditional databases have bottlenecks where processing is reduced to a single lane, whereas MPP is designed to handle processing in a very fast manner. MPP also provides benefits like redundancy, easy recovery from failures, resource management tools, and query optimizers. MPP can greatly improve data preparation and scoring tasks required for advanced analytics by parallelizing joins, aggregations, and other transformations across its independent resources.

Uploaded by

Ganesh Kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views10 pages

MPP Is The Most Mature, Proven, and Widely Deployed Mechanism For Storing and Analyzing Large Amounts of Data

MPP (Massively Parallel Processing) systems spread large amounts of data across independent storage and CPU resources, allowing for much faster query execution. In MPP, many smaller, simultaneous queries can run in parallel instead of just one large query. Traditional databases have bottlenecks where processing is reduced to a single lane, whereas MPP is designed to handle processing in a very fast manner. MPP also provides benefits like redundancy, easy recovery from failures, resource management tools, and query optimizers. MPP can greatly improve data preparation and scoring tasks required for advanced analytics by parallelizing joins, aggregations, and other transformations across its independent resources.

Uploaded by

Ganesh Kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 10

Massively Parallel Processing Systems

MPP is the most mature, proven, and widely deployed


mechanism for storing and analyzing large amounts of
data
An MPP database spreads data out into independent
pieces managed by independent storage and central
processing unit (CPU) resources
In non-MPP database architectures, there are at least
a few points during processing where, the number of
lanes is reduced to one
This allows much faster query execution….
This allows much faster query execution, since many
independent smaller queries are running
simultaneously instead of just one big query.
Normal system its got very compicated in case of
processing due to several stage
In MPP it is built to handle that in very fast manner.
Traditional Query versus an MPP Query
MPP systems build in redundancy.
Possible to make a copy of data and accessing it
whenever it is required
Make recovery easy in cases where there’s equipment
failure
Resource management tools to manage the CPU and
disk space
Query optimizers to make sure queries are being
optimally executed
Using MPP Systems for Data Preparation and Scoring
MPP can be a huge benefit to advanced analytics
Data preparation is made up of joins, aggregations,
derivations, and transformations
This is the process of combining various data sources
to pull together all the information needed for an
analysis
Aggregations involve combining information from
multiple records into one
Data preparation tasks required for the majority of
analysis
Data preparation tasks required for the majority of
analysis

You might also like