Data Parallelism
Data Parallelism
(a)Horizontal Parallelism:
Horizontal parallelism in a data warehouse splits data rows across nodes to process the
same task simultaneously, boosting performance.
(b)Vertical Parallelism:
Vertical parallelism in a data warehouse runs different tasks, like scanning or sorting,
simultaneously to improve efficiency.
Intraquery Parallelism:
• Defines execution of a single query in parallel on multiple processors and
disks.
• Essential for speeding up long-running queries.
• DBMS vendors use intraquery parallelism to improve performance.
• Decomposes serial SQL query into lower-level operations like scan, join, sort,
and aggregation.
• Lower-level operations are executed concurrently in parallel.
Interquery Parallelism:
• Interquery parallelism allows multiple queries or transactions to execute in
parallel.
• Database vendors use parallel hardware architectures to handle large client
requests efficiently.
• Successful implementation on SMP systems increases throughput and
supports more concurrent users.
Shared-Nothing Architecture:
• Data partitioned across all disks.
• DBMS partitioned across multiple co-servers.
• Each node owns its disk and database partition.
• Parallelizes SQL query execution across multiple processing nodes.
• Each processor communicates with other processors via interconnection
network.
• Optimized for Multi-Process-Performer-Node (MPP) and cluster systems.
• Offers near-linear scalability, with each node capable of being a powerful
SMP system.
Advantages:
Disadvantages: