SCU: a GPU stream compaction unit for graph processing
… time in several graph applications. We propose to offload the stream compaction operations
to a specialized unit, the SCU. The SCU is an efficient, compact and small footprint unit that …
to a specialized unit, the SCU. The SCU is an efficient, compact and small footprint unit that …
Energy-efficient stream compaction through filtering and coalescing accesses in gpgpu memory partitions
… GPU extension that combines both the efficient SCU and the filtering mechanism of the IRU
to improve overall graph processing efficiency. We evaluate our proposal on top of a modern …
to improve overall graph processing efficiency. We evaluate our proposal on top of a modern …
High-performance and energy-efficient irregular graph processing on GPU architectures
A Segura Salvador - 2021 - upcommons.upc.edu
… to a programmable Stream Compaction Unit (SCU) hardware … the graph-based algorithm
are efficiently executed on the GPU cores. The SCU is a small unit tightly integrated in the GPU …
are efficiently executed on the GPU cores. The SCU is a small unit tightly integrated in the GPU …
Tdgraph: a topology-driven accelerator for high-performance streaming graph processing
… We analysed the characteristics of streaming graph processing and made two main observations.
… a compacting and filtering technique to prepare data for SMs of GPU for higher GPU …
… a compacting and filtering technique to prepare data for SMs of GPU for higher GPU …
Irregular accesses reorder unit: improving GPGPU memory coalescing for graph-based workloads
… Despite these efforts, we show that irregular graph processing … SCU [8] proposes a
programmable GPU hardware extension for graph processing that is tailored to stream compaction …
programmable GPU hardware extension for graph processing that is tailored to stream compaction …
SCU: a GPU stream compaction unit for graph processing
A Segura Salvador, JM Arnau Montañés… - recercat.cat
… stream compaction, and propose to offload this task to a … Stream Compaction Unit (SCU)
tailored to the requirements of this kernel. The SCU is a small unit tightly integrated in the GPU …
tailored to the requirements of this kernel. The SCU is a small unit tightly integrated in the GPU …
Improving streaming graph processing performance using input knowledge
… the performance of streaming graph processing. To improve graph update efficiency, we …
To complement adaptive batch reordering, we propose updating graphs dynamically, based …
To complement adaptive batch reordering, we propose updating graphs dynamically, based …
WER: Maximizing Parallelism of Irregular Graph Applications Through GPU Warp EqualizeR
… In the context of largescale graph processing, programmable General-Purpose Graphics
Processing Units (… GraphPEG [10] and SCU [11], on the other hand, proposed custom hardware …
Processing Units (… GraphPEG [10] and SCU [11], on the other hand, proposed custom hardware …
Redzone stream compaction: removing k items from a list in parallel O (k) time
J Bontes, J Gain - ACM Transactions on Parallel Computing, 2024 - dl.acm.org
… Redzone stream compaction, the first parallel stream compaction algorithm … GPU and CPU,
if k is proportionally small (k ≪ n), Redzone outperforms existing parallel stream compaction …
if k is proportionally small (k ≪ n), Redzone outperforms existing parallel stream compaction …
Near-Memory Parallel Indexing and Coalescing: Enabling Highly Efficient Indirect Access for SpMV
… González, “Scu: a gpu stream compaction unit for graph processing,” in Proceedings of
the 46th international symposium on computer architecture, 2019, pp. 424–435. [20] S. …
the 46th international symposium on computer architecture, 2019, pp. 424–435. [20] S. …