Parallel Sorting on the Heterogeneous AMD Fusion Accelerated Processing Unit
We explore efficient parallel radix sort for the AMD Fusion Accelerated Processing Unit (APU). Two challenges arise: efficiently partitioning data between the CPU and GPU and the allocation of data in memory regions. Our coarse-grained implementation utilizes both the GPU and CPU by sharing data at...
Main Author: | |
---|---|
Other Authors: | |
Language: | en_ca |
Published: |
2013
|
Subjects: | |
Online Access: | http://hdl.handle.net/1807/35116 |