PMAM'17- Proceedings of the 8th International Workshop on Programming Models and Applications for Multicores and Manycores

Full Citation in the ACM Digital Library

Batched Gauss-Jordan Elimination for Block-Jacobi Preconditioner Generation on GPUs

TaskInsight: Understanding Task Schedules Effects on Memory and Performance

A high-performance portable abstract interface for explicit SIMD vectorization

PETRAS: Performance, Energy and Thermal Aware Resource Allocation and Scheduling for Heterogeneous Systems

Reduction to Tridiagonal Form for Symmetric Eigenproblems on Asymmetric Multicore Processors

High Performance Detection of Strongly Connected Components in Sparse Graphs on GPUs

Towards Composable GPU Programming: Programming GPUs with Eager Actions and Lazy Views

Assessing One-to-One Parallelism Levels Mapping for OpenMP Offloading to GPUs

A Framework for Developing Parallel Applications with high level Tasks on Heterogeneous Platforms