Heterogeneous computing with OpenCL B Gaster, L Howes, DR Kaeli, P Mistry, D Schaa Morgan Kaufmann, 2011 | 593* | 2011 |
A comparison of CPUs, GPUs, FPGAs, and massively parallel processor arrays for random number generation DB Thomas, L Howes, W Luk Proceedings of the ACM/SIGDA international symposium on Field programmable …, 2009 | 266 | 2009 |
Performance comparison of graphics processors to reconfigurable logic: A case study B Cope, PYK Cheung, W Luk, L Howes IEEE Transactions on computers 59 (4), 433-448, 2010 | 154 | 2010 |
Efficient random number generation and application using CUDA L Howes, D Thomas GPU gems 3, 805-830, 2007 | 120 | 2007 |
The OpenCL specification, version 2.0 L Howes, A Munshi Khronos Group, 2015 | 96 | 2015 |
Khronos SYCL for OpenCL: a tutorial R Keryell, R Reyes, L Howes Proceedings of the 3rd International Workshop on OpenCL, 1-1, 2015 | 63 | 2015 |
Design space exploration with a stream compiler O Mencer, DJ Pearce, LW Howes, W Luk Proceedings. 2003 IEEE International Conference on Field-Programmable …, 2003 | 55 | 2003 |
Can GPGPU Programming Be Liberated from the Data-Parallel Bottleneck? BR Gaster, L Howes IEEE Computer 45 (8), 42-52, 2012 | 52 | 2012 |
HRF-Relaxed: Adapting HRF to the complexities of industrial heterogeneous memory models BR Gaster, D Hower, L Howes ACM Transactions on Architecture and Code Optimization (TACO) 12 (1), 1-26, 2015 | 50 | 2015 |
Deriving efficient data movement from decoupled access/execute specifications LW Howes, A Lokhmotov, AF Donaldson, PHJ Kelly High Performance Embedded Architectures and Compilers: Fourth International …, 2009 | 45 | 2009 |
Optimized Context Switching for Long-Running Processes LW Howes, BR Gaster, M Mantor US Patent App. 13/691,066, 2014 | 42 | 2014 |
Comparing FPGAs to graphics accelerators and the PlayStation 2 using a unified source description LW Howes, P Price, O Mencer, O Beckmann, O Pell 2006 International Conference on Field Programmable Logic and Applications, 1-6, 2006 | 30 | 2006 |
Method and system for workitem synchronization LW Howes, BR Gaster, MC Houston, M Mantor, M Leather, N Rubin, ... US Patent 8,607,247, 2013 | 28 | 2013 |
Heterogeneous Parallel Primitives Programming Model BR Gaster, LW Howes US Patent App. 13/904,791, 2013 | 27 | 2013 |
Method and system for yield operation supporting thread-like behavior LW Howes, BR Gaster, MC Houston US Patent 9,697,003, 2017 | 19 | 2017 |
Method and system for synchronization of workitems with divergent control flow MC Houston, BR Gaster, LW Howes, M Mantor, D Behr US Patent 9,424,099, 2016 | 17 | 2016 |
Introduction to GPU radix sort T Harada, L Howes Heterogeneous Computing with OpenCL. Morgan Kaufman, 2011 | 16 | 2011 |
Efficient processor load balancing using predication V Tipparaju, L Howes, T Scogland US Patent 10,296,378, 2019 | 15 | 2019 |
Register spill management for general purpose registers (GPRs) L Howes, M Kazakov US Patent 9,779,469, 2017 | 14 | 2017 |
High-performance SIMT code generation in an active visual effects library JLT Cornwall, L Howes, PHJ Kelly, P Parsonage, B Nicoletti Proceedings of the 6th ACM conference on Computing frontiers, 175-184, 2009 | 14 | 2009 |