Analyzing CUDA workloads using a detailed GPU simulator A Bakhoda, GL Yuan, WWL Fung, H Wong, TM Aamodt Performance Analysis of Systems and Software, 2009. ISPASS 2009. IEEE …, 2009 | 2133 | 2009 |
Dynamic warp formation and scheduling for efficient gpu control flow WWL Fung, I Sham, G Yuan, TM Aamodt Proceedings of the 40th Annual IEEE/ACM International Symposium on …, 2007 | 659 | 2007 |
Thread block compaction for efficient SIMT control flow WWL Fung, TM Aamodt High Performance Computer Architecture (HPCA), 2011 IEEE 17th International …, 2011 | 276 | 2011 |
Cache coherence for GPU architectures. I Singh, A Shriraman, WWL Fung, M O'Connor, TM Aamodt HPCA, 578-590, 2013 | 220 | 2013 |
Kilo TM: Hardware Transactional Memory for GPU Architectures WWL Fung, I Singh, A Brownsword, T Aamodt Micro, IEEE 32 (3), 7-16, 2012 | 156* | 2012 |
Hardware transactional memory for gpu architectures WWL Fung, I Singh, A Brownsword, TM Aamodt Proceedings of the 44th Annual IEEE/ACM International Symposium on …, 2011 | 142 | 2011 |
Dynamic warp formation: Efficient MIMD control flow on SIMD graphics hardware WWL Fung, I Sham, G Yuan, TM Aamodt ACM Transactions on Architecture and Code Optimization (TACO) 6 (2), 7, 2009 | 116 | 2009 |
General-Purpose Graphics Processor Architectures TM Aamodt, WWL Fung, TG Rogers Synthesis Lectures on Computer Architecture 13 (2), 1-140, 2018 | 105 | 2018 |
GPGPU-Sim 3. x manual TM Aamodt, WWL Fung, I Singh, A El-Shafiey, J Kwa, T Hetherington, ... 2012-08-08)[2013-08-08]. http:∥ gpgpu-sim. org/manual/index. php/GPGPU …, 2012 | 65 | 2012 |
Energy efficient GPU transactional memory via space-time optimizations WWL Fung, TM Aamodt Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013 | 55 | 2013 |
Visualizing complex dynamics in many-core accelerator architectures A Ariel, WWL Fung, AE Turner, TM Aamodt Performance Analysis of Systems & Software (ISPASS), 2010 IEEE International …, 2010 | 49 | 2010 |
GPUDet: A Deterministic GPU Architecture H Jooybar, WWL Fung, M O’Connor, J Devietti, TM Aamodt | 44 | 2013 |
PUPIL: Programmable ultrasound platform and interface library R Rohling, W Fung, P Lajevardi International Conference on Medical Image Computing and Computer-Assisted …, 2003 | 24 | 2003 |
GPU computing architecture for irregular parallelism WWL Fung University of British Columbia, 2015 | 8 | 2015 |
Rotation, scaling, and translation-invariant multi-bit watermarking based on log-polar mapping and discrete Fourier transform WWL Fung, A Kunisa Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on, 4 pp., 2005 | 7 | 2005 |
Dynamic warp formation: exploiting thread scheduling for efficient MIMD control flow on SIMD graphics hardware WWL Fung University of British Columbia, 2008 | 6 | 2008 |
Design of an Open-Architecture Ultrasound Acquisition System for Real-Time Processing and Control W Fung, R Rohling Technical Report TR002, University of British Columbia, 2003. http://www …, 2003 | 6 | 2003 |
Kilo TM Correctness: ABA Tolerance and Validation-Commit Indivisibility WWL Fung, I Singh, TM Aamodt Technical report, University of British Columbia, 2012. http://www. ece. ubc …, 2012 | 3 | 2012 |
EECE 474 Instrumentation Design Laboratory E Chan, M Chen, J Chuang, W Fung, B Yip, K Zhu | | |