Analyzing CUDA workloads using a detailed GPU simulator A Bakhoda, GL Yuan, WWL Fung, H Wong, TM Aamodt 2009 IEEE international symposium on performance analysis of systems and …, 2009 | 2134 | 2009 |
Demystifying GPU microarchitecture through microbenchmarking H Wong, MM Papadopoulou, M Sadooghi-Alvandi, A Moshovos 2010 IEEE International Symposium on Performance Analysis of Systems …, 2010 | 626 | 2010 |
Comparing FPGA vs. custom CMOS and the impact on processor microarchitecture H Wong, V Betz, J Rose Proceedings of the 19th ACM/SIGDA international symposium on Field …, 2011 | 157 | 2011 |
Pangaea: a tightly-coupled IA32 heterogeneous chip multiprocessor H Wong, A Bracy, E Schuchman, TM Aamodt, JD Collins, PH Wang, ... Proceedings of the 17th international conference on Parallel architectures …, 2008 | 86 | 2008 |
Intel Ivy Bridge Cache Replacement Policy H Wong http://blog.stuffedcow.net/2013/01/ivb-cache-replacement/, 2013 | 49 | 2013 |
Micro-benchmarking the GT200 GPU MM Papadopoulou, M Sadooghi-Alvandi, H Wong Computer Group, ECE, University of Toronto, Tech. Rep, 2009 | 49 | 2009 |
Quantifying the gap between fpga and custom cmos to aid microarchitectural design H Wong, V Betz, J Rose IEEE Transactions on Very Large Scale Integration (VLSI) Systems 22 (10 …, 2013 | 27 | 2013 |
A Comparison of Intel's 32nm and 22nm Core i5 CPUs: Power, Voltage, Temperature, and Frequency H Wong http://blog.stuffedcow.net/2012/10/intel32nm-22nm-core-i5-comparison/, 2012 | 17 | 2012 |
High performance instruction scheduling circuits for out-of-order soft processors H Wong, V Betz, J Rose 2016 IEEE 24th Annual International Symposium on Field-Programmable Custom …, 2016 | 12 | 2016 |
Store-to-Load Forwarding and Memory Disambiguation in x86 Processors H Wong http://blog.stuffedcow.net/2014/01/x86-memory-disambiguation/, 2014 | 12 | 2014 |
Measuring Reorder Buffer Capacity H Wong http://blog.stuffedcow.net/2013/05/measuring-rob-capacity/, 2013 | 12 | 2013 |
Efficient methods for out-of-order load/store execution for high-performance soft processors H Wong, V Betz, J Rose 2013 International Conference on Field-Programmable Technology (FPT), 442-445, 2013 | 11 | 2013 |
Microarchitecture and circuits for a 200 mhz out-of-order soft processor memory system H Wong, V Betz, J Rose ACM Transactions on Reconfigurable Technology and Systems (TRETS) 10 (1), 1-22, 2016 | 9 | 2016 |
The performance potential for single application heterogeneous systems H Wong, TM Aamodt 8th Workshop on Duplicating, Deconstructing, and Debunking, 2009 | 8 | 2009 |
High-performance instruction scheduling circuits for superscalar out-of-order soft processors H Wong, V Betz, J Rose ACM Transactions on Reconfigurable Technology and Systems (TRETS) 11 (1), 1-22, 2018 | 6 | 2018 |
A superscalar out-of-order x86 soft processor for fpga HTH Wong University of Toronto (Canada), 2017 | 4 | 2017 |
TLB and Pagewalk Coherence in x86 Processors H Wong http://blog.stuffedcow.net/2015/08/pagewalk-coherence/, 2015 | 4 | 2015 |
Microbenchmarking Return Address Branch Prediction H Wong http://blog.stuffedcow.net/2018/04/ras-microbenchmarks/, 2018 | 3 | 2018 |
Architectures and limits of GPU-CPU heterogeneous systems HTH Wong University of British Columbia, 2008 | 3 | 2008 |
The Microarchitecture Behind Meltdown H Wong http://blog.stuffedcow.net/2018/05/meltdown-microarchitecture/, 2018 | 2 | 2018 |