Full stack optimization of transformer inference: a survey S Kim, C Hooper, T Wattanawong, M Kang, R Yan, H Genc, G Dinh, ... ASSYST Workshop at ISCA 2023, 2023 | 95 | 2023 |
Llm2llm: Boosting llms with novel iterative data enhancement N Lee, T Wattanawong, S Kim, K Mangalam, S Shen, G Anumanchipali, ... ACL 2024, 2024 | 37 | 2024 |
Zone balancing in a multi cluster database system J Harjono, DG Karp, R Radut, S Rehmtulla, AK Shi, T Wattanawong US Patent 11,372,820, 2022 | 12 | 2022 |
Yakun Sophia Shao, and Amir Gholami. 2023. Full Stack Optimization of Transformer Inference: a Survey S Kim, C Hooper, T Wattanawong, M Kang, R Yan, H Genc, G Dinh, ... arXiv preprint arXiv:2302.14017, 2023 | 9 | 2023 |
Zone balancing in a multi cluster database system J Harjono, DG Karp, R Radut, S Rehmtulla, AK Shi, T Wattanawong US Patent 11,537,566, 2022 | 2 | 2022 |
Cluster instance balancing of a database system across zones J Harjono, DG Karp, R Radut, S Rehmtulla, AK Shi, T Wattanawong US Patent 11,698,886, 2023 | 1 | 2023 |
Cluster balancing for zones of a database system J Harjono, DG Karp, R Radut, S Rehmtulla, AK Shi, T Wattanawong US Patent 11,966,368, 2024 | | 2024 |
Hardware Software Co-design and Architectural Optimization of Deep Learning Models for Natural Language Processing T Wattanawong, K Keutzer | | 2023 |