Seguir
Jovan Stojkovic
Jovan Stojkovic
Dirección de correo verificada de illinois.edu - Página principal
Título
Citado por
Citado por
Año
Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference
J Stojkovic, E Choukse, C Zhang, I Goiri, J Torrellas
arXiv preprint arXiv:2403.20306, 2024
412024
Collabar: Edge-assisted collaborative image recognition for mobile augmented reality
Z Liu, G Lan, J Stojkovic, Y Zhang, C Joe-Wong, M Gorlatova
2020 19th ACM/IEEE International Conference on Information Processing in …, 2020
402020
MXFaaS: Resource Sharing in Serverless Environments for Parallelism and Efficiency
J Stojkovic, T Xu, H Franke, J Torrellas
Proceedings of the 50th Annual International Symposium on Computer …, 2023
272023
Parallel virtualized memory translation with nested elastic cuckoo page tables
J Stojkovic, D Skarlatos, A Kokolis, T Xu, J Torrellas
Proceedings of the 27th ACM International Conference on Architectural …, 2022
202022
DynamoLLM: Designing LLM Inference Clusters for Performance and Energy Efficiency
J Stojkovic, C Zhang, Í Goiri, J Torrellas, E Choukse
arXiv preprint arXiv:2408.00741, 2024
172024
Memory-Efficient Hashed Page Tables
J Stojkovic, N Mantri, D Skarlatos, T Xu, J Torrellas
2023 IEEE International Symposium on High-Performance Computer Architecture …, 2023
132023
SpecFaaS: Accelerating Serverless Applications with Speculative Function Execution
J Stojkovic, T Xu, H Franke, J Torrellas
2023 IEEE International Symposium on High-Performance Computer Architecture …, 2023
122023
Edge-assisted collaborative image recognition for mobile augmented reality
G Lan, Z Liu, Y Zhang, T Scargill, J Stojkovic, C Joe-Wong, M Gorlatova
ACM Transactions on Sensor Networks (TOSN) 18 (1), 1-31, 2021
122021
EcoFaaS: Rethinking the Design of Serverless Environments for Energy Efficiency
J Stojkovic, N Iliakopoulou, T Xu, H Franke, J Torrellas
Proceedings of the 51st Annual International Symposium on Computer …, 2024
102024
μManycore: A Cloud-Native CPU for Tail at Scale
J Stojkovic, C Liu, M Shahbaz, J Torrellas
Proceedings of the 50th Annual International Symposium on Computer …, 2023
72023
Edge-assisted collaborative image recognition for augmented reality: demo abstract
J Stojkovic, Z Liu, G Lan, C Joe-Wong, M Gorlatova
Proceedings of the 17th Conference on Embedded Networked Sensor Systems, 394-395, 2019
72019
SmartOClock: Workload-and Risk-Aware Overclocking in the Cloud
J Stojkovic, P Misra, Í Goiri, S Whitlock, E Choukse, M Das, C Bansal, ...
Proceedings of the 51st Annual International Symposium on Computer …, 2024
52024
TAPAS: Thermal-and Power-Aware Scheduling for LLM Inference in Cloud Platforms
J Stojkovic, C Zhang, Í Goiri, E Choukse, H Qiu, R Fonseca, J Torrellas, ...
arXiv preprint arXiv:2501.02600, 2025
22025
Workload Intelligence: Punching Holes Through the Cloud Abstraction
L Huang, A Parayil, J Zhang, X Qin, C Bansal, J Stojkovic, P Zardoshti, ...
arXiv preprint arXiv:2404.19143, 2024
22024
UniCache: The Next 700 Caches for Serverless Computing
J Stojkovic, T Xu, H Franke, J Torrellas
5th International Workshop on Cloud Intelligence / AIOps (AIOps '24), 2024
12024
Concord: Rethinking Distributed Coherence for Software Caches in Serverless Environments
J Stojkovic, C Alverti, A Andrade, N Iliakopoulou, H Franke, T Xu, ...
2025 IEEE International Symposium on High-Performance Computer Architecture …, 2025
2025
Managing Power for Serverless Computing
J Stojkovic, H Franke, A Buyuktosunoglu
US Patent App. 18/206,283, 2024
2024
Chameleon: Adaptive Caching and Scheduling for Many-Adapter LLM Inference Environments
N Iliakopoulou, J Stojkovic, C Alverti, T Xu, H Franke, J Torrellas
arXiv preprint arXiv:2411.17741, 2024
2024
Serverless Computing with Latency Reduction
J Stojkovic, H Franke
US Patent App. 18/152,341, 2024
2024
Serverless Computing Using Resource Multiplexing
J Stojkovic, H Franke, T Xu, J Torrellas
US Patent App. 18/049,125, 2024
2024
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20