Missile: Fine-Grained, Hardware-Level GPU Resource Isolation for Multi-Tenant DNN Inference

Jul 1, 2024ยท
Yongkang Zhang
,
Haoxuan Yu
,
Chenxia Han
,
Cheng Wang
,
Baotong Lu
,
Yunzhe Li
,
Zhifeng Jiang
,
Yang Li
,
Xiaowen Chu
,
Huaicheng Li
ยท 0 min read
Type
Publication
arXiv preprint arXiv:2407.13996
Authors
Yongkang Zhang
Researcher
Researcher working on GPU computing and DNN inference.
Authors
Haoxuan Yu
Researcher
Researcher working on computer systems.
Authors
Chenxia Han
Researcher
Researcher working on computer systems.
Authors
Cheng Wang
Researcher
Researcher working on computer systems.
Authors
Baotong Lu
Researcher
Researcher working on computer systems.
Authors
Zhifeng Jiang
Researcher
Researcher working on computer systems.
Authors
Yang Li
Researcher
Researcher working on computer systems.
Authors
Xiaowen Chu
Professor
Professor working on computer systems and networking.