DocumentCode :
3575081
Title :
Optimizing GPU Virtualization with Address Mapping and Delayed Submission
Author :
Wang Xiao-Lin ; Wang Han-bing ; Sang Yan ; Wang Zhen-Lin ; Luo Ying-Wei
Author_Institution :
Sch. of EECS, Peking Univ., Beijing, China
fYear :
2014
Firstpage :
413
Lastpage :
416
Abstract :
The state-of-the-art GPU virtualization framework, gVirtuS, relies on an API remoting mechanism to set up a communication channel between a virtual machine and the host, so that a CUDA application in a virtual machine can be executed "remotely" in the host. We observe that this API remoting mechanism often involves large-volume and frequent data transmissions between the host OS and the guest OS, which lead to a significant performance degradation. We present an address mapping scheme so the host can directly access the machine memory space of the guest and thus avoid data copying between the guest and the host. To reduce the frequency of data transmissions, we introduce a delayed submission scheme. We implement both address mapping and delayed submission in KVM. Our evaluation on a set of CUDA benchmarks shows that address mapping can improve over the original gVirtuS by up to 6.5 times. Delayed submission is able to further reduce the virtualization overhead by half in a pathological case.
Keywords :
application program interfaces; graphics processing units; operating systems (computers); parallel architectures; remote procedure calls; storage management; virtual machines; virtualisation; API remoting mechanism; CUDA benchmark; GPU virtualization framework; GPU virtualization optimization; KVM; address mapping scheme; communication channel; data copying; data transmission; delayed submission scheme; gVirtuS; guest OS; host OS; machine memory space access; performance degradation; remote CUDA application execution; virtual machine; virtualization overhead; Benchmark testing; Data communication; Graphics processing units; Memory management; Resource management; Virtual machining; Virtualization; GPU virtualization; address mapping; delayed submission; memory virtualization; performance improvement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on
Print_ISBN :
978-1-4799-6122-1
Type :
conf
DOI :
10.1109/HPCC.2014.70
Filename :
7056774
Link To Document :
بازگشت