Kai Wu (Kyle)

I am a senior engineer at Microsoft, where I work on performance optimization for inference workloads for large language models. Before Microsoft, I was a researcher at ByteDance Infrastructure System Lab, focusing on hardware acceleration for infrastructure systems. I received my Ph.D. in Electrical Engineering and Computer Sciences from the University of California, Merced, working with Prof. Dong Li to build system supports for big, heterogeneous memory platforms. I also did internships at Lawrence Livermore National Laboratory, Los Alamos National Laboratory, and ByteDance.

My interests include: 1) system optimization for high performance computing, machine learning and database workloads; 2) software/hardware co-design for data center infrastructure systems.

Google Scholar    LinkedIn

Selected Publications


I have served as a reviewer for the following journals and conferences: