My research studies the system foundations of large-scale AI — from new system abstractions to scheduling algorithms and parallelization strategies that enables more accessible and efficient AI computing infrastructure. I have led the design and operation of multiple production AI clusters.
My work has been published in top-tier conferences across computer networks, operating systems, computer architecture, and databases, including NSDI, OSDI, ASPLOS, and SIGMOD. I was a researcher at Google, where I worked on large model inference acceleration.
I received my PhD in Computer Science from HKUST, advised by Prof. Kai Chen, where my PhD dissertation received the Honorable Mention of the CSE Best PhD Dissertation Award.
My work seeks the synergy between technology and creativity. I aim to design exceptional user experiences in products that embed advanced technologies, transforming the way people interact with complex systems. The Wall Street Journal featured an exclusive interview with me for creating the world-recognized all-terrain robot HEXA.
I was named to the Forbes 30 Under 30 list twice, in 2020 (China) and 2022 (Asia).
Earlier in my career, I worked in serveral startups including as an algorithm engineer at Zhihu.com (NYSE: ZH) and a backend engineer at Tantan (now part of MOMO, NASDAQ: MOMO). See my LinkedIn profile for more details.