I am a Research Scientist at Meta since August 2020. I received my Ph.D. degree in Computer Science and Engineering from The Ohio State University at 2020. I was born in the beautiful country Taiwan and grew up there until I moved to US in 2014 for pursuing my PhD degree.
Computer Science and Engineering,
THE Ohio State
University, Columbus,
OH, USA
Dissertation: "Accelerator-enabled Communication Middleware for Large-scale Heterogeneous HPC Systems with Modern Interconnects"
Advisor: Dr. Dhabaleswar K. Panda
Computer Science and Information Engineering,
National
Central
University, Taiwan
Thesis: "Jitter-based TCP for Incast Communication on Data Center Networks"
Advisor: Dr. Eric Hsiao-Kuang Wu
Computer Science and Information Engineering,
National
Changhua
University of Education, Taiwan
MSL Infra Kernel and Optimization Team
AI System Co-design team
Contributed Open-source projects:
Department of Computer Science and Engineering
Network-based Computing Lab
(NOWLAB)
Advisor: Dr. DK Panda
Core developer of MVAPICH2-GDR, a CUDA-Aware MPI library in MVAPICH project.
GPU Communication team
Developed a OpenSHMEM-based Key-Value storing mechanism achieving 4.8X
speedup compared to SOTA GPU-based schemes.
Ching-Hsiang Chu, Potluri S, Goswami A, Gorentla Venkata M, Imam N, Newburn
CJ. "Designing High-Performance In-Memory Key-Value Operations with Persistent GPU Kernels
and OpenSHMEM".
In Workshop on OpenSHMEM and Related Technologies 2018 Aug 21 (pp. 148-164).
Institute of Information Science
Supervisor: Dr. Ling-Jyh Chen
Army of Republic of China (R.O.C.)
* Please visit my LinkedIn for more details.
I have been lucky enough to collaborate with many top-notch researchers, scientists and engineers, and co-authored 50+ peer-reviewed papers in the areas of HPC, ML Systems, Networking and Computer Architecture, you can find a near complete list in Google Scholar Citations
Meta AI teams, "Collective Communication for 100k+ GPUs", 2025.
Dhabaleswar K. (DK) Panda, Hari Subramoni, Ching-Hsiang Chu and Mohammadreza Bayatpour, "The MVAPICH Project: Transforming Research into High-Performance MPI Library for HPC Community," in Journal of Computational Science, Special issue on Translational Computer Science, Vol 52, 2021. (2019 Impact Factor: 2.644)
Ching-Hsiang Chu, Xiaoyi Lu, Ammar Ahmad Awan, Hari Subramoni, Bracy Elton, Dhabaleswar K. (DK) Panda, "Exploiting Hardware Multicast and GPUDirect RDMA for Efficient Broadcast," in IEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 30, no. 3, pp. 575-588, 1 March 2019. (2019 Impact Factor: 2.6)
Weiwei Chu, Xinfeng Xie, Jiecao Yu, Jie Wang, Amar Phanishayee, Chunqiang Tang, Yuchen Hao, Jianyu Huang, Mustafa Ozdal, Jun Wang, Vedanuj Goswami, Naman Goyal, Abhishek Kadian, Andrew Gu, Chris Cai, Feng Tian, Xiaodong Wang, Min Si, Pavan Balaji, Ching-Hsiang Chu, and Jongsoo Park, "Scaling Llama 3 Training with Efficient Parallelism Strategies," In Proceedings of the 52nd Annual International Symposium on Computer Architecture (ISCA '25). Association for Computing Machinery, New York, NY, USA, 1703–1716.
Hao Feng, Boyuan Zhang, Fanjiang Ye, Min Si, Ching-Hsiang Chu, Jiannan Tian, Chunxing Yin, Summer Deng, Yuchen Hao, Pavan Balaji, Tong Geng, Dingwen Tao, "Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression," SC24: International Conference for High Performance Computing, Networking, Storage and Analysis, Atlanta, GA, USA, Nov. 17-22, 2024.
Kshiteej Mahajan, Ching-Hsiang Chu, Srinivas Sridharan, Aditya Akella, "Better Together: Jointly Optimizing ML Collective Scheduling and Execution Planning using SYNDICATE," 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23), Boston, MA. 2023.
Q Zhou, Ching-Hsiang Chu, NS Kumar, Pouya Kousha, Seyedeh Mahdieh Ghazimirsaeed, Hari Subramoni, Dhabaleswar K Panda, "Designing High-Performance MPI Libraries with On-the-fly Compression for Modern GPU Clusters," 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Portland, OR, USA. 17-21 May 2021
Best PaperChing-Hsiang Chu, Pouya Kousha, Ammar Awan, Kawthar Shafie Khorassani, Hari Subramoni and D. K. Panda, "NV-Group: Link-Efficient Reductions for Distributed Deep Learning on Modern Dense GPU Systems," The 34th ACM International Conference on Supercomputing (ICS-2020), Barcelona, Spain (ONLINE due to COVID-19), June 29 - July 2, 2020. (Acceptance rate: 30%, 40/132)
Ching-Hsiang Chu and Dhabaleswar Panda, "Efficient and Scalable Communication Middleware for Emerging Dense-GPU Clusters," The International Conference for High Performance Computing, Networking, Storage and Analysis (SC'19), Denver, CO, USA, Nov. 18-21, 2019. Doctoral Showcase with TCHPC Travel Award
Ching-Hsiang Chu, "Accelerator-enabled Communication Middleware for Large-scale Heterogeneous HPC Systems with Modern Interconnects," July, 2020.
kingchc0120_AT_gmail.com