Publications

Last Updated: 08/17/2022
My Google Scholar Citations

Useful Statistical Data and Document:

Acceptance Rate in Networking Conferences
Overall Journal Ranking
HPC Conference & Journal Ranking
HPC Conference & Journal Tiers
Impact Factor of Selected Journals

Dissertation

  1. Ching-Hsiang Chu, "Accelerator-enabled Communication Middleware for Large-scale Heterogeneous HPC Systems with Modern Interconnects, " July, 2020.

Journal

  1. Dhabaleswar K. (DK) Panda, Hari Subramoni, Ching-Hsiang Chu and Mohammadreza Bayatpour, "The MVAPICH Project: Transforming Research into High-Performance MPI Library for HPC Community," in Journal of Computational Science, Special issue on Translational Computer Science, Vol 52, 2021. (2019 Impact Factor: 2.644)
  2. Jahanzeb Maqbool Hashmi, Ching-Hsiang Chu, Sourav Chakraborty, Mohammadreza Bayatpour, Hari Subramoni and Dhabaleswar K. (DK) Panda, "FALCON-X: Zero-copy MPI Derived Datatype Processing on Modern CPU and GPU Architectures  ," in Journal of Parallel and Distributed Computing (JPDC), Vol. 144, pp. 1-13. 2020. (2019 Impact Factor: 2.296)
  3. Ammar Ahmad Awan, Arpan Jain, Ching-Hsiang Chu, Hari Subramoni and Dhabaleswar K. (DK) Panda, "Communication Profiling and Characterization of Deep Learning Workloads on Clusters with High-Performance Interconnects," in IEEE Micro, vol. 40, no. 1, pp. 35-43, 1 Jan.-Feb. 2020, doi: 10.1109/MM.2019.2949986. (2019 Impact Factor: 3.172)
  4. Ammar Ahmad Awan, Karthik Vadambacheri Manian, Ching-Hsiang Chu, Hari Subramoni and Dhabaleswar K. (DK) Panda, "Optimized Large-Message Broadcast for Deep Learning Workloads: MPI, MPI+NCCL, or NCCL2?," Parallel Computing, Volume 85, Pages 141-152, July 2019. (2019 Impact Factor: 1.119)
  5. Ching-Hsiang Chu, Xiaoyi Lu, Ammar Ahmad Awan, Hari Subramoni, Bracy Elton, Dhabaleswar K. (DK) Panda, "Exploiting Hardware Multicast and GPUDirect RDMA for Efficient Broadcast," in IEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 30, no. 3, pp. 575-588, 1 March 2019. (2019 Impact Factor: 2.6)
  6. Min-Te Sun, Ching-Hsiang Chu, Eric Hsiao-Kuang Wu, Chi-Sen Hsiao, and Andy An-Kai Jeng, "Distributed Topology Control for Energy-Efficient and Reliable Wireless Communications," IEEE Systems Journal, vol. 12, no. 3, pp. 2152-2161, Sept. 2018. doi: 10.1109/JSYST.2017.2673830. (2018 Impact Factor: 4.463)
  7. Min-Te Sun, Ching-Hsiang Chu, Eric Hsiao-Kuang Wu, and Chi-Sen Hsiao, "Efficient Articulation Point Collaborative Exploration for Reliable Communications in Wireless Sensor Networks," in IEEE Sensors Journal, vol. 16, no. 23, pp. 8578-8588, Dec.1, 2016. (2016 Impact Factor: 2.512, 2016 Acceptance rate: 33%)
  8. Khaled Hamidouche, Akshay Venkatesh, Ammar Ahmad Awan, Hari Subramoni, Ching-Hsiang Chu, Dhabaleswar K. (DK) Panda, "CUDA-Aware OpenSHMEM: Extensions and Designs for High Performance OpenSHMEM on GPU Clusters," Parallel Computing, Volume 58, Pages 27-36, October 2016. (2016 Impact Factor: 1.362)
  9. Jyh-Ming Chen, Eric Hsiao-Kuang Wu, Hsiang-Wei Lu, Ching-Hsiang Chu, Meng-Feng Tsai, "Channel Condition Self-Clocked Packet Scheduling Scheme for Wireless Networks," EURASIP Journal on Wireless Communications and Networking, vol. 2013, Article ID 131, 2013. doi:10.1186/1687-1499-2013-131 (2013 Impact Factor: 0.80)
  10. Wei-Li Chang, Eric Hsiao-Kuang Wu, Min-Te Sun, Ching-Hsiang Chu, "LDFS: Localized Depth First Search for Member Loss Detection of Moving Group in Wireless Networks," Applied Mechanics and Materials, Vol 378, pp. 558-564, 2013. doi:10.4028/www.scientific.net/AMM.378.558
  11. Jyh-Ming Chen, Ching-Hsiang Chu, Eric Hsiao-Kuang Wu, Meng-Feng Tsai, and Jian-Ren Wang, "Improving SCTP Performance by Jitter-Based Congestion Control over Wired-Wireless Networks," EURASIP Journal on Wireless Communications and Networking, vol. 2011, Article ID 103027, 2011. doi:10.1155/2011/103027. (2011 Impact Factor: 0.87)
  12. Ing-Chau Chang, Chih-Sung Hsieh, Ching-Hsiang Chu, "HSMM: Hierarchical Synchronized Multimedia Multicast for Heterogeneous Mobile Networks, " Tamkang Journal of Science and Engineering, Vol. 13, No. 1 pp. 39-51, 2010.

Conference/Workshop

  1. Dheevatsa Mudigere, Yuchen Hao, Jianyu Huang, Zhihao Jia, Andrew Tulloch, Srinivas Sridharan, Xing Liu, Mustafa Ozdal, Jade Nie, Jongsoo Park, Liang Luo, Jie Yang, Leon Gao, Dmytro Ivchenko, Aarti Basant, Yuxi Hu, Jiyan Yang, Ehsan K Ardestani, Xiaodong Wang, Rakesh Komuravelli, Ching-Hsiang Chu, Serhat Yilmaz, Huayu Li, Jiyuan Qian, Zhuobo Feng, Yinbin Ma, Junjie Yang, Ellie Wen, Hong Li, Lin Yang, Chonglin Sun, Whitney Zhao, Dimitry Melts, Krishna Dhulipala, KR Kishore, Tyler Graf, Assaf Eisenman, Kiran Kumar Matam, Adi Gangidi, Guoqiang Jerry Chen, Manoj Krishnan, Avinash Nayak, Krishnakumar Nair, Bharath Muthiah, Mahmoud khorashadi, Pallab Bhattacharya, Petr Lapukhov, Maxim Naumov, Ajit Mathews, Lin Qiao, Mikhail Smelyanskiy, Bill Jia, Vijay Rao , "Software-hardware co-design for fast and scalable training of deep learning recommendation models, " in Proceedings of the 49th Annual International Symposium on Computer Architecture (ISCA 2022), June 2022.
  2. Kawthar Shafie Khorassani, Jahanzeb Maqbool Hashmi, Ching-Hsiang Chu, Chen-Chun Chen, Hari Subramoni and D. K. Panda, "Designing a ROCm-aware MPI Library for AMD GPUs: Early Experiences, " ISC HIGH PERFORMANCE 2021 Digital (due to COVID-19) JUNE 24 - JULY 2, 2021. (Accepted, Acceptance rate: 32%, 24/74)
  3. Qinghua Zhou, Ching-Hsiang Chu, N. Senthil Kumar, P. Kousha, Hari Subramoni and D. K. Panda, "Designing High-Performance MPI Libraries with On-the-fly Compression for Modern GPU Clusters, " 35th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Portland, Oregon USA, May 17-21, 2021. (Accepted) (Best Paper Nominee)
  4. Kawthar Shafie Khorassani, Ching-Hsiang Chu, Quentin Anthony, Hari Subramoni and D. K. Panda, "Adaptive and Hierarchical Large Message All-to-all Communication Algorithms for Large-scale Dense GPU Systems, " The 21st IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid), Melbourne, Victoria, Australia (Virtual due to COVID-19), May 10-13, 2021. (Accepted)
  5. Ching-Hsiang Chu, Kawthar Shafie Khorassani, Qinghua Zhou, Hari Subramoni and D. K. Panda, "Dynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters, " IEEE CLUSTER 2020, Kobe, Japan (ONLINE due to COVID-19), Sep. 14-17, 2020. (Acceptance rate: 24%, 32/132 full paper)
  6. Ching-Hsiang Chu, Pouya Kousha, Ammar Awan, Kawthar Shafie Khorassani, Hari Subramoni and D. K. Panda, "NV-Group: Link-Efficient Reductions for Distributed Deep Learning on Modern Dense GPU Systems, " The 34th ACM International Conference on Supercomputing (ICS-2020), Barcelona, Spain (ONLINE due to COVID-19), June 29 - July 2, 2020. (Acceptance rate: 30%, 40/132)
  7. Ching-Hsiang Chu, Jahanzeb Hashmi, Kawthar Shafie Khorassani, Hari Subramoni and D. K. Panda, "High-Performance Adaptive MPI Derived Datatype Communication for Modern Multi-GPU Systems, " 26th IEEE International Conference on High Performance Computing, Data, Analytics and Data Science (HiPC '19), Hyderabad, India, Dec 17-20, 2019. (Acceptance rate: 23%, 39/171)
  8. Pouya Kousha, Bharath Ramesh, Kaushik Kandadi Suresh, Ching-Hsiang Chu, Arpan Jain , Nick Sarkauskas, Hari Subramoni and Dhabaleswar K. Panda, "Designing a Profiling and Visualization Tool for Scalable and In-Depth Analysis of High-Performance GPU Clusters, " 26th IEEE International Conference on High Performance Computing, Data, Analytics and Data Science (HiPC '19), Hyderabad, India, Dec 17-20, 2019.
  9. K. Vadambacheri Manian, Ching-Hsiang Chu, A. Ahmad Awan, K. Shafie Khorassani and H. Subramoni, "OMB-UM: Design, Implementation, and Evaluation of CUDA Unified Memory Aware MPI Benchmarks," 2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS), held in conjunction with SC2019, Denver, CO, USA, 2019, pp. 82-92, doi: 10.1109/PMBS49563.2019.00015.
  10. Ammar Ahmad Awan, Arpan Jain, Ching-Hsiang Chu, Hari Subramoni and Dhabaleswar K. Panda, "Communication Profiling and Characterization of Deep Learning Workloads on Clusters with High-Performance Interconnects, " 26th IEEE biennial Symposium on High-Performance Interconnects (HOTI'19), Santa Clara, CA, USA, Aug 14-16, 2019.
  11. Kawthar Shafie Khorassani, Ching-Hsiang Chu, Hari Subramoni and Dhabaleswar K. Panda, "Performance Evaluation of MPI Libraries on GPU-enabled OpenPOWER Architectures: Early Experiences, " International Workshop on OpenPOWER for HPC, held in conjunction with ISC'19, Frankfurt, Germany, June 20, 2019
  12. Jie Zhang, Xiaoyi Lu, Ching-Hsiang Chu, and D. K. Panda "C-GDR: High-Performance Container-aware GPUDirect MPI Communication Schemes on RDMA Networks, " 33rd IEEE International Parallel & Distributed Processing Symposium (IPDPS '19), Rio de Janeiro, Brazil, May 20-24, 2019. (Acceptance rate: 27.7%, 103/372)
  13. Ching-Hsiang Chu, Sreeram Potluri, Anshuman Goswami, Manjunath Venkata, Neena Inam and Chris J. Newburn "Designing High-Performance In-Memory Key-Value Operations with Persistent GPU Kernels and OpenSHMEM, " Fifth Workshop on OpenSHMEM and Related Technologies (OpenSHMEM 2018), Baltimore, Maryland, Aug 21-23, 2018.
  14. Ammar Ahmad Awan, Ching-Hsiang Chu, Hari Subramoni and Dhabaleswar K. Panda, "Optimized Broadcast for Deep Learning Workloads on Dense-GPU InfiniBand Clusters: MPI or NCCL?, " EuroMPI, Barcelona, Spain, September 23 - 26, 2018.
  15. Ching-Hsiang Chu, Xiaoyi Lu, Ammar Ahmad Awan, Hari Subramoni, Jahanzeb Hashmi, Bracy Elton and Dhabaleswar Panda, "Efficient and Scalable Multi-Source Streaming Broadcast on GPU Clusters for Deep Learning, " The 46th International Conference on Parallel Processing (ICPP 2017), Bristol, UK, Aug 14-17, 2017. (Acceptance rate: 28.4%)
  16. Akshay Venkatesh, Khaled Hamidouche, Sreeram Potluri, Davide Rossetti, Ching-Hsiang Chu and Dhabaleswar Panda, "MPI-GDS: High Performance MPI Designs with GPUDirect-aSync for CPU-GPU Control Flow Decoupling, " The 46th International Conference on Parallel Processing (ICPP 2017), Bristol, UK, Aug 14-17, 2017. (Acceptance rate: 28.4%)
  17. Ching-Hsiang Chu, Kaled Hamidouche, Hari Subramoni, Akshay Venkatesh, Bracy Elton, and Dhabaleswar K. Panda, "Efficient Reliability Support for Hardware Multicast-based Broadcast in GPU-enabled Streaming Applications, " First Workshop on Optimization of Communication in HPC runtime systems (COM-HPC), held in conjunction with SC'16, Salt Lake City, UT, USA, Nov. 18, 2016. (Acceptance rate: 35%, 8/23)
  18. Ching-Hsiang Chu, Kaled Hamidouche, Hari Subramoni, Akshay Venkatesh, Bracy Elton, and Dhabaleswar K. Panda, "Designing High Performance Heterogeneous Broadcast for Streaming Applications on GPU Clusters, " 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD'16), Los Angeles, CA, USA, Oct. 26-28, 2016. (Acceptance rate: 35%, 27/77)
  19. Ching-Hsiang Chu, Kaled Hamidouche, Akshay Venkatesh, Dip Sankar Banerjee, Hari Subramoni, and Dhabaleswar K. Panda, "Exploiting Maximal Overlap for Non-Contiguous Data Movement Processing on Modern GPU-enabled Systems, " 30th IEEE International Parallel & Distributed Processing Symposium (IEEE IPDPS 2016), Chicago, IL, USA, May 23-27, 2016. (Acceptance rate: 23%, 114/496)
  20. Ching-Hsiang Chu, Kaled Hamidouche, Akshay Venkatesh, Ammar Ahmad Awan, and Dhabaleswar K. Panda, "CUDA Kernel based Collective Reduction Operations on Large-scale GPU Clusters, " 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (IEEE/ACM CCGrid 2016), Cartagena, Colombia, May 16-19, 2016. (Acceptance rate: ~20%)
  21. Ammar Ahmad Awan, Kaled Hamidouche, Ching-Hsiang Chu and Dhabaleswar K. Panda, "A Case for Non-Blocking Collectives in OpenSHMEM: Design, Implementation, and Performance Evaluation using MVAPICH2-X, " OpenSHMEM Workshop 2015, Annapolis, Maryland, USA, August 4-6, 2015.
  22. Kaled Hamidouche, Akshay Venkatesh, Ammar Ahmad Awan, Hari Subramoni, Ching-Hsiang Chu and Dhabaleswar K. Panda, "Exploiting GPUDirect RDMA in Designing High Performance OpenSHMEM for NVIDIA GPU Clusters, " IEEE Cluster 2015, Chicago, IL, USA, Sep. 8-11, 2015.
  23. Ching-Hsiang Chu, You-Ming Chen, Yu-Te Huang, Roberto Carvalho, Chiun-Chieh Hsu, and Ling-Jyn Chen, "Measurement of Long-distance Wi-Fi Connections: An Empirical Study, " 2014 IEEE International Conference on Communications - Mobile and Wireless Networking Symposium (ICC'14 MWN), Sydney, Australia, June 10-14, 2014. doi:10.1109/ICC.2014.6883685 (Link) (Presenter)(Acceptance rate: 38%, 995/2608)
  24. Ching-Hsiang Chu, Jyh-Ming Chen and Eric Hsiao-Kuang Wu, "A New Transport Protocol for Cloud Servers, " The 6th IEEE International Conference on Ubi-Media Computing (UMEDIA 2013), Aizu-Wakamatsu, Japan, November 2-4, 2013.
  25. Yu-Chen Huang, Ching-Hsiang Chu, Eric Hsiao-Kuang Wu, "A Novel Congestion Control Mechanism on TFRC for Streaming Applications over Wired-Wireless Networks," The 7th ACM* Workshop on Wireless Multimedia Networking and Computing (WMUNEP 2011), Miami Beach, Florida, USA, Oct. 31-Nov. 4, 2011. doi: 10.1145/2069117.2069122. (Presenter) (Link)
  26. Jyh-Ming Chen, Ching-Hsiang Chu, Eric Hsiao-Kuang Wu, Meng-Feng Tsai, and Jian-Ren Wang, "JSCTP: A Jitter-based Congestion Control Scheme for SCTP over Wired-Wireless Networks," IEEE 2011 International Conference on Wireless and Optical Communications (ICWOC 2011), Zhengzhou, China, May 21-22, 2011. (Presenter)
  27. Yi-Cheng Chan, Ming-Chun Liao, and Ching-Hsiang Chu, “A Collision-Aware Backoff Mechanisms for IEEE 802.11 WLANs, ” IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2009), Shanghai, China, November 2009. doi:10.1109/ICICISYS.2009.5358181. (Link)

Poster/Demo

  1. Ammar Ahmad Awan, Ching-Hsiang Chu, Xiaoyi Lu, Dhabaleswar K. Panda, Hari Subramoni, "Can Unified-Memory support on Pascal and Volta GPUs enable Out-of-Core DNN Training? " ISC Research Poster, Franfurt, Germany, June 26, 2018.
  2. Ching-Hsiang Chu and Dhabaleswar Panda, “High-Performance and Scalable Broadcast Schemes for Deep Learning on GPU Clusters, ” The International Conference for High Performance Computing, Networking, Storage and Analysis (SC17), Denver, CO, USA, Nov. 12-17, 2017. (ACM Student Research Competition Poster with Travel Award) (Acceptance rate: 47.5%, 28/59)
  3. Roberto Carvalho, Ching-Hsiang Chu and Ling-Jyh Chen*, “IVC: Imperceptible Video Communication, ” The 15th Annual International Workshop on Mobile Computing Systems and Applications (ACM HotMobile 2014), Santa Barbara, CA, USA, February 26-27, 2014. (Link)

Domestic Conference at Taiwan

  1. Ing-Chau Chang, Chih-Sung Hsieh, Ching-Hsiang Chu, “HSMM: Hierarchical Synchronized Multimedia Multicast for Heterogeneous Mobile Networks, ”Taiwan Academic Network Conference 2009 (TANET 2009), NCUE, October 28-30, 2009. (Presenter)
  2. 朱慶翔, 邱昶豪, 田惠文, 林煜泓, 詹益禎, “以協定為基礎的垃圾郵件防堵機制之研究, ” Taiwan Academic Network Conference 2009 (TANET 2009), pp. 196, NCUE, October 28-30, 2009. (Presenter)

Impact Factor of Selected Journals