Publications
GPU architecture research
QuCo: Efficient and Flexible Hardware-Driven Automatic Configuration of Tile Transfers in GPUs (HPCA 26)
HDPAT: Hierarchical Distributed Page Address Translation for Wafer-Scale GPUs (HPCA 26)
DaisenBot: Human-AI Collaboration in GPU Performance Analysis with Multi-Modal AI Assistant
Sibir: A Dynamic Binary Instrumentation Framework Targeting AMD GPUs
M. Raayai-Ardakani et al., “Luthier: A Dynamic Binary Instrumentation Framework Targeting AMD GPUs,” 2025 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Ghent, Belgium, 2025, pp. 137-149, doi: 10.1109/ISPASS64960.2025.00022.
Exploring the Wafer-Scale GPUs
Daoxuan Xu, Le Xu, Jie Ren, and Yifan Sun. 2025. Exploring the Wafer-Scale GPUs. In Proceedings of the 17th Workshop on General Purpose Processing Using GPU (GPGPU ‘25). Association for Computing Machinery, New York, NY, USA, 8–13. https://doi.org/10.1145/3725798.3725800
Machine learning
Impact of Raindrops on Camera-Based Detection in Software-Defined Vehicles
Y. Luo, D. Xu, G. Zhou, Y. Sun and S. Lu, “Impact of Raindrops on Camera-Based Detection in Software-Defined Vehicles,” 2024 IEEE International Conference on Mobility, Operations, Services and Technologies (MOST), Dallas, TX, USA, 2024, pp. 193-205, doi: 10.1109/MOST60774.2024.00028.