I am a fourth-year undergraduate student at the School of Computer Engineering and Science, Shanghai University, now working as an intern in Shanghai AI Laboratory, mainly engaged in inference acceleration and graph optimization of large language models.
🔬 Research Interests
- Training and Inference Acceleration
- Graph Compilation and Optimization
- Machine Learning Systems
- High Performance Computing
🚀 Skills
- C/C++
- CUDA
- Pytorch/LibTorch
- MLIR
- CMake