个人简介:
娄文启,现为中国科大软件学院特任副研究员,硕士生导师。2018年6月本科毕业于西北工业大学计算机学院,2023年12月于中国科学技术大学获得计算机系统结构博士学位,导师为周学海教授与王超教授。主要研究方向为智能加速器架构、FPGA加速器设计、软硬件协同优化等,致力于从算法与硬件角度缓解深度学习模型的部署压力。相关成果发表于IEEE TC、DATE、FPGA,CLUSTER等计算机系统结构领域知名期刊和会议。
电子邮箱: louwenqi@ustc.edu.cn
联系地址: 至德楼A1102-2,中国科大苏州高等研究院若水路校区
个人主页:http://home.ustc.edu.cn/~louwenqi/
主要研究方向:
FPGA加速器设计(CNN、Transformer等)
模型与硬件协同优化(模型稀疏化与量化、神经网络架构搜索等)
智能加速器架构
获奖情况:
英特尔中国奖学金 2022
中国科大姑苏一等奖学金 2021
主要学术论文及著作
Wenqi Lou, Lei Gong, Chao Wang, Zidong Du, Xuehai Zhou. "OctCNN: A High Throughput FPGA Accelerator for CNNs Using Octave Convolution Algorithm". IEEE Transactions on Computers (IEEE TC), 2021, 71(8): 1847-1859. (CCF-A)
Wenqi Lou, Jiaming Qian, Lei Gong, Xuan Wang, Chao Wang, Xuehai Zhou. "NAF: Deeper Network/Accelerator Co-Exploration for Customizing CNNs on FPGA". Design, Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 2023 (CCF-B, EDA Flagship Conference)
娄文启, 王超, 宫磊, 周学海. 一种神经网络指令集扩展与代码映射机制. 软件学报, 2020. (CCF-A 类中文期刊)
Wenqi Lou, Chao Wang, Lei Gong, Xuehai Zhou. "OctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm". IEEE International Conference on Cluster Computing (CLUSTER). IEEE, 2020. (CCF-B)
Wenqi lou, Chao Wang, Lei Gong, Xuehai Zhou. "Neural Network Instruction Set Extension and Code Mapping Mechanism". International Journal of Software and Informatics (IJSI), 2020. (EI Index)
Wenqi Lou, Chao Wang, Lei Gong, Xuehai Zhou. "RV-CNN: Flexible and efficient instruction set for CNNs based on RISC-V processors" Advanced Parallel Processing Technologies: 13th International Symposium (APPT), 2019. (EI Index)
Xuan Wang, Lei Gong, Jing Cao, Wenqi Lou, Weiya Wang, Chao Wang, Xuehai Zhou. "hAP: A Spatial-von Neumann Heterogeneous Automata Processor with Optimized Resource and IO Overhead on FPGA". Proceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA). 2023. (CCF-B, FPGA TOP Conference)