Wenqi Lou (娄文启 )
About me
I was born in Xinxiang, Henan Pronvince. I received Bachelor's degree in School of Computer in June 2018 from Northwestern Polytechnical University (NWPU), Xi'an.
In the same year, I was admitted to study for a M.Sc. degree in School of Computer Science and Technology, University of Science and Technology of China without entrance examination. From Sept. 2020, I started my Ph.D. degree under the supervision of Professor Xuehai Zhou and Professor Chao Wang. I received my PhD degree of computer science in December, 2023 at USTC.
娄文启,现为中国科大软件学院特任副研究员,硕士生导师。2018年6月本科毕业于西北工业大学计算机学院,
2023年12月于中国科学技术大学获得计算机系统结构博士学位,导师为
周学海教授与
王超教授。
主要研究方向为智能加速器架构、FPGA加速器设计、软硬件协同优化等,致力于从算法与硬件角度缓解深度学习模型的部署压力。
近年来,在计算机系统结构领域发表学术论文10余篇,包括IEEE TCAD、TC、FPGA、DATE、CODES+ISSS等顶级期刊会议,其中以第一/通讯作者身份发表CFF-A/B及JCR一区论文6篇。
高能效智能计算系统实验室面向中科大计算机学院、软件学院、大数据学院招收推免生与考研生(含软件学院联合培养),如有兴趣欢迎与我联系。
Research Interests
- FPGA Accelerator Design (CNN, Transformer, etc.)
- Co-optimization of Algorithm and Hardware (Model Sparsification and Quantization, Neural Architecture Search, etc.)
- Intelligent accelerator architecture
Publications (* corresponding author)
Wenqi Lou, Yunji Qin, Xuan Wang, Lei Gong, Chao Wang, Xuehai Zhou.
"FlexBCM: Hybrid Block-Circulant Neural Network and Accelerator Co-Search on FPGAs".
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), 2024, 43(11):3852-3863.
(CCF-A, accepted at the CODES+ISSS 2024)
Wenqi Lou, Lei Gong, Chao Wang, Jiaming Qian, Xuan Wang, Changlong Li, Xuehai Zhou.
"Unleashing Network/Accelerator Co-Exploration Potential on FPGAs: A Deeper Joint Search".
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), 2024, 43(10):3041-3054.
(CCF-A)
Wenqi Lou, Lei Gong, Chao Wang, Zidong Du, Xuehai Zhou.
"OctCNN: A High Throughput FPGA Accelerator for CNNs Using Octave Convolution Algorithm".
IEEE Transactions on Computers (IEEE TC), 2022, 71(8): 1847-1859.
(CCF-A)
Wenqi Lou, Jiaming Qian, Lei Gong, Xuan Wang, Chao Wang, Xuehai Zhou.
"NAF: Deeper Network/Accelerator Co-Exploration for Customizing CNNs on FPGA".
Design, Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 2023
(CCF-B, Flagship Conference in EDA Area)
娄文启, 王超, 宫磊, 周学海.
一种神经网络指令集扩展与代码映射机制. 软件学报, 2020.
(CCF-T1, Chinese Journal)
Wenqi Lou, Chao Wang, Lei Gong, Xuehai Zhou. "OctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm".
IEEE International Conference on Cluster Computing (CLUSTER). IEEE, 2020.
(CCF-B)
Wenqi Lou, Chao Wang, Lei Gong, Xuehai Zhou.
"RV-CNN: Flexible and efficient instruction set for CNNs based on RISC-V processors".
Advanced Parallel Processing Technologies: 13th International Symposium (APPT), 2019.
(EI Index, CCF Architecture Committee held)
Wenqi Lou, Chao Wang, Lei Gong, Xuehai Zhou.
"Neural Network Instruction Set Extension and Code Mapping Mechanism".
International Journal of Software and Informatics (IJSI), 2020. (EI Index)
Yunji Qin, Wenqi Lou*, Chao Wang*, Lei Gong, Xuehai Zhou.
"Enhancing Long Sequence Input Processing in FPGA-Based Transformer Accelerators through Attention Fusion".
Proceedings of the 2024 ACM Great Lakes Symposium on VLSI (GLVLSI). 2024. (CCF-C)
Wei Fu, Wenqi Lou*, Yunji Qin, Lei Gong, Chao Wang, Xuehai Zhou.
"MFNAS: Multi-Fidelity Exploration in Neural Architecture Search with Stable Zero-shot Proxy".
Pacific Rim International Conference on Artificial Intelligence (PRICAI). Springer, 2024:348-360. (CCF-C)
Wei Fu, Wenqi Lou*, Lei Gong, Chao Wang, Xuehai Zhou.
"Beyond Training: A Zero-Shot Framework to Neural Architecture and Accelerator Co-Exploration".
IEEE International Conference on Cluster Computing Workshops (CLUSTER Workshops). IEEE, 2024. (CCF-B)
Cheng Tang, Wenqi Lou*.
"TCL-Net: A Lightweight and Efficient Dehazing Network with Frequency-Domain Fusion and Multi-Angle Attention".
Asian Conference on Computer Vision (ACCV). Spring, 2024. (CCF-C, oral, 5.6%)
Yixuan Zhu, Wenqi Lou, Yinkang Gao, Binze Jiang, Xiaohang Gong, Xi Li.
"Fine-Grained Shared Cache Interference Analysis using Basic Block's Execution Time".
IEEE International Conference on Computer Design (ICCD), 2024.
(CCF-B)
Xuan Wang, Lei Gong, Jing Cao, Wenqi Lou, Weiya Wang, Chao Wang, Xuehai Zhou.
"hAP: A Spatial-von Neumann Heterogeneous Automata Processor with Optimized Resource and IO Overhead on FPGA".
Proceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA). 2023. (CCF-B, Top Conference in FPGA Area)
Xiangjun Qu, Lei Gong, Wenqi Lou, Qianyu Cheng, Xianglan Chen, Chao Wang, Xuehai Zhou.
"AutoSparse: A Source-to-Source Format and Schedule Auto-Tuning Framework for Sparse Tensor Program".
IEEE International Conference on Computer Design (ICCD), 2024.
(CCF-B)
Zihan Wang, Lei Gong, Wenqi Lou, Qianyu Cheng, Xianglan Chen, Chao Wang, Xuehai Zhou.
"UniCoMo: A Unified Learning-Based Cost Model for Tensorized Program Tuning". IEEE International Conference on Computer Design (ICCD), 2024.
(CCF-B)
Activities
Journal Reviewer
- Reviewer for IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (CCF-A)
- Reviewer for IEEE/ACM Transactions on Computational Biology and Bioinformatics (CCF-B)
- Reviewer for Microprocessors and Microsystem (SCIE)
- Reviewer for International Journal of Electronics (SCIE)
- Reviewer for IET Computers & Digital Techniques (EI)
Awards
- Intel Fellowship 2022
- USTC-Gusu First Class Scholarship 2021
- Outstanding Graduate of NWPU 2018
Projects
- "面向卷积-注意力混合模型的FPGA加速器软硬件协同优化方法研究", 中国科大软件学院青年创新基金, 2024-05 至 2026-05, 20万元, 项目主持人, 在研.
- "基于频域滤波卷积的神经网络可重构加速器新原理、新结构与新方法", 国家自然科学基金面上项目, 2022-01 至 2025-12-31, 59万元, 技术骨干, 在研.
Teaching
- 2024 中国科大软件学院基础课 《实用算法设计》, 2024.09,主讲
- 2025 中国科大软件学院选修课《高级计算机体系结构》2025.02,主讲
|