Biography
Publications
Activities
Awards
Teaching

Wenqi Lou (娄文启 )

alt text 

Associate Researcher (特任副研究员),
School of Software Engineering
Suzhou Institute for Advanced Research,
University of Science and Technology of China (USTC)
Research Interests: Neural Network Accelerators、Hardware-Software Co-Optimization
Address: 508, ShaoJun Building, Suzhou Institute for Advanced Research, USTC, Suzhou, Jiangsu, China
E-mail: louwenqi@ustc.edu.cn

About me

I was born in Xinxiang, Henan Pronvince. I received Bachelor's degree in School of Computer in June 2018 from Northwestern Polytechnical University (NWPU), Xi'an. In the same year, I was admitted to study for a M.Sc. degree in School of Computer Science and Technology, University of Science and Technology of China without entrance examination. From Sept. 2020, I started my Ph.D. degree under the supervision of Professor Xuehai Zhou and Professor Chao Wang. I received my PhD degree of computer science in December, 2023 at USTC.

娄文启,现为中国科大软件学院特任副研究员,硕士生导师。2018年6月本科毕业于西北工业大学计算机学院, 2023年12月于中国科学技术大学获得计算机系统结构博士学位,导师为 周学海教授与 王超教授。 主要研究方向为智能加速器架构、FPGA加速器设计、软硬件协同优化等,致力于从算法与硬件角度缓解深度学习模型的部署压力。相关成果发表于IEEE TC、IEEE TCAD、DATE、FPGA,CODES+ISSS等计算机系统结构领域知名期刊和会议。

高能效智能计算系统实验室面向中科大计算机学院、软件学院、大数据学院招收推免生与考研生(含软件学院联合培养),如有兴趣欢迎与我联系。

Research Interests

  • FPGA Accelerator Design (CNN, Transformer, etc.)
  • Co-optimization of Algorithm and Hardware (Model Sparsification and Quantization, Neural Architecture Search, etc.)
  • Intelligent accelerator architecture

Publications (* corresponding author)

  1. Wenqi Lou, Yunji Qin, Xuan Wang, Lei Gong, Chao Wang, Xuehai Zhou. "FlexBCM: Hybrid Block-Circulant Neural Network and Accelerator Co-Search on FPGAs". IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), 2024. (CCF-A, accepted at the CODES+ISSS 2024)

  2. Wenqi Lou, Lei Gong, Chao Wang, Jiaming Qian, Xuan Wang, Changlong Li, Xuehai Zhou. "Unleashing Network/Accelerator Co-Exploration Potential on FPGAs: A Deeper Joint Search". IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), 2024. (CCF-A)

  3. Wenqi Lou, Lei Gong, Chao Wang, Zidong Du, Xuehai Zhou. "OctCNN: A High Throughput FPGA Accelerator for CNNs Using Octave Convolution Algorithm". IEEE Transactions on Computers (IEEE TC), 2021, 71(8): 1847-1859. (CCF-A)

  4. Wenqi Lou, Jiaming Qian, Lei Gong, Xuan Wang, Chao Wang, Xuehai Zhou. "NAF: Deeper Network/Accelerator Co-Exploration for Customizing CNNs on FPGA". Design, Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 2023 (CCF-B, Flagship Conference in EDA Area)

  5. 娄文启, 王超, 宫磊, 周学海. 一种神经网络指令集扩展与代码映射机制. 软件学报, 2020. (CCF-T1, Chinese Journal)

  6. Wenqi Lou, Chao Wang, Lei Gong, Xuehai Zhou. "OctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm". IEEE International Conference on Cluster Computing (CLUSTER). IEEE, 2020. (CCF-B)

  7. Wenqi Lou, Chao Wang, Lei Gong, Xuehai Zhou. "RV-CNN: Flexible and efficient instruction set for CNNs based on RISC-V processors". Advanced Parallel Processing Technologies: 13th International Symposium (APPT), 2019. (EI Index, CCF Architecture Committee held)

  8. Wenqi Lou, Chao Wang, Lei Gong, Xuehai Zhou. "Neural Network Instruction Set Extension and Code Mapping Mechanism". International Journal of Software and Informatics (IJSI), 2020. (EI Index)

  9. Yunji Qin, Wenqi Lou*, Chao Wang*, Lei Gong, Xuehai Zhou. "Enhancing Long Sequence Input Processing in FPGA-Based Transformer Accelerators through Attention Fusion". Proceedings of the 2024 ACM Great Lakes Symposium on VLSI (GLVLSI). 2024. (CCF-C)

  10. Wei Fu, Wenqi Lou*, Yunji Qin, Lei Gong, Chao Wang, Xuehai Zhou. "MFNAS: Multi-Fidelity Exploration in Neural Architecture Search with Stable Zero-shot Proxy". Pacific Rim International Conference on Artificial Intelligence (PRICAI). Springer, 2024. (CCF-C)

  11. Wei Fu, Wenqi Lou*, Lei Gong, Chao Wang, Xuehai Zhou. "Beyond Training: A Zero-Shot Framework to Neural Architecture and Accelerator Co-Exploration". IEEE International Conference on Cluster Computing Workshops (CLUSTER Workshops). IEEE, 2024. (CCF-B)

  12. Cheng Tang, Wenqi Lou*. "TCL-Net: A Lightweight and Efficient Dehazing Network with Frequency-Domain Fusion and Multi-Angle Attention". Asian Conference on Computer Vision (ACCV). Spring, 2024. (CCF-C, oral, 5.6%)

  13. Yixuan Zhu, Wenqi Lou, Yinkang Gao, Binze Jiang, Xiaohang Gong, Xi Li. "Fine-Grained Shared Cache Interference Analysis using Basic Block's Execution Time". IEEE International Conference on Computer Design (ICCD), 2024. (CCF-B)

  14. Xuan Wang, Lei Gong, Jing Cao, Wenqi Lou, Weiya Wang, Chao Wang, Xuehai Zhou. "hAP: A Spatial-von Neumann Heterogeneous Automata Processor with Optimized Resource and IO Overhead on FPGA". Proceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA). 2023. (CCF-B, Top Conference in FPGA Area)

  15. Xiangjun Qu, Lei Gong, Wenqi Lou, Qianyu Cheng, Xianglan Chen, Chao Wang, Xuehai Zhou. "AutoSparse: A Source-to-Source Format and Schedule Auto-Tuning Framework for Sparse Tensor Program". IEEE International Conference on Computer Design (ICCD), 2024. (CCF-B)

  16. Zihan Wang, Lei Gong, Wenqi Lou, Qianyu Cheng, Xianglan Chen, Chao Wang, Xuehai Zhou. "UniCoMo: A Unified Learning-Based Cost Model for Tensorized Program Tuning". IEEE International Conference on Computer Design (ICCD), 2024. (CCF-B)

Activities

Journal Reviewer

  • Reviewer for IEEE/ACM Transactions on Computational Biology and Bioinformatics (CCF-B)
  • Reviewer for Microprocessors and Microsystem (SCIE)
  • Reviewer for International Journal of Electronics (SCIE)
  • Reviewer for IET Computers & Digital Techniques (EI)

Awards

  • Intel Fellowship 2022
  • USTC-Gusu First Class Scholarship 2021
  • Outstanding Graduate of NWPU 2018

Teaching

  • 2024 软件学院基础课 《实用算法设计》, 2024.09,主讲
  • 2025 软件学院选修课《高级计算机体系结构》2025.02,主讲