Zheren Fu

PhD student

School of Cyber Science and Technology
University of Science and Technology of China

A504, Xinzhi Building, Gaoxin Campus of USTC, Hefei, China
fzr@mail.ustc.edu.cn
[Linkedin] [Zhihu] [Github]

Biography

I'm a final year Ph.D student in University of Science and Technology of China (USTC), Hefei, China. And I'm studying in the National Engineering Laboratory for Brain-inspired Intelligence Technology and Application (NEL-BITA) and Laboratory for Future Networks (LFN) supervised by Professor Zhendong Mao. My research field includes Artificial Intelligence, Computer Vision, Natural Language Processing, Large Multi-modal Model, and Cross-modal Alignment.

Education

  • University of Science and Technology of China, Hefei, 2020.9-2025.6
    Ph.D's Degree, Cyberspace Security

  • University of Science and Technology of China, Hefei, 2016.9-2020.6
    Bachelor's Degree, Electronic Information Engineering
    GPA: 3.81/4.3

Experiences

  • Multi-modal Algorithm Engineer (internship), 2024.5-Now
    Alibaba Group, Beijing
    I work in the Tongyi Laboratory. Now I focus on the supervised fine-tuning and preference alignment technology for large multi-modal models.

  • Image Algorithm Engineer (internship), 2019.12-2020.10
    Kuaishou Technology, Hangzhou
    I joined the Search Technology Department (STD), and focused on the development and research of (commodity) image retrieval algorithm. I have published a top conference paper as the first author during the internship, with the guidance of my mentor Dr. Lei Zhang.

  • Teaching Assistant, 2019.8-2020.1
    University of Science and Technology of China, Hefei
    I took charge of the course of Basic Circuit Theory, which is for junior undergraduate students of the electric and information profession. Here is the Course Website.

Publications

(* represents the corresponding author.)

  • Zheren Fu, Lei Zhang, Hou Xia, Zhendong Mao*, Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A), 2024 [PDF] [Link] [Code]

  • Zheren Fu, Zhendong Mao*, Yan Song, Yongdong Zhang, Learning Semantic Relationship among Instances for Image-Text Matching, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A), 2023 [PDF] [Link] [Code]

  • Zheren Fu, Zhendong Mao*, Bo Hu, Anan Liu, Yongdong Zhang, Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric Learning, IEEE Transactions on Multimedia (T-MM, CCF-B), 2022 [PDF] [Link] [Code]

  • Zheren Fu, Zhendong Mao*, Chenggang Yan, Anan Liu, Hongtao Xie, Yongdong Zhang, Self-supervised Synthesis Ranking for Deep Metric Learning, IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT, CCF-B), 2021 [PDF] [Link]

  • Zheren Fu, Yan Li, Zhendong Mao*, Quan Wang, Yongdong Zhang, Deep Metric Learning with Self-Supervised Ranking, AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2021 [PDF] [Link]

Honors & Awards

  • Student Funding Program of Cyberspace Security College, 2024

  • DAS-Security Scholarship, 2023

  • National Scholarship for Master Students, 2021

  • USTC Cyberspace Special Scholarship, 2020

  • USTC Excellent Graduation Project, 2020

  • USTC Outstanding Students Scholarship (Silver), 2019

  • CAS SIMIT Scholarship, 2018

  • USTC Outstanding Students Scholarship (Bronze), 2017

Services

  • Program Committee Member: AAAI25, CVPR24, AAAI24, AAAI23

  • Invited Reviewer: T-PAMI, T-MM, T-CSVT

Patents

  • 毛震东,张勇东,赵博文,付哲仁,基于图像情感倾向的多模态谣言检测方法,申请号:202010940956.5

  • 张勇东,毛震东,邓旭冉,付哲仁,一种基于预训练语言模型的网络谣言检测方法,申请号:201911379298.0

Others

  • Bachelor Thesis: Self-Supervised Representation on Multi-Modal Scene, Excellent graduation project [PDF] [PPT] [Code]