Biography
News Recent Updates
Feb 2026 · Paper CVPR 2026 paper accepted (Highlight) · Congratulations for Dr. Gao!
Publications
(C: Conference | J: Journal | P: Preprint | *: Equal Contribution)
Conference
[C1] C²FG: Control Classifier-Free Guidance via Score Discrepancy Analysis
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026 Highlight
[C2] I-DRUID: Layout to Image Generation via Instance-Disentangled Representation and Unpaired Data
International Conference on Learning Representations (ICLR) 2026
[C3] Bidirectional Noise Injection: Enhancing Diffusion Models via Coordinated Input-Output Perturbation
AAAI Conference on Artificial Intelligence (AAAI) 2026 Oral
[C4] Pruning for Sparse Diffusion Models based on Gradient Flow
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025 Oral
[C5] Non-uniform Timestep Sampling: Towards Faster Diffusion Model Training
ACM International Conference on Multimedia (MM) 2024
[C6] Beta-Tuned Timestep Diffusion Model
European Conference on Computer Vision (ECCV) 2024
[C7] FAMIM: A Novel Frequency-Domain Augmentation Masked Image Model Framework for Domain Generalizable Face Anti-Spoofing
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024 Oral
Journal
[J1] Bidirectional Beta-Tuned Diffusion Model
IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI) 2026
[J2] Gradient Flow-Based Iterative Pruning for Efficient and High-Quality Lightweight Diffusion Models
Neural Networks (NN) 2025
[J3] EnfoMax: Domain Entropy and Mutual Information Maximization for Domain Generalized Face Anti-Spoofing
Neurocomputing (NC) 2025
[J4] Enhancing the Accuracy of Generative Adversarial Networks with Fokker-Planck Equations
Neurocomputing (NC) 2025
[J5] EBM-WGF: Training Energy-based Models with Wasserstein Gradient Flow
Neural Networks (NN) 2025
[J6] MFAE: Masked Frequency Autoencoders for Domain Generalization Face Anti-Spoofing
IEEE Transactions on Information Forensics and Security (T-IFS) 2024
Preprint
[P2] InfoTok: Regulating Information Flow for Capacity-Constrained Shared Visual Tokenization in Unified MLLMs
arXiv 2026
[P1] Towards Generalizable Data Protection With Transferable Unlearnable Examples
arXiv 2023
Research Experiences
WorkResearcher
Jul. 2025 - Present
- Focused on improving diffusion model architectures, including training efficiency, sampling quality, and controllability.
- Explored multimodal large language models (MLLMs), with emphasis on visual understanding and generation alignment.
InternResearch Intern
Mar. 2025 - Jun. 2025
- Investigated controllable image generation, focusing on the automated generation of advertising assets.
- Explored techniques for achieving fine-grained control over generative outputs in diffusion-based frameworks.
InternResearch Intern
Oct. 2023 - Mar. 2025
- Investigated the application and improvement of diffusion models for image generation and edit tasks.
InternResearch Intern
Jan. 2022 - Oct. 2023
- Conducted research on computer vision algorithms with a focus on self-supervised learning and representation learning for visual understanding.
- Explored domain generalization methods to improve model robustness across different data distributions, particularly for face anti-spoofing.
InternResearch Intern
Jul. 2021 - Dec. 2021
- Explored efficient inference techniques for deep learning models.
Honors & Awards
BYD Scholarship, Shanghai Jiao Tong University
2025
Outstanding Graduate Scholarship, Xidian University
2020
Outstanding Student Scholarship, Xidian University
2017, 2018, 2019