Junkun Yuan   袁俊坤

Research Scientist,  Hunyuan Multimodal Model Group  @  Tencent

yuanjk0921@outlook.com

work and live in Shenzhen, China

Last updated on October 25, 2025 at 10:52 (UTC+8)

Biography    Publications    Professional Service

Biography

I have been working as a research scientist in the Foundation Model Team of the Hunyuan Multimodal Model Group at Tencent since Jul 2024. I am focusing on multimodal generative foundation models and their various downstream applications.

During Sep 2023 — Jul 2024, I interned in the Hunyuan Multimodal Model Group at Tencent, working with Wei Liu. During Jul 2022 — Aug 2023, I interned in the Computer Vision Group at Baidu, working with Xinyu Zhang and Jingdong Wang.

I received my Ph.D. degree in Computer Science from Zhejiang University (2019 — 2024), co-supervised by professors of Kun Kuang, Lanfen Lin, and Fei Wu. I received my B.S. degree in Automation from Zhejiang University of Technology (2015 — 2019), co-supervised by professors of Qi Xuan and Li Yu.

I have been fortunate to work closely with some friends such as Defang Chen and Yue Ma, their insights also profoundly shape my approach to research.

Publications

Google Scholar Profile

✳: (co-)first author    ✉: corresponding author

2025: AsynDM (arXiv 2025)       Follow-Your-Preference (arXiv 2025)       Follow-Your-Emoji-Faster (IJCV 2025)       Hunyuan-Game (arXiv 2025)

2024: HunyuanVideo (arXiv 2024)       MPL (IJCV 2024)       Follow-Your-Canvas (AAAI 2025)       Follow-Your-Emoji (SIGGRAPH-Asia 2024)       Domaindiff (ICASSP 2024)

2023: HAP (NeurIPS 2023)       MAP (ICCV 2023)       NPT (KDD 2024)       HTCL (KDD 2023)       CAM (ICCV 2023)       KDDRL (TMM 2023)       MPL (ICCV 2023)

2022: CAE v2 (TMLR 2023)       CEG (MM 2022)       ACDA (Neurocomputing 2022)

2021: DSBF (IJCV 2022)       CSAC (TKDE 2023)       IV-DG (TKDD 2023)       AutoIV (TKDD 2022)

2020: GAPGAN (ECAI 2020)       DeR-CFR (TKDE 2023)

2019: SGNs (TKDE 2021)

2025

Asynchronous Denoising Diffusion Models for Aligning Text-to-Image Generation

Zijing Hu, Yunze Tong, Fengda Zhang, Junkun Yuan, et al.

arXiv 2025

Oct 06, 2025   |   AsynDM   |   code

Follow-Your-Preference: Towards Preference-Aligned Image Inpainting

Yutao Shen, Junkun Yuan, Toru Aonishi, Hideki Nakayama, et al.

arXiv 2025

Sep 27, 2025   |   Follow-Your-Preference   |   code

Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation

Yue Ma, Zexuan Yan, Hongyu Liu, Hongfa Wang, Heng Pan, Yingqing He, Junkun Yuan, et al.

International Journal of Computer Vision (IJCV), 2025

Sep 20, 2025   |   Follow-Your-Emoji-Faster   |   code

Hunyuan-Game: Industrial-grade Intelligent Game Creation Model

Hunyuan Multimodal Model Group at Tencent (as a group member)

arXiv 2025

May 20, 2025   |   Hunyuan-Game   |   code

2024

HunyuanVideo: A Systematic Framework For Large Video Generative Models

Hunyuan Multimodal Model Group at Tencent (as a group member)

arXiv 2024

Dec 03, 2024   |   HunyuanVideo   |   code

It introduces an open-source diffusion model for video generation with 13B parameters. It has over 400 citations and over 11,000 GitHub stars (as of Oct 2025).

Mutual Prompt Learning for Vision Language Models

Sifan Long, Zhen Zhao, Junkun Yuan, Zichang Tan, et al.

International Journal of Computer Vision (IJCV), 2024

Sep 26, 2024   |   MPL

Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation

Qihua Chen, Yue Ma, Hongfa Wang, Junkun Yuan, et al.

AAAI Conference on Artificial Intelligence (AAAI), 2025

Sep 02, 2024   |   Follow-Your-Canvas   |   code

Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation

Yue Ma, Hongyu Liu, Hongfa Wang, Heng Pan, Yingqing He, Junkun Yuan, et al.

ACM SIGGRAPH Annual Conference in Asia (SIGGRAPH-Asia), 2024

Jun 04, 2024   |   Follow-Your-Emoji   |   code

Domaindiff: Boost Out-of-Distribution Generalization with Synthetic Data

Qiaowei Miao, Junkun Yuan, Shengyu Zhang, Fei Wu, et al.

International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024

Apr 14, 2024   |   Domaindiff

2023

HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception

Junkun Yuan, Xinyu Zhang, Hao Zhou, Jian Wang, et al.

Advances in Neural Information Processing Systems (NeurIPS), 2023

Oct 31, 2023   |   HAP   |   code

MAP: Towards Balanced Generalization of IID and OOD through Model-Agnostic Adapters

Min Zhang, Junkun Yuan, Yue He, Wenbin Li, et al.

International Conference on Computer Vision (ICCV), 2023

Oct 02, 2023   |   MAP   |   code

Neural Collapse Anchored Prompt Tuning for Generalizable Vision-Language Models

Didi Zhu, Zexi Li, Min Zhang, Junkun Yuan, et al.

ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2024

Jun 28, 2023   |   NPT

Quantitatively Measuring and Contrastively Exploring Heterogeneity for Domain Generalization

Yunze Tong, Junkun Yuan, Min Zhang, Didi Zhu, et al.

ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2023

May 25, 2023   |   HTCL   |   code

Universal Domain Adaptation via Compressive Attention Matching

Didi Zhu, Yincuan Li, Junkun Yuan, Zexi Li, et al.

International Conference on Computer Vision (ICCV), 2023

Apr 24, 2023   |   CAM

Knowledge Distillation-Based Domain-Invariant Representation Learning for Domain Generalization

Ziwei Niu, Junkun Yuan, Xu Ma, Yingying Xu, et al.

IEEE Transactions on Multimedia (TMM), 2023

Apr 05, 2023   |   KDDRL

Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models

Sifan Long, Zhen Zhao, Junkun Yuan, Zichang Tan, et al.

International Conference on Computer Vision (ICCV), 2023

Mar 30, 2023   |   MPL

2022

CAE v2: Context Autoencoder with CLIP Target

Xinyu Zhang, Jiahui Chen, Junkun Yuan, Qiang Chen, et al.

Transactions on Machine Learning Research (TMLR), 2023

Nov 17, 2022   |   CAE v2   |   code

Label-Efficient Domain Generalization via Collaborative Exploration and Generalization

Junkun Yuan, Xu Ma, Defang Chen, Kun Kuang, et al.

International Conference on Multimedia (MM), 2022

Aug 07, 2022   |   CEG   |   code

Attention-based Cross-Layer Domain Alignment for Unsupervised Domain Adaptation

Xu Ma, Junkun Yuan, Yen-Wei Chen, Ruofeng Tong, et al.

Neurocomputing 2022

Feb 27, 2022   |   ACDA

2021

Domain-Specific Bias Filtering for Single Labeled Domain Generalization

Junkun Yuan, Xu Ma, Defang Chen, Kun Kuang, et al.

International Journal of Computer Vision (IJCV), 2022

Oct 02, 2021   |   DSBF   |   code

Collaborative Semantic Aggregation and Calibration for Federated Domain Generalization

Junkun Yuan, Xu Ma, Defang Chen, Fei Wu, et al.

IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023

Oct 13, 2021   |   CSAC   |   code

Instrumental Variable-Driven Domain Generalization with Unobserved Confounders

Junkun Yuan, Xu Ma, Kun Kuang, Ruoxuan Xiong, et al.

ACM Transactions on Knowledge Discovery from Data (TKDD), 2023

Oct 04, 2021   |   IV-DG   |   code

Auto IV: Counterfactual Prediction via Automatic Instrumental Variable Decomposition

Junkun Yuan, Anpeng Wu, Kun Kuang, Bo Li, et al.

ACM Transactions on Knowledge Discovery from Data (TKDD), 2022

Jul 13, 2021   |   AutoIV   |   code

2020

Black-Box Adversarial Attacks Against Deep Learning Based Malware Binaries Detection with GAN

Junkun Yuan, Shaofang Zhou, Lanfen Lin, Feng Wang, et al.

European Conference on Artificial Intelligence (ECAI), 2020

Aug 29, 2020   |   GAPGAN

Learning Decomposed Representation for Counterfactual Inference

Anpeng Wu, Junkun Yuan, Kun Kuang, Bo Li, et al.

IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023

Jun 12, 2020   |   DeR-CFR   |   code

2019

Subgraph Networks with Application to Structural Feature Space Expansion

Qi Xuan, Jinhuan Wang, Minghao Zhao, Junkun Yuan, et al.

IEEE Transactions on Knowledge and Data Engineering (TKDE), 2021

Mar 21, 2019   |   SGNs   |   code

Professional Service