Junkun Yuan 袁俊坤Research Scientist, Hunyuan Multimodal Model Group @ Tencent yuanjk0921@outlook.com work and live in Shenzhen, China Last updated on October 25, 2025 at 10:52 (UTC+8) |
|
Biography Publications Professional Service
I have been working as a research scientist in the Foundation Model Team of the Hunyuan Multimodal Model Group at Tencent since Jul 2024. I am focusing on multimodal generative foundation models and their various downstream applications.
During Sep 2023 — Jul 2024, I interned in the Hunyuan Multimodal Model Group at Tencent, working with Wei Liu.
During Jul 2022 — Aug 2023, I interned in the Computer Vision Group at Baidu, working with Xinyu Zhang and Jingdong Wang.
I received my Ph.D. degree in Computer Science from Zhejiang University (2019 — 2024), co-supervised by professors of Kun Kuang, Lanfen Lin, and Fei Wu. I received my B.S. degree in Automation from Zhejiang University of Technology (2015 — 2019), co-supervised by professors of Qi Xuan and Li Yu.
I have been fortunate to work closely with some friends such as Defang Chen and Yue Ma, their insights also profoundly shape my approach to research.
✳: (co-)first author ✉: corresponding author
2025: AsynDM (arXiv 2025) Follow-Your-Preference✳✉ (arXiv 2025) Follow-Your-Emoji-Faster (IJCV 2025) Hunyuan-Game (arXiv 2025)
2024: HunyuanVideo (arXiv 2024) MPL (IJCV 2024) Follow-Your-Canvas✳✉ (AAAI 2025) Follow-Your-Emoji (SIGGRAPH-Asia 2024) Domaindiff (ICASSP 2024)
2023: HAP✳ (NeurIPS 2023) MAP (ICCV 2023) NPT (KDD 2024) HTCL (KDD 2023) CAM (ICCV 2023) KDDRL (TMM 2023) MPL✳ (ICCV 2023)
2022: CAE v2 (TMLR 2023) CEG✳ (MM 2022) ACDA (Neurocomputing 2022)
2021: DSBF✳ (IJCV 2022) CSAC✳ (TKDE 2023) IV-DG✳ (TKDD 2023) AutoIV✳ (TKDD 2022)
2020: GAPGAN✳ (ECAI 2020) DeR-CFR✳ (TKDE 2023)
2019: SGNs (TKDE 2021)
Asynchronous Denoising Diffusion Models for Aligning Text-to-Image Generation
Zijing Hu, Yunze Tong, Fengda Zhang, Junkun Yuan, et al.
arXiv 2025
Follow-Your-Preference: Towards Preference-Aligned Image Inpainting
Yutao Shen✳, Junkun Yuan✳✉, Toru Aonishi, Hideki Nakayama, et al.
arXiv 2025
Sep 27, 2025 | Follow-Your-Preference | code
Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation
Yue Ma✳, Zexuan Yan✳, Hongyu Liu✳, Hongfa Wang, Heng Pan, Yingqing He, Junkun Yuan, et al.
International Journal of Computer Vision (IJCV), 2025
Sep 20, 2025 | Follow-Your-Emoji-Faster | code
Hunyuan-Game: Industrial-grade Intelligent Game Creation Model
Hunyuan Multimodal Model Group at Tencent (as a group member)
arXiv 2025
May 20, 2025 | Hunyuan-Game | code
HunyuanVideo: A Systematic Framework For Large Video Generative Models
Hunyuan Multimodal Model Group at Tencent (as a group member)
arXiv 2024
Dec 03, 2024 | HunyuanVideo | code
It introduces an open-source diffusion model for video generation with 13B parameters. It has over 400 citations and over 11,000 GitHub stars (as of Oct 2025).
Mutual Prompt Learning for Vision Language Models
Sifan Long✳, Zhen Zhao✳, Junkun Yuan✳, Zichang Tan✳, et al.
International Journal of Computer Vision (IJCV), 2024
Sep 26, 2024 | MPL
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
Qihua Chen✳, Yue Ma✳, Hongfa Wang✳, Junkun Yuan✳✉, et al.
AAAI Conference on Artificial Intelligence (AAAI), 2025
Sep 02, 2024 | Follow-Your-Canvas | code
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Yue Ma✳, Hongyu Liu✳, Hongfa Wang✳, Heng Pan✳, Yingqing He, Junkun Yuan, et al.
ACM SIGGRAPH Annual Conference in Asia (SIGGRAPH-Asia), 2024
Jun 04, 2024 | Follow-Your-Emoji | code
Domaindiff: Boost Out-of-Distribution Generalization with Synthetic Data
Qiaowei Miao, Junkun Yuan, Shengyu Zhang, Fei Wu, et al.
International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024
Apr 14, 2024 | Domaindiff
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
Junkun Yuan✳, Xinyu Zhang✳✉, Hao Zhou, Jian Wang, et al.
Advances in Neural Information Processing Systems (NeurIPS), 2023
MAP: Towards Balanced Generalization of IID and OOD through Model-Agnostic Adapters
Min Zhang, Junkun Yuan, Yue He, Wenbin Li, et al.
International Conference on Computer Vision (ICCV), 2023
Neural Collapse Anchored Prompt Tuning for Generalizable Vision-Language Models
Didi Zhu, Zexi Li, Min Zhang, Junkun Yuan, et al.
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2024
Jun 28, 2023 | NPT
Quantitatively Measuring and Contrastively Exploring Heterogeneity for Domain Generalization
Yunze Tong, Junkun Yuan, Min Zhang, Didi Zhu, et al.
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2023
Universal Domain Adaptation via Compressive Attention Matching
Didi Zhu✳, Yincuan Li✳, Junkun Yuan, Zexi Li, et al.
International Conference on Computer Vision (ICCV), 2023
Apr 24, 2023 | CAM
Knowledge Distillation-Based Domain-Invariant Representation Learning for Domain Generalization
Ziwei Niu, Junkun Yuan, Xu Ma, Yingying Xu, et al.
IEEE Transactions on Multimedia (TMM), 2023
Apr 05, 2023 | KDDRL
Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models
Sifan Long✳, Zhen Zhao✳, Junkun Yuan✳, Zichang Tan, et al.
International Conference on Computer Vision (ICCV), 2023
Mar 30, 2023 | MPL
CAE v2: Context Autoencoder with CLIP Target
Xinyu Zhang✳, Jiahui Chen✳, Junkun Yuan, Qiang Chen, et al.
Transactions on Machine Learning Research (TMLR), 2023
Label-Efficient Domain Generalization via Collaborative Exploration and Generalization
Junkun Yuan✳, Xu Ma✳, Defang Chen, Kun Kuang✉, et al.
International Conference on Multimedia (MM), 2022
Attention-based Cross-Layer Domain Alignment for Unsupervised Domain Adaptation
Xu Ma, Junkun Yuan, Yen-Wei Chen, Ruofeng Tong, et al.
Neurocomputing 2022
Feb 27, 2022 | ACDA
Domain-Specific Bias Filtering for Single Labeled Domain Generalization
Junkun Yuan✳, Xu Ma✳, Defang Chen, Kun Kuang✉, et al.
International Journal of Computer Vision (IJCV), 2022
Collaborative Semantic Aggregation and Calibration for Federated Domain Generalization
Junkun Yuan✳, Xu Ma✳, Defang Chen, Fei Wu, et al.
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Instrumental Variable-Driven Domain Generalization with Unobserved Confounders
Junkun Yuan, Xu Ma, Kun Kuang✉, Ruoxuan Xiong, et al.
ACM Transactions on Knowledge Discovery from Data (TKDD), 2023
Auto IV: Counterfactual Prediction via Automatic Instrumental Variable Decomposition
Junkun Yuan✳, Anpeng Wu✳, Kun Kuang✉, Bo Li, et al.
ACM Transactions on Knowledge Discovery from Data (TKDD), 2022
Black-Box Adversarial Attacks Against Deep Learning Based Malware Binaries Detection with GAN
Junkun Yuan, Shaofang Zhou, Lanfen Lin✉, Feng Wang, et al.
European Conference on Artificial Intelligence (ECAI), 2020
Aug 29, 2020 | GAPGAN
Learning Decomposed Representation for Counterfactual Inference
Anpeng Wu✳, Junkun Yuan✳, Kun Kuang✉, Bo Li✉, et al.
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Subgraph Networks with Application to Structural Feature Space Expansion
Qi Xuan✉, Jinhuan Wang, Minghao Zhao, Junkun Yuan, et al.
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2021