Junkun Yuan 袁俊坤Research Scientist, Hunyuan Multimodal Model Group @ Tencent yuanjk0921@outlook.com work and live in Shenzhen, China Last updated on September 15, 2025 at 16:48 (UTC+8) |
![]() |
I have been working as a research scientist in the Foundation Model Team of the Hunyuan Multimodal Model Group at Tencent since Jul 2024, working with Zhao Zhong and Liefeng Bo. I am focusing on multimodal generative foundation models and their various downstream applications.
During Sep 2023 — Jul 2024, I interned in the Hunyuan Multimodal Model Group at Tencent, working with Wei Liu.
During Jul 2022 — Aug 2023, I interned in the Computer Vision Group at Baidu, working with Xinyu Zhang and Jingdong Wang.
I received my Ph.D. degree in Computer Science from Zhejiang University (2019 — 2024), co-supervised by professors of Kun Kuang, Lanfen Lin, and
Fei Wu. I received my B.S. degree in Automation from Zhejiang University of Technology (2015 — 2019), co-supervised by professors of Qi Xuan and Li Yu.
I have been fortunate to work closely with some friends such as Defang Chen and Yue Ma, their insights also profoundly shape my approach to research.
Hunyuan-Game (arXiv 2025) HunyuanVideo (arXiv 2024) MPL (IJCV 2024) Follow-Your-Canvas (AAAI 2025) Follow-Your-Emoji (SIGGRAPH-Asia 2024) Domaindiff (ICASSP 2024) KDDRL (TMM 2023) HAP (NeurIPS 2023) MAP (ICCV 2023) NPT (KDD 2024) HTCL (KDD 2023) CAM (ICCV 2023) MPL (ICCV 2023) CAE v2 (TMLR 2023) CEG (MM 2022) ACDA (Neurocomputing 2022) DSBF (IJCV 2022) CSAC (TKDE 2023) DSBF (TKDD 2023) AutoIV (TKDD 2022) GAPGAN (ECAI 2020) DeR-CFR (TKDE 2023) SGNs (TKDE 2021)
Hunyuan-Game: Industrial-grade Intelligent Game Creation Model
Hunyuan Multimodal Model Group at Tencent (as a group member)
arXiv 2025
May 20, 2025 | Hunyuan-Game
HunyuanVideo: A Systematic Framework For Large Video Generative Models
Hunyuan Multimodal Model Group at Tencent (as a group member)
arXiv 2024
Dec 03, 2024 | HunyuanVideo
It is an open-source large-scale video generation model with 13B parameters. It has over 300 citations and over 11,000 GitHub stars (as of Sep 2025).
Mutual Prompt Learning for Vision Language Models
Sifan Long✳, Zhen Zhao✳, Junkun Yuan✳, Zichang Tan✳, Jiangjiang Liu, Jingyuan Feng, Shengsheng Wang✉, and Jingdong Wang
International Journal of Computer Vision (IJCV), 2024
Sep 26, 2024 | MPL
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
Qihua Chen✳, Yue Ma✳, Hongfa Wang✳, Junkun Yuan✳✉, Wenzhe Zhao, Qi Tian, Hongmei Wang, Shaobo Min, Qifeng Chen✉, and Wei Liu
AAAI Conference on Artificial Intelligence (AAAI), 2025
Sep 02, 2024 | Follow-Your-Canvas
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Yue Ma✳, Hongyu Liu✳, Hongfa Wang✳, Heng Pan✳, Yingqing He, Junkun Yuan, Ailing Zeng, Chengfei Cai, Heung-Yeung Shum, Wei Liu✉, and Qifeng Chen✉
ACM SIGGRAPH Annual Conference in Asia (SIGGRAPH-Asia), 2024
Jun 04, 2024 | Follow-Your-Emoji
Domaindiff: Boost Out-of-Distribution Generalization with Synthetic Data
Qiaowei Miao, Junkun Yuan, Shengyu Zhang, Fei Wu, and Kun Kuang✉
International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024
Apr 14, 2024 | Domaindiff
Knowledge Distillation-Based Domain-Invariant Representation Learning for Domain Generalization
Ziwei Niu, Junkun Yuan, Xu Ma, Yingying Xu, Jing Liu, Yen-Wei Chen, Ruofeng Tong, and Lanfen Lin✉
IEEE Transactions on Multimedia (TMM), 2023
Jan 01, 2024 | KDDRL
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
Junkun Yuan✳, Xinyu Zhang✳✉, Hao Zhou, Jian Wang, Zhongwei Qiu, Zhiyin Shao, Shaofeng Zhang, Sifan Long, and Kun Kuang✉, Kun Yao, Junyu Han, Errui Ding, Lanfen Lin, Fei Wu, and Jingdong Wang✉
Advances in Neural Information Processing Systems (NeurIPS), 2023
Oct 31, 2023 | HAP
MAP: Towards Balanced Generalization of IID and OOD through Model-Agnostic Adapters
Min Zhang, Junkun Yuan, Yue He, Wenbin Li, Zhengyu Chen, and Kun Kuang✉
International Conference on Computer Vision (ICCV), 2023
Oct 02, 2023 | MAP
Neural Collapse Anchored Prompt Tuning for Generalizable Vision-Language Models
Didi Zhu, Zexi Li, Min Zhang, Junkun Yuan, Yunfeng Shao, Jiashuo Liu, Kun Kuang✉, Yinchuan Li, and Chao Wu
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2024
Jun 28, 2023 | NPT
Quantitatively Measuring and Contrastively Exploring Heterogeneity for Domain Generalization
Yunze Tong, Junkun Yuan, Min Zhang, Didi Zhu, Keli Zhang, Fei Wu, and Kun Kuang✉
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2023
May 25, 2023 | HTCL
Universal Domain Adaptation via Compressive Attention Matching
Didi Zhu✳, Yincuan Li✳, Junkun Yuan, Zexi Li, Kun Kuang, and Chao Wu✉
International Conference on Computer Vision (ICCV), 2023
Apr 24, 2023 | CAM
Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models
Sifan Long✳, Zhen Zhao✳, Junkun Yuan✳, Zichang Tan, Jiangjiang Liu, Luping Zhou, Shengsheng Wang✉, and Jingdong Wang✉
International Conference on Computer Vision (ICCV), 2023
Mar 30, 2023 | MPL
CAE v2: Context Autoencoder with CLIP Target
Xinyu Zhang✳, Jiahui Chen✳, Junkun Yuan, Qiang Chen, Jian Wang, Xiaodi Wang, Shumin Han, Xiaokang Chen, Jimin Pi, Kun Yao, Junyu Han, Errui Ding, and Jingdong Wang✉
Transactions on Machine Learning Research (TMLR), 2023
Nov 17, 2022 | CAE v2
Label-Efficient Domain Generalization via Collaborative Exploration and Generalization
Junkun Yuan✳, Xu Ma✳, Defang Chen, Kun Kuang✉, Fei Wu, and Lanfen Lin
International Conference on Multimedia (MM), 2022
Aug 07, 2022 | CEG
Attention-based Cross-Layer Domain Alignment for Unsupervised Domain Adaptation
Xu Ma, Junkun Yuan, Yen-Wei Chen, Ruofeng Tong, and Lanfen Lin✉
Neurocomputing 2022
Feb 27, 2022 | ACDA
Domain-Specific Bias Filtering for Single Labeled Domain Generalization
Junkun Yuan✳, Xu Ma✳, Defang Chen, Kun Kuang✉, Fei Wu, and Lanfen Lin
International Journal of Computer Vision (IJCV), 2022
Oct 02, 2021 | DSBF
Collaborative Semantic Aggregation and Calibration for Federated Domain Generalization
Junkun Yuan✳, Xu Ma✳, Defang Chen, Fei Wu, Lanfen Lin, and Kun Kuang✉
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Oct 13, 2021 | CSAC
Instrumental Variable-Driven Domain Generalization with Unobserved Confounders
Junkun Yuan, Xu Ma, Kun Kuang✉, Ruoxuan Xiong, Mingming Gong, and Lanfen Lin
ACM Transactions on Knowledge Discovery from Data (TKDD), 2023
Oct 04, 2021 | DSBF
Auto IV: Counterfactual Prediction via Automatic Instrumental Variable Decomposition
Junkun Yuan✳, Anpeng Wu✳, Kun Kuang✉, Bo Li, Runze Wu, Fei Wu, and Lanfen Lin
ACM Transactions on Knowledge Discovery from Data (TKDD), 2022
Jul 13, 2021 | AutoIV
Black-Box Adversarial Attacks Against Deep Learning Based Malware Binaries Detection with GAN
Junkun Yuan, Shaofang Zhou, Lanfen Lin✉, Feng Wang, and Jia Cui
European Conference on Artificial Intelligence (ECAI), 2020
Aug 29, 2020 | GAPGAN
Learning Decomposed Representation for Counterfactual Inference
Anpeng Wu✳, Junkun Yuan✳, Kun Kuang✉, Bo Li✉, Runze Wu, Qiang Zhu, Yueting Zhuang, and Fei Wu
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Jun 12, 2020 | DeR-CFR
Subgraph Networks with Application to Structural Feature Space Expansion
Qi Xuan✉, Jinhuan Wang, Minghao Zhao, Junkun Yuan, Chenbo Fu, Zhongyuan Ruan✉, Guanrong Chen
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2021
Mar 21, 2019 | SGNs