Yikang Ding

Yikang Ding - 丁宜康

Researcher at KwaiVGI (@Kling Team) | M.S. of Tsinghua University

3D Reconstruction World Models Generative Models Computer Vision

Biography

I am currently a researcher at KwaiVGI (aka Kling Team), where I am working on the task of generative model and AIGC.

Before that, I worked at MEGVII as a researcher (2023-2025) and led a group to work on 3D reconstruction and generative models for autonomous driving. I got my master's degree and bachelor's degree at Tsinghua University (2020-2023) and Beihang University (2016-2020).

If you are interested in working with me, please feel free to contact me at dingyikang23@gmail.com, or you can add me on WeChat: yeecon_

Latest News

Sep 2025

We're delighted to release Kling-Avatar. Try it on Kling AI Page!

July 2025

One paper (MuDG) accepted to BMVC 2025. Congrats to Yingshuang!

Jun 2025

Two papers (DiST-4D, HERMES) accepted to ICCV 2025. Congrats to Jiazhe and Xin Zhou!

Jun 2025

One paper accepted to IROS 2025.

Feb 2025

One paper (UniScene) accepted to CVPR 2025. Congrats to my several interns!

Jul 2024

One paper (M2Depth) accepted to ECCV 2024 as Oral. Congrats to Yingshuang!

Mar 2023

Won Innovation Award of 3D Occupancy Prediction Challenge in CVPR 2023.

Mar 2023

One paper accepted to CVPR 2023.

Sep 2022

One paper accepted to NeurIPS 2022.

Jun 2022

Two papers accepted to ECCV 2022.

Mar 2023

One paper accepted to CVPR 2022.

Selected Publications

(* denotes equal contribution, denotes project leader.)

Kling-Avatar

Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis

Yikang Ding*, Jiwen Liu*, Wenyuan Zhang, Zekun Wang, Wentao Hu, Liyuan Cui, Mingming Lao, Yingchao Shao, Hui Liu, Xiaohan Li, Ming Chen, Xiaoqiang Liu, Yu-Shen Liu, Pengfei Wan

Tech Report

DiST-4D

DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation

Jiazhe Guo*, Yikang Ding*, Xiwu Chen, Shuo Chen, Bohan Li, Yingshuang Zou, Xiaoyang Lyu, Feiyang Tan, Xiaojuan Qi, Zhiheng Li, Hao Zhao

ICCV 2025

HERMES

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation

Xin Zhou*, Dingkang Liang*, Sifan Tu, Xiwu Chen, Yikang Ding, Dingyuan Zhang, Feiyang Tan, Hengshuang Zhao, Xiang Bai

ICCV 2025

MuDG

MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction

Yingshuang Zou*, Yikang Ding*, Chuanrui Zhang, Jiazhe Guo, Bohan Li, Xiaoyang Lyu, Feiyang Tan, Xiaojuan Qi, Haoqian Wang

BMVC 2025

UniScene

UniScene: Unified Occupancy-centric Driving Scene Generation

Bohan Li*, Jiazhe Guo*, Hongsi Liu*, Yingshuang Zou*, Yikang Ding*, Xiwu Chen, Hu Zhu, Other authors

CVPR 2025

M2Depth

M2Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation

Yingshuang Zou*, Yikang Ding*, Xi Qiu, Haoqian Wang, Haotian Zhang

ECCV 2024 Oral

adamatcher

Adaptive Assignment for Geometry Aware Local Feature Matching

Dihe Huang, Ying Chen, Shang Xu, Yong Liu, Wenlong Wu, Yikang Ding, Chengjie Wang, Fan Tang

CVPR 2023

transmvsnet

TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers

Yikang Ding*, Wentao Yuan*, Qingtian Zhu, Haotian Zhang, Xiangyue Liu, Yuanjiang Wang, Xiao Liu

IEEE/CVF Conference on Computer Vision and Pattern Recognition

CVPR 2022

kdmvs

KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-view Stereo

Yikang Ding, Qingtian Zhu, Xiangyue Liu, Wentao Yuan, Haotian Zhang, Chi Zhang

ECCV 2022

layerdiffusion

LayerDiffusion: Layered Controlled Image Editing with Diffusion Models

Pengzhi Li, Qinxuan Huang, Yikang Ding, Zhiheng Li

SIGGRAPH Asia 2023

wtmvs

WT-MVSNet: Window-based Transformers for Multi-view Stereo

Jinli Liao*, Yikang Ding*, Yoli Shavit, Dihe Huang, Shihao Ren, Jia Guo, Wensen Feng, Kai Zhang

NeurIPS 2022

Honors & Awards

  • Innovation Award

    3D Occupancy Prediction Challenge in CVPR 2023

  • 1st Place

    Indoor and Outdoor Visual Localization Challenge of ICCV 2021

  • Outstanding Graduate of Beijing

    2020.06

Academic Services

Conference Reviewer:

CVPR 2022- ECCV 2022- AAAI 2023- ACCV 2022- 3DV 2022-

Research Interests

3D Reconstruction

Multi-view stereo, depth estimation, scene reconstruction

World Models

Scene understanding, occupancy prediction, self-driving

Generative Models

Diffusion models, scene generation, image editing