News

  • ๐Ÿ†Our X-VLA has won 1st place in the AGIBOT World Challenge (Manipulation track) @ IROS 2025.
  • One paper (UniAct) on cross-embodiment universal actions is accepted to CVPR 2025.
  • ๐ŸŒŸDiffusion-Planner is selected as oral presentation at ICLR 2025.
  • One papers on autonomous driving (Diffusion-Planner) are accepted to ICLR 2025.
  • One paper (Robo-MUTUAL) on embodied representations is accepted to ICRA 2025.
  • ๐ŸŒŸIVM and DecisionNCE are selected as Outstanding Paper at MFM-EAI workshop @ ICML 2024.
  • One paper (IVM) on embodied foundation multimodal models is accepted to NeurIPS 2024.
  • One paper (DecisionNCE) on embodied multimodal representations is accepted to ICML 2024.
  • ๐ŸŒŸOne paper (GLID) on unified vision pretraining is accepted to CVPR 2024 .

Publications (* marks equal contribution)

  • X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model Jinliang Zheng*, Jianxiong Li*, Zhihao Wang, Dongxiu Liu, Xirui Kang, Yuchun Feng, Yinan Zheng, Jiayin Zou, Yilun Chen, Jia Zeng, Ya-Qin Zhang, Jiangmiao Pang, Jingjing Liu, Tai Wang, Xianyuan Zhan (1st place ๐Ÿ† @ AGIBOT World Challenge (Manipulation track), IROS 2025) 2025 Paper | Code | Page
  • Universal Actions for Enhanced Embodied Foundation Models Jinliang Zheng*, Jianxiong Li*, Dongxiu Liu*, Yinan Zheng, Zhihao Wang, Zhonghong Ou, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan CVPR 2025 2025 Paper | Code | Page
  • Instruction Guided Visual Masking Jinliang Zheng*, Jianxiong Li*, Sijie Cheng, Yinan Zheng, Jiaming Li, Jihao Liu, Yu Liu, Jingjing Liu, Xianyuan Zhan NeurIPS 2024 (Outstanding Paper @ ICML 2024 MFM-EAI Workshop) 2024 Paper | Code | Page | Dataset | Model
  • DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning Jianxiong Li*, Jinliang Zheng*, Yinan Zheng*, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan ICML 2024 (Outstanding Paper @ ICML 2024 MFM-EAI Workshop) 2024 Paper | Code | Page
  • GLID: Pre-training a Generalist Encoder-Decoder Vision Model Jihao Liu*, Jinliang Zheng*, Yu Liu, Hongsheng Li CVPR 2024 2024 Paper |
  • Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning Jianxiong Li*, Zhihao Wang*, Jinliang Zheng*, Xiaoai Zhou, Guanming Wang, Guanglu Song, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Junzhi Yu, Xianyuan Zhan ICRA 2025 2025 Paper | Code | Page
  • Universal Actions for Enhanced Embodied Foundation Models Jinliang Zheng*, Jianxiong Li*, Dongxiu Liu*, Yinan Zheng, Zhihao Wang, Zhonghong Ou, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan CVPR 2025 2025 Paper | Code | Page
  • Instruction Guided Visual Masking Jinliang Zheng*, Jianxiong Li*, Sijie Cheng, Yinan Zheng, Jiaming Li, Jihao Liu, Yu Liu, Jingjing Liu, Xianyuan Zhan NeurIPS 2024 (Outstanding Paper @ ICML 2024 MFM-EAI Workshop) 2024 Paper | Code | Page | Dataset | Model
  • DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning Jianxiong Li*, Jinliang Zheng*, Yinan Zheng*, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan ICML 2024 (Outstanding Paper @ ICML 2024 MFM-EAI Workshop) 2024 Paper | Code | Page
  • Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning Jianxiong Li*, Zhihao Wang*, Jinliang Zheng*, Xiaoai Zhou, Guanming Wang, Guanglu Song, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Junzhi Yu, Xianyuan Zhan ICRA 2025 2025 Paper | Code | Page
  • GLID: Pre-training a Generalist Encoder-Decoder Vision Model Jihao Liu*, Jinliang Zheng*, Yu Liu, Hongsheng Li CVPR 2024 2024 Paper |
  • Diffusion-Based Planning for Autonomous Driving with Flexible Guidance Yinan Zheng*, Ruiming Liang*, Kexin Zheng*, Jinliang Zheng, Liyuan Mao, Jianxiong Li, Weihao Gu, Rui Ai, Shengbo Eben Li, Xianyuan Zhan, Jingjing Liu ICLR 2025 (Oral, Top 2%) 2025 Paper | Code | Page
  • MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment Jihao Liu*, Xin Huang*, Jinliang Zheng*, Jia Wang, Boxiao Liu, Osamu Yoshie, Yu Liu, Hongsheng Li Under Review 2024
  • Efficient Robotic Policy Learning via Latent Space Backward Planning Dongxiu Liu*, Haoyi Niu*, Zhihao Wang, Jinliang Zheng, Yinan Zheng, Zhonghong Ou, Jianming Hu, Jianxiong Li, Xianyuan Zhan Under Review 2025
  • GoBigger: A Scalable Platform for Cooperative-Competitive Multi-Agent Interactive Simulation Ming Zhang, Shenghan Zhang, Zhenjie Yang, Lekai Chen, Jinliang Zheng, Chao Yang, Chuming Li, Hang Zhou, Yazhe Niu, Yu Liu ICLR 2023 2023
  • MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers Jihao Liu, Xin Huang, Jinliang Zheng, Yu Liu, Hongsheng Li CVPR 2023 2023

Professional Services

Reviewer for ICLR 25, ICML 25, NeurIPS 24-25, CVPR 25, ICCV 25