Wentao Cheng

prof_pic.jpg

Associate Professor

Beijing Normal-Hong Kong Baptist University

Email: wentaocheng@bnbu.edu.cn

I am an Associate Professor at Beijing Normal-Hong Kong Baptist University (BNBU).

Prior to joining BNBU in Feb 2025, I worked as an Associate Professor at Nankai University (2022–2024) and as a Senior Algorithm Engineer at Alibaba Group AI Lab (2020–2022).

I received my Joint Ph.D. degree in Computer Science and Engineering from Nanyang Technological University (NTU), Singapore, and Technische Universität Darmstadt, Germany. I obtained my B.Eng. degree from Harbin Institute of Technology (HIT).

My research interests lie in 3D Computer Vision, with a particular focus on:

  • Visual Localization
  • 3D Reconstruction
  • Efficient Feed-Forward 3D Foundation Models

I actively serve as a reviewer for top-tier conferences and journals, including CVPR, ICCV, ECCV, ICRA, TIP, and TRO. Additionally, I serve as an Executive Committee Member of the Technical Committee on CAD and Computer Graphics (CAD/CG), China Computer Federation (CCF).

Students who are interested in my research topics, please refer to this and contact me.

news

Mar 18, 2026 Two papers on 3D Foundation Models and Local Feature Learning have been accepted by ICME 2026! 🎉🎉
  • S-VGGT: Structure-Aware Subscene Decomposition for Scalable 3D Foundation Models.
  • D2Feat: Dual Distillation for Semantic and Geometric Local Feature Learning. Stay tuned for the full papers!
Dec 16, 2025 I served as the Lead Organizer for the CCF CAD&CG Academic Seminar (BNBU Station). We invited four distinguished experts to discuss Visual Perception and Graphics.
Oct 01, 2025 One paper titled “Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation” was accepted to ICCV Workshops (HiGen) 2025.
Feb 01, 2025 I joined Beijing Normal-Hong Kong Baptist University (BNBU) as an Associate Professor!
Feb 27, 2024 One paper titled “ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction” was accepted to CVPR 2024.

selected publications

  1. ICME
    S-VGGT: Structure-Aware Subscene Decomposition for Scalable 3D Foundation Models
    Xinze Li, Pengxu Chen, Yiyuan Wang, and 2 more authors
    In IEEE International Conference on Multimedia and Expo (ICME), 2026
  2. ICME
    D2Feat: Dual Distillation for Semantic and Geometric Local Feature Learning
    Yiyuan Wang, Xinze Li, Weifeng Su, and 1 more author
    In IEEE International Conference on Multimedia and Expo (ICME), 2026
  3. ICCVW
    Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
    Minxing Luo, Zixun Xia, Liaojun Chen, and 7 more authors
    In IEEE/CVF International Conference on Computer Vision (ICCV) HiGen Workshop, 2025
  4. CVPR
    ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction
    Zhicheng Zhang, Junyao Hu, Wentao Cheng, and 2 more authors
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  5. SPL
    MVP: One-Shot Object Pose Estimation by Matching with Visible Points
    Wentao Cheng and Minxing Luo
    IEEE Signal Processing Letters (SPL), 2024
  6. RA-L
    Road Mapping and Localization Using Sparse Semantic Visual Features
    Wentao Cheng, Sheng Yang, Maomin Zhou, and 3 more authors
    IEEE Robotics and Automation Letters (RA-L), 2021
  7. ICCV
    Cascaded Parallel Filtering for Memory-Efficient Image-Based Localization
    Wentao Cheng, Weisi Lin, Kan Chen, and 1 more author
    In IEEE International Conference on Computer Vision (ICCV), 2019
  8. TIP
    A Two-Stage Outlier Filtering Framework for City-Scale Localization Using 3D SfM Point Clouds
    Wentao Cheng, Kan Chen, Weisi Lin, and 3 more authors
    IEEE Transactions on Image Processing (TIP), 2019
  9. TIP
    A Data-Driven Point Cloud Simplification Framework for City-Scale Image-Based Localization
    Wentao Cheng, Weisi Lin, Xinfeng Zhang, and 2 more authors
    IEEE Transactions on Image Processing (TIP), 2017