Wentao Cheng

prof_pic.jpg

Associate Professor

Beijing Normal-Hong Kong Baptist University

Email: wentaocheng@bnbu.edu.cn

I am an Associate Professor at Beijing Normal-Hong Kong Baptist University (BNBU).

Prior to joining BNBU in Feb 2025, I worked as an Associate Professor at Nankai University (2022–2024) and as a Senior Algorithm Engineer at Alibaba Group AI Lab (2020–2022).

I received my Joint Ph.D. degree in Computer Science and Engineering from Nanyang Technological University (NTU), Singapore, and Technische Universität Darmstadt, Germany. I obtained my B.Eng. degree from Harbin Institute of Technology (HIT).

My research interests lie in 3D Computer Vision, with a particular focus on:

  • Visual Localization
  • 3D Reconstruction
  • Efficient Feed-Forward 3D Foundation Models

I actively serve as a reviewer for top-tier conferences and journals, including CVPR, ICCV, ECCV, ICRA, TIP, and TRO. Additionally, I serve as an Executive Committee Member of the Technical Committee on CAD and Computer Graphics (CAD/CG), China Computer Federation (CCF).

news

Mar 18, 2026 Two papers on 3D Foundation Models and Local Feature Learning have been accepted by ICME 2026! 🎉🎉
  • S-VGGT: Structure-Aware Subscene Decomposition for Scalable 3D Foundation Models.
  • D2Feat: Dual Distillation for Semantic and Geometric Local Feature Learning. Stay tuned for the full papers!
Dec 16, 2025 I served as the Lead Organizer for the CCF CAD&CG Academic Seminar (BNBU Station). We invited four distinguished experts to discuss Visual Perception and Graphics.
Oct 01, 2025 One paper titled “Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation” was accepted to ICCV Workshops (HiGen) 2025.
Feb 01, 2025 I joined Beijing Normal-Hong Kong Baptist University (BNBU) as an Associate Professor!
Feb 27, 2024 One paper titled “ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction” was accepted to CVPR 2024.

selected publications

  1. ICME
    S-VGGT: Structure-Aware Subscene Decomposition for Scalable 3D Foundation Models
    Xinze Li, Pengxu Chen, Yiyuan Wang, and 2 more authors
    In IEEE International Conference on Multimedia and Expo (ICME), 2026
  2. ICME
    D2Feat: Dual Distillation for Semantic and Geometric Local Feature Learning
    Yiyuan Wang, Xinze Li, Weifeng Su, and 1 more author
    In IEEE International Conference on Multimedia and Expo (ICME), 2026
  3. ICCVW
    Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
    Minxing Luo, Zixun Xia, Liaojun Chen, and 7 more authors
    In IEEE/CVF International Conference on Computer Vision (ICCV) HiGen Workshop, 2025
  4. CVPR
    ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction
    Zhicheng Zhang, Junyao Hu, Wentao Cheng, and 2 more authors
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  5. SPL
    MVP: One-Shot Object Pose Estimation by Matching with Visible Points
    Wentao Cheng and Minxing Luo
    IEEE Signal Processing Letters (SPL), 2024
  6. RA-L
    Road Mapping and Localization Using Sparse Semantic Visual Features
    Wentao Cheng, Sheng Yang, Maomin Zhou, and 3 more authors
    IEEE Robotics and Automation Letters (RA-L), 2021
  7. ICCV
    Cascaded Parallel Filtering for Memory-Efficient Image-Based Localization
    Wentao Cheng, Weisi Lin, Kan Chen, and 1 more author
    In IEEE International Conference on Computer Vision (ICCV), 2019
  8. TIP
    A Two-Stage Outlier Filtering Framework for City-Scale Localization Using 3D SfM Point Clouds
    Wentao Cheng, Kan Chen, Weisi Lin, and 3 more authors
    IEEE Transactions on Image Processing (TIP), 2019
  9. TIP
    A Data-Driven Point Cloud Simplification Framework for City-Scale Image-Based Localization
    Wentao Cheng, Weisi Lin, Xinfeng Zhang, and 2 more authors
    IEEE Transactions on Image Processing (TIP), 2017