R.I.P.
๐ป
Ghosted
VistaBot: View-Robust Robot Manipulation via Spatiotemporal-Aware View Synthesis
April 23, 2026 ยท Grace Period ยท ๐ ICRA 2026
Authors
Songen Gu, Yuhang Zheng, Weize Li, Yupeng Zheng, Yating Feng, Xiang Li, Yilun Chen, Pengfei Li, Wenchao Ding
arXiv ID
2604.21914
Category
cs.RO: Robotics
Citations
0
Venue
ICRA 2026
Abstract
Recently, end-to-end robotic manipulation models have gained significant attention for their generalizability and scalability. However, they often suffer from limited robustness to camera viewpoint changes when training with a fixed camera. In this paper, we propose VistaBot, a novel framework that integrates feed-forward geometric models with video diffusion models to achieve view-robust closed-loop manipulation without requiring camera calibration at test time. Our approach consists of three key components: 4D geometry estimation, view synthesis latent extraction, and latent action learning. VistaBot is integrated into both action-chunking (ACT) and diffusion-based ($ฯ_0$) policies and evaluated across simulation and real-world tasks. We further introduce the View Generalization Score (VGS) as a new metric for comprehensive evaluation of cross-view generalization. Results show that VistaBot improves VGS by 2.79$\times$ and 2.63$\times$ over ACT and $ฯ_0$, respectively, while also achieving high-quality novel view synthesis. Our contributions include a geometry-aware synthesis model, a latent action planner, a new benchmark metric, and extensive validation across diverse environments. The code and models will be made publicly available.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Robotics
R.I.P.
๐ป
Ghosted
AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles
๐
๐
The Cartographer
A Survey of Motion Planning and Control Techniques for Self-driving Urban Vehicles
๐
๐
The Cartographer
Unmanned Aerial Vehicles: A Survey on Civil Applications and Key Research Challenges
๐
๐
The Cartographer
A Survey of Autonomous Driving: Common Practices and Emerging Technologies
R.I.P.
๐ป
Ghosted