SSCNav: Confidence-Aware Semantic Scene Completion for Visual Semantic Navigation

December 08, 2020 ยท Entered Twilight ยท ๐Ÿ› IEEE International Conference on Robotics and Automation

๐ŸŒ… TWILIGHT: Old Age
Predates the code-sharing era โ€” a pioneer of its time

"No code URL or promise found in abstract"
"Code repo scraped from project page (backfill)"

Evidence collected by the PWNC Scanner

Repo contents: LICENSE, README.md, SCNav_agent.py, configs, data, models, push.sh, requirements.txt, test_agent.py, train_agent.py, train_cmplt.py, train_conf.py, utils

Authors Yiqing Liang, Boyuan Chen, Shuran Song arXiv ID 2012.04512 Category cs.CV: Computer Vision Cross-listed cs.RO Citations 89 Venue IEEE International Conference on Robotics and Automation Repository https://github.com/columbia-ai-robotics/SSCNav โญ 50 Last Checked 29 days ago
Abstract
This paper focuses on visual semantic navigation, the task of producing actions for an active agent to navigate to a specified target object category in an unknown environment. To complete this task, the algorithm should simultaneously locate and navigate to an instance of the category. In comparison to the traditional point goal navigation, this task requires the agent to have a stronger contextual prior to indoor environments. We introduce SSCNav, an algorithm that explicitly models scene priors using a confidence-aware semantic scene completion module to complete the scene and guide the agent's navigation planning. Given a partial observation of the environment, SSCNav first infers a complete scene representation with semantic labels for the unobserved scene together with a confidence map associated with its own prediction. Then, a policy network infers the action from the scene completion result and confidence map. Our experiments demonstrate that the proposed scene completion module improves the efficiency of the downstream navigation policies. Video, code, and data: https://sscnav.cs.columbia.edu/
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision