From USD Scenes to Knowledge Graphs: Zero-Shot Ontology Grounding with LLMs

June 08, 2026 ยท Grace Period ยท ๐Ÿ› the IEEE ICRA 2026 International Joint Workshop on Ontologies

โณ Grace Period
This paper is less than 90 days old. We give authors time to release their code before passing judgment.
Authors Jiangtao Shuai, Zongxiong Chen, Manfred Hauswirth, Sonja Schimmler arXiv ID 2606.09134 Category cs.RO: Robotics Cross-listed cs.AI, cs.CL, cs.CV, cs.GR Citations 0 Venue the IEEE ICRA 2026 International Joint Workshop on Ontologies
Abstract
Constructing knowledge graphs from 3D simulation scenes is essential for robot task reasoning, but the key bottleneck, grounding scene objects to formal ontology classes, still relies on manually curated dictionaries that are brittle and do not generalize across assets. We investigate whether large language models (LLMs) can automate this grounding step for Universal Scene Description (USD) scenes as a zero-shot, training-free alternative. On a kitchen scene (125 objects) with SOMA-HOME Ontology, LLMs achieve 90-96% exact-match accuracy with descriptive names and 49-89% with abbreviated names, substantially outperforming dictionary and embedding baselines. Under fully opaque names, context-augmented prompting recovers up to 48%. Feature ablation reveals that LLMs primarily exploit semantic cues in the scene graph (sibling names and parent paths); anonymizing these cues reduces accuracy to 0-6%, while geometry alone yields only 4-17%.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Robotics