Landmark Detection using Transformer Toward Robot-assisted Nasal Airway Intubation

August 05, 2023 Β· Entered Twilight Β· πŸ› Procedia Computer Science

πŸ’€ TWILIGHT: Eternal Rest
Repo abandoned since publication

Repo contents: README.md, samdefDeTR_mmdet, samdefdetr_config_glottis.py, samdefdetr_config_nostril.py

Authors Tianhang Liu, Hechen Li, Long Bai, Yanan Wu, An Wang, Mobarakol Islam, Hongliang Ren arXiv ID 2308.02845 Category eess.IV: Image & Video Processing Cross-listed cs.CV, cs.RO Citations 1 Venue Procedia Computer Science Repository https://github.com/ConorLTH/airway_intubation_landmarks_detection ⭐ 1 Last Checked 3 months ago
Abstract
Robot-assisted airway intubation application needs high accuracy in locating targets and organs. Two vital landmarks, nostrils and glottis, can be detected during the intubation to accommodate the stages of nasal intubation. Automated landmark detection can provide accurate localization and quantitative evaluation. The Detection Transformer (DeTR) leads object detectors to a new paradigm with long-range dependence. However, current DeTR requires long iterations to converge, and does not perform well in detecting small objects. This paper proposes a transformer-based landmark detection solution with deformable DeTR and the semantic-aligned-matching module for detecting landmarks in robot-assisted intubation. The semantics aligner can effectively align the semantics of object queries and image features in the same embedding space using the most discriminative features. To evaluate the performance of our solution, we utilize a publicly accessible glottis dataset and automatically annotate a nostril detection dataset. The experimental results demonstrate our competitive performance in detection accuracy. Our code is publicly accessible.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Image & Video Processing