A Partial Replication of MaskFormer in TensorFlow on TPUs for the TensorFlow Model Garden

April 29, 2024 ยท Entered Twilight ยท ๐Ÿ› arXiv.org

๐Ÿ’ค TWILIGHT: Eternal Rest
Repo abandoned since publication

Repo contents: AUTHORS, CODEOWNERS, CODE_OF_CONDUCT.md, CONTRIBUTING.md, ISSUES.md, LICENSE, README.md, SECURITY.md, community, docs, official, orbit, research, tensorflow_models

Authors Vishal Purohit, Wenxin Jiang, Akshath R. Ravikiran, James C. Davis arXiv ID 2404.18801 Category cs.CV: Computer Vision Cross-listed cs.LG, cs.SE Citations 1 Venue arXiv.org Repository https://github.com/PurdueDualityLab/tf-maskformer/tree/main/official/projects/maskformer โญ 6 Last Checked 3 months ago
Abstract
This paper undertakes the task of replicating the MaskFormer model a universal image segmentation model originally developed using the PyTorch framework, within the TensorFlow ecosystem, specifically optimized for execution on Tensor Processing Units (TPUs). Our implementation exploits the modular constructs available within the TensorFlow Model Garden (TFMG), encompassing elements such as the data loader, training orchestrator, and various architectural components, tailored and adapted to meet the specifications of the MaskFormer model. We address key challenges encountered during the replication, non-convergence issues, slow training, adaptation of loss functions, and the integration of TPU-specific functionalities. We verify our reproduced implementation and present qualitative results on the COCO dataset. Although our implementation meets some of the objectives for end-to-end reproducibility, we encountered challenges in replicating the PyTorch version of MaskFormer in TensorFlow. This replication process is not straightforward and requires substantial engineering efforts. Specifically, it necessitates the customization of various components within the TFMG, alongside thorough verification and hyper-parameter tuning. The replication is available at: https://github.com/PurdueDualityLab/tf-maskformer/tree/main/official/projects/maskformer
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision

๐ŸŒ… ๐ŸŒ… Old Age

Fast R-CNN

Ross Girshick

cs.CV ๐Ÿ› ICCV ๐Ÿ“š 27.7K cites 11 years ago