R.I.P.
๐ป
Ghosted
PeaTMOSS: Mining Pre-Trained Models in Open-Source Software
October 05, 2023 ยท Entered Twilight ยท ๐ arXiv.org
Repo contents: .DS_Store, .gitignore, Examples, PeaTMOSS.py, PeaTMOSS.sql, PeaTMOSS_SAMPLE.db.zip, README.md, environment.yml, globus.py
Authors
Wenxin Jiang, Jason Jones, Jerin Yasmin, Nicholas Synovic, Rajeev Sashti, Sophie Chen, George K. Thiruvathukal, Yuan Tian, James C. Davis
arXiv ID
2310.03620
Category
cs.SE: Software Engineering
Cross-listed
cs.AI
Citations
2
Venue
arXiv.org
Repository
https://github.com/PurdueDualityLab/PeaTMOSS-Demos
โญ 13
Last Checked
3 months ago
Abstract
Developing and training deep learning models is expensive, so software engineers have begun to reuse pre-trained deep learning models (PTMs) and fine-tune them for downstream tasks. Despite the wide-spread use of PTMs, we know little about the corresponding software engineering behaviors and challenges. To enable the study of software engineering with PTMs, we present the PeaTMOSS dataset: Pre-Trained Models in Open-Source Software. PeaTMOSS has three parts: a snapshot of (1) 281,638 PTMs, (2) 27,270 open-source software repositories that use PTMs, and (3) a mapping between PTMs and the projects that use them. We challenge PeaTMOSS miners to discover software engineering practices around PTMs. A demo and link to the full dataset are available at: https://github.com/PurdueDualityLab/PeaTMOSS-Demos.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Software Engineering
R.I.P.
๐ป
Ghosted
Microservices: yesterday, today, and tomorrow
๐
๐
The Cartographer
A Survey of Machine Learning for Big Code and Naturalness
R.I.P.
๐ป
Ghosted
An Overview on Smart Contracts: Challenges, Advances and Platforms
R.I.P.
๐ป
Ghosted
Slither: A Static Analysis Framework For Smart Contracts
R.I.P.
๐ป
Ghosted