R.I.P.
π»
Ghosted
GitHub Copilot: the perfect Code compLeeter?
June 17, 2024 Β· Entered Twilight Β· π arXiv.org
Repo contents: .gitignore, C++, Copilot_Solutions, Java, Python3, README.md, Rust, config.py, correctness_by_topic.csv, grand_concat_table.csv, leetcode_config.py, main.py, script.applescript, utils.py
Authors
Ilja SiroΕ‘, Dave SingelΓ©e, Bart Preneel
arXiv ID
2406.11326
Category
cs.SE: Software Engineering
Cross-listed
cs.AI
Citations
5
Venue
arXiv.org
Repository
https://github.com/IljaSir/CopilotSolverForLeetCode
Last Checked
3 months ago
Abstract
This paper aims to evaluate GitHub Copilot's generated code quality based on the LeetCode problem set using a custom automated framework. We evaluate the results of Copilot for 4 programming languages: Java, C++, Python3 and Rust. We aim to evaluate Copilot's reliability in the code generation stage, the correctness of the generated code and its dependency on the programming language, problem's difficulty level and problem's topic. In addition to that, we evaluate code's time and memory efficiency and compare it to the average human results. In total, we generate solutions for 1760 problems for each programming language and evaluate all the Copilot's suggestions for each problem, resulting in over 50000 submissions to LeetCode spread over a 2-month period. We found that Copilot successfully solved most of the problems. However, Copilot was rather more successful in generating code in Java and C++ than in Python3 and Rust. Moreover, in case of Python3 Copilot proved to be rather unreliable in the code generation phase. We also discovered that Copilot's top-ranked suggestions are not always the best. In addition, we analysed how the topic of the problem impacts the correctness rate. Finally, based on statistics information from LeetCode, we can conclude that Copilot generates more efficient code than an average human.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Software Engineering
R.I.P.
π»
Ghosted
Microservices: yesterday, today, and tomorrow
π
π
The Cartographer
A Survey of Machine Learning for Big Code and Naturalness
R.I.P.
π»
Ghosted
An Overview on Smart Contracts: Challenges, Advances and Platforms
R.I.P.
π»
Ghosted
Slither: A Static Analysis Framework For Smart Contracts
R.I.P.
π»
Ghosted