A Taxonomy of Inefficiencies in LLM-Generated Python Code

March 08, 2025 · The Cartographer · 🏛 IEEE International Conference on Software Maintenance and Evolution

"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Taxonomy of Inefficiencies in LLM-Generated Python Code"

Evidence collected by the PWNC Scanner

Authors Altaf Allah Abbassi, Leuson Da Silva, Amin Nikanjam, Foutse Khomh arXiv ID 2503.06327 Category cs.SE: Software Engineering Citations 7 Venue IEEE International Conference on Software Maintenance and Evolution Last Checked 3 days ago

Abstract

Large Language Models (LLMs) are widely adopted for automated code generation with promising results. Although prior research has assessed LLM-generated code and identified various quality issues -- such as redundancy, poor maintainability, and sub-optimal performance a systematic understanding and categorization of these inefficiencies remain unexplored. Without such knowledge, practitioners struggle to optimize LLM-generated code for real-world applications, limiting its adoption. This study can also guide improving code LLMs, enhancing the quality and efficiency of code generation. Therefore, in this study, we empirically investigate inefficiencies in LLM-generated code by state-of-the-art models, i.e., CodeLlama, DeepSeek-Coder, and CodeGemma. To do so, we analyze 492 generated code snippets in the HumanEval++ dataset. We then construct a taxonomy of inefficiencies in LLM-generated code that includes 5 categories General Logic, Performance, Readability, Maintainability, and Errors) and 19 subcategories of inefficiencies. We then validate the proposed taxonomy through an online survey with 58 LLM practitioners and researchers. Our study indicates that logic and performance-related inefficiencies are the most popular, relevant, and frequently co-occur and impact overall code quality inefficiency. Our taxonomy provides a structured basis for evaluating the quality LLM-generated code and guiding future research to improve code generation efficiency.