Low-Cost Language Models: Survey and Performance Evaluation on Python Code Generation

Espejel, Jessica López; Alassan, Mahaman Sanoussi Yahaya; Bouhandi, Merieme; Dahhane, Walid; Ettifouri, El Hassane

Computer Science > Artificial Intelligence

arXiv:2404.11160 (cs)

[Submitted on 17 Apr 2024]

Title:Low-Cost Language Models: Survey and Performance Evaluation on Python Code Generation

Authors:Jessica López Espejel, Mahaman Sanoussi Yahaya Alassan, Merieme Bouhandi, Walid Dahhane, El Hassane Ettifouri

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have become the go-to solution for many Natural Language Processing (NLP) tasks due to their ability to tackle various problems and produce high-quality results. Specifically, they are increasingly used to automatically generate code, easing the burden on developers by handling repetitive tasks. However, this improvement in quality has led to high computational and memory demands, making LLMs inaccessible to users with limited resources. In this paper, we focus on Central Processing Unit (CPU)-compatible models and conduct a thorough semi-manual evaluation of their strengths and weaknesses in generating Python code. We enhance their performance by introducing a Chain-of-Thought prompt that guides the model in problem-solving. Additionally, we propose a dataset of 60 programming problems with varying difficulty levels for evaluation purposes. Our assessment also includes testing these models on two state-of-the-art datasets: HumanEval and EvalPlus. We commit to sharing our dataset and experimental results publicly to ensure transparency.

Comments:	Under review at Elsevier's Engineering Applications of Artificial Intelligence
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2404.11160 [cs.AI]
	(or arXiv:2404.11160v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2404.11160

Submission history

From: El Hassane Ettifouri [view email]
[v1] Wed, 17 Apr 2024 08:16:48 UTC (204 KB)

Computer Science > Artificial Intelligence

Title:Low-Cost Language Models: Survey and Performance Evaluation on Python Code Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Low-Cost Language Models: Survey and Performance Evaluation on Python Code Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators