5 results found Sort:

70
831
apache-2.0
15
We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.
Created 2024-05-13
52 commits to main branch, last one 6 months ago
Self-evaluating interview for AI coders
Created 2023-05-27
763 commits to main branch, last one 7 days ago
36
393
mit
11
Run evaluation on LLMs using human-eval benchmark
Created 2023-07-01
76 commits to main branch, last one about a year ago
SkyCode是一个多语言开源编程大模型,采用GPT3模型结构,支持Java, JavaScript, C, C++, Python, Go, shell等多种主流编程语言,并能理解中文注释。模型可以对代码进行补全,拥有强大解题能力,使您从编程中解放出来,专心于解决更重要的问题。| SkyCode is an open source programming model, which adopts...
Created 2022-12-14
19 commits to main branch, last one about a year ago
Evaluate LLM-generated COBOL
Created 2024-03-20
4 commits to main branch, last one 8 months ago