5 results found Sort:
We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.
Created
2024-05-13
52 commits to main branch, last one 6 months ago
Self-evaluating interview for AI coders
Created
2023-05-27
763 commits to main branch, last one 7 days ago
Run evaluation on LLMs using human-eval benchmark
Created
2023-07-01
76 commits to main branch, last one about a year ago
SkyCode是一个多语言开源编程大模型,采用GPT3模型结构,支持Java, JavaScript, C, C++, Python, Go, shell等多种主流编程语言,并能理解中文注释。模型可以对代码进行补全,拥有强大解题能力,使您从编程中解放出来,专心于解决更重要的问题。| SkyCode is an open source programming model, which adopts...
Created
2022-12-14
19 commits to main branch, last one about a year ago
Evaluate LLM-generated COBOL
Created
2024-03-20
4 commits to main branch, last one 8 months ago