-
nuprl/MultiPL-E
Viewer • Updated • 12.7k • 64.8k • 60 -
openai/openai_humaneval
Viewer • Updated • 164 • 118k • 360 -
Big Code Models Leaderboard
📈1.48kExplore and compare code generation models on a leaderboard
-
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper • 2402.14261 • Published • 10
Shaun
drgitt
AI & ML interests
None yet
Organizations
None yet
codegen_eval
-
nuprl/MultiPL-E
Viewer • Updated • 12.7k • 64.8k • 60 -
openai/openai_humaneval
Viewer • Updated • 164 • 118k • 360 -
Running1.48k
Big Code Models Leaderboard
📈1.48kExplore and compare code generation models on a leaderboard
-
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper • 2402.14261 • Published • 10
Interesting LLMs
datasets
0
None public yet