Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

EvoEval

university
https://evo-eval.github.io/
evo-eval
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

nevetsaix  authored a paper about 1 month ago
Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?
Yinlin  authored a paper over 1 year ago
Agentless: Demystifying LLM-based Software Engineering Agents
nevetsaix  authored a paper over 1 year ago
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation
View all activity

Deng's profile picture Chunqiu Steven Xia's profile picture

evoeval 's datasets 5

evoeval/EvoEval_tool_use

Viewer • Updated Mar 27, 2024 • 100 • 36 • 3

evoeval/EvoEval_combine

Viewer • Updated Mar 27, 2024 • 100 • 32

evoeval/EvoEval_subtle

Viewer • Updated Mar 27, 2024 • 100 • 28

evoeval/EvoEval_creative

Viewer • Updated Mar 27, 2024 • 100 • 23

evoeval/EvoEval_difficult

Viewer • Updated Mar 27, 2024 • 100 • 36 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs