Spaces:

SWE-Arena
/

SWE-Model-Arena

Running

App Files Files Community

zhiminy commited on 28 days ago

Commit

8a91d68

unverified ·

1 Parent(s): d58e595

Rename SE-Arena to SWE-Arena in README

Browse files

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -1,19 +1,19 @@
 ---
-title: SE-Arena
 emoji: 🛠️
 colorFrom: blue
 colorTo: purple
 sdk: gradio
-sdk_version: 5.25.2
 app_file: app.py
 hf_oauth: true
 pinned: false
 short_description: The chatbot arena for software engineering
 ---
-# SWE Arena: An Interactive Platform for Evaluating Foundation Models in Software Engineering
-Welcome to **SWE Arena**, an open-source platform designed for evaluating software engineering-focused foundation models (FMs), particularly large language models (LLMs). SWE Arena benchmarks models in iterative, context-rich workflows that are characteristic of software engineering (SE) tasks.
 ## Key Features
@@ -26,9 +26,9 @@ Welcome to **SWE Arena**, an open-source platform designed for evaluating softwa
   - **Consistency score**: Quantify model determinism and reliability through self-play matches
 - **Transparent, Open-Source Leaderboard**: View real-time model rankings across diverse SE workflows with full transparency.
-## Why SWE Arena?
-Existing evaluation frameworks (like Chatbot Arena, WebDev Arena, and Copilot Arena) often don't address the complex, iterative nature of SE tasks. SWE Arena fills critical gaps by:
 - Supporting context-rich, multi-turn evaluations to capture iterative workflows
 - Integrating repository-level context through RepoChat to simulate real-world development scenarios
@@ -51,7 +51,7 @@ Existing evaluation frameworks (like Chatbot Arena, WebDev Arena, and Copilot Ar
 ### Usage
-1. Navigate to the [SWE Arena platform](https://huggingface.co/spaces/SE-Arena/Software-Engineering-Arena)
 2. Sign in with your Hugging Face account
 3. Enter your SE task prompt (optionally include a repository URL for RepoChat)
 4. Engage in multi-round interactions and vote on model performance
@@ -66,7 +66,7 @@ We welcome contributions from the community! Here's how you can help:
 ## Privacy Policy
-Your interactions are anonymized and used solely for improving SWE Arena and FM benchmarking. By using SWE Arena, you agree to our Terms of Service.
 ## Future Plans

 ---
+title: SWE-Arena
 emoji: 🛠️
 colorFrom: blue
 colorTo: purple
 sdk: gradio
+sdk_version: 5.49.1
 app_file: app.py
 hf_oauth: true
 pinned: false
 short_description: The chatbot arena for software engineering
 ---
+# SWE-Arena: An Interactive Platform for Evaluating Foundation Models in Software Engineering
+Welcome to **SWE-Arena**, an open-source platform designed for evaluating software engineering-focused foundation models (FMs), particularly large language models (LLMs). SWE-Arena benchmarks models in iterative, context-rich workflows that are characteristic of software engineering (SE) tasks.
 ## Key Features
   - **Consistency score**: Quantify model determinism and reliability through self-play matches
 - **Transparent, Open-Source Leaderboard**: View real-time model rankings across diverse SE workflows with full transparency.
+## Why SWE-Arena?
+Existing evaluation frameworks (like Chatbot Arena, WebDev Arena, and Copilot Arena) often don't address the complex, iterative nature of SE tasks. SWE-Arena fills critical gaps by:
 - Supporting context-rich, multi-turn evaluations to capture iterative workflows
 - Integrating repository-level context through RepoChat to simulate real-world development scenarios
 ### Usage
+1. Navigate to the [SWE-Arena platform](https://huggingface.co/spaces/SE-Arena/Software-Engineering-Arena)
 2. Sign in with your Hugging Face account
 3. Enter your SE task prompt (optionally include a repository URL for RepoChat)
 4. Engage in multi-round interactions and vote on model performance
 ## Privacy Policy
+Your interactions are anonymized and used solely for improving SWE-Arena and FM benchmarking. By using SWE-Arena, you agree to our Terms of Service.
 ## Future Plans