Skip to content

Commit

Permalink
Add SWE-bench eval upgrade announcement
Browse files Browse the repository at this point in the history
  • Loading branch information
john-b-yang committed Jun 27, 2024
1 parent 244a7b3 commit 8fb5a54
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 6 deletions.
6 changes: 3 additions & 3 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -94,9 +94,9 @@ <h3 style="font-size: 20px; padding-top: 1.2em">ICLR 2024</h3>
<section class="main-container">
<div class="content-wrapper" style="display: flex; justify-content: center; align-items: center;">
<div style="background-color: black; padding: 1.5em 1em; color: white; border-radius: 1em; text-align: center; width: 80%;">
🎉 Check out our latest work,
<a href="https://swe-agent.com/" class="light-blue-link" target="_blank" rel="noopener noreferrer">SWE-agent</a>,
which achieves a 12.47% resolve rate on SWE-bench!
🔥 Evaluating on SWE-bench just became a lot more reliable!
SWE-bench evaluation now uses <b>Docker</b> for easier, containerized, reproducible evaluation.
[<a style="color:#0ca7ff" href="https://github.com/princeton-nlp/SWE-bench/tree/main/docs/20240627_docker">Report</a>]
</div>
</div>
<div class="content-wrapper">
Expand Down
6 changes: 3 additions & 3 deletions template/template.html
Original file line number Diff line number Diff line change
Expand Up @@ -94,9 +94,9 @@ <h3 style="font-size: 20px; padding-top: 1.2em">ICLR 2024</h3>
<section class="main-container">
<div class="content-wrapper" style="display: flex; justify-content: center; align-items: center;">
<div style="background-color: black; padding: 1.5em 1em; color: white; border-radius: 1em; text-align: center; width: 80%;">
🎉 Check out our latest work,
<a href="https://swe-agent.com/" class="light-blue-link" target="_blank" rel="noopener noreferrer">SWE-agent</a>,
which achieves a 12.47% resolve rate on SWE-bench!
🔥 Evaluating on SWE-bench just became a lot more reliable!
SWE-bench evaluation now uses <b>Docker</b> for easier, containerized, reproducible evaluation.
[<a style="color:#0ca7ff" href="https://github.com/princeton-nlp/SWE-bench/tree/main/docs/20240627_docker">Report</a>]
</div>
</div>
<div class="content-wrapper">
Expand Down

0 comments on commit 8fb5a54

Please sign in to comment.