Skip to content

Commit

Permalink
Add results to lite
Browse files Browse the repository at this point in the history
  • Loading branch information
carlosejimenez committed Mar 19, 2024
1 parent fbe0695 commit 45ab640
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 1 deletion.
Binary file added img/swe-bench_lite_results.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
7 changes: 6 additions & 1 deletion lite.html
Original file line number Diff line number Diff line change
Expand Up @@ -69,7 +69,12 @@ <h3>A Canonical Subset for Efficient Evaluation of Language Models as Software E
<br/>
<img src="img/swebench-lite-pie.png" style="width: 50%; max-width: 400px; margin: auto; display: block;"/>
<p class="text-content" style="width: 50%; margin: auto; text-align: center;">
SWE-bench lite distribution across repositories. Compare to the full SWE-bench in Figure 3 of the <a href="https://arxiv.org/abs/2310.06770">SWE-bench paper</a>.
SWE-bench Lite distribution across repositories. Compare to the full SWE-bench in Figure 3 of the <a href="https://arxiv.org/abs/2310.06770">SWE-bench paper</a>.
</p>
</br>
<img src="img/swe-bench_lite_results.png" style="width: 50%; max-width: 400px; margin: auto; display: block;"/>
<p class="text-content" style="width: 50%; margin: auto; text-align: center;">
SWE-bench Lite performance for our baselines. Compare to the full SWE-bench baseline performance in Table 5 of the <a href="https://arxiv.org/abs/2310.06770">SWE-bench paper</a>.
</p>
</div>
</div>
Expand Down

0 comments on commit 45ab640

Please sign in to comment.