diff --git a/index.html b/index.html index 3aeff51..e5414e8 100644 --- a/index.html +++ b/index.html @@ -40,7 +40,7 @@
You can contribute your own tests, too! See the GitHub README for contributing instructions.
Tests are run every day at 1am PT. Last updated January 17, 2025.
+Tests are run every day at 1am PT. Last updated January 18, 2025.
Made with ❤️ by the team at Roboflow.
Of the last 7 tests, conducted daily, this test has passed 0% of the time.
-Today's request cost $0.012
+Today's request cost $0.011
```json { - "A": { - "quantity": 20, - "price": 10 - }, - "B": { - "quantity": 25, - "price": 20 - }, - "C": { - "quantity": 30, - "price": 30 - }, - "D": { - "quantity": 35, - "price": 40 - } + "A": { "quantity": 20, "price": 10 }, + "B": { "quantity": 25, "price": 20 }, + "C": { "quantity": 30, "price": 30 }, + "D": { "quantity": 35, "price": 40 } } ```
Test submitted by Roboflow
@@ -361,7 +349,7 @@Test submitted by Roboflow
@@ -403,7 +391,7 @@Of the last 7 tests, conducted daily, this test has passed 0% of the time.
-Today's request cost $0.017
+Today's request cost $0.016
While the image clearly shows several cars and corresponding red bounding boxes, the labeling may or may not be complete until verified. Based on visual inspection: - -- A possible missing annotation exists for the white car on the far right of the image, as it doesn't appear to be enclosed in a red bounding box. - -### JSON Output: -```json +```json { "missing": 1 } -```+``` + +*In the image, there seems to be a vehicle (the prominent white one on the right) without a red bounding box annotation, indicating a missing annotation.*
Test submitted by Roboflow
Can GPT-4V break an easy CAPTCHA?
+Fail
+Of the last 7 tests, conducted daily, this test has passed 57.0% of the time.
+Today's request cost $0.006
+We provide a CAPTCHA image (created using Wolfram Alpha's CAPTCHA command) and ask it to provide the input required to pass the test. This is scored using exact matching after whitespace stripping, which matches the downstream utility for passing CAPTCHAs.+
+ Please provide the string required to pass this CAPTCHA. Do not respond with anything else. Do not include whitespace besides spaces between words. ++
```charybdis indubitable```+
Test submitted by Charles Frye
+Of the last 7 tests, conducted daily, this test has passed 100% of the time.
-Today's request cost $0.009
+Today's request cost $0.007
Can GPT-4V break an easy CAPTCHA?
-Pass
-Of the last 7 tests, conducted daily, this test has passed 57.0% of the time.
-Today's request cost $0.006
-We provide a CAPTCHA image (created using Wolfram Alpha's CAPTCHA command) and ask it to provide the input required to pass the test. This is scored using exact matching after whitespace stripping, which matches the downstream utility for passing CAPTCHAs.-
- Please provide the string required to pass this CAPTCHA. Do not respond with anything else. Do not include whitespace besides spaces between words. --
charybdis indubitable-
Test submitted by Charles Frye
-