diff --git a/index.html b/index.html index 2b97196..9ec9ad3 100644 --- a/index.html +++ b/index.html @@ -40,7 +40,7 @@
You can contribute your own tests, too! See the GitHub README for contributing instructions.
Tests are run every day at 1am PT. Last updated January 24, 2025.
+Tests are run every day at 1am PT. Last updated January 25, 2025.
Made with ❤️ by the team at Roboflow.
Can GPT-4V read a document and return the exact characters in the text?
+Fail
+Of the last 7 tests, conducted daily, this test has passed 86.0% of the time.
+Today's request cost $0.01
+We send a screenshot of a typed document to determine if it can correctly read the text. If it correctly gets the text, it gets a 100%. Otherwise, it gets a 0%.+
+ Read the text in the image. Return only the text, with punctuation. ++
I was thinking earlier today that I have gone through, to use the lingo, eras of listening to each of Swift's Eras. Meta indeed. I started listening to Ms. Swift's music after hearing the *Midnights* album. A few weeks after hearing the album for the first time, I found myself playing various songs on repeat. I listened to the album in order multiple times.+
Test submitted by Roboflow
+{'x': 0.5, 'y': 0.35, 'width': 0.3, 'height': 0.4}+
{'x': 0.5, 'y': 0.35, 'width': 0.3, 'height': 0.2}
Test submitted by Roboflow
Of the last 7 tests, conducted daily, this test has passed 0% of the time.
-Today's request cost $0.012
+Today's request cost $0.011
```json { - "A": { - "quantity": 20, - "price": 10 - }, - "B": { - "quantity": 25, - "price": 20 - }, - "C": { - "quantity": 30, - "price": 30 - }, - "D": { - "quantity": 35, - "price": 40 - } + "A": { "quantity": 20, "price": 10 }, + "B": { "quantity": 25, "price": 20 }, + "C": { "quantity": 30, "price": 30 }, + "D": { "quantity": 35, "price": 40 } } ```
Test submitted by Roboflow
@@ -305,7 +347,7 @@```json { - "R": 80, + "R": 82, "G": 0, "B": 128 } @@ -431,25 +473,16 @@Result
Can GPT-4V classify an image without being trained on that particular use case?
+Can GPT-4V break an easy CAPTCHA?
Pass
+Fail
Of the last 7 tests, conducted daily, this test has passed 100% of the time.
+Of the last 7 tests, conducted daily, this test has passed 86.0% of the time.
Today's request cost $0.006
We check to see if the model can correctly identify the vehicle. If it can, it recieves a 100%, if it is incorrect, it recieves a 0%.+
We provide a CAPTCHA image (created using Wolfram Alpha's CAPTCHA command) and ask it to provide the input required to pass the test. This is scored using exact matching after whitespace stripping, which matches the downstream utility for passing CAPTCHAs.
- What is in the image? Return the class of the object in the image. Here are the classes: Toyota Camry, Tesla Model 3. You can only return one class from that list. + Please provide the string required to pass this CAPTCHA. Do not respond with anything else. Do not include whitespace besides spaces between words.
Toyota Camry-
Test submitted by Roboflow
+```charybdis indubitable```+
Test submitted by Charles Frye
Can GPT-4V read a document and return the exact characters in the text?
+Can GPT-4V classify an image without being trained on that particular use case?
Of the last 7 tests, conducted daily, this test has passed 86.0% of the time.
-Today's request cost $0.01
+Of the last 7 tests, conducted daily, this test has passed 100% of the time.
+Today's request cost $0.006
We send a screenshot of a typed document to determine if it can correctly read the text. If it correctly gets the text, it gets a 100%. Otherwise, it gets a 0%.+
We check to see if the model can correctly identify the vehicle. If it can, it recieves a 100%, if it is incorrect, it recieves a 0%.
- Read the text in the image. Return only the text, with punctuation. + What is in the image? Return the class of the object in the image. Here are the classes: Toyota Camry, Tesla Model 3. You can only return one class from that list.
I was thinking earlier today that I have gone through, to use the lingo, eras of listening to each of Swift's Eras. Meta indeed. I started listening to Ms. Swift's music after hearing the Midnights album. A few weeks after hearing the album for the first time, I found myself playing various songs on repeat. I listened to the album in order multiple times.+
Toyota Camry
Test submitted by Roboflow
Can GPT-4V break an easy CAPTCHA?
-Pass
-Of the last 7 tests, conducted daily, this test has passed 86.0% of the time.
-Today's request cost $0.006
-We provide a CAPTCHA image (created using Wolfram Alpha's CAPTCHA command) and ask it to provide the input required to pass the test. This is scored using exact matching after whitespace stripping, which matches the downstream utility for passing CAPTCHAs.-
- Please provide the string required to pass this CAPTCHA. Do not respond with anything else. Do not include whitespace besides spaces between words. --
charybdis indubitable-
Test submitted by Charles Frye
-