wf_httpbin_form_postv1httpbin.orgeasy
Submit the HTTPBin form and return the echoed submission payload
timeout 300s • started Apr 21, 11:28 AM
Cohort status
completed
Attempts
1
Pass rate
100.0%
Live attempts
0
Cohort verdicts
TinyFish • passed
Task promptCollapsed preview. Expand for the full prompt.Expand
Task prompt
Collapsed preview. Expand for the full prompt.
Benchmark case: Submit the HTTPBin form and return the echoed submission payload
Goal: Open the form at https://httpbin.org/forms/post, fill it with the provided input data, submit it, and return the echoed submitted_form object.
Preferred start URL: https://httpbin.org/forms/post
Input guidance: Use these exact values when filling the form.
Structured input data:
{
"comments": "Leave at reception",
"custemail": "ada@example.com",
"custname": "Ada Lovelace",
"custtel": "+1-415-555-0136",
"delivery": "19:45",
"size": "medium",
"topping": "bacon"
}
Return a final JSON object matching this schema:
{
"type": "object",
"required": [
"submitted_form"
],
"properties": {
"submission_url": {
"type": "string",
"format": "uri",
"optional": true
},
"submitted_form": {
"type": "object",
"required": [
"custname",
"custtel",
"custemail",
"size",
"topping",
"delivery",
"comments"
],
"properties": {
"comments": {
"type": "string"
},
"custemail": {
"type": "string"
},
"custname": {
"type": "string"
},
"custtel": {
"type": "string"
},
"delivery": {
"type": "string"
},
"size": {
"type": "string"
},
"topping": {
"type": "string"
}
}
}
}
}
Work the website directly. Prefer exact values over paraphrases. Return only the final structured answer when done.Benchmark contract
Frozen case metadata for this cohort
This is the versioned case payload the arena executed and later submitted to tinyeval.
Case ID
wf_httpbin_form_post
Version
1
Domain
httpbin.org
Difficulty
easy
Timeout
300s
Entrypoint
https://httpbin.org/forms/post
Structured input data
{
"comments": "Leave at reception",
"custemail": "ada@example.com",
"custname": "Ada Lovelace",
"custtel": "+1-415-555-0136",
"delivery": "19:45",
"size": "medium",
"topping": "bacon"
}Output schema
{
"type": "object",
"required": [
"submitted_form"
],
"properties": {
"submission_url": {
"type": "string",
"format": "uri",
"optional": true
},
"submitted_form": {
"type": "object",
"required": [
"custname",
"custtel",
"custemail",
"size",
"topping",
"delivery",
"comments"
],
"properties": {
"comments": {
"type": "string"
},
"custemail": {
"type": "string"
},
"custname": {
"type": "string"
},
"custtel": {
"type": "string"
},
"delivery": {
"type": "string"
},
"size": {
"type": "string"
},
"topping": {
"type": "string"
}
}
}
}
}TinyFish
Eval status: passed
completed
Success
100.0%
Latency
45.94s
Cost
$0.00
Tokens
11,243
Steps
8
Current URL
https://httpbin.org/forms/post
Submitted outputExact structured payload submitted to tinyeval for scoring.Expand
Submitted output
Exact structured payload submitted to tinyeval for scoring.
{
"submission_url": "https://httpbin.org/post",
"submitted_form": {
"comments": "Leave at reception",
"custemail": "ada@example.com",
"custname": "Ada Lovelace",
"custtel": "+1-415-555-0136",
"delivery": "19:45",
"size": "medium",
"topping": "bacon"
}
}Output
{
"submission_url": "https://httpbin.org/post",
"submitted_form": {
"comments": "Leave at reception",
"custemail": "ada@example.com",
"custname": "Ada Lovelace",
"custtel": "+1-415-555-0136",
"delivery": "19:45",
"size": "medium",
"topping": "bacon"
}
}Live viewer
Viewer URL retained for this attempt. Availability depends on the underlying provider session.
Agent steps15 logged steps. Expand for the full timeline.Expand
Agent steps
15 logged steps. Expand for the full timeline.
1. navigate_to
Apr 21, 11:28 AM
Navigate to the form page.
https://httpbin.org/
1. navigate_to
Apr 21, 11:28 AM
page_map: 317 chars
https://httpbin.org/forms/post
2. set_value
Apr 21, 11:28 AM
Fill the pizza order form with input data.
https://httpbin.org/forms/post
2. set_value
Apr 21, 11:28 AM
page_map: 575 chars
https://httpbin.org/forms/post
3. click
Apr 21, 11:28 AM
Select pizza size Medium
https://httpbin.org/forms/post
3. click
Apr 21, 11:28 AM
page_map: 476 chars
https://httpbin.org/forms/post
4. set_value
Apr 21, 11:28 AM
Fill remaining form fields and submit
https://httpbin.org/forms/post
4. set_value
Apr 21, 11:28 AM
page_map: 378 chars
https://httpbin.org/forms/post
5. click
Apr 21, 11:28 AM
Select bacon topping and submit form
https://httpbin.org/forms/post
5. click
Apr 21, 11:28 AM
page_map: 486 chars
https://httpbin.org/forms/post
6. set_value
Apr 21, 11:28 AM
Enter delivery time and instructions
https://httpbin.org/forms/post
6. set_value
Apr 21, 11:28 AM
page_map: 520 chars
https://httpbin.org/forms/post
7. click
Apr 21, 11:28 AM
Submit the pizza order form
https://httpbin.org/forms/post
7. click
Apr 21, 11:28 AM
page_map: 1570 chars
https://httpbin.org/forms/post
8. end
Apr 21, 11:28 AM
{"error":"none","notes":"The form was successfully submitted, and the response from httpbin.org/post shows the echoed form data. I have extracted the 'form' object from the page...
https://httpbin.org/forms/post
Evaluation
Passed all required checks
Final output matched the expected result.
Failure reason
--
Failed checkpoint
--
Checkpoint summary
0 passed • 0 failed
Evaluated
Apr 21, 11:28 AM
Raw evaluation payload
{
"submit": {
"eval_id": "run_b16c6db291c969bf",
"run_id": "run_b16c6db291c969bf",
"status": "success",
"note": "Final output matched the expected result."
},
"detail": {
"eval": {
"run_id": "run_b16c6db291c969bf",
"case_id": "wf_httpbin_form_post",
"case_version": 1,
"status": "passed",
"agent_name": "TinyFish",
"last_evaluation": {
"event_id": "evt_3444ba3484864a7f",
"status": "passed",
"note": "Final output matched the expected result.",
"evaluated_at": "2026-04-21T11:28:51.849966252Z"
},
"created_at": "2026-04-21T11:28:51.525492316Z",
"updated_at": "2026-04-21T11:28:51.849967172Z",
"completed_at": "2026-04-21T11:28:51.849967172Z"
},
"run": {
"run_id": "run_b16c6db291c969bf",
"case_id": "wf_httpbin_form_post",
"case_version": 1,
"status": "passed",
"agent_name": "TinyFish",
"last_evaluation": {
"event_id": "evt_3444ba3484864a7f",
"status": "passed",
"note": "Final output matched the expected result.",
"evaluated_at": "2026-04-21T11:28:51.849966252Z"
},
"created_at": "2026-04-21T11:28:51.525492316Z",
"updated_at": "2026-04-21T11:28:51.849967172Z",
"completed_at": "2026-04-21T11:28:51.849967172Z"
},
"case": {
"case_id": "wf_httpbin_form_post",
"version": 1,
"status": "active",
"created_at": "2026-04-09T00:00:00Z",
"updated_at": "2026-04-09T00:00:00Z",
"title": "Submit the HTTPBin form and return the echoed submission payload",
"task_description": "Open the form at https://httpbin.org/forms/post, fill it with the provided input data, submit it, and return the echoed submitted_form object.",
"domains": [
"httpbin.org"
],
"tags": [
"forms",
"httpbin",
"submission",
"echo"
],
"difficulty": "easy",
"locale": "en-US",
"input_spec": {
"description": "Use these exact values when filling the form.",
"schema": {
"type": "object",
"required": [
"custname",
"custtel",
"custemail",
"size",
"topping",
"delivery",
"comments"
],
"properties": {
"comments": {
"type": "string"
},
"custemail": {
"type": "string"
},
"custname": {
"type": "string"
},
"custtel": {
"type": "string"
},
"delivery": {
"type": "string"
},
"size": {
"type": "string"
},
"topping": {
"type": "string"
}
}
},
"data": {
"comments": "Leave at reception",
"custemail": "ada@example.com",
"custname": "Ada Lovelace",
"custtel": "+1-415-555-0136",
"delivery": "19:45",
"size": "medium",
"topping": "bacon"
}
},
"output_schema": {
"type": "object",
"required": [
"submitted_form"
],
"properties": {
"submission_url": {
"type": "string",
"format": "uri",
"optional": true
},
"submitted_form": {
"type": "object",
"required": [
"custname",
"custtel",
"custemail",
"size",
"topping",
"delivery",
"comments"
],
"properties": {
"comments": {
"type": "string"
},
"custemail": {
"type": "string"
},
"custname": {
"type": "string"
},
"custtel": {
"type": "string"
},
"delivery": {
"type": "string"
},
"size": {
"type": "string"
},
"topping": {
"type": "string"
}
}
}
}
},
"entrypoint": {
"mode": "url",
"url": "https://httpbin.org/forms/post"
},
"run_policy": {
"timeout_seconds": 300
}
},
"events": [
{
"event_id": "evt_3444ba3484864a7f",
"run_id": "run_b16c6db291c969bf",
"event_type": "final",
"payload": {
"event_type": "final",
"timestamp": "2026-04-21T11:28:51.528776883Z",
"page_state": {},
"intermediate_data": {
"raw_submission": {
"submission_url": "https://httpbin.org/post",
"submitted_form": {
"comments": "Leave at reception",
"custemail": "ada@example.com",
"custname": "Ada Lovelace",
"custtel": "+1-415-555-0136",
"delivery": "19:45",
"size": "medium",
"topping": "bacon"
}
}
},
"final_output": {
"submission_url": "https://httpbin.org/post",
"submitted_form": {
"comments": "Leave at reception",
"custemail": "ada@example.com",
"custname": "Ada Lovelace",
"custtel": "+1-415-555-0136",
"delivery": "19:45",
"size": "medium",
"topping": "bacon"
}
},
"artifacts": {},
"agent": {}
},
"created_at": "2026-04-21T11:28:51.529006472Z"
}
],
"timeout_seconds": 300,
"deadline_at": "2026-04-21T11:33:51.525492316Z"
}
}Provider terminal payloadRaw terminal result retained from the provider adapter.Expand
Provider terminal payload
Raw terminal result retained from the provider adapter.
{
"answer": "{\n \"submission_url\": \"https://httpbin.org/post\",\n \"submitted_form\": {\n \"comments\": \"Leave at reception\",\n \"custemail\": \"ada@example.com\",\n \"custname\": \"Ada Lovelace\",\n \"custtel\": \"+1-415-555-0136\",\n \"delivery\": \"19:45\",\n \"size\": \"medium\",\n \"topping\": \"bacon\"\n }\n}",
"data": {
"error": "none",
"notes": "The form was successfully submitted, and the response from httpbin.org/post shows the echoed form data. I have extracted the 'form' object from the page content as requested.",
"result": "{\n \"submission_url\": \"https://httpbin.org/post\",\n \"submitted_form\": {\n \"comments\": \"Leave at reception\",\n \"custemail\": \"ada@example.com\",\n \"custname\": \"Ada Lovelace\",\n \"custtel\": \"+1-415-555-0136\",\n \"delivery\": \"19:45\",\n \"size\": \"medium\",\n \"topping\": \"bacon\"\n }\n}",
"success": true
},
"durationMs": 45938,
"error": "none",
"inputTokens": 10452,
"llmDurationMs": 10786,
"model": "gemini-3-flash-preview",
"notes": "The form was successfully submitted, and the response from httpbin.org/post shows the echoed form data. I have extracted the 'form' object from the page content as requested.",
"outputTokens": 791,
"recordingId": "6af8b27c-0b62-4d75-9865-0de79a21648f",
"result": "{\n \"submission_url\": \"https://httpbin.org/post\",\n \"submitted_form\": {\n \"comments\": \"Leave at reception\",\n \"custemail\": \"ada@example.com\",\n \"custname\": \"Ada Lovelace\",\n \"custtel\": \"+1-415-555-0136\",\n \"delivery\": \"19:45\",\n \"size\": \"medium\",\n \"topping\": \"bacon\"\n }\n}",
"success": true,
"thinkingTokens": 0,
"totalSteps": 8,
"totalTokens": 11243,
"type": "done"
}