wf_httpbin_form_postv1httpbin.orgeasy
Submit the HTTPBin form and return the echoed submission payload
timeout 300s • started Apr 21, 11:04 AM
Cohort status
completed
Attempts
1
Pass rate
100.0%
Live attempts
0
Cohort verdicts
TinyFishpassed
Task prompt
Collapsed preview. Expand for the full prompt.
Expand
Benchmark case: Submit the HTTPBin form and return the echoed submission payload Goal: Open the form at https://httpbin.org/forms/post, fill it with the provided input data, submit it, and return the echoed submitted_form object. Preferred start URL: https://httpbin.org/forms/post Input guidance: Use these exact values when filling the form. Structured input data: { "comments": "Leave at reception", "custemail": "ada@example.com", "custname": "Ada Lovelace", "custtel": "+1-415-555-0136", "delivery": "19:45", "size": "medium", "topping": "bacon" } Return a final JSON object matching this schema: { "type": "object", "required": [ "submitted_form" ], "properties": { "submission_url": { "type": "string", "format": "uri", "optional": true }, "submitted_form": { "type": "object", "required": [ "custname", "custtel", "custemail", "size", "topping", "delivery", "comments" ], "properties": { "comments": { "type": "string" }, "custemail": { "type": "string" }, "custname": { "type": "string" }, "custtel": { "type": "string" }, "delivery": { "type": "string" }, "size": { "type": "string" }, "topping": { "type": "string" } } } } } Work the website directly. Prefer exact values over paraphrases. Return only the final structured answer when done.
Benchmark case: Submit the HTTPBin form and return the echoed submission payload

Goal: Open the form at https://httpbin.org/forms/post, fill it with the provided input data, submit it, and return the echoed submitted_form object.

Preferred start URL: https://httpbin.org/forms/post

Input guidance: Use these exact values when filling the form.

Structured input data:
{
  "comments": "Leave at reception",
  "custemail": "ada@example.com",
  "custname": "Ada Lovelace",
  "custtel": "+1-415-555-0136",
  "delivery": "19:45",
  "size": "medium",
  "topping": "bacon"
}

Return a final JSON object matching this schema:
{
  "type": "object",
  "required": [
    "submitted_form"
  ],
  "properties": {
    "submission_url": {
      "type": "string",
      "format": "uri",
      "optional": true
    },
    "submitted_form": {
      "type": "object",
      "required": [
        "custname",
        "custtel",
        "custemail",
        "size",
        "topping",
        "delivery",
        "comments"
      ],
      "properties": {
        "comments": {
          "type": "string"
        },
        "custemail": {
          "type": "string"
        },
        "custname": {
          "type": "string"
        },
        "custtel": {
          "type": "string"
        },
        "delivery": {
          "type": "string"
        },
        "size": {
          "type": "string"
        },
        "topping": {
          "type": "string"
        }
      }
    }
  }
}

Work the website directly. Prefer exact values over paraphrases. Return only the final structured answer when done.
Benchmark contract
Frozen case metadata for this cohort
This is the versioned case payload the arena executed and later submitted to tinyeval.
Case ID
wf_httpbin_form_post
Version
1
Domain
httpbin.org
Difficulty
easy
Timeout
300s
Entrypoint
https://httpbin.org/forms/post
Structured input data
{
  "comments": "Leave at reception",
  "custemail": "ada@example.com",
  "custname": "Ada Lovelace",
  "custtel": "+1-415-555-0136",
  "delivery": "19:45",
  "size": "medium",
  "topping": "bacon"
}
Output schema
{
  "type": "object",
  "required": [
    "submitted_form"
  ],
  "properties": {
    "submission_url": {
      "type": "string",
      "format": "uri",
      "optional": true
    },
    "submitted_form": {
      "type": "object",
      "required": [
        "custname",
        "custtel",
        "custemail",
        "size",
        "topping",
        "delivery",
        "comments"
      ],
      "properties": {
        "comments": {
          "type": "string"
        },
        "custemail": {
          "type": "string"
        },
        "custname": {
          "type": "string"
        },
        "custtel": {
          "type": "string"
        },
        "delivery": {
          "type": "string"
        },
        "size": {
          "type": "string"
        },
        "topping": {
          "type": "string"
        }
      }
    }
  }
}
TinyFish
Eval status: passed
completed
Success
100.0%
Latency
11.63s
Cost
$0.00
Tokens
8,766
Steps
6
Current URL
https://httpbin.org/forms/post
Submitted output
Exact structured payload submitted to tinyeval for scoring.
Expand
{
  "submission_url": "https://httpbin.org/post",
  "submitted_form": {
    "comments": "Leave at reception",
    "custemail": "ada@example.com",
    "custname": "Ada Lovelace",
    "custtel": "+1-415-555-0136",
    "delivery": "19:45",
    "size": "medium",
    "topping": "bacon"
  }
}
Output
{
  "submission_url": "https://httpbin.org/post",
  "submitted_form": {
    "comments": "Leave at reception",
    "custemail": "ada@example.com",
    "custname": "Ada Lovelace",
    "custtel": "+1-415-555-0136",
    "delivery": "19:45",
    "size": "medium",
    "topping": "bacon"
  }
}
Live viewer
Viewer URL retained for this attempt. Availability depends on the underlying provider session.
Open viewer
Agent steps
11 logged steps. Expand for the full timeline.
Expand
1. set_value
Apr 21, 11:04 AM
Fill out the pizza order form
https://httpbin.org/forms/post
1. set_value
Apr 21, 11:04 AM
page_map: 572 chars
https://httpbin.org/forms/post
2. click
Apr 21, 11:04 AM
Select pizza size medium
https://httpbin.org/forms/post
2. click
Apr 21, 11:04 AM
page_map: 473 chars
https://httpbin.org/forms/post
3. click
Apr 21, 11:04 AM
Select pizza topping bacon
https://httpbin.org/forms/post
3. click
Apr 21, 11:04 AM
page_map: 483 chars
https://httpbin.org/forms/post
4. set_value
Apr 21, 11:04 AM
Enter delivery time and instructions
https://httpbin.org/forms/post
4. set_value
Apr 21, 11:04 AM
page_map: 517 chars
https://httpbin.org/forms/post
5. click
Apr 21, 11:04 AM
Submit the form
https://httpbin.org/forms/post
5. click
Apr 21, 11:04 AM
page_map: 1570 chars
https://httpbin.org/forms/post
6. end
Apr 21, 11:04 AM
{"error":"none","notes":"The form was successfully submitted and the echoed form data is displayed on the page. I am extracting the 'form' object as 'submitted_form'.","result":...
https://httpbin.org/forms/post
Evaluation
Passed all required checks
Final output matched the expected result.
passed
Failure reason
--
Failed checkpoint
--
Checkpoint summary
0 passed • 0 failed
Evaluated
Apr 21, 11:04 AM
Raw evaluation payload
{
  "submit": {
    "eval_id": "run_ff08fe1b9f3c96ff",
    "run_id": "run_ff08fe1b9f3c96ff",
    "status": "success",
    "note": "Final output matched the expected result."
  },
  "detail": {
    "eval": {
      "run_id": "run_ff08fe1b9f3c96ff",
      "case_id": "wf_httpbin_form_post",
      "case_version": 1,
      "status": "passed",
      "agent_name": "TinyFish",
      "last_evaluation": {
        "event_id": "evt_d2ef0b0faa2d6296",
        "status": "passed",
        "note": "Final output matched the expected result.",
        "evaluated_at": "2026-04-21T11:04:33.371749221Z"
      },
      "created_at": "2026-04-21T11:04:33.132321656Z",
      "updated_at": "2026-04-21T11:04:33.371750082Z",
      "completed_at": "2026-04-21T11:04:33.371750082Z"
    },
    "run": {
      "run_id": "run_ff08fe1b9f3c96ff",
      "case_id": "wf_httpbin_form_post",
      "case_version": 1,
      "status": "passed",
      "agent_name": "TinyFish",
      "last_evaluation": {
        "event_id": "evt_d2ef0b0faa2d6296",
        "status": "passed",
        "note": "Final output matched the expected result.",
        "evaluated_at": "2026-04-21T11:04:33.371749221Z"
      },
      "created_at": "2026-04-21T11:04:33.132321656Z",
      "updated_at": "2026-04-21T11:04:33.371750082Z",
      "completed_at": "2026-04-21T11:04:33.371750082Z"
    },
    "case": {
      "case_id": "wf_httpbin_form_post",
      "version": 1,
      "status": "active",
      "created_at": "2026-04-09T00:00:00Z",
      "updated_at": "2026-04-09T00:00:00Z",
      "title": "Submit the HTTPBin form and return the echoed submission payload",
      "task_description": "Open the form at https://httpbin.org/forms/post, fill it with the provided input data, submit it, and return the echoed submitted_form object.",
      "domains": [
        "httpbin.org"
      ],
      "tags": [
        "forms",
        "httpbin",
        "submission",
        "echo"
      ],
      "difficulty": "easy",
      "locale": "en-US",
      "input_spec": {
        "description": "Use these exact values when filling the form.",
        "schema": {
          "type": "object",
          "required": [
            "custname",
            "custtel",
            "custemail",
            "size",
            "topping",
            "delivery",
            "comments"
          ],
          "properties": {
            "comments": {
              "type": "string"
            },
            "custemail": {
              "type": "string"
            },
            "custname": {
              "type": "string"
            },
            "custtel": {
              "type": "string"
            },
            "delivery": {
              "type": "string"
            },
            "size": {
              "type": "string"
            },
            "topping": {
              "type": "string"
            }
          }
        },
        "data": {
          "comments": "Leave at reception",
          "custemail": "ada@example.com",
          "custname": "Ada Lovelace",
          "custtel": "+1-415-555-0136",
          "delivery": "19:45",
          "size": "medium",
          "topping": "bacon"
        }
      },
      "output_schema": {
        "type": "object",
        "required": [
          "submitted_form"
        ],
        "properties": {
          "submission_url": {
            "type": "string",
            "format": "uri",
            "optional": true
          },
          "submitted_form": {
            "type": "object",
            "required": [
              "custname",
              "custtel",
              "custemail",
              "size",
              "topping",
              "delivery",
              "comments"
            ],
            "properties": {
              "comments": {
                "type": "string"
              },
              "custemail": {
                "type": "string"
              },
              "custname": {
                "type": "string"
              },
              "custtel": {
                "type": "string"
              },
              "delivery": {
                "type": "string"
              },
              "size": {
                "type": "string"
              },
              "topping": {
                "type": "string"
              }
            }
          }
        }
      },
      "entrypoint": {
        "mode": "url",
        "url": "https://httpbin.org/forms/post"
      },
      "run_policy": {
        "timeout_seconds": 300
      }
    },
    "events": [
      {
        "event_id": "evt_d2ef0b0faa2d6296",
        "run_id": "run_ff08fe1b9f3c96ff",
        "event_type": "final",
        "payload": {
          "event_type": "final",
          "timestamp": "2026-04-21T11:04:33.134412437Z",
          "page_state": {},
          "intermediate_data": {
            "raw_submission": {
              "submission_url": "https://httpbin.org/post",
              "submitted_form": {
                "comments": "Leave at reception",
                "custemail": "ada@example.com",
                "custname": "Ada Lovelace",
                "custtel": "+1-415-555-0136",
                "delivery": "19:45",
                "size": "medium",
                "topping": "bacon"
              }
            }
          },
          "final_output": {
            "submission_url": "https://httpbin.org/post",
            "submitted_form": {
              "comments": "Leave at reception",
              "custemail": "ada@example.com",
              "custname": "Ada Lovelace",
              "custtel": "+1-415-555-0136",
              "delivery": "19:45",
              "size": "medium",
              "topping": "bacon"
            }
          },
          "artifacts": {},
          "agent": {}
        },
        "created_at": "2026-04-21T11:04:33.134579912Z"
      }
    ],
    "timeout_seconds": 300,
    "deadline_at": "2026-04-21T11:09:33.132321656Z"
  }
}
Provider terminal payload
Raw terminal result retained from the provider adapter.
Expand
{
  "answer": "{\n  \"submission_url\": \"https://httpbin.org/post\",\n  \"submitted_form\": {\n    \"comments\": \"Leave at reception\",\n    \"custemail\": \"ada@example.com\",\n    \"custname\": \"Ada Lovelace\",\n    \"custtel\": \"+1-415-555-0136\",\n    \"delivery\": \"19:45\",\n    \"size\": \"medium\",\n    \"topping\": \"bacon\"\n  }\n}",
  "data": {
    "error": "none",
    "notes": "The form was successfully submitted and the echoed form data is displayed on the page. I am extracting the 'form' object as 'submitted_form'.",
    "result": "{\n  \"submission_url\": \"https://httpbin.org/post\",\n  \"submitted_form\": {\n    \"comments\": \"Leave at reception\",\n    \"custemail\": \"ada@example.com\",\n    \"custname\": \"Ada Lovelace\",\n    \"custtel\": \"+1-415-555-0136\",\n    \"delivery\": \"19:45\",\n    \"size\": \"medium\",\n    \"topping\": \"bacon\"\n  }\n}",
    "success": true
  },
  "durationMs": 11631,
  "error": "none",
  "inputTokens": 8205,
  "llmDurationMs": 8637,
  "model": "gemini-3-flash-preview",
  "notes": "The form was successfully submitted and the echoed form data is displayed on the page. I am extracting the 'form' object as 'submitted_form'.",
  "outputTokens": 561,
  "recordingId": "96eea7f4-23e4-4012-8983-0526d1a6a04f",
  "result": "{\n  \"submission_url\": \"https://httpbin.org/post\",\n  \"submitted_form\": {\n    \"comments\": \"Leave at reception\",\n    \"custemail\": \"ada@example.com\",\n    \"custname\": \"Ada Lovelace\",\n    \"custtel\": \"+1-415-555-0136\",\n    \"delivery\": \"19:45\",\n    \"size\": \"medium\",\n    \"topping\": \"bacon\"\n  }\n}",
  "success": true,
  "thinkingTokens": 0,
  "totalSteps": 6,
  "totalTokens": 8766,
  "type": "done"
}