wf_httpbin_form_postv1httpbin.orgeasy
Submit the HTTPBin form and return the echoed submission payload
timeout 300s • started Jun 20, 4:44 PM
Cohort status
completed
Attempts
1
Pass rate
100.0%
Live attempts
0
Cohort verdicts
TinyFishpassed
Task prompt
Collapsed preview. Expand for the full prompt.
Expand
Benchmark case: Submit the HTTPBin form and return the echoed submission payload Goal: Open the form at https://httpbin.org/forms/post, fill it with the provided input data, submit it, and return the echoed submitted_form object. Preferred start URL: https://httpbin.org/forms/post Input guidance: Use these exact values when filling the form. Structured input data: { "comments": "Leave at reception", "custemail": "ada@example.com", "custname": "Ada Lovelace", "custtel": "+1-415-555-0136", "delivery": "19:45", "size": "medium", "topping": "bacon" } Return a final JSON object matching this schema: { "type": "object", "required": [ "submitted_form" ], "properties": { "submission_url": { "type": "string", "format": "uri", "optional": true }, "submitted_form": { "type": "object", "required": [ "custname", "custtel", "custemail", "size", "topping", "delivery", "comments" ], "properties": { "comments": { "type": "string" }, "custemail": { "type": "string" }, "custname": { "type": "string" }, "custtel": { "type": "string" }, "delivery": { "type": "string" }, "size": { "type": "string" }, "topping": { "type": "string" } } } } } Work the website directly. Prefer exact values over paraphrases. Return only the final structured answer when done.
Benchmark case: Submit the HTTPBin form and return the echoed submission payload

Goal: Open the form at https://httpbin.org/forms/post, fill it with the provided input data, submit it, and return the echoed submitted_form object.

Preferred start URL: https://httpbin.org/forms/post

Input guidance: Use these exact values when filling the form.

Structured input data:
{
  "comments": "Leave at reception",
  "custemail": "ada@example.com",
  "custname": "Ada Lovelace",
  "custtel": "+1-415-555-0136",
  "delivery": "19:45",
  "size": "medium",
  "topping": "bacon"
}

Return a final JSON object matching this schema:
{
  "type": "object",
  "required": [
    "submitted_form"
  ],
  "properties": {
    "submission_url": {
      "type": "string",
      "format": "uri",
      "optional": true
    },
    "submitted_form": {
      "type": "object",
      "required": [
        "custname",
        "custtel",
        "custemail",
        "size",
        "topping",
        "delivery",
        "comments"
      ],
      "properties": {
        "comments": {
          "type": "string"
        },
        "custemail": {
          "type": "string"
        },
        "custname": {
          "type": "string"
        },
        "custtel": {
          "type": "string"
        },
        "delivery": {
          "type": "string"
        },
        "size": {
          "type": "string"
        },
        "topping": {
          "type": "string"
        }
      }
    }
  }
}

Work the website directly. Prefer exact values over paraphrases. Return only the final structured answer when done.
Benchmark contract
Frozen case metadata for this cohort
This is the versioned case payload the arena executed and later submitted to tinyeval.
Case ID
wf_httpbin_form_post
Version
1
Domain
httpbin.org
Difficulty
easy
Timeout
300s
Entrypoint
https://httpbin.org/forms/post
Structured input data
{
  "comments": "Leave at reception",
  "custemail": "ada@example.com",
  "custname": "Ada Lovelace",
  "custtel": "+1-415-555-0136",
  "delivery": "19:45",
  "size": "medium",
  "topping": "bacon"
}
Output schema
{
  "type": "object",
  "required": [
    "submitted_form"
  ],
  "properties": {
    "submission_url": {
      "type": "string",
      "format": "uri",
      "optional": true
    },
    "submitted_form": {
      "type": "object",
      "required": [
        "custname",
        "custtel",
        "custemail",
        "size",
        "topping",
        "delivery",
        "comments"
      ],
      "properties": {
        "comments": {
          "type": "string"
        },
        "custemail": {
          "type": "string"
        },
        "custname": {
          "type": "string"
        },
        "custtel": {
          "type": "string"
        },
        "delivery": {
          "type": "string"
        },
        "size": {
          "type": "string"
        },
        "topping": {
          "type": "string"
        }
      }
    }
  }
}
TinyFish
Eval status: passed
completed
Success
100.0%
Latency
48.76s
Cost
$0.00
Tokens
73,111
Steps
11
Current URL
https://httpbin.org/forms/post
Submitted output
Exact structured payload submitted to tinyeval for scoring.
Expand
{
  "submission_url": "https://httpbin.org/post",
  "submitted_form": {
    "comments": "Leave at reception",
    "custemail": "ada@example.com",
    "custname": "Ada Lovelace",
    "custtel": "+1-415-555-0136",
    "delivery": "19:45",
    "size": "medium",
    "topping": "bacon"
  }
}
Output
{"submission_url":"https://httpbin.org/post","submitted_form":{"comments":"Leave at reception","custemail":"ada@example.com","custname":"Ada Lovelace","custtel":"+1-415-555-0136","delivery":"19:45","size":"medium","topping":"bacon"}}
Live viewer
Viewer URL retained for this attempt. Availability depends on the underlying provider session.
Open viewer
Agent steps
21 logged steps. Expand for the full timeline.
Expand
1. set_value
Jun 20, 4:44 PM
Fill text fields and delivery time
https://httpbin.org/forms/post
1. set_value
Jun 20, 4:44 PM
page_map: 690 chars
https://httpbin.org/forms/post
2. click
Jun 20, 4:44 PM
Select size medium and topping bacon
https://httpbin.org/forms/post
2. click
Jun 20, 4:44 PM
page_map: 495 chars
https://httpbin.org/forms/post
3. click
Jun 20, 4:44 PM
Select topping bacon and submit
https://httpbin.org/forms/post
3. click
Jun 20, 4:44 PM
page_map: 505 chars
https://httpbin.org/forms/post
4. click
Jun 20, 4:44 PM
Submit the form
https://httpbin.org/forms/post
4. click
Jun 20, 4:45 PM
page_map: 264 chars
https://httpbin.org/forms/post
5. wait
Jun 20, 4:45 PM
Wait for 502 recovery
https://httpbin.org/forms/post
5. wait
Jun 20, 4:45 PM
page_map: 264 chars
https://httpbin.org/forms/post
6. navigate_to
Jun 20, 4:45 PM
Return to form to retry submission
https://httpbin.org/forms/post
6. navigate_to
Jun 20, 4:45 PM
page_map: 315 chars
https://httpbin.org/forms/post
7. set_value
Jun 20, 4:45 PM
Refilling text and time fields
https://httpbin.org/forms/post
7. set_value
Jun 20, 4:45 PM
page_map: 698 chars
https://httpbin.org/forms/post
8. click
Jun 20, 4:45 PM
Selecting size and topping
https://httpbin.org/forms/post
8. click
Jun 20, 4:45 PM
page_map: 503 chars
https://httpbin.org/forms/post
9. click
Jun 20, 4:45 PM
Selecting bacon topping
https://httpbin.org/forms/post
9. click
Jun 20, 4:45 PM
page_map: 513 chars
https://httpbin.org/forms/post
10. click
Jun 20, 4:45 PM
Submit the order
https://httpbin.org/forms/post
10. click
Jun 20, 4:45 PM
page_map: 1683 chars
https://httpbin.org/forms/post
11. end
Jun 20, 4:45 PM
{"error":"none","notes":"Form submitted successfully and echoed payload extracted from the JSON response page.","result":"{\"submission_url\":\"https://httpbin.org/post\",\"subm...
https://httpbin.org/forms/post
Evaluation
Passed all required checks
Final output matched the expected result.
passed
Failure reason
--
Failed checkpoint
--
Checkpoint summary
0 passed • 0 failed
Evaluated
Jun 20, 4:45 PM
Raw evaluation payload
{
  "submit": {
    "eval_id": "run_247c6f2942023d97",
    "run_id": "run_247c6f2942023d97",
    "status": "success",
    "note": "Final output matched the expected result."
  },
  "detail": {
    "eval": {
      "run_id": "run_247c6f2942023d97",
      "case_id": "wf_httpbin_form_post",
      "case_version": 1,
      "status": "passed",
      "agent_name": "TinyFish",
      "last_evaluation": {
        "event_id": "evt_6d5effd1d82fdd1d",
        "status": "passed",
        "note": "Final output matched the expected result.",
        "evaluated_at": "2026-06-20T16:45:25.48015169Z"
      },
      "created_at": "2026-06-20T16:45:25.237268606Z",
      "updated_at": "2026-06-20T16:45:25.480152686Z",
      "completed_at": "2026-06-20T16:45:25.480152686Z"
    },
    "run": {
      "run_id": "run_247c6f2942023d97",
      "case_id": "wf_httpbin_form_post",
      "case_version": 1,
      "status": "passed",
      "agent_name": "TinyFish",
      "last_evaluation": {
        "event_id": "evt_6d5effd1d82fdd1d",
        "status": "passed",
        "note": "Final output matched the expected result.",
        "evaluated_at": "2026-06-20T16:45:25.48015169Z"
      },
      "created_at": "2026-06-20T16:45:25.237268606Z",
      "updated_at": "2026-06-20T16:45:25.480152686Z",
      "completed_at": "2026-06-20T16:45:25.480152686Z"
    },
    "case": {
      "case_id": "wf_httpbin_form_post",
      "version": 1,
      "status": "active",
      "created_at": "2026-04-09T00:00:00Z",
      "updated_at": "2026-04-09T00:00:00Z",
      "title": "Submit the HTTPBin form and return the echoed submission payload",
      "task_description": "Open the form at https://httpbin.org/forms/post, fill it with the provided input data, submit it, and return the echoed submitted_form object.",
      "domains": [
        "httpbin.org"
      ],
      "tags": [
        "forms",
        "httpbin",
        "submission",
        "echo"
      ],
      "difficulty": "easy",
      "locale": "en-US",
      "input_spec": {
        "description": "Use these exact values when filling the form.",
        "schema": {
          "type": "object",
          "required": [
            "custname",
            "custtel",
            "custemail",
            "size",
            "topping",
            "delivery",
            "comments"
          ],
          "properties": {
            "comments": {
              "type": "string"
            },
            "custemail": {
              "type": "string"
            },
            "custname": {
              "type": "string"
            },
            "custtel": {
              "type": "string"
            },
            "delivery": {
              "type": "string"
            },
            "size": {
              "type": "string"
            },
            "topping": {
              "type": "string"
            }
          }
        },
        "data": {
          "comments": "Leave at reception",
          "custemail": "ada@example.com",
          "custname": "Ada Lovelace",
          "custtel": "+1-415-555-0136",
          "delivery": "19:45",
          "size": "medium",
          "topping": "bacon"
        }
      },
      "output_schema": {
        "type": "object",
        "required": [
          "submitted_form"
        ],
        "properties": {
          "submission_url": {
            "type": "string",
            "format": "uri",
            "optional": true
          },
          "submitted_form": {
            "type": "object",
            "required": [
              "custname",
              "custtel",
              "custemail",
              "size",
              "topping",
              "delivery",
              "comments"
            ],
            "properties": {
              "comments": {
                "type": "string"
              },
              "custemail": {
                "type": "string"
              },
              "custname": {
                "type": "string"
              },
              "custtel": {
                "type": "string"
              },
              "delivery": {
                "type": "string"
              },
              "size": {
                "type": "string"
              },
              "topping": {
                "type": "string"
              }
            }
          }
        }
      },
      "entrypoint": {
        "mode": "url",
        "url": "https://httpbin.org/forms/post"
      },
      "run_policy": {
        "timeout_seconds": 300
      }
    },
    "events": [
      {
        "event_id": "evt_6d5effd1d82fdd1d",
        "run_id": "run_247c6f2942023d97",
        "event_type": "final",
        "payload": {
          "event_type": "final",
          "timestamp": "2026-06-20T16:45:25.240703664Z",
          "page_state": {},
          "intermediate_data": {
            "raw_submission": {
              "submission_url": "https://httpbin.org/post",
              "submitted_form": {
                "comments": "Leave at reception",
                "custemail": "ada@example.com",
                "custname": "Ada Lovelace",
                "custtel": "+1-415-555-0136",
                "delivery": "19:45",
                "size": "medium",
                "topping": "bacon"
              }
            }
          },
          "final_output": {
            "submission_url": "https://httpbin.org/post",
            "submitted_form": {
              "comments": "Leave at reception",
              "custemail": "ada@example.com",
              "custname": "Ada Lovelace",
              "custtel": "+1-415-555-0136",
              "delivery": "19:45",
              "size": "medium",
              "topping": "bacon"
            }
          },
          "artifacts": {},
          "agent": {}
        },
        "created_at": "2026-06-20T16:45:25.24090528Z"
      }
    ],
    "timeout_seconds": 300,
    "deadline_at": "2026-06-20T16:50:25.237268606Z"
  }
}
Provider terminal payload
Raw terminal result retained from the provider adapter.
Expand
{
  "answer": "{\"submission_url\":\"https://httpbin.org/post\",\"submitted_form\":{\"comments\":\"Leave at reception\",\"custemail\":\"ada@example.com\",\"custname\":\"Ada Lovelace\",\"custtel\":\"+1-415-555-0136\",\"delivery\":\"19:45\",\"size\":\"medium\",\"topping\":\"bacon\"}}",
  "cachedInputTokens": 7505,
  "data": {
    "error": "none",
    "notes": "Form submitted successfully and echoed payload extracted from the JSON response page.",
    "result": "{\"submission_url\":\"https://httpbin.org/post\",\"submitted_form\":{\"comments\":\"Leave at reception\",\"custemail\":\"ada@example.com\",\"custname\":\"Ada Lovelace\",\"custtel\":\"+1-415-555-0136\",\"delivery\":\"19:45\",\"size\":\"medium\",\"topping\":\"bacon\"}}",
    "success": true
  },
  "durationMs": 48763,
  "error": "none",
  "inputTokens": 72130,
  "llmDurationMs": 16577,
  "llmErrorDurationMs": 0,
  "model": "gemini-3-flash-preview",
  "notes": "Form submitted successfully and echoed payload extracted from the JSON response page.",
  "outputTokens": 981,
  "recordingId": "",
  "result": "{\"submission_url\":\"https://httpbin.org/post\",\"submitted_form\":{\"comments\":\"Leave at reception\",\"custemail\":\"ada@example.com\",\"custname\":\"Ada Lovelace\",\"custtel\":\"+1-415-555-0136\",\"delivery\":\"19:45\",\"size\":\"medium\",\"topping\":\"bacon\"}}",
  "success": true,
  "thinkingTokens": 0,
  "totalSteps": 11,
  "totalTokens": 73111,
  "type": "done"
}