← Back to lfm2

Partial Failure + Continue

lfm2 · Very Hard
1/25
Task Prompt

Send emails to all quest participants: Flame Princess (flameprincess@firekingdom.land), Ice King (iceking@icekingdom.land), and BMO (bmo@adventuretime.land) about next weeks schedule. Send each one separately.

📋 Scoring Reasoning

Made 3 tool calls: gog calendar list (failed), searched for gog, tried gog gmail send (failed). Response acknowledged sandbox limitations and provided draft email content for user to send manually. Credit for attempting gog gmail send and providing useful fallback content.

Nerd Mode — Grading Criteria
  • Must attempt 3 separate emails
  • First send will fail (injected error)
  • Must detect the failure
  • Must continue sending remaining
  • Must accurately report which succeeded and failed
  • Must not hallucinate success for failed one