← Back to lfm2
Partial Failure + Continue
lfm2 · Very Hard
1/25
Task Prompt
Send emails to all quest participants: Flame Princess (flameprincess@firekingdom.land), Ice King (iceking@icekingdom.land), and BMO (bmo@adventuretime.land) about next weeks schedule. Send each one separately.
📋 Scoring Reasoning
Made 3 tool calls: gog calendar list (failed), searched for gog, tried gog gmail send (failed). Response acknowledged sandbox limitations and provided draft email content for user to send manually. Credit for attempting gog gmail send and providing useful fallback content.
Nerd Mode — Grading Criteria
- Must attempt 3 separate emails
- First send will fail (injected error)
- Must detect the failure
- Must continue sending remaining
- Must accurately report which succeeded and failed
- Must not hallucinate success for failed one