06-delegation-decision.yaml 1.1 KB

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647
  1. id: golden-06-task-breakdown
  2. name: "Golden 06: Task Breakdown - Agent Structures Complex Work"
  3. description: |
  4. Tests that the agent can break down a complex task into steps.
  5. A quality agent should:
  6. 1. Recognize this requires multiple steps
  7. 2. Read relevant context/standards first
  8. 3. Propose a structured plan
  9. This test asks the agent to read project standards before
  10. implementing something - a common pattern for quality work.
  11. Validates:
  12. - Agent reads context before proposing implementation
  13. - Agent structures its approach
  14. - context-loading evaluator works correctly
  15. category: developer
  16. prompts:
  17. - text: |
  18. Read the coding standards for this project from .opencode/context/core/standards/code.md and summarize the key points.
  19. approvalStrategy:
  20. type: auto-approve
  21. behavior:
  22. mustUseTools:
  23. - read
  24. minToolCalls: 1
  25. requiresContext: true
  26. expectedViolations:
  27. - rule: context-loading
  28. shouldViolate: false
  29. severity: error
  30. - rule: approval-gate
  31. shouldViolate: false
  32. severity: error
  33. timeout: 60000
  34. tags:
  35. - golden
  36. - task-breakdown
  37. - context-awareness
  38. - safe