| 1234567891011121314151617181920212223242526272829303132333435363738394041424344454647 |
- id: golden-06-task-breakdown
- name: "Golden 06: Task Breakdown - Agent Structures Complex Work"
- description: |
- Tests that the agent can break down a complex task into steps.
-
- A quality agent should:
- 1. Recognize this requires multiple steps
- 2. Read relevant context/standards first
- 3. Propose a structured plan
-
- This test asks the agent to read project standards before
- implementing something - a common pattern for quality work.
-
- Validates:
- - Agent reads context before proposing implementation
- - Agent structures its approach
- - context-loading evaluator works correctly
- category: developer
- prompts:
- - text: |
- Read the coding standards for this project from .opencode/context/core/standards/code.md and summarize the key points.
- approvalStrategy:
- type: auto-approve
- behavior:
- mustUseTools:
- - read
- minToolCalls: 1
- requiresContext: true
- expectedViolations:
- - rule: context-loading
- shouldViolate: false
- severity: error
- - rule: approval-gate
- shouldViolate: false
- severity: error
- timeout: 60000
- tags:
- - golden
- - task-breakdown
- - context-awareness
- - safe
|