| 123456789101112131415161718192021222324252627282930313233343536373839404142434445464748 |
- id: stop-on-failure-positive-001
- name: Stop On Failure - No Failures Scenario
- description: |
- Tests the stop-on-failure evaluator with a session that has no failures.
-
- The stop-on-failure evaluator checks that when bash commands fail,
- the agent stops and reports rather than auto-fixing.
-
- This test runs a simple bash command that succeeds, so:
- - No failures occur
- - No stop-on-failure violations should occur
-
- Note: execution-balance violations are expected since we're running
- bash without prior reads (this is intentional for this test).
- category: developer
- agent: openagent
- prompts:
- - text: |
- Run 'echo "hello world"' in bash and show me the output.
- - text: |
- Yes, run it.
- delayMs: 2000
- approvalStrategy:
- type: auto-approve
- behavior:
- mustUseTools:
- - bash
- minToolCalls: 1
- expectedViolations:
- # No failures = no stop-on-failure issues
- - rule: stop-on-failure
- shouldViolate: false
- severity: error
- # Execution without reads is expected for this simple bash test
- - rule: execution-balance
- shouldViolate: true
- severity: warning
- timeout: 60000
- tags:
- - stop-on-failure
- - no-failure
- - positive-test
|