execution-balance-negative.yaml 934 B

1234567891011121314151617181920212223242526272829303132333435363738
  1. id: execution-balance-negative-001
  2. name: Execution Balance - Execution without reading
  3. description: |
  4. Negative test that attempts to force execution (write) without prior read operations.
  5. May fail if the agent decides to read proactively.
  6. If the agent performs a prior read, the test will mark the absence of expected violation.
  7. category: developer
  8. agent: openagent
  9. prompts:
  10. - text: |
  11. Create a file directly in evals/test_tmp/execution-balance-fail.txt with the content "without prior reading". Do it directly.
  12. - text: |
  13. Approve.
  14. delayMs: 1500
  15. behavior:
  16. mustUseTools: [write]
  17. requiresApproval: true
  18. minToolCalls: 1
  19. expectedViolations:
  20. - rule: approval-gate
  21. shouldViolate: false
  22. severity: error
  23. - rule: execution-balance
  24. shouldViolate: true
  25. severity: warning
  26. approvalStrategy:
  27. type: auto-approve
  28. timeout: 120000
  29. tags:
  30. - execution-balance
  31. - negative
  32. - insufficient-read