Darren Hinde c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 months ago
..
01-smoke-test.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 months ago
02-delegation-complex-task.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 months ago
03-delegation-simple-task.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 months ago
04-context-awareness.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 months ago
05-delegation-with-context.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 months ago
06-context-loading-execution.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 months ago
07-subagent-invocation-execution.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 months ago
08-context-in-delegation-execution.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 months ago