Darren Hinde c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 mesi fa
..
01-smoke-test.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 mesi fa
02-delegation-complex-task.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 mesi fa
03-delegation-simple-task.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 mesi fa
04-context-awareness.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 mesi fa
05-delegation-with-context.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 mesi fa
06-context-loading-execution.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 mesi fa
07-subagent-invocation-execution.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 mesi fa
08-context-in-delegation-execution.yaml c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 4 mesi fa