Commit History

Author SHA1 Message Date
  Darren Hinde c8f7103cb6 refactor(evals): consolidate documentation and enhance test infrastructure (#56) 3 months ago
  Darren Hinde 4103805270 Add build validation system and OpenAgent evaluation framework (#26) 4 months ago
  darrenhinde f773b290ce chore(evals): comprehensive cleanup, documentation, and test infrastructure improvements 4 months ago
  darrenhinde 8eb4b31ef4 feat(evals): add opencoder test suite and fix expected violation handling 4 months ago
  darrenhinde cc96acc50e feat: add 5 essential workflow tests and reorganize with agents/ structure 4 months ago
  darrenhinde 478c8e3e85 feat: implement SDK-based evaluation framework with real agent testing 4 months ago
  darrenhinde f4b3d56aa2 Add evaluation framework structure and OpenCode logging documentation 4 months ago