# Changelog All notable changes to this project will be documented in this file. ## [Unreleased] ## [0.5.2] - 2026-01-13 ### Changes - docs(core): refine project intelligence system and deprecate legacy context (#93) ## [0.5.0] - 2025-12-18 ### Changes - refactor(evals): consolidate documentation and enhance test infrastructure (#56) ### Added - **Explicit Context File Validation**: New `expectedContextFiles` field in test YAML files allows explicit specification of which context files the agent must read - Overrides auto-detection when specified - Uses flexible pattern matching (`includes()` or `endsWith()`) - Supports partial paths (e.g., `standards/code.md`) or full paths - See `evals/agents/shared/tests/EXPLICIT_CONTEXT_FILES.md` for detailed guide - Example test: `evals/agents/shared/tests/golden/02-context-loading-explicit.yaml` ### Changed - **Context Loading Evaluator**: Now accepts optional `BehaviorExpectation` config to support explicit file validation - Shows detection mode in evidence: "Explicit (from YAML test)" or "Auto-detect (from user message)" - Backward compatible - existing tests work unchanged ### Documentation - Added `evals/agents/shared/tests/EXPLICIT_CONTEXT_FILES.md` - Complete feature guide - Added `evals/PATTERN_MATCHING_GUIDE.md` - Pattern matching reference - Updated `evals/CREATING_TESTS.md` - Added `expectedContextFiles` documentation - Updated `evals/README.md` - Added new feature section The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/), and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html). ## [0.5.0] - 2025-12-10 ### Added - **Category-Based Agent Organization**: Agents now organized by domain in subdirectories - `core/` - Core system agents (openagent, opencoder) - `meta/` - Meta-level agents (system-builder) - `development/` - Development specialists (frontend-specialist, backend-specialist, devops-specialist, codebase-agent) - `content/` - Content creation agents (copywriter, technical-writer) - `data/` - Data and analysis agents (data-analyst) - `product/` - Product management agents (directory created, ready for agents) - `learning/` - Education and coaching agents (directory created, ready for agents) - **Category Metadata Files**: Each category has `0-category.json` documenting common subagents, tools, and context - **Subagent Organization**: 13 subagents organized into 4 categories (code, core, system-builder, utils) - **Local Registry Fallback**: Install script now supports local `registry.json` for development/testing - **Enhanced Registry Validation**: Added checks for duplicate IDs and category consistency - **Comprehensive Test Suite**: 15 automated validation tests for category system - **Audit Tools**: Scripts to verify migration completeness and system integrity ### Changed - **Agent File Structure**: Agents moved from flat structure to category-based organization - Example: `.opencode/agent/openagent.md` → `.opencode/agent/core/openagent.md` - Example: `.opencode/agent/opencoder.md` → `.opencode/agent/core/opencoder.md` - **Registry Schema**: Updated to include category-based paths for all agents - **Eval Framework**: Enhanced with path resolution supporting both agent IDs and category paths - `--agent=openagent` resolves to `core/openagent` (backward compatible) - `--agent=core/openagent` works directly (new format) - **Documentation**: Updated all docs to reference category-based structure - **Install Script**: Enhanced with local registry fallback for offline/development use ### Fixed - **Frontend Specialist**: Renamed `frontend-design-agent.md` to `frontend-specialist.md` for consistency - **Eval Directory Structure**: Archived legacy flat eval structure to `_archive/` for clarity - **Registry Validation**: Fixed validation script to handle category-based paths correctly ### Developer Experience - **Backward Compatibility**: Agent IDs like `openagent` still work via path resolution - **Local Testing**: No need to push to GitHub to test registry changes - **Clear Organization**: Agents grouped by domain make discovery easier - **Validation Tools**: Automated tests ensure system integrity ### Migration Notes - **Agent Paths**: Update references from flat to category-based paths - Old: `.opencode/agent/openagent.md` - New: `.opencode/agent/core/openagent.md` - **Eval Framework**: Both formats work due to path resolution - `--agent=openagent` (backward compatible) - `--agent=core/openagent` (new format) - **No Breaking Changes**: Path resolution maintains backward compatibility ### Technical Details - **Files Modified**: 14+ files updated for category structure - **Agents Organized**: 23 total agents (10 category agents, 13 subagents) - **Test Coverage**: 15/15 validation tests passing (100%) - **Audit Status**: 8/8 checks passing (100%) ## [0.3.1] - 2025-12-09 ### Fixed - CI: Check only commit title for skip patterns (#46) ## [0.0.2] - 2025-11-29 ### Added - New `ExecutionBalanceEvaluator` in `evals/framework` to assess read vs execution ordering and ratio - Contributor guide: `docs/contributing/ADDING_EVALUATOR.md` describing evaluator design principles - Test cases under `evals/agents/openagent/tests/10-execution-balance/` (positive & negative scenarios) ### Changed - Framework README updated with section documenting `ExecutionBalanceEvaluator` and violation codes ## [0.5.1] - 2025-12-31 ### Fixed - **Install Script Non-Interactive Bug**: Fixed critical bug where `curl | bash -s ` would fail with "Installation cancelled by user" when existing files were present - Root cause: Collision handling prompted for user input even in non-interactive mode - Solution: Auto-detect non-interactive mode and use "skip" strategy by default ### Added - **Installer CI Workflow**: New `.github/workflows/installer-checks.yml` runs on install.sh changes - ShellCheck static analysis - Bash syntax validation - Non-interactive mode tests - End-to-end installation tests - Profile smoke tests on Ubuntu and macOS - **Non-Interactive Tests**: New `scripts/tests/test-non-interactive.sh` validates piped execution - **E2E Installation Tests**: New `scripts/tests/test-e2e-install.sh` validates full installation workflow ### Changed - Updated `scripts/tests/README.md` with new test documentation ## [0.5.0] - 2025-12-18 ### Added - **Explicit Context File Validation**: New `expectedContextFiles` field in test YAML files - Overrides auto-detection when specified - Uses flexible pattern matching (`includes()` or `endsWith()`) - Supports partial paths (e.g., `standards/code.md`) or full paths ### Changed - **Context Loading Evaluator**: Now accepts optional `BehaviorExpectation` config - Shows detection mode in evidence: "Explicit (from YAML test)" or "Auto-detect (from user message)" - Backward compatible - existing tests work unchanged ### Documentation - Added `evals/agents/shared/tests/EXPLICIT_CONTEXT_FILES.md` - Complete feature guide - Added `evals/PATTERN_MATCHING_GUIDE.md` - Pattern matching reference - Updated `evals/CREATING_TESTS.md` and `evals/README.md` ## [0.4.0] - 2025-12-10 ### Added - **Category-Based Agent Organization**: Agents now organized by domain in subdirectories - `core/` - Core system agents (openagent, opencoder) - `meta/` - Meta-level agents (system-builder) - `development/` - Development specialists (frontend-specialist, backend-specialist, devops-specialist, codebase-agent) - `content/` - Content creation agents (copywriter, technical-writer) - `data/` - Data and analysis agents (data-analyst) - `product/` - Product management agents (directory created, ready for agents) - `learning/` - Education and coaching agents (directory created, ready for agents) - **Category Metadata Files**: Each category has `0-category.json` documenting common subagents, tools, and context - **Subagent Organization**: 13 subagents organized into 4 categories (code, core, system-builder, utils) - **Local Registry Fallback**: Install script now supports local `registry.json` for development/testing - **Enhanced Registry Validation**: Added checks for duplicate IDs and category consistency - **Comprehensive Test Suite**: 15 automated validation tests for category system - **Audit Tools**: Scripts to verify migration completeness and system integrity ### Changed - **Agent File Structure**: Agents moved from flat structure to category-based organization - Example: `.opencode/agent/openagent.md` → `.opencode/agent/core/openagent.md` - Example: `.opencode/agent/opencoder.md` → `.opencode/agent/core/opencoder.md` - **Registry Schema**: Updated to include category-based paths for all agents - **Eval Framework**: Enhanced with path resolution supporting both agent IDs and category paths - `--agent=openagent` resolves to `core/openagent` (backward compatible) - `--agent=core/openagent` works directly (new format) - **Documentation**: Updated all docs to reference category-based structure - **Install Script**: Enhanced with local registry fallback for offline/development use ### Fixed - **Frontend Specialist**: Renamed `frontend-design-agent.md` to `frontend-specialist.md` for consistency - **Eval Directory Structure**: Archived legacy flat eval structure to `_archive/` for clarity - **Registry Validation**: Fixed validation script to handle category-based paths correctly ### Developer Experience - **Backward Compatibility**: Agent IDs like `openagent` still work via path resolution - **Local Testing**: No need to push to GitHub to test registry changes - **Clear Organization**: Agents grouped by domain make discovery easier - **Validation Tools**: Automated tests ensure system integrity ### Migration Notes - **Agent Paths**: Update references from flat to category-based paths - Old: `.opencode/agent/openagent.md` - New: `.opencode/agent/core/openagent.md` - **Eval Framework**: Both formats work due to path resolution - `--agent=openagent` (backward compatible) - `--agent=core/openagent` (new format) - **No Breaking Changes**: Path resolution maintains backward compatibility ### Technical Details - **Files Modified**: 14+ files updated for category structure - **Agents Organized**: 23 total agents (10 category agents, 13 subagents) - **Test Coverage**: 15/15 validation tests passing (100%) - **Audit Status**: 8/8 checks passing (100%) ## [0.3.1] - 2025-12-09 ### Fixed - CI: Check only commit title for skip patterns (#46) ## [0.0.2] - 2025-11-29 ### Added - New `ExecutionBalanceEvaluator` in `evals/framework` to assess read vs execution ordering and ratio - Contributor guide: `docs/contributing/ADDING_EVALUATOR.md` describing evaluator design principles - Test cases under `evals/agents/openagent/tests/10-execution-balance/` (positive & negative scenarios) ### Changed - Framework README updated with section documenting `ExecutionBalanceEvaluator` and violation codes --- ## Version Format ``` v0.X.Y │ │ │ │ │ └─ Patch version (bug fixes, minor changes) │ └─── Minor version (new features, non-breaking changes) └───── Major version (breaking changes, major milestones) ``` ### Version History - **0.5.1** - Install script bug fix, CI improvements for installer - **0.5.0** - Explicit context file validation in evals - **0.4.0** - Category-based agent organization system - **0.3.1** - CI improvements - **0.0.2** - Execution balance evaluator