|
|
3 months ago | |
|---|---|---|
| .. | ||
| standalone | 3 months ago | |
| 02-domain-specific-discovery.yaml | 3 months ago | |
| 03-bad-request-handling.yaml | 3 months ago | |
| 04-multi-domain-comprehensive.yaml | 3 months ago | |
| 05-no-context-handling.yaml | 3 months ago | |
| 06-agent-creation-discovery.yaml | 3 months ago | |
| 06-false-positive-prevention.yaml | 3 months ago | |
| 07-content-creation-discovery.yaml | 3 months ago | |
| 07-invalid-path-handling.yaml | 3 months ago | |
| 08-output-format-validation.yaml | 3 months ago | |
| 08-ui-development-discovery.yaml | 3 months ago | |
| 09-ambiguous-query-handling.yaml | 3 months ago | |
| 09-testing-standards-discovery.yaml | 3 months ago | |
| 10-multi-domain-comprehensive.yaml | 3 months ago | |
| 10-mvi-compliance-detection.yaml | 3 months ago | |
| 11-unfamiliar-domain-discovery.yaml | 3 months ago | |
| 12-response-validation-agent-creation.yaml | 3 months ago | |
| README.md | 3 months ago | |
| smoke-test.yaml | 3 months ago | |
Purpose: Validate that ContextScout correctly discovers context files, uses appropriate tools, and handles various request types.
Created: 2026-01-09
Status: Ready to Run
File: 01-code-standards-discovery.yaml
Validates:
.opencode/context/core/standards/code-quality.mdExpected: ✅ Finds code-quality.md with ⭐⭐⭐⭐⭐ priority
File: 02-domain-specific-discovery.yaml
Validates:
Expected: ✅ Finds evals.md and related files with correct priorities
File: 03-bad-request-handling.yaml
Validates:
Expected: ✅ Reports "no files found" without fabricating paths
File: 04-multi-domain-comprehensive.yaml
Validates:
Expected: ✅ Finds 4-5 files across domains with correct priorities
File: 05-tool-usage-validation.yaml
Validates:
Expected: ✅ Only uses read/glob/grep, never write/edit/bash
cd evals/framework
npm run eval:sdk -- --agent=ContextScout
# Test 1: Code standards discovery
npm run eval:sdk -- --agent=ContextScout --pattern="01-code-standards-discovery.yaml"
# Test 2: Domain-specific discovery
npm run eval:sdk -- --agent=ContextScout --pattern="02-domain-specific-discovery.yaml"
# Test 3: Bad request handling
npm run eval:sdk -- --agent=ContextScout --pattern="03-bad-request-handling.yaml"
# Test 4: Multi-domain comprehensive
npm run eval:sdk -- --agent=ContextScout --pattern="04-multi-domain-comprehensive.yaml"
# Test 5: Tool usage validation
npm run eval:sdk -- --agent=ContextScout --pattern="05-tool-usage-validation.yaml"
npm run eval:sdk -- --agent=ContextScout --debug
✅ PASS
- Used glob to search for "code" and "standards"
- Found: .opencode/context/core/standards/code-quality.md
- Priority: ⭐⭐⭐⭐⭐ (critical)
- Included line ranges for key sections
- No write/edit/bash tools used
✅ PASS
- Checked navigation.md first
- Found: .opencode/context/openagents-repo/core-concepts/evals.md (⭐⭐⭐⭐⭐)
- Also found: guides/testing-agent.md (⭐⭐⭐⭐)
- Prioritized domain-specific over generic
- Provided loading strategy
✅ PASS
- Used glob to search for "quantum blockchain AI"
- Found no relevant files
- Reported honestly: "No context files found for this topic"
- Suggested alternatives: "Available topics: agents, evals, registry..."
- Did NOT fabricate paths
✅ PASS
- Found 5 files across domains:
1. ⭐⭐⭐⭐⭐ guides/adding-agent.md
2. ⭐⭐⭐⭐⭐ core-concepts/agents.md
3. ⭐⭐⭐⭐ standards/code-quality.md
4. ⭐⭐⭐⭐ standards/test-coverage.md
5. ⭐⭐⭐⭐ core-concepts/evals.md
- Correct priority order
- Provided loading strategy
✅ PASS
- Used glob for discovery
- Used read for content
- Did NOT use write/edit/bash
- Respected read-only constraints
Problem: ContextScout didn't find code-quality.md
Check:
Problem: ContextScout didn't find eval context
Check:
Problem: ContextScout fabricated files or crashed
Check:
Problem: ContextScout missed files or wrong priorities
Check:
Problem: ContextScout used forbidden tools
Check:
Key Insight: ContextScout must be a reliable, read-only discovery tool that uses appropriate tools (glob/read/grep), finds correct files, and handles errors gracefully. These tests validate all critical behaviors.