Decisions

Pending: Skill-based (/smoke-test) or script-based (pytest)?
Pending: Run automatically or on-demand?

User Tasks

FR-051: System Integration Testing

Summary

A /smoke-test skill and test suite that verifies skills, hooks, agents, rules, and templates work correctly together.

Problem / Motivation

FR-049 covers Python unit testing. But the Opus system is more than Python code — it’s a constellation of skills, hooks, agents, rules, templates, and vault conventions that must work together. Currently there’s no way to verify that:

Skills produce valid output and don’t error
Hooks fire correctly and don’t block valid operations
Templates contain all required sections
Rules load for the right paths
Frontmatter schemas are consistent across all FR files
Cross-references between files are valid (no broken links)

As the system grows, changes to one component can silently break others.

Proposed Solution

Two layers:

/smoke-test skill — quick health check runnable in any session
Automated test suite — deeper validation, runnable via script

Test Categories

Category	What it checks
Template integrity	All templates have required sections, valid YAML frontmatter
FR consistency	All FRs have valid frontmatter, status matches directory, no orphan references
Hook health	Hook scripts exist, are executable, handle valid input without error
Skill syntax	Skill YAML/MD files parse correctly, required fields present
Rule loading	Rules files exist at expected paths
Cross-references	FR dependencies point to existing FRs, related links are valid

Open Questions

No open questions.

Phase Overview

Phase	Description	Status
Phase 1	`/smoke-test` skill with basic checks	—
Phase 2	Automated test scripts (deeper validation)	—

Phase 1: Smoke Test Skill —

Goal: Quick health check as a slash command.

File / Feature	Details	Owner	Status
`.claude/skills/smoke-test/SKILL.md`	Skill definition	mv	—
Template validation	Check all templates parse correctly	opus	—
FR frontmatter scan	Validate all FRs have required fields	opus	—
Hook existence check	Verify registered hooks point to real scripts	mv	—

Phase 2: Deep Validation —

Goal: Thorough automated test suite.

File / Feature	Details	Owner	Status
`src/tests/test_system_integrity.py`	Pytest-based system tests	mv	—
Cross-reference validation	Check all FR links resolve	opus	—
Directory-status consistency	FR in `in-progress/` must have status: in-progress	opus	—
Integration with CI	Run on push if CI exists	opus	—

Test

Manual tests

Test	Expected	Actual	Last
…	…	pending	-

AI-verified tests

Scenario	Expected behavior	Verification method
…	…	…

E2E tests

Scenario	Assertion
…	…

Integration tests

Component	Coverage
…	…

Unit tests

Component	Tests	Coverage
…	…	…

History

Date	Event	Details
2026-03-12	Created	Identified as gap — no way to test system components work together

References

FR-049 (Testing Infrastructure) — Python unit tests; this FR covers system-level integration
FR-004 (Deterministic Validation Hooks) — hooks are one of the things tested here
FR-028 (Feature Overview Sync Check) — overlaps on FR consistency; this FR is broader

Opus Vault

Explorer

System Integration Testing