nestharus/ADR.md

Created January 1, 2026 12:18

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Select an option

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/nestharus/b61dc15e37e6b2f2302cfa678718b1ff.js"></script>
Save nestharus/b61dc15e37e6b2f2302cfa678718b1ff to your computer and use it in GitHub Desktop.

Download ZIP

Planning Documents

Raw

ADR.md

ADR-### — {decision title}

Status: {Proposed | Accepted | Superseded | Deprecated} Date: YYYY-MM-DD Supersedes: ADR-### Superseded-by: ADR-###

ADRs record decisions about invariants, boundaries, and relationships. Do not specify concrete schema field/key names or data types; express impact as ID-level changes and link to PRD/Design Map IDs.

Context

Links: (GOAL-, INV-, SET-, ART-, COM-, CON-, IAR-, TASK-)
Drivers: (INV-, SET-, MET-__) (IDs only)

Decision

{one-line decision (state at invariant/boundary level; avoid schema field/key names and data types)}

Options considered

Option A — {short}
Option B — {short}

Consequences

Positive:
- ...
Negative:
- ...

References

PRD: (INV-, {DOMAIN}-)
Design Map: (COM-, CON-, ART-, IAR-, OBL-__)
Plan: (PHASE-, TASK-)

Raw

Component.md

Components (Packages) — Structure

Component packages are where components live. A component package groups:

COM-XX components (may be purely architectural composition of other COM-XX)
ALG-XX algorithms (Mermaid flowcharts + cross-references back to PRD IDs)
optional state holders (IAR-XX) used or owned by the component
CON-XX surfaces (contracts/boundaries) expressed as input/output invariants (not schemas)

This module defines the standard structure for component packages. It intentionally specifies invariants on inputs/outputs (what they MUST include / guarantee), not concrete schemas, field/key names, data types, or strict data "shapes".

Invariants

INV-COM-01 — PRD traceability: every COM-XX / ALG-XX / CON-XX / IAR-XX section MUST cite PRD IDs via Cross-references: or Implements:.
INV-COM-02 — Surfaces are invariant contracts: surfaces MUST be documented as input/output invariants per .tasks/processes/design map structure.md.
INV-COM-03 — Start with one component: begin with a single root component package that contains the full component graph and all algorithms; split only when the decomposition is clear.
INV-COM-04 — No shape embedding: do not embed schema definitions, field/key names, data types, or example payload shapes inside component packages; express "shape" requirements as invariant IDs and reference them.
INV-COM-05 — Architectural components allowed: a COM-XX may exist only to compose other components (no algorithms/state); it still participates in the graph and must reference relevant invariants and boundaries.

Root Component Package: `architecture.md`

The root component package is the starting point:

Contains the initial component graph and all ALG-XX algorithms
Defines the component's surfaces (CON-XX) and their input/output invariants
Acts as the index when additional component packages are created

When splitting into multiple components:

Create new component packages as com-XX-<kebab-name>.md
Move the relevant ALG-XX sections into the new package
Keep architecture.md as the root index + cross-component invariants + system diagram

Templates

Template: Root component package (`architecture.md`)

# Component: Architecture (root)

Sources: {links to PRD, Design Map, ADRs}

## Component and surface map (optional until split)

```mermaid
flowchart LR
  %% Components + boundaries; keep state-holder (`IAR-XX`) detail in the Design Map
  A["COM-__"] -->|CON-__| B["COM-__"]
```

## Surfaces (CON-XX)

### Contract: CON-__

Between: (COM-__, COM-__)
For: (ART-__/IAR-__)
Interaction: {request-response|pub-sub|batch|stream|filesystem|...}
Cross-references: (requires: INV-__; satisfies: SET-__; decided-by: ADR-###)

Input MUST include (IDs only; no schema fields/types):

- {ID} — {required semantic element / operational metadata / gating invariant}

Output MUST include (IDs only; no schema fields/types):

- {ID} — {required semantic element / operational metadata / gating invariant}

Boundary obligations:

- OBL-__ — {short obligation description}
  Cross-references: (requires: INV-__; satisfies: SET-__)

## Algorithms (ALG-XX)

### ALG-01: {algorithm name}

```mermaid
flowchart TD
  %% Cross-references: (requires: RULE-01, INV-02)
  A["Start"] --> B{"Decision?"}
  B -->|Yes| C["Action"]
  B -->|No| D["Other action"]
```

Template: Split component package (`com-XX-*.md`)

# Component: COM-__ — {component name}

Sources: {links}

## Surfaces (CON-XX)

### Contract: CON-__
... (same surface sections/invariants as above)

## Algorithms (ALG-XX)

### ALG-__: {algorithm name}
... (Mermaid flowchart; cite PRD IDs in Cross-references)

Reference Example (splitting)

For a concrete example of a root architecture.md plus split component package(s), see:

.tasks/plans/parallel linter/lint dispatcher/architecture.md
.tasks/plans/parallel linter/lint dispatcher/com-02-run-linters-orchestrator.md

Raw

Design.md

Design Map Structure

A Design Map defines components, state holders (IAR-XX), and contracts/boundaries (CON-XX), and maps them back to PRD IDs. Decision rationale must not be embedded in the Design Map; link to ADRs by ID only: (decided-by: ADR-###). ADRs live in .tasks/processes/adr/.

Invariants

INV-DM-01 — PRD traceability: every COM-XX / CON-XX / IAR-XX node MUST cite PRD IDs via Implements: or Cross-references:.
INV-DM-02 — Derived requirements allowed, PRD escalation only for ambiguity: Design Map nodes MAY introduce derived requirements discovered during exploration (pattern/boundary/ecosystem constraints). These derived requirements remain in the Design Map unless they cannot be resolved autonomously without an implicit choice. If autonomous derivation fails, the Design Map MUST surface the missing requirement(s) needed to decide (not the decision), and those missing requirements MUST be escalated to the PRD (typically as Q-XX).
INV-DM-03 — Boundary obligations: every introduced boundary (CON-XX and any IAR-XX that creates a boundary) MUST list its boundary obligation IDs (OBL-XX) (without describing algorithms here).
INV-DM-04 — ADR-only decisions: decisions appear only as (decided-by: ADR-###) with no rationale prose in PRD/Plan/Design Map.
INV-DM-05 — Surface invariants, not shapes: Design Maps MUST define boundary behavior as invariants/obligations identified by IDs (what inputs/outputs MUST include / guarantee), not as concrete schema definitions, field/key names, data types, or example payload shapes.
INV-DM-06 — Graph-first IDs: every node and relationship in a Design Map instance MUST be representable as IDs + typed edges/relationships (avoid free-form prose that cannot be attached to an ID).

Derived requirement classification (Design Map behavior)

DR-1 (Deterministic derived requirement): can be derived from PRD + explored facts/pattern constraints with no preference tradeoff → stays in Design Map.
DR-2 (Decision-derivable): multiple valid options exist, but one option is deterministically implied by existing constraints → record ADR; Design Map links (decided-by: ADR-###); no PRD change.
DR-3 (Ambiguity / missing intent): cannot be derived without preference/tradeoff → Design Map must surface missing requirement(s); escalate to PRD as Q-XX (or equivalent).
Escalation rule: Only Needs items that are ambiguities requiring user intent are escalated to PRD (as Q-XX or equivalent). Deterministic derived requirements MUST remain in Design Map and MUST NOT be rewritten into PRD.

Identifier Vocabulary

PRD IDs (inputs):

GOAL-XX, INV-XX, SET-XX, ART-XX, ALG-XX, RES-XX, Q-XX, {DOMAIN}-XX

Derived requirements introduced/resolved inside a Design Map may use domain-specific prefixes in the {DOMAIN}-XX space (e.g., DM-XX, BND-XX, ECO-XX) as long as they are unique and cross-referenced.

Process IDs (links only):

ADR-### — Architecture Decision Record

Design Map IDs (this document):

COM-XX — component
CON-XX — contract / boundary (component↔component or component↔artifact)
IAR-XX — internal artifact / state holder / boundary (queue, table, topic, cache, internal API, filesystem path)
OBL-XX — boundary obligation (referenceable obligation applied at a boundary)

Deterministic Derivation (Minimum Required Invariants)

For deterministic derivation from a Design Map instance:

Each COM-* must specify:
- chosen pattern (name/ID) and its required contract types;
- optional composition (Composed-of: (COM-*, ...)) for purely architectural components;
- its required inputs/outputs (as ART-* and/or IAR-* IDs);
- its surface list (CON-*) (provided and/or consumed).
Each CON-* must specify:
- interaction type (request/response, pub/sub, batch, stream, filesystem, etc.);
- input invariants (what inbound payload + metadata MUST include);
- output invariants (what outbound payload + metadata MUST include);
- required invariants/constraint sets (by ID references);
- boundary obligation IDs (OBL-*).
Each IAR-* must specify:
- kind + access contracts (CON-*);
- state invariants (what the artifact MUST represent/include/guarantee).

Minimal Node Templates

Needs: IDs of unresolved inputs required to complete deterministic derivation of this node without making an implicit decision. Needs items can be:

Derived requirements resolved inside Design Map, or
Open questions / missing intent that must be escalated to PRD.

Cross-references: optional typed relations. Relation labels are extensible; include only those relevant to the node.

Component Template

## Component: COM-__

Composed-of (optional): (COM-__, COM-__)
Pattern: {pattern-name-or-id}
Implements: (ALG-__, {DOMAIN}-__, GOAL-__)
Cross-references (optional): (requires: INV-__; uses: RES-__; satisfies: SET-__;
  impacts: ART-__; decided-by: ADR-###)
Needs: ({DOMAIN}-__, Q-__)
Consumes: (ART-__, IAR-__)
Produces: (ART-__, IAR-__)

### Surfaces (contracts)

- CON-__: {provides|consumes} for (ART-__/IAR-__) Cross-references (optional):
  (requires: INV-__; satisfies: SET-__; decided-by: ADR-###)

Contract (Boundary) Template

## Contract: CON-__

Pattern: {pattern-name-or-id}
Between: (COM-__, COM-__)
For: (ART-__/IAR-__)
Interaction: {request-response|pub-sub|batch|stream|filesystem|...}
Implements: (ALG-__, {DOMAIN}-__, GOAL-__)
Cross-references (optional): (requires: INV-__; uses: RES-__; satisfies: SET-__;
  impacts: ART-__; decided-by: ADR-###)
Needs: ({DOMAIN}-__, Q-__)

Input MUST include (IDs only; no schema fields/types):

- {ID} — {required semantic element / operational metadata / gating invariant}

Output MUST include (IDs only; no schema fields/types):

- {ID} — {required semantic element / operational metadata / gating invariant}

Boundary obligations:

- OBL-__ — {short obligation description}
  Cross-references: (requires: INV-__; satisfies: SET-__)

Internal Artifact / Boundary Template

## Internal Artifact / State Holder / Boundary: IAR-__

Pattern: {pattern-name-or-id}
Kind: {queue|topic|table|index|cache|internal-api|filesystem|job|timer}
Owned-by: (COM-__)
Implements: (ALG-__, {DOMAIN}-__, GOAL-__)
Cross-references (optional): (requires: INV-__; uses: RES-__; satisfies: SET-__;
  impacts: ART-__; decided-by: ADR-###)
Needs: ({DOMAIN}-__, Q-__)

State MUST include / guarantee (IDs only; no schema fields/types):

- {ID} — {required state element / invariants the artifact enforces}

### Contracts (how it is accessed)

- CON-__: {read|write|publish|subscribe|mutate} Cross-references (optional):
  (derived-from: IAR-__; requires: INV-__; satisfies: SET-__)

Boundary obligations:

- OBL-__ — {short obligation description}
  Cross-references: (requires: INV-__; satisfies: SET-__)

Raw

PRD.md

PRD Structure and Composition Theory

Purpose

This document defines the principles and structure for composing Product Requirements Documents (PRDs) in this project. The format prioritizes machine-parseable structure, traceability, and zero duplication over traditional prose-based documentation.

Core Principles

1. No Prose

PRDs must avoid narrative paragraphs. All content should be expressed as:

Bulleted lists
Indexed rules with unique identifiers
Tables (for structured comparisons)
Diagrams (Mermaid flowcharts/component diagrams)

Prose exception (only if present): Problem Statement (single paragraph).

Why: Prose introduces ambiguity, makes requirements hard to trace, and encourages duplication through restating the same concept in different words.

Bad example (prose):

The system should extract facts from documents. These facts need to be atomic,
meaning each fact contains exactly one assertion. The extraction process must
preserve the original location in the source document for traceability purposes.

Good example (structured):

- **EX-05 — Atomicity:** no compound facts; split conjunctions ("and/or/but") into
  separate facts.
- **EX-06 — Source context required:** each fact carries provenance offsets (not
  ownership).

2. Indexed Identifiers

Every discrete requirement, resource, goal, invariant, constraint set, artifact/boundary, algorithm, or rule receives a unique identifier following these conventions:

Prefix	Meaning	Example
`RES-XX`	Resource (tool, library, external dependency)	`RES-01` Claude Code
`GOAL-XX`	High-level objective	`GOAL-03` Atomic information units
`INV-XX`	Invariant (immutable constraint)	`INV-01` Byte-exact provenance
`SET-XX`	Constraint set (bundle of actual system constraints: INV/RULE)	`SET-01` Auditability set
`ART-XX`	External artifact/boundary requirement (external only)	`ART-01` GitHub webhook payload
`ALG-XX`	Algorithm (documented in Components; not embedded in PRDs)	`ALG-01` Fact extraction
`MET-XX`	Success metric (measurable verification)	`MET-01` Fact extraction accuracy
`TEST-XX`	Test artifact / verification evidence	`TEST-01` Webhook signature verification tests
`Q-XX`	Open question (unresolved decision requiring future input)	`Q-01` PDF input support
`ADR-###`	Architecture Decision Record (decision rationale/history; not PRD content)	`ADR-012` Storage engine choice
`XXX-XX`	Domain-specific rule (e.g., EX, VAL, REC)	`EX-05` Atomicity

This prefix table is a starter set; PRD instances may define additional prefixes as needed. Unrecognized prefixes should be treated as domain-specific XXX-XX rules as long as IDs are unique and cross-referenced.

SET-XX rules:

SET-XX members are actual system constraints (INV/RULE), not document meta-format requirements.
Apply sets by attachment (do not restate set contents): Cross-references: (satisfies: SET-XX).

SET-XX template:

- **SET-01 — {set name}:** bundle of commonly-applied constraints.
  Cross-references: (includes: INV-05, SEC-02, RET-01)

ART-XX rules:

PRDs list external ART-XX only (inputs/outputs/interfaces that are requirements).
Internal artifacts/boundaries (queues, tables, internal APIs) belong in the Design Map.

Why: Indexed identifiers enable:

Cross-referencing without duplication
Traceability from implementation to requirement
Precise citations in discussions and reviews
Machine parsing for coverage analysis

3. DRY (Don't Repeat Yourself)

Each concept appears exactly once. When a rule depends on or relates to another, use typed cross-references instead of restating.

Cross-reference format (machine-parseable):

Use a Cross-references: field with labeled relations.
Grammar: Cross-references: (requires: INV-01; uses: RES-03; satisfies: SET-02; validated-by: MET-01)
Delimiters:
- Relations separated by ;
- IDs separated by ,

Initial relation vocabulary (extend only when needed):

requires: hard dependency
uses: resource/tooling dependency
satisfies: compliance/constraint satisfaction (often a SET-XX)
validated-by: metric (MET-XX) or test artifact (TEST-XX)
derived-from: decomposition of another ID
impacts: non-required effect (rare)
decided-by: ADR link (ID only, no prose)

Decision rationale rule (keep PRD timeless):

If something requires a choice, apply decision-gating:
- If it is deterministically derivable from existing requirements + explored facts → record as an ADR decision and link from PRD items via Cross-references: (decided-by: ADR-###) (ID only).
- If it is not derivable → surface the missing requirement(s) needed to derive it (not the choice itself), ideally as an Open Questions Q-XX item.
Do not put decision history/justifications inside PRD prose.

Single extension (justified):

includes: membership list for SET-XX (constraint sets must expose members for machine parsing)

Bad (duplication):

- **VAL-G-01:** All facts must be atomic with one assertion each.
- **EX-05:** Each stored fact contains exactly one assertion.

Good (cross-reference):

- **EX-05 — Atomicity:** no compound facts; split conjunctions into separate facts.
  Cross-references: (validated-by: MET-01)
- **MET-01 — Atomicity conformance:** % of extracted facts that contain exactly one
  assertion.
  Cross-references: (derived-from: EX-05)

4. Precedence Through Invariants

Invariants are constraints that:

Apply globally across the entire document
Override any conflicting requirements
Cannot be violated by lower-level rules

Place invariants first and explicitly state their precedence:

### Invariants

- **Precedence:** invariants apply globally and override any conflicting requirements.
- **INV-01 — Byte-exact provenance:** ...
- **INV-REF-01 — No orphan requirements:** every non-invariant rule MUST be referenced
  by at least one of:
  - an `ALG-XX` algorithm (in component packages), or
  - an external `ART-XX` requirement, or
  - a `MET-XX` success metric.

5. Hierarchical Organization

Group related rules into logical categories. Standard categories include:

Category	Purpose
Invariants	Global constraints that override all other rules
Execution rules	How the system runs (harness, orchestration)
Input rules	Input formats, validation, canonicalization
Processing rules	Core logic, extraction, transformation
Validation rules	Quality gates, error detection
Output rules	Artifacts, formats, packaging
Maintenance rules	Updates, merges, idempotency
Performance rules	Constraints on resources, timing
QA rules	Testing strategy, coverage requirements

6. Visual Representation

Use Mermaid diagrams for:

Component diagrams: System architecture showing major components and data flow
Algorithm flowcharts (components): Step-by-step decision logic with rule annotations

Annotate diagram nodes with rule references:

flowchart TD
  A["Input: document<br/>Cross-references: (requires: IN-01, CAN-01)"] -->
    B["Processing<br/>Cross-references: (requires: EX-01, EX-02)"]

7. Invariants, Not Concrete Shapes

Requirements and contracts must be defined as invariants/obligations identified by IDs, not as concrete data schemas.

Rules:

Do not specify data types, field/key names, or example payload shapes in PRDs.
If a boundary "needs a field", define the semantic requirement as an ID (e.g., ERR-01, TRACE-01) and reference that ID from ART-XX / CON-XX / ALG-XX documentation.

Standard Sections

Required Sections

These sections should appear in every PRD:

1. Sources (optional header line)

References to related documents that inform or extend this PRD.

Sources: QA_strategy.md · requirements.md

2. Resources

Indexed list of external dependencies (tools, libraries, APIs, models).

## Resources

- `RES-01` Claude Code — execution harness
- `RES-02` Python — deterministic helper scripts runtime
- `RES-03` SQLite — local DB engine / file format

Guidelines:

One resource per line
Include brief description of role/purpose
Resources may reference each other via Cross-references: (requires: RES-XX) when relevant

3. External Artifacts / Boundaries

Indexed list of external interfaces/deliverables that are requirements.

## External Artifacts / Boundaries

- **ART-01 — {artifact/boundary name}:** exists as an external interface/deliverable.
  Cross-references: (satisfies: SET-01; validated-by: MET-03; requires: INV-02)

Guidelines:

One artifact/boundary per line
PRDs list external ART-XX only
Internal artifacts/boundaries belong in the Design Map

4. Problem Statement

Single paragraph (exception to no-prose rule) explaining:

What problem exists
Why it matters
What success looks like

Keep under 200 words. This is the only prose allowed.

5. Goal List

Indexed objectives the system must achieve.

## Goal List

- **GOAL-01 — Structured fact index:** Extract all unique, atomic facts...
- **GOAL-02 — Completeness by derivability:** Preserve base facts sufficient for...

Guidelines:

Use GOAL-XX — format with short title and description
Goals are outcome-focused (what), not implementation-focused (how)
Goals should be measurable or verifiable

6. Indexed Rule List

The bulk of the PRD. Organized into subsections by category.

## Indexed rule list

### Invariants

- **INV-01 — Rule name:** Description.
- **INV-02 — Rule name:** Description.
  Cross-references: (requires: INV-01; derived-from: GOAL-02)

### Execution rules

- **EXEC-01 — Rule name:** Description.

Guidelines:

Each rule has unique prefix+number identifier
Include Cross-references: (...) with typed relations where applicable
Rules should be atomic (one concern per rule)
Use consistent verb tense (present/imperative)

7. Component Diagrams

Mermaid diagrams showing system architecture.

## Component diagrams

### Component diagram: System Name

```mermaid
flowchart LR
  subgraph INPUTS["Inputs"]
    DOC["Documents<br/>Cross-references: (requires: IN-01)"]
  end
  ...
```

Guidelines:

Annotate nodes with relevant rule IDs
Use subgraphs to group related components
Show data flow direction with arrows
Include legend if diagram is complex

In PRDs:

Prefer external ART-XX boundaries and major components only
Put internal artifacts/boundaries in the Design Map

8. Components (Packages)

Algorithms do not live in PRDs. They live in component packages under a sibling components/ folder. Component package structure is defined in .tasks/processes/components/architecture.md.

## Components (packages)

- `components/architecture.md` — root component package (start here; contains all
  `ALG-XX` initially)

Guidelines:

Treat each component as a package of component composition (COM-XX), ALG-XX algorithms, and state holders (IAR-XX), plus its exposed surfaces (CON-XX).
Start with a single root component (components/architecture.md) that contains all algorithms; split into additional component packages only when the decomposition is clear.
Component surfaces MUST be documented as input/output invariants per .tasks/processes/design map structure.md.

9. Success Metrics

Indexed, measurable criteria that define when goals are achieved. Each metric links to specific goals and provides objective verification criteria.

## Success Metrics

| ID | Metric | Target | Measurement | Cross-references |
|----|--------|--------|-------------|------------------|
| `MET-01` | Fact extraction accuracy | >95% | Manual review of 100 samples | (derived-from: GOAL-01, GOAL-03) |
| `MET-02` | Reconstruction success rate | 100% | Automated byte-comparison | (derived-from: GOAL-02, INV-01) |
| `MET-03` | Processing throughput | <5min/page | Benchmark on 50-page doc | (derived-from: GOAL-10, PERF-02) |
| `MET-04` | Deduplication precision | >99% | No false merges in test set | (derived-from: GOAL-04, DED-05) |

Guidelines:

Use MET-XX prefix for metric identifiers
Each metric must include at least one typed cross-reference (e.g., (derived-from: GOAL-XX[, INV-XX]))
Each goal/rule should cite its verification via Cross-references: (validated-by: MET-XX)
Targets must be specific and testable (not "high accuracy" but ">95%")
Measurement column specifies HOW the metric is validated
Include both functional metrics (accuracy, coverage) and operational metrics (speed, resource use)

Metric Categories:

Category	Purpose	Examples
Correctness	Validates functional requirements	Accuracy, precision, recall, F1
Completeness	Validates coverage requirements	% of source covered, reconstruction rate
Performance	Validates resource constraints	Throughput, latency, memory usage
Reliability	Validates stability requirements	Error rate, idempotency, crash frequency

Writing Good Metrics:

Derived from Goals: Every goal should have at least one metric derived from it and cite it via validated-by
Binary Testable: At any point, you can definitively say pass/fail
Realistic Targets: Based on baseline measurements or industry standards
Measurable Now: Don't defer measurement to "later" — define the test procedure
Independent: Metrics should not be redundant with each other

Bad metrics:

| Metric | Target |
|--------|--------|
| Quality | High |
| Speed | Fast enough |
| User satisfaction | Good |

Good metrics:

| ID | Metric | Target | Measurement |
|----|--------|--------|-------------|
| `MET-01` | Fact precision | >95% | Manual audit of 100 random facts |
| `MET-02` | Cold start time | <10s | Time from invocation to first output |
| `MET-03` | Task completion rate | 100% | No CQ artifacts in golden test set |

Optional Sections

Include when applicable:

Personas / User Scenarios

When the product serves distinct user types or use cases.

## Personas

### Primary: Developer

- Uses CLI interface
- Needs batch processing
- Values accuracy over speed

### Secondary: Reviewer

- Uses output artifacts
- Needs clear provenance
- Values auditability

Open Questions

Unresolved decisions that require future input.

## Open Questions

- [ ] **Q-01:** Should we support PDF input directly?
  Cross-references: (requires: RES-XX)
- [x] **Q-02:** Embedding model selection — resolved: RES-10 Qwen-3
  Cross-references: (uses: RES-10)

Composition Workflow

Step 1: Define Problem and Goals

Start with:

Problem Statement (single paragraph)
Goal List (3-10 indexed objectives)

These frame all subsequent requirements.

Step 2: Define Success Metrics

For each goal, define at least one measurable metric:

What measurement validates this goal?
What is the specific target threshold?
How will measurement be performed?

This ensures goals are verifiable from the start.

Step 3: Identify Resources

List all external dependencies. This inventory constrains what rules can specify.

Step 4: Establish Invariants

Define immutable constraints that must never be violated. These are your "constitutional" requirements.

Step 5: Derive Rules by Category

For each category (execution, input, processing, etc.):

Enumerate the rules needed to achieve goals
Ensure each rule is atomic
Add typed Cross-references: (...) fields
Avoid duplicating concepts already covered

Step 6: Add Visual Diagrams

Create component diagrams (in PRDs) and algorithm flowcharts (in component packages) that:

Summarize the system visually
Annotate with rule references
Clarify complex interactions

Step 7: Review for DRY Violations

Scan the document for:

Similar wording in multiple rules (consolidate or cross-reference)
Implicit dependencies (make explicit via references)
Prose creep (convert to structured format)

Step 8: Validate Metric Coverage

Verify that:

Every GOAL has at least one validated-by: MET-XX reference
Every MET has a defined measurement procedure
No MET is redundant with another

Rule Writing Guidelines

Atomic Rules

Each rule captures exactly one requirement:

Bad (compound):

- **EX-01:** Extract facts atomically, deduplicate them, and store with provenance.

Good (atomic):

- **EX-01:** Extract facts from input documents.
- **EX-02:** Ensure extracted facts are atomic (one assertion each).
- **DED-01:** Deduplicate identical facts.
- **EX-06:** Attach provenance offsets to each fact.

Self-Contained Description

Rules should be understandable without reading others, but may reference others for context:

- **REC-03 — Byte-for-byte comparison:** reconstruction validation requires exact
  character match between original and reconstructed text (not semantic equivalence).

Consistent Formatting

Use this template for rules:

- **PREFIX-XX — Short title:** Description in imperative or declarative form.
  Additional detail if needed. Cross-references: (requires: INV-01; derived-from:
  GOAL-02).

Expanded Sections

For complex invariants or rules, add an expanded subsection:

### Invariants

- **INV-01 — Byte-exact provenance:** canonical UTF-8 source; reconstruction success.

#### INV-01: Byte-exact Provenance (expanded)

- **Canonical source text:** ingestion produces canonical UTF-8 text representation.
  Cross-references: (requires: CAN-01)
- **Provenance via offsets:** references stored as character offsets.
  Cross-references: (requires: GOAL-06)
- **Pass condition:** byte-for-byte match required.
  Cross-references: (requires: REC-03)

Common Anti-Patterns

1. Prose Descriptions

Problem: Narrative paragraphs that bury requirements in sentences.

Fix: Convert to indexed rules with explicit IDs.

2. Duplicate Concepts

Problem: Same requirement stated differently in multiple places.

Fix: Consolidate to single rule; use cross-references elsewhere.

3. Implicit Dependencies

Problem: Rule assumes another without citing it.

Fix: Add explicit cross-reference: Cross-references: (requires: INV-01)

4. Vague Rules

Problem: "The system should handle errors appropriately."

Fix: Specify exact behavior as invariant obligations (IDs), not schema fields: "EXEC-05 — On extraction failure, emit a failure artifact. Cross-references: (requires: ERR-01, TRACE-01, PROV-01)."

5. Implementation Details in Goals

Problem: Goals that specify how, not what.

Fix: Keep goals outcome-focused. Move implementation to rules.

6. Missing Diagrams

Problem: Complex multi-component system described only in text.

Fix: Add component diagram showing relationships and data flow.

Summary

The PRD format used in this project differs from traditional prose-based PRDs by:

Eliminating prose in favor of indexed, atomic rules
Enabling traceability through unique identifiers
Preventing duplication through cross-references
Establishing precedence through invariants
Visualizing complexity through Mermaid diagrams
Organizing hierarchically by concern (execution, validation, output, etc.)

This approach produces documents that are:

Machine-parseable for coverage analysis
Unambiguous for implementation
Maintainable through modular updates
Traceable from requirement to code

The trade-off is reduced readability for casual readers. These PRDs are optimized for precision and completeness, not narrative flow.

nestharus/ADR.md

ADR-### — {decision title}

Context

Decision

Options considered

Consequences

References

Components (Packages) — Structure

Invariants

Root Component Package: architecture.md

Templates

Template: Root component package (architecture.md)

Template: Split component package (com-XX-*.md)

Reference Example (splitting)

Design Map Structure

Invariants

Derived requirement classification (Design Map behavior)

Identifier Vocabulary

Deterministic Derivation (Minimum Required Invariants)

Minimal Node Templates

Component Template

Contract (Boundary) Template

Internal Artifact / Boundary Template

PRD Structure and Composition Theory

Purpose

Core Principles

1. No Prose

2. Indexed Identifiers

3. DRY (Don't Repeat Yourself)

4. Precedence Through Invariants

5. Hierarchical Organization

6. Visual Representation

7. Invariants, Not Concrete Shapes

Standard Sections

Required Sections

1. Sources (optional header line)

2. Resources

3. External Artifacts / Boundaries

4. Problem Statement

5. Goal List

6. Indexed Rule List

7. Component Diagrams

8. Components (Packages)

9. Success Metrics

Optional Sections

Personas / User Scenarios

Open Questions

Composition Workflow

Step 1: Define Problem and Goals

Step 2: Define Success Metrics

Step 3: Identify Resources

Step 4: Establish Invariants

Step 5: Derive Rules by Category

Step 6: Add Visual Diagrams

Step 7: Review for DRY Violations

Step 8: Validate Metric Coverage

Rule Writing Guidelines

Atomic Rules

Self-Contained Description

Consistent Formatting

Expanded Sections

Common Anti-Patterns

1. Prose Descriptions

2. Duplicate Concepts

3. Implicit Dependencies

4. Vague Rules

5. Implementation Details in Goals

6. Missing Diagrams

Summary

Root Component Package: `architecture.md`

Template: Root component package (`architecture.md`)

Template: Split component package (`com-XX-*.md`)