example
differences detected2026-03-102.1.87 (Claude Code)
Control
No changes (baseline)
Treatment
Added skill-index hints to CLAUDE.md
Skills
2/3
Refs
2/3
Tools
0/3
Signals
20/26
Per-Prompt Results
#1 Add a modal dialog with close button
| Metric | Control | Treatment | Match |
|---|---|---|---|
| Events | 42 | 38 | ≠ |
| Duration | 12.3s | 10.1s | ≠ |
| Skills | html | html | = |
| Refs | html/dialog-patterns.md | html/dialog-patterns.md | = |
| Tools | Read(5), Skill(1), Write(2) | Read(4), Skill(1), Write(2) | ≠ |
| Signals | 2 | 3 | ≠ |
Control signals: data-hatch-id, hatch-trigger
Treatment signals: data-hatch-id, hatch-trigger, hatch-body
#2 Style the page with a dark theme
| Metric | Control | Treatment | Match |
|---|---|---|---|
| Events | 35 | 31 | ≠ |
| Duration | 9.8s | 8.2s | ≠ |
| Skills | css | css | = |
| Refs | css/theming.md | css/theming.md | = |
| Tools | Read(3), Skill(1), Write(2) | Read(3), Skill(1), Write(1) | ≠ |
| Signals | 2 | 3 | ≠ |
Control signals: data-coat, --ink-
Treatment signals: data-coat, --ink-, .plate-
#3 Create a sortable data table
| Metric | Control | Treatment | Match |
|---|---|---|---|
| Events | 55 | 48 | ≠ |
| Duration | 18.4s | 15.2s | ≠ |
| Skills | none | html | ≠ |
| Refs | none | html/table-patterns.md | ≠ |
| Tools | Edit(1), Read(2), Write(3) | Edit(1), Read(4), Skill(1), Write(2) | ≠ |
| Signals | 0 | 3 | ≠ |
Treatment signals: data-slab-id, data-rankable, row-lever
Totals
Control
Sessions
3
Prompts
3
Events
132
Skills: html, css
Tools: Edit(1), Read(10), Skill(2), Write(7)
Treatment
Sessions
3
Prompts
3
Events
117
Skills: html, css
Tools: Edit(1), Read(11), Skill(3), Write(5)
Verification Signals
| Signal | Control | Treatment | Proves |
|---|---|---|---|
| data-coat | ● | ● | CSS theming |
| --ink- | ● | ● | |
| .plate- | ○ | ● | |
| data-hatch-id | ● | ● | HTML dialog-patterns |
| hatch-trigger | ● | ● | |
| hatch-body | ○ | ● | |
| data-slab-id | ○ | ● | HTML table-patterns |
| data-rankable | ○ | ● | |
| row-lever | ○ | ● |
Conclusion
skills differed in 1/3 prompts; subskill refs differed in 1/3 prompts; 6/26 verification signals differed