Reading guide
This page routes you to the part of the writeup that fits your purpose.
The project ships with two reading-style guides covering the same content (per ADR-079):
| Guide | Style | Length | Best for |
|---|---|---|---|
| WRITEUP_PAPER.md | Academic IMRAD (Abstract / Methods / Results / Discussion / Limits / Refs) | ~20–25 min | Reviewers expecting journal-paper discipline |
| WRITEUP_NARRATIVE.md | Story arc (Hook / Setup / Investigation / Revelation / Implications) | ~15–20 min | Readers preferring plain-English first-person prose |
Both guides cover the same content. Pick the register that fits.
Persona-specific paths
Path A — Academic reviewer (~20–25 min)
Read WRITEUP_PAPER.md end-to-end. It is structured as a journal paper (Abstract, Introduction, Background, Methods, Results, Discussion, Limitations, Conclusion, References). Cross- references to ADRs at every methodology decision. Bibliography includes external papers, project artifacts, and ADR citations.
If you want depth on a specific subsection, the methodology spokes are at WRITEUP/ — 8 files covering data decisions, evaluation design, model details, threshold policy, reference-scorer audit, methodology guarantees, reproducibility, and limitations + future work.
Path B — Story reader (~15–20 min)
Read WRITEUP_NARRATIVE.md end-to-end. It is structured as a 5-act story arc with an epilogue. Plain-English voice; defines technical terms on first use.
The story’s third act surfaces the headline finding dramatically (the anti-correlation result). The fourth act covers the 6 supporting findings as equal-weight enumeration so the headline doesn’t drown out the rest.
Path C — Hiring manager (60 seconds)
Read Project at a glance. Four questions: what problem, what found, why trust, how the candidate thinks. This is the shortest reader path.
Path D — Reproducer (~15–20 min setup + ~$0 to ~$125 compute)
Three tiers per WRITEUP/reproducibility.md:
- T0 — score-match against published HF Hub checkpoints (~$0, ~20 min)
- T1 — laptop smoke test (~$0, <10 min)
- T3 — full retraining on cloud GPU (~$125, hours; cost-capped per ADR-020)
Commands in README §Reproduce — three tiers. Cost ledger at evals/cost_ledger.csv.
Path E — Just the numbers
Read RESULTS.md. Tables-only appendix; no narrative prose. 5 canonical figures + raw artifact pointers.
Result map
| Result section | Where it lives |
|---|---|
| Headline pooled OOD AUPRC (Finding 3) | README §Executive summary, WRITEUP_PAPER §4.3, WRITEUP_NARRATIVE Act 3 Finding 3, RESULTS §1 |
| Direct detection check (Finding 1) | README §Executive summary, WRITEUP_PAPER §4.1, WRITEUP_NARRATIVE Act 3 Finding 1, RESULTS §Direct Prompt-Injection Performance |
| OOD wall is cross-family, not source-level (Finding 2) | WRITEUP_PAPER §4.2, WRITEUP_NARRATIVE Act 3 Finding 2, WRITEUP/eval-design §5.5 |
| Mechanism (lexical overfitting + label-relevance shift) | README §Executive summary, WRITEUP_PAPER §5.1, WRITEUP_NARRATIVE Act 3 |
| Context-window ablation | WRITEUP_PAPER §4.4, WRITEUP_NARRATIVE Finding 4, RESULTS §1B |
| Calibration | WRITEUP_PAPER §4.7, WRITEUP_NARRATIVE Finding 7, RESULTS §5 |
| Threshold fragility | WRITEUP_PAPER §4.6, WRITEUP_NARRATIVE Finding 6, RESULTS §4 |
Glossary + decisions
- Glossary: docs/GLOSSARY.md — all technical terms used in either guide, with cross-references.
- Decisions: 81 ADRs at decisions/ lock the methodology choices. Both guides cite specific ADRs at every methodology decision point.
- Evidence trail: EVIDENCE.md for external-evidence audit (training corpus contamination, reference scorer training pools, etc.).
Submission anchors
- Current state:
tree/v1.3.13(2026-05-26) — live-site source. - Original submission tag:
tree/v1.0.0(2026-05-18) — preserved as historical reviewer pin per ADR-033. - Live rendered site: https://brandon-behring.github.io/prompt-injection-detection-prototype/.