Example Output: Security Risk Experiment Launch Plan
Inputs used
- Project context: an internal agent that can read tickets, GitHub issues, and customer documents
- Target audience: security engineers, platform owners, privacy teams
- Success metric: activation, quality, and risk reduction
- Available tools and data: threat model template, SIEM, secret scanner, policy engine
- Desired depth: Production-ready
- Output tone: Clear operator memo
Generated Result
hypothesis, audience, variants, instrumentation, and decision rule
Hypothesis
Hypothesis: improving how the workflow handles data flow diagrams will reduce prompt injection. Ship two variants at most, instrument the primary metric before launch, and decide in advance what evidence stops the test.
Target segment
Use tool permissions as evidence, apply the constraint "map each risk to a control", and explicitly note how the plan reduces credential leakage. The output should be ready for a practitioner to act on without a follow-up explanation.
Variants
Hypothesis: improving how the workflow handles audit logs will reduce cross-tenant access. Ship two variants at most, instrument the primary metric before launch, and decide in advance what evidence stops the test.
Instrumentation
Hypothesis: improving how the workflow handles security incidents will reduce unreviewed tool writes. Ship two variants at most, instrument the primary metric before launch, and decide in advance what evidence stops the test.
Risks
Treat prompt injection as a launch blocker until there is a control that can be verified. The minimum control is: map each risk to a control, plus reviewer sign-off for ambiguous outputs.
Decision rule
Hypothesis: improving how the workflow handles tool permissions will reduce credential leakage. Ship two variants at most, instrument the primary metric before launch, and decide in advance what evidence stops the test.
Recommended Decision
Proceed with a narrow pilot focused on data flow diagrams and tool permissions. Treat prompt injection as the primary launch blocker. The first milestone should prove that the workflow produces a usable risk register, mitigations, and verification checklist with clear evidence, named owners, and a review path for ambiguous cases.
Expected quality checks
- The result is specific to AI system threat modeling, prompt-injection review, data exposure risk, and incident readiness.
- It includes the required sections: Hypothesis, Target segment, Variants, Instrumentation, Risks, Decision rule.
- It separates evidence, assumptions, risks, and recommended next actions.
- It includes practical verification steps, not only generic advice.
- It names the most important failure mode for this domain: prompt injection.
Reuse note
Before copying the output into production work, replace all default variables with your real data and run a human review for high-impact decisions.