App 04 — Studio
Prompts deserve version control.
Versioned templates, A/B experiments, model benchmarks, and snapshot tests. Prompt engineering as infrastructure, not guesswork.
The API
Templates, experiments, benchmarks, and snapshots are all first-class REST resources. Version your prompts the same way you version your code.
curl -X POST /api/studio/templates \
-d '{"slug":"summarize-doc", "body":"Summarize: {{text}}",
"model":"gpt-4o", "variables":["text"]}'
curl -X POST /api/studio/templates/summarize-doc/versions \
-d '{"body":"Summarize in 3 bullets: {{text}}", "note":"tighter format"}'
curl -X POST /api/studio/experiments \
-d '{"template":"code-review", "variants":["v6","v7"],
"split":[30,70], "metric":"quality"}'
curl -X POST /api/studio/benchmarks/run \
-d '{"template":"summarize-doc",
"models":["gpt-4o","claude-sonnet-4-20250514","gpt-4o-mini"]}'
curl -X POST /api/studio/snapshots \
-d '{"template":"extract-entities", "version":"v5", "tag":"pre-model-update"}'
Stop guessing. Start testing.
Studio ships with every Stockyard instance. Self-hosted or Cloud.