Use Cases

The sandbox is the core of Veris — your agent running against simulated users, triggers, and services. Once your agent is integrated (see Quickstart), the sandbox supports several distinct workflows. Most teams start with the development loop and pick up others as they grow.

Development loop

The default path for most Veris users. Simulate → read the report → fix the highest-leverage issue → push → re-simulate. Each pass narrows on the real failures. Most of the value of Veris lives here.

→ Development loop guide

CI/CD regression gating

Wire Veris into the PR path. On every push, build the agent image, push a PR-specific tag, run the same scenario set you use in development, and fail the PR if eval scores drop below a baseline. veris run exits non-zero on evaluation failure, so it drops into CI as a gate without special plumbing.

→ CI/CD regression gating guide

Regulatory QA

For agents in regulated domains (financial services, healthcare, public-sector workflows), the same simulation loop becomes the evidence base. Scenarios cover the regulated cases. Graders encode the policies. Reports document what the agent did, where it deviated, and how the team responded.

This is a positioning frame rather than a distinct technical workflow — the mechanics are the dev loop and CI gating. What changes is which scenarios you prioritize, what your graders encode, and who reads the reports.

Training — Reinforcement Learning

Use the simulation environment as a live training ground. Graders provide the reward signal; the model is updated to favor higher-scoring behaviors.

→ RL training guide · RL reference

Training — Supervised Fine-Tuning

Use high-scoring simulation transcripts as training data for SFT. Distills behavior from a larger model into a smaller, cheaper one. Depends on having a trustworthy grader and coverage of your real production distribution.

→ SFT training guide · SFT reference

All of these workflows run against the same sandbox and the same scenario sets. You don’t need separate integrations — once your agent is pushed, every use case is available.