You don't need to read docs. You don't need a config file. Install one package, write a test, run it. Here's exactly how.
One package. It brings pytest along if you don't have it.
Don't want to write tests yet? Run a single command. It fires 10 safety prompts at any model and gives you a score instantly.
gpt-4o, claude-sonnet-4-6, etc.) and get a safety baseline in seconds. When you're ready for custom assertions, keep reading.
Create a file called test_my_agent.py and paste this in. These tests run completely offline — no API key, no network calls. You're testing proofagent's assertion engine against hardcoded results.
Just pytest. Nothing special.
Here's what you'll get. Three tests, all passing, sub-second. No API calls were made.
This is where it gets real. Set your API key and let proofagent actually call the model. Add this test to the same file or a new one.
proofagent_run is a pytest fixture. It sends the prompt to the model, captures the response, tracks cost and latency, and returns an LLMResult you can assert against.
Same command. proofagent calls the model, checks assertions, and reports cost.
Now make it run on every push. Drop this into .github/workflows/eval.yml and bad deploys get blocked automatically.
tests/ folder. They'll run in CI too, adding a fast safety net that doesn't cost anything.
Got an agent running behind a URL? You don't need to integrate an SDK. Just point proofagent at the endpoint.
You've got the basics. Here are a few directions to go from here:
proofagent dashboard to see results in your browser