How it works
- Quality scores. Every call an Observe rule judges gets a score and observation stored against it.
- Feedback. Feedback you send from your SDK attaches a human signal to a call, alongside the judge’s scores.
- Better outputs. Steer uses that signal to select few-shot examples from your top-scoring calls and to tune the prompt, so quality climbs as more calls come in.
The loop in practice
- Score the calls you care about with an Observe rule, and send thumbs-up/down feedback from your app where you have it.
- Accumulate signal. As scored calls and feedback build up, Steer learns which responses are your strongest.
- Improve. Steer selects few-shot examples from the top-scoring calls and tunes the prompt, so later calls to the same function start from better material.
- Watch it climb. Because Observe keeps scoring, you can see quality trend up over time in Analytics.