Overview

Teresa Torres built an app to automate feedback on discovery calls:

Major takeaways & concepts

Evals, Failure modes
Tools: airtable for eval tracking, jupyter notebooks for eval analysis
- Used jupyter notebooks to quickly evaluate responses because it was easy to compare. Really embraced engineering mindset. Rolled her own eval tool.
Judges
Traces
Context engineering

Automated coach takes in transcript from a customer discovery call to give you feedback, e.g. . Ran into a couple of failure modes recommendations:

Leading questions: where the question implies the answer
General questions: tell me about your morning routine ⇒ tell me about your morning routine this morning

Uses Claude Code inside of VSCode extensively — doesn’t “vibe code” in the sense that she doesn’t simply let claude write code blindly.

Teresa:

Has transitioned to using notebook for analysis, visualization, individual transcript details.

Teresa Plans to integrate automated coach into vistaly: https://www.vistaly.com/