Challenge
What had to change
The team had a working prototype that impressed in demos but failed in production: it hallucinated, lacked auditability, and ran without per-user cost limits. Clinical leadership would not approve it for ward use.
AI · Healthcare
DrRobot wanted an AI assistant that read patient context safely and shortened a real workflow — without becoming another tab clinicians had to switch into.
Challenge
The team had a working prototype that impressed in demos but failed in production: it hallucinated, lacked auditability, and ran without per-user cost limits. Clinical leadership would not approve it for ward use.
Approach
Before any model work, we built a golden test set drawn from real anonymised cases. Every change had to clear the bar.
Retrieval-augmented generation grounded in the patient record, with row-level access controls. The model could only see what the user could see.
Per-user spend caps, hard rate limits, and a fall-back path to a smaller model when the budget was hit.
Outcomes
What shipped
Stack
They told us the prototype was not safe to ship before we asked. That conversation saved us a launch.
Let's talk
Send the details through WhatsApp and we'll route it to the right person.
Trusted by founders across healthcare, hospitality and professional services. London HQ · Bilingual EN/AR delivery · NDA-friendly
Let's talk
Send the details through WhatsApp and we'll route it to the right person.