The offering measures AI’s real-world performance and safety around handling realistic medical conversations, using physician-created rubrics and GPT-4.1 scoring.

LEAVE A REPLY

Please enter your comment!
Please enter your name here