Large language models — generative artificial intelligence tools that process and generate text — are proving popular in healthcare. Because LLMs can respond to diverse prompts and process complex concepts, they show promise for augmenting medical research, patient education and clinical documentation.
That said, models trained on general data often fall short when it comes to nuanced healthcare questions. A 2024 study from Mayo Clinic found accuracy rates of less than 40% for prompts on ChatGPT, Microsoft Bing Chat and Google Bard AI compared with in-depth literature searches for…