A large, preregistered randomized study from the University of Oxford has delivered a sobering verdict: while today’s large language models (LLMs) can store and generate medical knowledge at benchmark-beating levels, they routinely fail when paired with real people seeking medical advice —...