Advanced AI fails test on grid knowledge

By Peter Behr | 12/10/2025 06:40 AM EST

The Electric Power Research Institute quizzed large language models’ ability to handle issues confronting U.S. power systems. It didn’t go well.

Signage of artificial intelligence is displayed.

OpenAI's GPT-5 scored the best on tough, open-ended engineering questions. Manaure Quintero/AFP via Getty Images

Grid operators don’t have to fear losing their control room jobs to artificial intelligence — at least not yet.

The Electric Power Research Institute released results of a survey Tuesday that quizzed the leading AI large language model apps on questions that electrical engineers, planners and other grid professionals should know.

OpenAI’s GPT-5 got the best score, followed by Google’s Gemini 2.5 Pro and Anthropic’s Claude Sonnet. Perplexity’s Sonar Pro was fourth, and xAI’s Grok 4 came in last. But they didn’t go to the head of the class.

Advertisement

While GPT-5 and Gemini managed B-minus grades on the easiest questions helped by multiple-choice prompts, and the others trailed, EPRI reported that all the AI assistants embarrassed themselves on the hardest, open-ended questions. GPT-5 scored 63 percent right. The rest were between 46 percent and 59 percent correct, exposing reliability risks, EPRI said.

GET FULL ACCESS