Google DeepMind trains AI agents in video games to build…

Google DeepMind trains AI agents in video games to build smarter robots

Nov 13·2 min read·London, United Kingdom·64 views

Originally reported by MIT Technology Review ↗ · Rewritten for clarity and brevity by Brightcast

Google DeepMind has built an AI agent called SIMA 2 that can navigate and solve problems across different 3D video game worlds — a step toward machines that can follow human instructions in complex environments and eventually control real robots.

The agent plays games like No Man's Sky and Goat Simulator 3 by watching the screen pixel by pixel and learning which keyboard and mouse inputs accomplish specific tasks. What makes SIMA 2 different from previous game-playing AI is that it doesn't chase preset victory conditions. Instead, it learns to follow instructions from humans — via text chat, voice commands, or even drawings on the screen.

Learning from failure

SIMA 2 was trained on footage of humans playing eight commercial games plus three custom virtual worlds. The agent learned to match inputs to actions, but the real breakthrough came from connecting it to Gemini, Google DeepMind's large language model. Gemini helps SIMA 2 understand what it's being asked to do, ask clarifying questions, and generate its own practice tasks.

Wait—What is Brightcast?

We're a new kind of news feed.

Regular news is designed to drain you. We're a non-profit built to restore you. Every story we publish is scored for impact, progress, and hope.

Start Your News Detox

When SIMA 2 fails at something, Gemini offers tips. The agent tries again. Repeat this enough times, and it improves through trial and error — much like a human learning to cook by attempting the same recipe multiple times.

Researchers tested SIMA 2 in environments it had never encountered. In one experiment, they used another Google AI system called Genie 3 to generate entirely new virtual worlds from scratch, then dropped SIMA 2 into them. The agent could navigate and follow instructions in these unfamiliar spaces, suggesting the skills it learned weren't locked to specific games.

The robot question

The long-term goal is clear: use these agents to control physical robots. Joe Marino, a research scientist at Google DeepMind, argues that skills SIMA 2 has learned — navigating spaces, using tools, collaborating with humans — are essential building blocks for future robot companions. Even simple actions in games involve complex sequences of steps. Lighting a lantern, for instance, might require finding the right object, positioning yourself correctly, and executing the right command.

But there are real limitations. SIMA 2 struggles with tasks requiring many steps over long periods. It only remembers recent interactions (the team deliberately cut its long-term memory to make it more responsive). And it's still far worse than humans at using a keyboard and mouse.

AI researchers are split on whether this translates to better robots. Julian Togelius at New York University notes that previous attempts to train single systems across multiple games have mostly failed — controlling games from visual input alone is genuinely difficult. But he's cautiously optimistic about the robot application, though he points out that real-world physics don't work like video game rules. Matthew Guzdial at the University of Alberta is more skeptical, noting that most games use similar keyboard-and-mouse controls, and that real-world camera images are far messier than game graphics designed for human eyes.

Google DeepMind isn't stopping here. Marino mentioned plans to create an "endless virtual training dojo" where Genie 3 generates worlds for SIMA to learn in, with Gemini providing feedback. "We've kind of just scratched the surface of what's possible," he said.

Brightcast Impact Score (BIS)

The article describes the development of a new video game-playing agent called SIMA 2 by Google DeepMind, which is a step towards more general-purpose agents and better real-world robots. The agent can navigate and solve problems in a wide range of 3D virtual worlds, and it can carry out complex tasks, figure out how to solve challenges, and improve itself through trial and error. This represents evidence of progress in the field of AI and robotics, with the potential for positive real-world applications in the future.

Hope15/40

Emotional uplift and inspirational potential

Reach11/30

Audience impact and shareability

Verification11/30

Source credibility and content accuracy

Minimal

37/100

Positive but limited scope

Start a ripple of hope

Share it and watch how far your hope travels · View analytics →

Spread hope

You

friendstheir friendsand beyond...

Wall of Hope

0/20

Be the first to share how this story made you feel

How does this make you feel?

Google DeepMind trains AI agents in video games to build smarter robots

Learning from failure

We're a new kind of news feed.

The robot question

Brightcast Impact Score (BIS)

Start a ripple of hope

Wall of Hope

More stories that restore faith in humanity

World’s first transatlantic robotic stroke surgery performed from Florida to Dundee

Fei-Fei Li wins Queen Elizabeth Prize for Engineering

Robert Therrien’s World of Everyday Wonder Emerges in a LA Retrospective

An improved way to detach cells from culture surfaces

Researchers Have Identified the Names of Five Million Victims Murdered in the Holocaust

Student Inventors Create Low-Cost Prosthetics Using 3D Printing