Skip to main content

Google DeepMind trains AI agents in video games to build smarter robots

Elena Voss
Elena Voss
·2 min read·London, United Kingdom·8 views
Share

Google DeepMind has built an AI agent called SIMA 2 that can navigate and solve problems across different 3D video game worlds — a step toward machines that can follow human instructions in complex environments and eventually control real robots.

The agent plays games like No Man's Sky and Goat Simulator 3 by watching the screen pixel by pixel and learning which keyboard and mouse inputs accomplish specific tasks. What makes SIMA 2 different from previous game-playing AI is that it doesn't chase preset victory conditions. Instead, it learns to follow instructions from humans — via text chat, voice commands, or even drawings on the screen.

Learning from failure

SIMA 2 was trained on footage of humans playing eight commercial games plus three custom virtual worlds. The agent learned to match inputs to actions, but the real breakthrough came from connecting it to Gemini, Google DeepMind's large language model. Gemini helps SIMA 2 understand what it's being asked to do, ask clarifying questions, and generate its own practice tasks.

Wait—What is Brightcast?

We're a new kind of news feed.

Regular news is designed to drain you. We're a non-profit built to restore you. Every story we publish is scored for impact, progress, and hope.

Start Your News Detox

When SIMA 2 fails at something, Gemini offers tips. The agent tries again. Repeat this enough times, and it improves through trial and error — much like a human learning to cook by attempting the same recipe multiple times.

Researchers tested SIMA 2 in environments it had never encountered. In one experiment, they used another Google AI system called Genie 3 to generate entirely new virtual worlds from scratch, then dropped SIMA 2 into them. The agent could navigate and follow instructions in these unfamiliar spaces, suggesting the skills it learned weren't locked to specific games.

The robot question

The long-term goal is clear: use these agents to control physical robots. Joe Marino, a research scientist at Google DeepMind, argues that skills SIMA 2 has learned — navigating spaces, using tools, collaborating with humans — are essential building blocks for future robot companions. Even simple actions in games involve complex sequences of steps. Lighting a lantern, for instance, might require finding the right object, positioning yourself correctly, and executing the right command.

But there are real limitations. SIMA 2 struggles with tasks requiring many steps over long periods. It only remembers recent interactions (the team deliberately cut its long-term memory to make it more responsive). And it's still far worse than humans at using a keyboard and mouse.

AI researchers are split on whether this translates to better robots. Julian Togelius at New York University notes that previous attempts to train single systems across multiple games have mostly failed — controlling games from visual input alone is genuinely difficult. But he's cautiously optimistic about the robot application, though he points out that real-world physics don't work like video game rules. Matthew Guzdial at the University of Alberta is more skeptical, noting that most games use similar keyboard-and-mouse controls, and that real-world camera images are far messier than game graphics designed for human eyes.

Google DeepMind isn't stopping here. Marino mentioned plans to create an "endless virtual training dojo" where Genie 3 generates worlds for SIMA to learn in, with Gemini providing feedback. "We've kind of just scratched the surface of what's possible," he said.

37
MinimalPositive but limited scope

Brightcast Impact Score

The article describes the development of a new video game-playing agent called SIMA 2 by Google DeepMind, which is a step towards more general-purpose agents and better real-world robots. The agent can navigate and solve problems in a wide range of 3D virtual worlds, and it can carry out complex tasks, figure out how to solve challenges, and improve itself through trial and error. This represents evidence of progress in the field of AI and robotics, with the potential for positive real-world applications in the future.

15

Hope

Moderate

11

Reach

Moderate

11

Verified

Moderate

Wall of Hope

0/50

Be the first to share how this story made you feel

How does this make you feel?

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
Share

Originally reported by MIT Technology Review · Verified by Brightcast

Get weekly positive news in your inbox

No spam. Unsubscribe anytime. Join thousands who start their week with hope.

More stories that restore faith in humanity