Upgrade & Secure Your Future with DevOps, SRE, DevSecOps, MLOps!

We spend hours on Instagram and YouTube and waste money on coffee and fast food, but won’t spend 30 minutes a day learning skills to boost our careers.
Master in DevOps, SRE, DevSecOps & MLOps!

Learn from Guru Rajesh Kumar and double your salary in just one year.

Get Started Now!

Facebook’s RIDE encourages AI agents to explore their environments

Source: venturebeat.com

A preprint paper coauthored by scientists at Facebook AI Research describes Rewarding Impact-Driven Exploration (RIDE), an intrinsic reward method that encourages AI-driven agents to take actions in an environment. The researchers say that it outperforms state-of-the-art methods on hard exploration tasks in procedurally generated worlds, a sign it might be a candidate for devices like robot vacuums that must often navigate new environments.

As the researchers explain, reinforcement learning, where the goal is to spur an agent to complete tasks via systems of rewards, learn to act in new environments through trial and error. But many environments of interest — particularly those closer to real-world problems — don’t provide a steady stream of rewards for agents to learn from, requiring many episodes before agents come across rewards.

The researchers’ proposed solution, then — RIDE — drives agents to try out actions that have a significant impact on the environment.

The team evaluated RIDE in procedurally generated environments from the open source tool MiniGrid, where the world is a partially observable grid and each tile in the grid contains at most one object of a discrete color (a wall, door, key, ball, box, or goal). Separately, they tasked it with navigating levels in VizDoom, a Doom-based AI research platform for reinforcement learning. While VizDoom is visually more complex than MiniGrid, they’re both challenging domains in the sense that the chance of randomly stumbling upon extrinsic rewards is extremely low.

The researchers report that, compared with baseline algorithms, RIDE considers certain states to be “novel” or “surprising” even after long periods of training and after seeing similar states in the past or learning to almost perfectly predict the next state in a subset of the environment. As a consequence, its intrinsic rewards don’t diminish during training, and agents manage to distinguish between actions that lead to novel or surprising states from those that do not, avoiding becoming trapped in some parts of the state space.

“RIDE has a number of desirable properties,” wrote the study’s coauthors. “It attracts agents to states where they can affect the environment, it provides a signal to agents even after training for a long time, and it is conceptually simple as well as compatible with other intrinsic or extrinsic rewards and any deep [reinforcement learning] algorithm … Furthermore, RIDE explores procedurally generated environments more efficiently than other exploration methods.”

They leave to future work improving RIDE by making use of symbolic information to measure the agent’s impact or considering longer-term effects of the agent’s actions. They also hope to investigate algorithms that can distinguish between desirable and undesirable types of impact, effectively constraining the agent to act safely and avoid distractions.

Related Posts

DeepMind open-sources Lab2D to support creation of 2D environments for AI and machine learning

Source: computing.co.uk Alphabet subsidiary DeepMind announced on Monday that it has open-sourced Lab2D, a scalable environment simulator for artificial intelligence (AI) research that facilitates researcher-led experimentation with environment Read More

Read More

A VR Film/Game with AI Characters Can Be Different Every Time You Watch or Play

Source: technologyreview.com The square-faced, three-legged alien shoves and jostles to get at the enormous plant taking over its tiny planet. But each bite just makes the forbidden Read More

Read More

Researchers detail LaND, AI that learns from autonomous vehicle disengagements

Source: venturebeat.com UC Berkeley AI researchers say they’ve created AI for autonomous vehicles driving in unseen, real-world landscapes that outperforms leading methods for delivery robots driving on Read More

Read More

Google Teases Large Scale Reinforcement Learning Infrastructurean

Source: alyticsindiamag.com The current state-of-the-art reinforcement learning techniques require many iterations over many samples from the environment to learn a target task. For instance, the game Dota Read More

Read More

Plan2Explore: Active Model-Building for Self-Supervised Visual Reinforcement Learning

Source: bair.berkeley.edu To operate successfully in unstructured open-world environments, autonomous intelligent agents need to solve many different tasks and learn new tasks quickly. Reinforcement learning has enabled Read More

Read More

Is AI an Existential Threat?

Source: unite.ai When discussing Artificial Intelligence (AI), a common debate is whether AI is an existential threat. The answer requires understanding the technology behind Machine Learning (ML), and recognizing Read More

Read More
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x