Upgrade & Secure Your Future with DevOps, SRE, DevSecOps, MLOps!

We spend hours on Instagram and YouTube and waste money on coffee and fast food, but won’t spend 30 minutes a day learning skills to boost our careers.
Master in DevOps, SRE, DevSecOps & MLOps!

Learn from Guru Rajesh Kumar and double your salary in just one year.

Get Started Now!

Using Rotation, Translation, and Cropping to Boost Generalization in Deep Reinforcement Learning Models

Source: syncedreview.com

“Generalization” is an AI buzzword these days for good reason: most scientists would love to see the models they’re training in simulations and video game environments evolve and expand to take on meaningful real-world challenges — for example in safety, conservation, medicine, etc.

One concerned research area is deep reinforcement learning (DRL), which implements deep learning architectures with reinforcement learning algorithms to enable AI agents to learn the best actions possible to attain their goals in virtual environments. DRL has been widely applied in games and robotics.

Such DRL agents have an impressive track record on Starcraft II and Dota-2. But because they were trained in fixed environments, studies suggest DRL agents can fail to generalize to even slight variations of their training environments.

In a new paper, researchers from the New York University and Modl.ai, a company applying machine learning to game developing, suggest that simple spacial processing methods such as rotation, translation and cropping could help increase model generality.

The ability to learn directly from pixels as output by various games was one of the reasons for DRL’s surge in popularity over the last few years. But many researchers have begun to question what the models actually learn from those pixels. One way to investigate what models trained with DRL learn from pixel data is by studying their generalization capacity.

Starting from the hypothesis that DRL cannot easily learn generalizable policies on games using a static third-person perspective, the researchers discovered that the lack of generalization is partly due to the input representations. This means that while DRL models for games with static third-person representations do not tend to learn generalizable policies, they have a better chance of doing so if the game is “seen” from a more agent-centric perspective.

Because an agent’s immediate surroundings can greatly affect its ability to learn in DRL scenarios, the team proposed providing agents with a first-person view. They applied three basic image processing techniques — rotating, translating, and cropping — to the observable areas around agents.

Rotation keeps the agents always facing forward, so any action they take always happens from the same perspective. Translation then orients the observations around the agent so it is always at the center of its view. Finally, cropping shrinks observations down to just local information around the agent.

In their experiments the researchers observed that these three simple transformations enable better learning for agents, and the polices that are learned generalize much better to new environments.

The technique has so far only been tested on two game variants — a GVGAI port for the dungeon system in The Legend of Zelda and a simplified version of the game, Simple Zelda. For future work, the researchers intend to continue testing the generalization effects on different games, and improve their understanding of the effects of each transformation.

Related Posts

DeepMind open-sources Lab2D to support creation of 2D environments for AI and machine learning

Source: computing.co.uk Alphabet subsidiary DeepMind announced on Monday that it has open-sourced Lab2D, a scalable environment simulator for artificial intelligence (AI) research that facilitates researcher-led experimentation with environment Read More

Read More

A VR Film/Game with AI Characters Can Be Different Every Time You Watch or Play

Source: technologyreview.com The square-faced, three-legged alien shoves and jostles to get at the enormous plant taking over its tiny planet. But each bite just makes the forbidden Read More

Read More

Researchers detail LaND, AI that learns from autonomous vehicle disengagements

Source: venturebeat.com UC Berkeley AI researchers say they’ve created AI for autonomous vehicles driving in unseen, real-world landscapes that outperforms leading methods for delivery robots driving on Read More

Read More

Google Teases Large Scale Reinforcement Learning Infrastructurean

Source: alyticsindiamag.com The current state-of-the-art reinforcement learning techniques require many iterations over many samples from the environment to learn a target task. For instance, the game Dota Read More

Read More

Plan2Explore: Active Model-Building for Self-Supervised Visual Reinforcement Learning

Source: bair.berkeley.edu To operate successfully in unstructured open-world environments, autonomous intelligent agents need to solve many different tasks and learn new tasks quickly. Reinforcement learning has enabled Read More

Read More

Is AI an Existential Threat?

Source: unite.ai When discussing Artificial Intelligence (AI), a common debate is whether AI is an existential threat. The answer requires understanding the technology behind Machine Learning (ML), and recognizing Read More

Read More
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x