. 24/7 Space News .
ROBO SPACE
Army research leads to more effective training model for robots
by Staff Writers
Adelphi MD (SPX) Dec 31, 2020

illustration only

Multi-domain operations, the Army's future operating concept, requires autonomous agents with learning components to operate alongside the warfighter. New Army research reduces the unpredictability of current training reinforcement learning policies so that they are more practically applicable to physical systems, especially ground robots.

These learning components will permit autonomous agents to reason and adapt to changing battlefield conditions, said Army researcher Dr. Alec Koppel from the U.S. Army Combat Capabilities Development Command, now known as DEVCOM, Army Research Laboratory.

The underlying adaptation and re-planning mechanism consists of reinforcement learning-based policies. Making these policies efficiently obtainable is critical to making the MDO operating concept a reality, he said.

According to Koppel, policy gradient methods in reinforcement learning are the foundation for scalable algorithms for continuous spaces, but existing techniques cannot incorporate broader decision-making goals such as risk sensitivity, safety constraints, exploration and divergence to a prior.

Designing autonomous behaviors when the relationship between dynamics and goals are complex may be addressed with reinforcement learning, which has gained attention recently for solving previously intractable tasks such as strategy games like go, chess and videogames such as Atari and Starcraft II, Koppel said.

Prevailing practice, unfortunately, demands astronomical sample complexity, such as thousands of years of simulated gameplay, he said. This sample complexity renders many common training mechanisms inapplicable to data-starved settings required by MDO context for the Next-Generation Combat Vehicle, or NGCV.

"To facilitate reinforcement learning for MDO and NGCV, training mechanisms must improve sample efficiency and reliability in continuous spaces," Koppel said. "Through the generalization of existing policy search schemes to general utilities, we take a step towards breaking existing sample efficiency barriers of prevailing practice in reinforcement learning."

Koppel and his research team developed new policy search schemes for general utilities, whose sample complexity is also established. They observed that the resulting policy search schemes reduce the volatility of reward accumulation, yield efficient exploration of an unknown domains and a mechanism for incorporating prior experience.

"This research contributes an augmentation of the classical Policy Gradient Theorem in reinforcement learning," Koppel said. "It presents new policy search schemes for general utilities, whose sample complexity is also established. These innovations are impactful to the U.S. Army through their enabling of reinforcement learning objectives beyond the standard cumulative return, such as risk sensitivity, safety constraints, exploration and divergence to a prior."

Notably, in the context of ground robots, he said, data is costly to acquire.

"Reducing the volatility of reward accumulation, ensuring one explores an unknown domain in an efficient manner, or incorporating prior experience, all contribute towards breaking existing sample efficiency barriers of prevailing practice in reinforcement learning by alleviating the amount of random sampling one requires in order to complete policy optimization," Koppel said.

The future of this research is very bright, and Koppel has dedicated his efforts towards making his findings applicable for innovative technology for Soldiers on the battlefield.

"I am optimistic that reinforcement-learning equipped autonomous robots will be able to assist the warfighter in exploration, reconnaissance and risk assessment on the future battlefield," Koppel said. "That this vision is made a reality is essential to what motivates which research problems I dedicate my efforts."

The next step for this research is to incorporate the broader decision-making goals enabled by general utilities in reinforcement learning into multi-agent settings and investigate how interactive settings between reinforcement learning agents give rise to synergistic and antagonistic reasoning among teams.

According to Koppel, the technology that results from this research will be capable of reasoning under uncertainty in team scenarios.


Related Links
US Army Research Laboratory
All about the robots on Earth and beyond!


Thanks for being there;
We need your help. The SpaceDaily news network continues to grow but revenues have never been harder to maintain.

With the rise of Ad Blockers, and Facebook - our traditional revenue sources via quality network advertising continues to decline. And unlike so many other news sites, we don't have a paywall - with those annoying usernames and passwords.

Our news coverage takes time and effort to publish 365 days a year.

If you find our news sites informative and useful then please consider becoming a regular supporter or for now make a one off contribution.
SpaceDaily Monthly Supporter
$5+ Billed Monthly


paypal only
SpaceDaily Contributor
$5 Billed Once


credit card or paypal


ROBO SPACE
U.S. Army, Clemson University partner on autonomous vehicle project
Washington DC (UPI) Dec 18, 2020
The U.S. Army and Clemson University announced a partnership to study conversion of Bradley tanks and armored personnel carriers to autonomous use. The study for the conversion of existing Army equipment to self-driving vehicles is enabled by an $18 million Defense Department grant in the school's Virtual Prototyping of Ground Systems, and a partnership between the U.S. Army Ground Vehicle Systems Center and the Clemson University International Center for Automotive Research, the South Carolina ... read more

Comment using your Disqus, Facebook, Google or Twitter login.



Share this article via these popular social media networks
del.icio.usdel.icio.us DiggDigg RedditReddit GoogleGoogle

ROBO SPACE
Rice seeds carried to the moon and back sprout

Marsquakes, water on other planets, asteroid hunting highlight 2020 in space

China to launch core module of space station in first half of 2021

US may buy seat on Russia's Soyuz for astronaut's flight to ISS in Spring 2021,

ROBO SPACE
SDA awards contract to SpaceX

Launch of Long March 4C closes out China 2020 space plan

Russia plans more Proton-M launches in 2021

mu Space to push Thai space industry, planning to build its first spaceship in 2021

ROBO SPACE
NASA video shows Perseverance rover's planned 'terror' landing on Mars

Fluvial Mapping of Mars

A Martian Roundtrip: NASA's Perseverance Rover Sample Tubes

How to get people from Earth to Mars and safely back again

ROBO SPACE
China's space achievements out of this world

China's Chang'e-5 orbiter embarks on new mission to gravitationally stable spot at L1

China plans to launch four manned spacecraft in next two years

Mission accomplished, now on to the next: China Daily editorial

ROBO SPACE
Record Year for FAA Commercial Space Activity

Voyager Space Holdings to buy all of Nanoracks

Lockheed Martin To Acquire Aerojet Rocketdyne

Russia lifts UK telecom satellites into orbit

ROBO SPACE
Scientists and philosopher team up, propose a new way to categorize minerals

New radiation vest technology protects astronauts, doctors

Order and disorder in crystalline ice explained

Spontaneous robot dances highlight a new kind of order in active matter

ROBO SPACE
Discovery boosts theory that life on Earth arose from RNA-DNA mix

Astronomers detect possible radio emission from exoplanet

Key building block for organic molecules discovered in meteorites

Device mimics life's first steps in outer space

ROBO SPACE
Dark Storm on Neptune reverses direction, possibly shedding a fragment

The 'Great' Conjunction of Jupiter and Saturn

NASA's Juno Spacecraft Updates Quarter-Century Jupiter Mystery

Swedish space instrument participates in the search for life around Jupiter









The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.