. 24/7 Space News .
ROBO SPACE
Pitt researcher uses video games to unlock new levels of AI
by Staff Writers
Pittsburgh PA (SPX) Nov 06, 2018

To test his algorithm, Dr. Jiang used a genre of video games called Multiplayer Online Battle Arena or MOBA. Games such as League of Legends or Heroes of the Storm are popular MOBAs in which players control one of several "hero" characters and try to destroy opponents' bases while protecting their own.

Expectations for artificial intelligences are very real and very high. An analysis in Forbes projects revenues from A.I. will skyrocket from $1.62 billion in 2018 to $31.2 billion in 2025. The report also included a survey revealing 84 percent of enterprises believe investing in A.I. will lead to competitive advantages.

"It is exciting to see the tremendous successes and progress made in recent years," says Daniel Jiang, assistant professor of industrial engineering at the University of Pittsburgh Swanson School of Engineering. "To continue this trend, we are looking to develop more sophisticated methods for algorithms to learn strategies for optimal decision making."

Dr. Jiang designs algorithms that learn decision strategies in complex and uncertain environments. By testing algorithms in simulated environments, they can learn from their mistakes while discovering and reinforcing strategies for success. To perfect this process, Dr. Jiang and many researchers in his field require simulations that mirror the real world.

"As industrial engineers, we typically work on problems with an operational focus. For example, transportation, logistics and supply chains, energy systems and health care are several important areas," he says. "All of those problems are high-stakes operations with real-world consequences. They don't make the best environments for trying out experimental technologies, especially when many of our algorithms can be thought of as clever ways of repeated 'trial and error' over all possible actions."

One strategy for preparing advanced A.I. to take on real-world scenarios and complications is to use historical data. For instance, algorithms could run through decades' worth of data to find which decisions were effective and which led to less than optimal results. However, researchers have found it difficult to test algorithms that are designed to learn adaptive behaviors using only data from the past.

Dr. Jiang explains, "Historical data can be a problem because people's actions fix the consequences and don't present alternative possibilities. In other words, it is difficult for an algorithm to ask the question 'how would things be different if I chose door B instead of door A?' In historical data, all we can see are the consequences of door A."

Video games, as an alternative, offer rich testing environments full of complex decision making without the dangers of putting an immature A.I. fully in charge. Unlike the real world, they provide a safe way for an algorithm to learn from its mistakes.

"Video game designers aren't building games with the goal to test models or simulations," Dr. Jiang says. "They're often designing games with a two-fold mission: to create environments that mimic the real world and to challenge players to make difficult decisions. These goals happen to align with what we are looking for as well. Also, games are much faster. In a few hours of real time, we can evaluate the results of hundreds of thousands of gameplay decisions."

To test his algorithm, Dr. Jiang used a genre of video games called Multiplayer Online Battle Arena or MOBA. Games such as League of Legends or Heroes of the Storm are popular MOBAs in which players control one of several "hero" characters and try to destroy opponents' bases while protecting their own.

A successful algorithm for training a gameplay A.I. must overcome several challenges, such as real-time decision making and long decision horizons - a mathematical term for when the consequences of some decisions are not known until much later.

"We designed the algorithm to evaluate 41 pieces of information and then output one of 22 different actions, including movement, attacks and special moves," says Dr. Jiang. "We compared different training methods against one another. The most successful player used a method called Monte Carlo tree search to generate data, which is then fed into a neural network."

Monte Carlo tree search is a strategy for decision making in which the player moves randomly through a simulation or a video game. The algorithm then analyzes the game results to give more weight to more successful actions. Over time and multiple iterations of the game, the more successful actions persist, and the player becomes better at winning the game.

"Our research also gave some theoretical results to show that Monte Carlo tree search is an effective strategy for training an agent to succeed at making difficult decisions in real-time, even when operating in an uncertain world," Dr. Jiang explains.

Dr. Jiang published his research in a paper co-authored with Emmanuel Ekwedike and Han Liu and presented the results at the 2018 International Conference on Machine Learning in Stockholm, Sweden this past summer.

At the University of Pittsburgh, he continues to work in the area of sequential decision making with Ph.D. students Yijia Wang and Ibrahim El-Shar. The team focuses on problems related to ride-sharing, energy markets, and public health. As industries prepare to put A.I. in charge of critical responsibilities, Dr. Jiang ensures the underlying algorithms stay at the top of their game.


Related Links
University of Pittsburgh
All about the robots on Earth and beyond!


Thanks for being there;
We need your help. The SpaceDaily news network continues to grow but revenues have never been harder to maintain.

With the rise of Ad Blockers, and Facebook - our traditional revenue sources via quality network advertising continues to decline. And unlike so many other news sites, we don't have a paywall - with those annoying usernames and passwords.

Our news coverage takes time and effort to publish 365 days a year.

If you find our news sites informative and useful then please consider becoming a regular supporter or for now make a one off contribution.
SpaceDaily Monthly Supporter
$5+ Billed Monthly


paypal only
SpaceDaily Contributor
$5 Billed Once


credit card or paypal


ROBO SPACE
Shape-shifting robots perceive surroundings, make decisions for first time
Ithaca NY (SPX) Nov 01, 2018
General-purpose robots have plenty of limitations. They can be expensive and cumbersome. They often accomplish only a single type of task. But modular robots - composed of several interchangeable parts, or modules - are far more flexible. If one part breaks, it can be removed and replaced. Components can be rearranged as needed - or better yet, the robots can figure out how to reconfigure themselves, based on the tasks they're assigned and the environments they're navigating. Now, a Cornell ... read more

Comment using your Disqus, Facebook, Google or Twitter login.



Share this article via these popular social media networks
del.icio.usdel.icio.us DiggDigg RedditReddit GoogleGoogle

ROBO SPACE
Russia plans first manned launch to ISS Dec 3 after accident

Thrusters with additively manufactured components qualified to fly humans on Orion spacecraft

Plant hormone makes space farming a possibility

Installing life support the hands-free way

ROBO SPACE
Rocket Lab enters high frequency launch operations

NASA conducts a 'BOO-tiful' RS-25 engine test

Soyuz launch failed due to assembly problem: Russia

All RS-25 flight controllers delivered for first four flights of NASA's SLS rocket

ROBO SPACE
Five things to know about InSight's Mars landing

Naturally occurring 'batteries' fueled organic carbon synthesis on Mars

NASA launches a new podcast to Mars

NASA will keep trying to contact stalled Mars rover Opportunity

ROBO SPACE
China's space programs open up to world

China's commercial aerospace companies flourishing

China launches Centispace-1-s1 satellite

China tests propulsion system of space station's lab capsules

ROBO SPACE
How Max Polyakov from Zaporozhie develops the Ukrainian space industry

SpaceFund launches the world's first space security token to fund the opening of the high frontier

ESA on the way to Space19+ and beyond

Ministers endorse vision for the future of Europe in space

ROBO SPACE
Physicists name and codify new field in nanotechnology: 'electron quantum metamaterials'

Bose-Einstein condensate generated in space for the first time

Super-computer brings 'cloud' to astronauts in space

Disorder plays a key role in phase transitions of materials

ROBO SPACE
NASA retires Kepler Space Telescope, passes planet-hunting torch

Rocky and habitable - sizing up a galaxy of planets

Some planetary systems just aren't into heavy metal

Giant planets around young star raise questions about how planets form

ROBO SPACE
SwRI team makes breakthroughs studying Pluto orbiter mission

ALMA maps temperature of Jupiter's icy moon Europa

NASA's Juno Mission Detects Jupiter Wave Trains

WorldWide Telescope looks ahead to New Horizons' Ultima Thule glyby









The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.