. 24/7 Space News .
INTERNET SPACE
A new model of vision
by Staff Writers
Boston MA (SPX) Mar 05, 2020

MIT cognitive scientists have developed a computer model of face recognition that performs a series of computations that reverse the steps that a computer graphics program would use to generate a 2D representation of a face. MIT cognitive scientists have developed a computer model of face recognition that performs a series of computations that reverse the steps that a computer graphics program would use to generate a 2D representation of a face.

When we open our eyes, we immediately see our surroundings in great detail. How the brain is able to form these richly detailed representations of the world so quickly is one of the biggest unsolved puzzles in the study of vision.

Scientists who study the brain have tried to replicate this phenomenon using computer models of vision, but so far, leading models only perform much simpler tasks such as picking out an object or a face against a cluttered background. Now, a team led by MIT cognitive scientists has produced a computer model that captures the human visual system's ability to quickly generate a detailed scene description from an image, and offers some insight into how the brain achieves this.

"What we were trying to do in this work is to explain how perception can be so much richer than just attaching semantic labels on parts of an image, and to explore the question of how do we see all of the physical world," says Josh Tenenbaum, a professor of computational cognitive science and a member of MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL) and the Center for Brains, Minds, and Machines (CBMM).

The new model posits that when the brain receives visual input, it quickly performs a series of computations that reverse the steps that a computer graphics program would use to generate a 2D representation of a face or other object. This type of model, known as efficient inverse graphics (EIG), also correlates well with electrical recordings from face-selective regions in the brains of nonhuman primates, suggesting that the primate visual system may be organized in much the same way as the computer model, the researchers say.

Ilker Yildirim, a former MIT postdoc who is now an assistant professor of psychology at Yale University, is the lead author of the paper, which appears in Science Advances. Tenenbaum and Winrich Freiwald, a professor of neurosciences and behavior at Rockefeller University, are the senior authors of the study. Mario Belledonne, a graduate student at Yale, is also an author.

Inverse graphics
Decades of research on the brain's visual system has studied, in great detail, how light input onto the retina is transformed into cohesive scenes. This understanding has helped artificial intelligence researchers develop computer models that can replicate aspects of this system, such as recognizing faces or other objects.

"Vision is the functional aspect of the brain that we understand the best, in humans and other animals," Tenenbaum says. "And computer vision is one of the most successful areas of AI at this point. We take for granted that machines can now look at pictures and recognize faces very well, and detect other kinds of objects."

However, even these sophisticated artificial intelligence systems don't come close to what the human visual system can do, Yildirim says.

"Our brains don't just detect that there's an object over there, or recognize and put a label on something," he says. "We see all of the shapes, the geometry, the surfaces, the textures. We see a very rich world."

More than a century ago, the physician, physicist, and philosopher Hermann von Helmholtz theorized that the brain creates these rich representations by reversing the process of image formation. He hypothesized that the visual system includes an image generator that would be used, for example, to produce the faces that we see during dreams. Running this generator in reverse would allow the brain to work backward from the image and infer what kind of face or other object would produce that image, the researchers say.

However, the question remained: How could the brain perform this process, known as inverse graphics, so quickly? Computer scientists have tried to create algorithms that could perform this feat, but the best previous systems require many cycles of iterative processing, taking much longer than the 100 to 200 milliseconds the brain requires to create a detailed visual representation of what you're seeing. Neuroscientists believe perception in the brain can proceed so quickly because it is implemented in a mostly feedforward pass through several hierarchically organized layers of neural processing.

The MIT-led team set out to build a special kind of deep neural network model to show how a neural hierarchy can quickly infer the underlying features of a scene - in this case, a specific face. In contrast to the standard deep neural networks used in computer vision, which are trained from labeled data indicating the class of an object in the image, the researchers' network is trained from a model that reflects the brain's internal representations of what scenes with faces can look like.

Their model thus learns to reverse the steps performed by a computer graphics program for generating faces. These graphics programs begin with a three-dimensional representation of an individual face and then convert it into a two-dimensional image, as seen from a particular viewpoint. These images can be placed on an arbitrary background image. The researchers theorize that the brain's visual system may do something similar when you dream or conjure a mental image of someone's face.

The researchers trained their deep neural network to perform these steps in reverse - that is, it begins with the 2D image and then adds features such as texture, curvature, and lighting, to create what the researchers call a "2.5D" representation. These 2.5D images specify the shape and color of the face from a particular viewpoint. Those are then converted into 3D representations, which don't depend on the viewpoint.

"The model gives a systems-level account of the processing of faces in the brain, allowing it to see an image and ultimately arrive at a 3D object, which includes representations of shape and texture, through this important intermediate stage of a 2.5D image," Yildirim says.

Model performance
The researchers found that their model is consistent with data obtained by studying certain regions in the brains of macaque monkeys. In a study published in 2010, Freiwald and Doris Tsao of Caltech recorded the activity of neurons in those regions and analyzed how they responded to 25 different faces, seen from seven different viewpoints. That study revealed three stages of higher-level face processing, which the MIT team now hypothesizes correspond to three stages of their inverse graphics model: roughly, a 2.5D viewpoint-dependent stage; a stage that bridges from 2.5 to 3D; and a 3D, viewpoint-invariant stage of face representation.

"What we show is that both the quantitative and qualitative response properties of those three levels of the brain seem to fit remarkably well with the top three levels of the network that we've built," Tenenbaum says.

The researchers also compared the model's performance to that of humans in a task that involves recognizing faces from different viewpoints. This task becomes harder when researchers alter the faces by removing the face's texture while preserving its shape, or distorting the shape while preserving relative texture. The new model's performance was much more similar to that of humans than computer models used in state-of-the-art face-recognition software, additional evidence that this model may be closer to mimicking what happens in the human visual system.

The researchers now plan to continue testing the modeling approach on additional images, including objects that aren't faces, to investigate whether inverse graphics might also explain how the brain perceives other kinds of scenes. In addition, they believe that adapting this approach to computer vision could lead to better-performing AI systems.

"If we can show evidence that these models might correspond to how the brain works, this work could lead computer vision researchers to take more seriously and invest more engineering resources in this inverse graphics approach to perception," Tenenbaum says. "The brain is still the gold standard for any kind of machine that sees the world richly and quickly."


Related Links
Massachusetts Institute Of Technology
Satellite-based Internet technologies


Thanks for being there;
We need your help. The SpaceDaily news network continues to grow but revenues have never been harder to maintain.

With the rise of Ad Blockers, and Facebook - our traditional revenue sources via quality network advertising continues to decline. And unlike so many other news sites, we don't have a paywall - with those annoying usernames and passwords.

Our news coverage takes time and effort to publish 365 days a year.

If you find our news sites informative and useful then please consider becoming a regular supporter or for now make a one off contribution.
SpaceDaily Monthly Supporter
$5+ Billed Monthly


paypal only
SpaceDaily Contributor
$5 Billed Once


credit card or paypal


INTERNET SPACE
Apple agrees to $500 mn deal in iPhone-slowing suit
San Francisco (AFP) March 2, 2020
Apple has agreed to pay up to $500 million to settle a class-action lawsuit over claims it covertly slowed older iPhones to get users to upgrade. A federal judge in California presiding over a group of lawsuits will be asked to approve the proposed settlement at a hearing in early April, according to a court filing on Friday. Apple did not immediately respond to a request for comment. The litigation centers on stealthy mobile operating software changes in the name of avoiding "unintended pow ... read more

Comment using your Disqus, Facebook, Google or Twitter login.



Share this article via these popular social media networks
del.icio.usdel.icio.us DiggDigg RedditReddit GoogleGoogle

INTERNET SPACE
No going back: Bali's Chinese tourists fear virus-hit homeland

Insects, seaweed and lab-grown meat could be the foods of the future

Katherine Johnson, NASA mathematician, dies at 101

US-China tensions colour race to head global patent agency

INTERNET SPACE
Northrop Grumman completes key test for Orion Launch Abort System Attitude Control Motor

AFRL, Masten Space Systems, NASA, collaborate on successful testing of methane engine

Simple, fuel-efficient rocket engine could enable cheaper, lighter spacecraft

SpaceX announces partnership to send four tourists into deep orbit

INTERNET SPACE
Seismic activity on Mars resembles that found in the Swabian Jura

Ancient meteorite site on Earth could reveal new clues about Mars' past

The seismicity of Mars

Magnetic field at Martian surface ten times stronger than expected

INTERNET SPACE
China's Yuanwang-5 sails to Pacific Ocean for space monitoring mission

Construction of China's space station begins with start of LM-5B launch campaign

China Prepares to Launch Unknown Satellite Aboard Long March 7A Rocket

China's Long March-5B carrier rocket arrives at launch site

INTERNET SPACE
Kleos Space secures 3M Euro loan agreement with Dubai family office

Europlanet launches 10M euro Research Infrastructure to support planetary science

Boeing buying Russian components for Starliner

NSW Government establishes a home for space industry initiatives

INTERNET SPACE
Hope for a new permanent magnet that's cheap and sustainable

Cloud data speeds set to soar with aid of laser mini-magnets

Creating custom light using 2D materials

Raytheon awarded $17 million for dual band radar spares for USS Ford

INTERNET SPACE
Salmon parasite is world's first non-oxygen breathing animal

Sub-Neptune sized planet validated with the habitable-zone planet finder

Planet on edge of destruction in 18-hour year frenzy

LOFAR pioneers new way to study exoplanet environments

INTERNET SPACE
Ultraviolet instrument delivered for ESA's Jupiter mission

One Step Closer to the Edge of the Solar System

TRIDENT Mission Concept Selected by NASA's Discovery Program

Findings from Juno Update Jupiter Water Mystery









The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.