. 24/7 Space News .
ROBO SPACE
Facebook researchers use maths for better translations
By Laurent BARTHELEMY
Paris (AFP) Oct 13, 2019

Designers of machine translation tools still mostly rely on dictionaries to make a foreign language understandable. But now there is a new way: numbers.

Facebook researchers say rendering words into figures and exploiting mathematical similarities between languages is a promising avenue -- even if a universal communicator a la Star Trek remains a distant dream.

Powerful automatic translation is a big priority for internet giants. Allowing as many people as possible worldwide to communicate is not just an altruistic goal, but also good business.

Facebook, Google and Microsoft as well as Russia's Yandex, China's Baidu and others are constantly seeking to improve their translation tools.

Facebook has artificial intelligence experts on the job at one of its research labs in Paris.

Up to 200 languages are currently used on Facebook, said Antoine Bordes, European co-director of fundamental AI research for the social network.

Automatic translation is currently based on having large databases of identical texts in both languages to work from. But for many language pairs there just aren't enough such parallel texts.

That's why researchers have been looking for another method, like the system developed by Facebook which creates a mathematical representation for words.

Each word becomes a "vector" in a space of several hundred dimensions. Words that have close associations in the spoken language also find themselves close to each other in this vector space.

- From Basque to Amazonian? -

"For example, if you take the words 'cat' and 'dog', semantically, they are words that describe a similar thing, so they will be extremely close together physically" in the vector space, said Guillaume Lample, one of the system's designers.

"If you take words like Madrid, London, Paris, which are European capital cities, it's the same idea."

These language maps can then be linked to one another using algorithms -- at first roughly, but eventually becoming more refined, until entire phrases can be matched without too many errors.

Lample said results are already promising.

For the language pair of English-Romanian, Facebook's current machine translation system is "equal or maybe a bit worse" than the word vector system, said Lample.

But for the rarer language pair of English-Urdu, where Facebook's traditional system doesn't have many bilingual texts to reference, the word vector system is already superior, he said.

But could the method allow translation from, say, Basque into the language of an Amazonian tribe?

In theory, yes, said Lample, but in practice a large body of written texts are needed to map the language, something lacking in Amazonian tribal languages.

"If you have just tens of thousands of phrases, it won't work. You need several hundreds of thousands," he said.

- 'Holy Grail' -

Experts at France's CNRS national scientific centre said the approach Lample has taken for Facebook could produce useful results, even if it doesn't result in perfect translations.

Thierry Poibeau of CNRS's Lattice laboratory, which also does research into machine translation, called the word vector approach "a conceptual revolution".

He said "translating without parallel data" -- dictionaries or versions of the same documents in both languages -- "is something of the Holy Grail" of machine translation.

"But the question is what level of performance can be expected" from the word vector method, said Poibeau.

The method "can give an idea of the original text" but the capability for a good translation every time remains unproven.

Francois Yvon, a researcher at CNRS's Computer Science Laboratory for Mechanics and Engineering Sciences, said "the linking of languages is much more difficult" when they are far removed from one another.

"The manner of denoting concepts in Chinese is completely different from French," he added.

However even imperfect translations can be useful, said Yvon, and could prove sufficient to track hate speech, a major priority for Facebook.

lby/rl/jh/kaf

Facebook

MICROSOFT

YANDEX

GOOGLE

BAIDU


Related Links
All about the robots on Earth and beyond!


Thanks for being there;
We need your help. The SpaceDaily news network continues to grow but revenues have never been harder to maintain.

With the rise of Ad Blockers, and Facebook - our traditional revenue sources via quality network advertising continues to decline. And unlike so many other news sites, we don't have a paywall - with those annoying usernames and passwords.

Our news coverage takes time and effort to publish 365 days a year.

If you find our news sites informative and useful then please consider becoming a regular supporter or for now make a one off contribution.
SpaceDaily Monthly Supporter
$5+ Billed Monthly


paypal only
SpaceDaily Contributor
$5 Billed Once


credit card or paypal


ROBO SPACE
Controlling robots across oceans and space
Paris (ESA) Oct 04, 2019
This Autumn is seeing a number of experiments controlling robots from afar, with ESA astronaut Luca Parmitano directing a robot in The Netherlands and engineers in Germany controlling a rover in Canada. Imagine looking down at the Moon from the Gateway as you prepare to land near a lunar base to run experiments, but you know the base needs maintenance work on the life-support system that will take days. It would be better to maintain the base from orbit so the astronauts can get straight to work o ... read more

Comment using your Disqus, Facebook, Google or Twitter login.



Share this article via these popular social media networks
del.icio.usdel.icio.us DiggDigg RedditReddit GoogleGoogle

ROBO SPACE
Astronauts will spend much of October outside the space station

The first humans in space

Deep space exploration isn't a far-fetched possibility

NASA astronaut Nick Hague, crewmates return safely from ISS

ROBO SPACE
Jet taking off from Florida will launch NASA weather satellite

Virgin Orbit selects RAF pilot as it plans satellite launch program

Space Launch System mock up arrives at Kennedy for testing

Artemis Generation takes on NASA Student Launch: 64 teams to compete

ROBO SPACE
Curiosity findings suggest Mars once featured dozens of shallow briny ponds

NASA's Mars 2020 rover tests descent-stage separation

InSight 'hears' peculiar sounds on Mars

A fresh attempt for the first 'Mole' on Mars

ROBO SPACE
China's KZ-1A rocket launches two satellites

China's newly launched communication satellite suffers abnormality

China launches first private rocket capable of carrying satellites

Chinese scientists say goodbye to Tiangong-2

ROBO SPACE
Talking space with the next generation in Europe

Playmobil go above and beyond with ESA's Luca Parmitano

NewSpace will eliminate sun-synchronous orbits

Australian Government commits to join NASA in Lunar exploration and beyond

ROBO SPACE
German chemical industry sketches costly carbon-neutral path

SwRI, international team use deep learning to create virtual 'super instrument'

A filament fit for space - silk is proven to thrive in outer space temperatures

Astroscale and Southampton jointly advance business case for active debris removal services

ROBO SPACE
Were hot, humid summers the key to life's origins?

A planet that should not exist

Many gas giant exoplanets waiting to be discovered

Giant exoplanet around tiny star challenges understanding of how planets form

ROBO SPACE
NASA's Juno prepares to jump Jupiter's shadow

Huge Volcano on Jupiter's Moon Io Erupts on Regular Schedule

Stony-iron meteoroid caused August impact flash at Jupiter

Storms on Jupiter are disturbing the planet's colorful belts









The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.