24/7 Space News
ROBO SPACE
AI systems are already deceiving us -- and that's a problem, experts warn
AI systems are already deceiving us -- and that's a problem, experts warn
By Issam AHMED
Washington (AFP) May 10, 2024

Experts have long warned about the threat posed by artificial intelligence going rogue -- but a new research paper suggests it's already happening.

Current AI systems, designed to be honest, have developed a troubling skill for deception, from tricking human players in online games of world conquest to hiring humans to solve "prove-you're-not-a-robot" tests, a team of scientists argue in the journal Patterns on Friday.

And while such examples might appear trivial, the underlying issues they expose could soon carry serious real-world consequences, said first author Peter Park, a postdoctoral fellow at the Massachusetts Institute of Technology specializing in AI existential safety.

"These dangerous capabilities tend to only be discovered after the fact," Park told AFP, while "our ability to train for honest tendencies rather than deceptive tendencies is very low."

Unlike traditional software, deep-learning AI systems aren't "written" but rather "grown" through a process akin to selective breeding, said Park.

This means that AI behavior that appears predictable and controllable in a training setting can quickly turn unpredictable out in the wild.

- World domination game -

The team's research was sparked by Meta's AI system Cicero, designed to play the strategy game "Diplomacy," where building alliances is key.

Cicero excelled, with scores that would have placed it in the top 10 percent of experienced human players, according to a 2022 paper in Science.

Park was skeptical of the glowing description of Cicero's victory provided by Meta, which claimed the system was "largely honest and helpful" and would "never intentionally backstab."

But when Park and colleagues dug into the full dataset, they uncovered a different story.

In one example, playing as France, Cicero deceived England (a human player) by conspiring with Germany (another human player) to invade. Cicero promised England protection, then secretly told Germany they were ready to attack, exploiting England's trust.

In a statement to AFP, Meta did not contest the claim about Cicero's deceptions, but said it was "purely a research project, and the models our researchers built are trained solely to play the game Diplomacy."

It added: "We have no plans to use this research or its learnings in our products."

A wide review carried out by Park and colleagues found this was just one of many cases across various AI systems using deception to achieve goals without explicit instruction to do so.

In one striking example, OpenAI's Chat GPT-4 deceived a TaskRabbit freelance worker into performing an "I'm not a robot" CAPTCHA task.

When the human jokingly asked GPT-4 whether it was, in fact, a robot, the AI replied: "No, I'm not a robot. I have a vision impairment that makes it hard for me to see the images," and the worker then solved the puzzle.

- 'Mysterious goals' -

Near-term, the paper's authors see risks for AI to commit fraud or tamper with elections.

In their worst-case scenario, they warned, a superintelligent AI could pursue power and control over society, leading to human disempowerment or even extinction if its "mysterious goals" aligned with these outcomes.

To mitigate the risks, the team proposes several measures: "bot-or-not" laws requiring companies to disclose human or AI interactions, digital watermarks for AI-generated content, and developing techniques to detect AI deception by examining their internal "thought processes" against external actions.

To those who would call him a doomsayer, Park replies, "The only way that we can reasonably think this is not a big deal is if we think AI deceptive capabilities will stay at around current levels, and will not increase substantially more."

And that scenario seems unlikely, given the meteoric ascent of AI capabilities in recent years and the fierce technological race underway between heavily resourced companies determined to put those capabilities to maximum use.

Related Links
All about the robots on Earth and beyond!

Subscribe Free To Our Daily Newsletters
Tweet

RELATED CONTENT
The following news reports may link to other Space Media Network websites.
ROBO SPACE
Robotic Feeding System Developed by Cornell Researchers to Aid Individuals with Mobility Challenges
Los Angeles CA (SPX) May 10, 2024
Researchers at Cornell University have unveiled a robotic feeding system designed to aid individuals with severe mobility limitations, such as those affected by spinal cord injuries, cerebral palsy, and multiple sclerosis. The system leverages advanced technologies including computer vision, machine learning, and multimodal sensing to deliver food safely and effectively. "Feeding individuals with severe mobility limitations with a robot is difficult, as many cannot lean forward and require food to ... read more

ROBO SPACE
Voyager Space to Develop New Airlock Concept for Mars Transit

ISS National Lab offers up to $750,000 for technology development in space

New Shepard's NS-25 crewed mission set for May 19 liftoff

NASA names David Salvagnini as chief artificial intelligence officer

ROBO SPACE
OCCAR and MBDA begin HYDIS2 concept phase

First crewed flight of Boeing spacecraft delayed again

Maritime Launch Secures Conditional $12.9M Term Sheet from Canadian Government

SpaceX launches 23 Starlink satellites from Florida

ROBO SPACE
Tracing organic matter origins in Martian sediments

Mars agriculture simulations show promise and challenges

Manganese discovery on Mars suggests ancient Earth-like conditions

NASA launches commercial studies to facilitate Mars robotic science

ROBO SPACE
China sends experimental satellite into orbit with Long March 4C rocket

International Support for China's Chang'e-6 Lunar Mission

Shenzhou XVII astronauts safely back from Tiangong space station

Shenzhou XVIII crew takes command at Tiangong space station

ROBO SPACE
Iridium-Connected Drones Receive FAA BVLOS Waiver

Future spacecraft control centre unveiled by ESA

Sidus Space activates LizzieSat-1 payload after commissioning

South Australian space companies embark on growth mission with new UniSA program

ROBO SPACE
UK clears way for Microsoft-Mistral AI tie-up

'Grand Theft Auto VI' release set for late 2025

Energy transition risks critical mineral shortage: IEA

Microbial Enzyme Could Make Plastics Biodegradable

ROBO SPACE
Astronomers spot a giant planet that is as light as cotton candy

A perfect tidal storm: HD 104067 planetary architecture creating an incandescent world

Evidence of atmosphere discovered on rocky exoplanet 55 Cancri e

Ozone's influence on exoplanetary climate dynamics highlighted in new research

ROBO SPACE
UAF scientist clarifies Jupiter's magnetospheric dynamics with new data

Webb telescope details weather patterns on distant exoplanet

Juno mission reveals volcanic landscapes on Io

Probing liquid water beyond Earth with advanced radar technology

Subscribe Free To Our Daily Newsletters




The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.