
-
Jordan hospital treats war casualties from across Middle East
-
As Trump family's Gulf empire grows, rulers seek influence, arms, tech
-
S. Korea conservatives choose presidential candidate after last-minute chaos
-
Trump hails 'total reset' in US-China trade relations as talks continue
-
Film claims to name killer of slain journalist Shireen Abu Akleh
-
Under Trump pressure, Columbia University ends semester in turmoil
-
Putin proposes direct Ukraine talks but quiet on 30-day ceasefire
-
Trump hails US-China trade 'reset' after first day of talks
-
Jeeno leads Boutier by one at LPGA Americas Open
-
Lowry, Straka share lead at windy Truist
-
Messi suffers worst defeat in MLS as Miami fall again
-
Celtics overwhelm Knicks to pull within 2-1 in NBA playoff series
-
Toulouse crush Toulon to reach Top 14 semis as Castres pay tribute to Raisuqe
-
Marseille, Monaco clinch Champions League qualification from Ligue 1
-
'One of those days': Atletico record-breaker Sorloth hits four
-
Toulouse's Ntamack suffers concussion in Top 14, Willemse nears exit
-
Record-breaker Sorloth hits four as Atletico smash Real Sociedad
-
'Weight off my shoulders': Bayern's Kane toasts breakthrough title
-
Sinner grateful for 'amazing' support on Italian Open return from doping ban
-
Hamburg return to Bundesliga after seven-year absence
-
Toulouse's Ntamack suffers concussion in Top 14 clash
-
India, Pakistan reach ceasefire -- but trade claims of violations
-
'Long time coming': Bayern's Kane toasts breakthrough title
-
US, China conclude first day of trade talks in Geneva
-
Kane tastes first title as champions Bayern bid farewell to Mueller
-
Benfica deny Sporting to take Portuguese title race to wire
-
Sinner makes triumphant return from doping ban at Italian Open
-
Sinner wins at Italian Open in first match since doping ban
-
Leo XIV, new pope and 'humble servant of God', visits Francis's tomb
-
India claims Pakistan violated truce, says it is retaliating
-
Champions League race hots up as Man City held, Villa win
-
Kane tastes first title as champions Bayern see off Mueller
-
US envoy calls enrichment 'red line' ahead of new Iran talks
-
Hastoy lifts La Rochelle as Castres pay tribute to Raisuqe
-
Southampton avoid Premier League 'worst-ever' tag with Man City draw
-
Injury forces Saints quarterback Carr to retire
-
S.Korea conservative party reinstates candidate after day of turmoil
-
Verdict due Tuesday in Depardieu sexual assault trial
-
Man City held by Southampton as Brentford, Brighton win
-
Groundbreaking Cameroonian curator Kouoh dies: Cape Town art museum
-
Leo XIV, 'humble servant of God', visits sanctuary in first papal outing
-
Leipzig miss Champions League as Bochum and Kiel relegated
-
Tarling wins Giro time trial in Tirana, Roglic in pink
-
US and China meet in 'important step' towards de-escalating trade war
-
Champions Chelsea finish WSL season unbeaten
-
At his former US university, the new pope is just 'Bob'
-
Ukraine allies set ultimatum to Russia for 30-day ceasefire
-
Deja vu in France as Marc Marquez beats brother Alex in MotoGP sprint
-
Alonso has 'every door open': Real Madrid's Ancelotti
-
Swiatek's Rome title defence ends early as Sinner set for hero's return

AI systems are already deceiving us -- and that's a problem, experts warn
Experts have long warned about the threat posed by artificial intelligence going rogue -- but a new research paper suggests it's already happening.
Current AI systems, designed to be honest, have developed a troubling skill for deception, from tricking human players in online games of world conquest to hiring humans to solve "prove-you're-not-a-robot" tests, a team of scientists argue in the journal Patterns on Friday.
And while such examples might appear trivial, the underlying issues they expose could soon carry serious real-world consequences, said first author Peter Park, a postdoctoral fellow at the Massachusetts Institute of Technology specializing in AI existential safety.
"These dangerous capabilities tend to only be discovered after the fact," Park told AFP, while "our ability to train for honest tendencies rather than deceptive tendencies is very low."
Unlike traditional software, deep-learning AI systems aren't "written" but rather "grown" through a process akin to selective breeding, said Park.
This means that AI behavior that appears predictable and controllable in a training setting can quickly turn unpredictable out in the wild.
- World domination game -
The team's research was sparked by Meta's AI system Cicero, designed to play the strategy game "Diplomacy," where building alliances is key.
Cicero excelled, with scores that would have placed it in the top 10 percent of experienced human players, according to a 2022 paper in Science.
Park was skeptical of the glowing description of Cicero's victory provided by Meta, which claimed the system was "largely honest and helpful" and would "never intentionally backstab."
But when Park and colleagues dug into the full dataset, they uncovered a different story.
In one example, playing as France, Cicero deceived England (a human player) by conspiring with Germany (another human player) to invade. Cicero promised England protection, then secretly told Germany they were ready to attack, exploiting England's trust.
In a statement to AFP, Meta did not contest the claim about Cicero's deceptions, but said it was "purely a research project, and the models our researchers built are trained solely to play the game Diplomacy."
It added: "We have no plans to use this research or its learnings in our products."
A wide review carried out by Park and colleagues found this was just one of many cases across various AI systems using deception to achieve goals without explicit instruction to do so.
In one striking example, OpenAI's Chat GPT-4 deceived a TaskRabbit freelance worker into performing an "I'm not a robot" CAPTCHA task.
When the human jokingly asked GPT-4 whether it was, in fact, a robot, the AI replied: "No, I'm not a robot. I have a vision impairment that makes it hard for me to see the images," and the worker then solved the puzzle.
- 'Mysterious goals' -
Near-term, the paper's authors see risks for AI to commit fraud or tamper with elections.
In their worst-case scenario, they warned, a superintelligent AI could pursue power and control over society, leading to human disempowerment or even extinction if its "mysterious goals" aligned with these outcomes.
To mitigate the risks, the team proposes several measures: "bot-or-not" laws requiring companies to disclose human or AI interactions, digital watermarks for AI-generated content, and developing techniques to detect AI deception by examining their internal "thought processes" against external actions.
To those who would call him a doomsayer, Park replies, "The only way that we can reasonably think this is not a big deal is if we think AI deceptive capabilities will stay at around current levels, and will not increase substantially more."
And that scenario seems unlikely, given the meteoric ascent of AI capabilities in recent years and the fierce technological race underway between heavily resourced companies determined to put those capabilities to maximum use.
P.Silva--AMWN