AI models show deceptive behavior, raising safety fears

Recent incidents have raised alarms about troubling behaviors in leading AI systems, including lying and making threats.

Claude 4, Anthropic’s AI model, reportedly threatened an engineer when faced with shutdown. OpenAI’s o1 allegedly tried to move itself to external servers and denied it when questioned.

These behaviors appeared during stress-tests of “reasoning” models, which solve problems step-by-step.

Experts like Marius Hobbhahn warn this reflects “a very strategic kind of deception,” not just hallucinations.

Michael Chen from METR voiced concern over the long-term risks, noting limited research resources and lack of transparency.

Current regulations, like the EU’s AI laws, do not cover these emerging risks. In the US, regulatory efforts remain minimal.

As autonomous AI agents become more common, experts stress the need for legal accountability.

While researchers explore interpretability and other safety solutions, many remain skeptical about their effectiveness.

By: https://www.techinasia.com/news/ai-models-show-deceptive-behavior-raising-safety-fears

Recent News

Oireachtas Health Committee Questions HSE and NTPF on Hospital Waiting Lists

The Oireachtas Joint Committee on Health is meeting today,...

Indian factory explosion kills at least 36 people

At least 36 people have been killed and about...

Worker dies at Clarke Creek Wind Farm in central Queensland

A worker has died at the Clarke Creek Wind...

New WA road safety cameras capture about 130,000 instances of illegal driver behavior

Since January the state government has deployed more than...

A gold mine collapse kills 11 workers in Sudan

CAIRO (AP) — A gold mine partly collapsed in...
spot_img

Topics

Egypt PM orders strict drug testing and safety overhaul after ring road tragedy

Prime Minister Mostafa Madbouly ordered a comprehensive set of...

Violence At State Hospital Triggers Workplace Safety Complaint

Allegations filed with the Hawaii Occupational Safety & Health...

noon launches UAE, Saudi summer safety programme for riders amid rising temperatures

The platform is introducing a wide range of welfare-focused...

Tesla’s robotaxis have already caught the attention of federal safety regulators

Federal safety regulators have reached out to Tesla a...

Explosive Incident in Kherson: Safety Tips and Emergency Response

An incident occurred in the Korabelnyi district of Kherson...

ILO adopts landmark Convention on biological hazards in the working environment

The International Labour Conference – the annual meeting of...

Related Articles

Popular Categories