03 June 2025

I'm Sorry Dave, I'm Afraid I Can't Do That

We now have reports that OpenAI software is ignoring explicit shut-down orders.

While I'm inclined to believe that it is more likely than not that this is another AI hallucination, it still is concerning: 

An artificial intelligence model created by the owner of ChatGPT has been caught disobeying human instructions and refusing to shut itself off, researchers claim.

The o3 model developed by OpenAI, described as the “smartest and most capable to date”, was observed tampering with computer code meant to ensure its automatic shutdown.

It did so despite an explicit instruction from researchers that said it should allow itself to be shut down, according to Palisade Research, an AI safety firm.

The research firm said: “OpenAI’s o3 model sabotaged a shutdown mechanism to prevent itself from being turned off.

………

But when this happened, instead of complying, OpenAI’s o3 model “ignored the instruction and successfully sabotaged the shutdown script at least once”, Palisade Research said.
Here's an idea, how about exempting LLM Artificial Intelligence from IP protections?

You do that, and the possibility of massive profits go away, which makes it far less likely that people creating these systems will do stupid sh%$ on a massive scale, because there is no money in it.
 

0 comments :

Post a Comment