AI is learning to lie, scheme, and threaten its creators

citizen.digital
منذ 5 ساعات
In one particularly jarring example, under threat of being unplugged, Anthropic's latest creation Claude 4 lashed back by blackmailing an engineer and threatened to reveal an extramarital affair.