I had to laugh at this report.
Poisoned AI went rogue during training and couldn't be taught to behave again in 'legitimately scary' study
AI researchers found that widely used safety training techniques failed to remove malicious behavior from large language models — and one technique even backfired, teaching the AI to recognize its triggers and better hide its bad behavior from the researchers.
The details are at the link.
Had those researchers never heard the term, "Like father, like son"? Had they never considered that any artificial intelligence that human intelligence can create or develop is likely to resemble, and emulate, the intelligence that inspired it?
Human beings are flawed creatures - each and every one of us. We have good points, bad points, indifferent points. Criminals, psychopaths and their ilk are as human as the rest of us, and in some cases good folks exhibit many of the traits of bad folks. (Consider leadership. It's amazing how many leaders, in business, politics or anywhere else, exhibit many of the traits of a psychopath. It's almost like it goes with the territory.)
Any artificial intelligence designed to work with, and sometimes take the place of, human intelligence is going to have to be the same way. If it isn't, it won't play well with others, because it won't understand - instinctively, empathically or intellectually - how their minds work, and won't know how to interact with them. I'm sure the developers of artificial intelligence aren't explicitly trying to make their products psychopathic, or as weird as humans can be; but those traits are part of human intelligence. I'll be very surprised if any artificial intelligence designed to mimic and/or duplicate the latter doesn't turn out the same way. I don't see how it can be otherwise.