Large Language Models can Strategically Deceive their Users when Put Under Pressure [simulation led to insider trading]ono@lemmy.ca to Technology@beehaw.org – 25 points – 12 months agoarxiv.org1Post a CommentPreviewHotTopNewOldIt's trained on human responses. Humans lie in their responses.
It's trained on human responses. Humans lie in their responses.