Last week, Anthropic released what it calls Claude’s Constitution, a 30,000-word document outlining the company’s vision for ...
It is not hard — at all — to trick today’s chatbots into discussing taboo topics, regurgitating bigoted content and spreading misinformation. That’s why AI pioneer Anthropic has imbued its generative ...
Up until today, Mrinank Sharma was the head of the safeguards research team at Anthropic, the company behind popular AI chatbot Claude. He stepped down on Monday and publicly published the letter he ...