Anthropic’s CEO wonders if future AI should have option to quit “unpleasant” tasks

Anthropic CEO Dario Amodei raised a few eyebrows on Monday after suggesting that advanced AI models might someday be provided with the ability to push a “button” to quit tasks they might find unpleasant. Amodei made the provocative remarks during an interview at the Council on Foreign Relations, acknowledging that the idea “sounds crazy.”
“So this is—this is another one of those topics that’s going to make me sound completely insane,” Amodei said during the interview. “I think we should at least consider the question of, if we are building these systems and they do all kinds of things like humans as well as humans, and seem to have a lot of the same cognitive capacities, if it quacks like a duck and it walks like a duck, maybe it’s a duck.”
Amodei’s comments came in response to an audience question from data scientist Carmem Domingues about Anthropic’s late-2024 hiring of AI welfare researcher Kyle Fish “to look at, you know, sentience or lack of thereof of future AI models, and whether they might deserve moral consideration and protections in the future.” Fish currently investigates the highly contentious topic of whether AI models could possess sentience or otherwise merit moral consideration.