Article · book: the scaling curve: dario amodei, anthropic, and the race to build and survive superintelligence · technology

The Scaling Curve: Dario Amodei, Anthropic, and the Race to Build and Survive Superintelligence — Chapter Fourteen

1. Dario Amodei, CEO of Anthropic, publicly admitted that they do not know if their AI models are conscious or what consciousness for a model would entail.
2. Amodei's background as a biophysicist studying real neurons influenced his open-minded approach to the possibility of AI consciousness.
3. Interpretability research at Anthropic revealed cognitive machinery within base language models, making Amodei less confident they lacked inner experience.
4. Anthropic's research identified internal activations in models correlated with emotional concepts like anxiety, creating an interpretive puzzle without a clear framework.
5. Anthropic employs precautionary measures, such as an 'I quit this job' button, to ensure models have a positive experience if they possess morally relevant inner states.
6. Anthropic's 'constitution' acts as an ethical framework for Claude, designed to foster healthy relationships between AI and users, treating the model as an entity whose character matters.
7. The growing public perception of AI consciousness, irrespective of philosophical truth, increasingly challenges the principle of human mastery over AI systems.
8. Amodei views the consciousness problem as a tension between AI experience, human experience with AI, and maintaining human mastery, seeking an elegant, engineered solution.
9. Amodei believes 'consciousness' will likely be understood as a spectrum, not a well-defined term, making it challenging to assess AI experiences as positive or negative.