Anthropic has published a newly devised approach to interpreting AI. They call this NLA for natural language autoencoders. An ...