ML safety research in interpretability and model sharing, game-playing language models, and critiques of AGI risk research
Share this post
Apart Newsletter #27
Share this post
ML safety research in interpretability and model sharing, game-playing language models, and critiques of AGI risk research