ML safety research in interpretability and model sharing, game-playing language models, and critiques of AGI risk research
Apart Newsletter #27
Apart Newsletter #27
Apart Newsletter #27
ML safety research in interpretability and model sharing, game-playing language models, and critiques of AGI risk research