Apart Research
Subscribe
Sign in
Home
Apart Newsletter
Hackathons
Product development
Organization news
š Versión en espaƱol
Latest
Top
Large language models might always be slightly misaligned
Apart Newsletter #28
Apr 25, 2023
Ā
ā¢
Ā
Apart Research
3
Apart Newsletter #27
ML safety research in interpretability and model sharing, game-playing language models, and critiques of AGI risk research
Apr 18, 2023
Ā
ā¢
Ā
Apart Research
3
Identifying semantic neurons, mechanistic circuits & interpretability web apps
Results from the 4th Alignment Jam
Apr 13, 2023
Ā
ā¢
Ā
CC
2
Ethics or Reward?
This week we take a look at LLMs that need therapists, governance of machine learning hardware, and benchmarks for dangerous behaviour. Read to the endā¦
Apr 9, 2023
Ā
ā¢
Ā
CC
5
Governing AI & Evaluating Danger
We might need to shut it all down, AI governance seems more important than ever and technical research is challenged. Welcome to this week's updateā¦
Apr 3, 2023
March 2023
What a Week! GPT-4 & Japanese Alignment
What a week. There was already a lot to cover Monday when I came in for work and I was going to do a special feature on the Japan Alignment Conferenceā¦
Mar 15, 2023
Perspectives on AI Safety
This week, we take a look at interpretability used on a Go-playing neural network, glitchy tokens and the opinions and actions of top AI labs andā¦
Mar 6, 2023
February 2023
Bing Wants to Kill Humanity W07
Welcome to this weekās ML & AI safety update where we look at Bing going bananas, see that certification mechanisms can be exploited and that scalingā¦
Feb 21, 2023
Will Microsoft and Google start an AI arms race? W06
We would not be an AI newsletter without covering the past weekās releases from Google and Microsoft but we will use this chance to introduce theā¦
Feb 10, 2023
Extreme AI Risk W05
In this week's newsletter, we explore the topic of modern large modelsā alignment and examine criticisms of extreme AI risk arguments. Of course, don'tā¦
Feb 6, 2023
January 2023
Was ChatGPT a good idea? W04
In this weekās ML & AI Safety Update, we hear Paul Christianoās take on one of OpenAIās main alignment strategies, dive into the second round winners ofā¦
Jan 28, 2023
2
Compiling code to neural networks? W03
Welcome to this weekās ML & AI Safety Report where we dive into overfitting and look at a compiler for Transformer architectures! This week is a bitā¦
Jan 20, 2023
2
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts