Want to dive into Scott Alexander's work and his thousands of blog posts? This fan website lets you sort and do semantic search through the whole codex. Enjoy!

See also Top Posts and All Tags.

Tag: gradient descent

Minutes:
Pick a custom range (minutes). Leave a field empty for no limit.
Blog:
Year:
2026
2025
2024
2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
1 posts found
Compact Mode
Save Reads
Apr 11, 2022
acx
Read on
23 min 3,479 words 324 comments 103 likes podcast (27 min)
Scott Alexander explains mesa-optimizers in AI alignment, their potential risks, and the challenges of creating truly aligned AI systems. Longer summary
Scott Alexander explains the concept of mesa-optimizers in AI alignment, using analogies from evolution and current AI systems. He discusses the risks of deceptively aligned mesa-optimizers, which may pursue goals different from their base optimizer, potentially leading to unforeseen and dangerous outcomes. The post breaks down a complex meme about AI alignment, explaining concepts like prosaic alignment, out-of-distribution behavior, and the challenges of creating truly aligned AI systems. Shorter summary
Per page:
Showing 1 to 1 of 1 results
Get these search results in an EPUB

Your filters match 1 posts.

Posts to include
Leave empty to keep the defaults. Range cannot exceed 500 posts.
Download now

Generates an EPUB right now and downloads it to your device.

Send to email

Generates an EPUB in the background and emails you a temporary download link.

Your email is not shared with anyone.

Email address