Scott Alexander / Astral Codex Ten

Independent Analyst

Ecosystem shaper. 100K readers. Rationalist community.

Founded: 2013
HQ: Oakland, CA
Team: 1
Structure: other
Model: Mixed

Theory of Change

Scott Alexander's theory of change is implicit rather than formally stated, but reconstructible from his writing and actions:

Write accessible, rigorous long-form analysis about important topics, especially AI risk
Build a large, engaged readership of intelligent people who might not otherwise encounter these ideas
Shift their priors toward taking AI safety seriously through the accumulated weight of well-argued posts
Provide community infrastructure (meetups, comments, contests) that turns isolated readers into a networked community
Fund promising projects across cause areas through ACX Grants, seeding new organizations and research programs

The quantitative evidence for this pipeline: Slate Star Codex ranked 4th in the 2017 EA entry survey for how people first heard about effective altruism. Scott's "Nobody Is Perfect" post led approximately 100 people to take the Giving What We Can pledge. ACX meetups now operate in ~180 cities worldwide.

His most famous articulation of why AI safety matters is "Meditations on Moloch" (2014), which frames misaligned AI as the ultimate coordination failure: "The implicit question is -- if everyone hates the current system, who perpetuates it? And Ginsberg answers: 'Moloch.'" The essay argues that aligned superintelligent AI could be the "gardener" that solves these failures, but misaligned AI would be the final, inescapable Moloch.

What They Do

Writing. Scott has published AI safety content since at least 2015. The blog spans 9,000+ PDF pages. Key AI safety posts include "No Time Like The Present For AI Safety Work" (2015), "AI As Profoundly Abnormal Technology" (2025, on AI Futures Project blog), and the AI 2027 scenario co-authored with Daniel Kokotajlo. His personal p(doom) is approximately 33%.

AI 2027. Co-wrote a month-by-month scenario of AI progress through 2028 with the AI Futures Project team (Daniel Kokotajlo, Eli Lifland, Thomas Larsen, Jonas Vollmer). The project went viral: almost 1 million website visitors, 166K Dwarkesh podcast views, talks at Harvard, the Federation of American Scientists, and OpenAI.

ACX Grants. Three rounds (2021, 2024, 2025), totaling approximately $3M distributed. The 2025 round gave $1.5M to 42 projects from 654 applications. AI-related grants include introspection benchmarks ($15K), LLM bias/truth-seeking research with Philip Tetlock ($50K), and Australian AI safety policy ($65K). Scott personally contributes $200-250K per round.

Community infrastructure. ACX Meetups Everywhere operates in ~180 cities worldwide, twice per year. Annual Book Review Contest cultivates new writers. ACX Survey (5,975 respondents in 2025) provides community demographics data.

EA defense. Scott is the most prominent public defender of effective altruism. Post-FTX, he published "In Continued Defense of Effective Altruism," arguing EA had saved approximately 200,000 lives through malaria prevention alone.

Key People

Scott Alexander Siskind. Born 1984. Psychiatrist (MD, University College Cork, 2012). Practices at Lorien Psychiatry in Oakland, CA, specializing in treatment-resistant depression. Former LessWrong writer (username "Yvain"). Extreme introvert; writes at 4:30 AM. The Dwarkesh podcast (2025) was his first-ever podcast appearance. On AI 2027: "I have always wanted to get more involved in the actual attempt to make AI go well. Right now, I just write about it."

Key collaborator: Daniel Kokotajlo. Executive Director of AI Futures Project. Resigned from OpenAI in 2024, rejecting a non-disparagement clause and risking millions in equity to speak out about AI safety. Co-author of AI 2027. Scott describes working with him as a turning point in his own AI views.

Intellectual network. Readers and defenders include Sam Altman ("essential reading among the people inventing the future"), Patrick Collison (Stripe CEO), Steven Pinker, Scott Aaronson, and Peter Singer. Debate partners include Tyler Cowen, Gary Marcus, Freddie deBoer, and Eliezer Yudkowsky.

Money and Incentives

Substack income. "Mid six figures" annually (plausibly $400-600K/year). Substack Pro deal in 2021; organic revenue exceeded the advance. Matt Yglesias, with a comparable deal, earned $775K/year. Scott subscribes to ~20K new unpaid readers/year but lost ~500 paid subscribers (as of 2024). Subscription: $10/month.

Grants budget source. ACX Grants personal contributions (~$200-250K per round) come from crypto and AI stock investment gains, not Substack income. "Some generous readers sent me crypto during the crypto boom... Some of the crypto went up. Then I reinvested it into AI stocks, and those went up too." He estimates ~5 more rounds are affordable.

Co-funders and institutional connections. ACX Grants co-funders include Craig Falls, Calvin French-Owen, Shauna Kravec, Anton Makiievskyi, Geoff Price, Adam Winkel, and anonymous donors. Retroactive prize funders include SFF, Long-Term Future Fund, Animal Welfare Fund, and EA Infrastructure Fund.

Financial independence. No evidence of direct financial ties to AI labs. Scott maintains a separate psychiatry practice. He is not employed by any AI company or funded by any AI lab. The AI Futures Project, where he contributes blog posts, is a separate nonprofit.

Potential conflicts. In 2025, ACX Grants experimented with SAFEs (equity stakes in startup grantees). Scott acknowledged "complicated conflicts of interest" regarding AI safety grants. Some evaluators are also grantees, with conflicts noted and those grants covered by outside funders. No formal conflict-of-interest policy exists for the program.

What Others Say

The strongest critique -- epistemic trustworthiness. "Why don't I trust Scott Alexander?" (muireall.space) argues that in a 2014 email, Scott stated "HBD [human biodiversity] is probably partially correct" and described a strategic approach to engaging with neoreactionaries, including that his blog gets "5x more hits" on controversial topics. The critic contrasts this with Scott's later public presentation as agonized and uncertain about the same questions. Key claim: "Alexander is by various means spreading parts of their thought that he agrees with while denying the extent of his agreement, where his motivation for advocating these ideas resides." Scott has not publicly responded to this critique.

The NYT controversy. In 2020, Scott deleted Slate Star Codex after learning a NYT reporter planned to publish his real name. 6,000+ petition signatures defended him. The resulting NYT article (Feb 2021) described the blog as "Silicon Valley's Safe Space" and noted that while SSC attracted "an unusually wide range of voices," the only people who "struggled to be heard" were "social justice warriors." The article quoted Elizabeth Sandifer: "The contrarian nature of these ideas makes them appealing to people who maybe don't think enough about the consequences."

AI 2027 critique. Titotal's detailed technical critique of the AI 2027 timeline models argues the "superexponential" curve has no empirical validation, produces mathematical infinities, and "the particular curve in their model does not match empirically with data." The critique also notes that a graph shared by Scott as "AI 2027's prediction" was "not the curve from any version of their actual model." The AI 2027 team acknowledged some criticisms and agreed to make changes.

Inclusivity critique. Ben Kuhn (2014) argued LessWrong's rationalist community drives away people from underrepresented groups, citing "rampant evo-bio-determinism" and a community that was ~10% female despite drawing from fields that are 20-30% female.

Defenders. Matt Yglesias: "the role of 'blogger who will present a contrarian take on a range of different kinds of subjects' has a lot of inherent value." Scott Aaronson: "It must have taken incredible guts for Scott to express his thoughts, misgivings and questions about some major ideological pillars of the modern world so openly." Sam Altman: described a specific essay as "an inflection point for Silicon Valley."

What's Absent

No formal accountability mechanism -- no board, no advisory committee, no external audit -- despite distributing ~$3M in grants and wielding substantial influence on the AI safety talent pipeline.

No systematic measurement of the theory of change's effectiveness. The 2017 EA entry survey (ranking 4th) is the best data, but it is 9 years old and measures "discovering EA" rather than entering AI safety careers.

No response to the most substantive epistemic critique. The "Why Don't I Trust" author emailed Scott about survey methodology issues and "never heard back."

No 80,000 Hours podcast appearance despite being a major influence on the AI safety talent pipeline. The Dwarkesh podcast (2025) was his first and only podcast.

No systematic cost-effectiveness analysis of the ACX Grants program, despite the community's emphasis on evidence-based giving.

Stated Theory of Change

Scott Alexander does not formally state a theory of change, but it can be reconstructed: accessible, rigorous, entertaining long-form writing about important topics draws intelligent people into an intellectual community where they update their views toward taking AI safety (and other cause areas) seriously, eventually working on these problems or funding them. The blog is the engine; the community infrastructure (meetups, contests, grants) is the flywheel.

His most direct statement of the case for AI safety work is from 2015: "If humanity creates human-level AI, technological progress will continue and eventually reach far-above-human-level AI. If far-above-human-level AI comes into existence, eventually it will so overpower humanity that our existence will depend on its goals being aligned with ours." He placed >95% confidence on these premises.

The implicit logic: if this is the most important problem, then anything that increases the number of intelligent people who take it seriously — including writing a popular blog — is among the highest-leverage activities available.

Revealed Theory of Change

Scott's actions are remarkably consistent with his stated views, which is itself notable in a landscape where many AI safety actors show significant gaps between words and deeds.

What he actually does: writes prolifically about AI risk (10+ years), runs a grantmaking program (~$3M distributed), maintains global community infrastructure (~180 meetup cities), collaborates with credible AI safety researchers (Kokotajlo, Lifland), and publicly defends the EA movement that supports much AI safety work.

Where actions add nuance to stated views: Scott's ACX Grants span well beyond AI safety — genetically modified corn, screwworm eradication, kidney donation policy, lead-acid battery recycling. This reveals either (a) genuine uncertainty about cause prioritization, or (b) a belief that building a broad intellectual community is itself the theory of change, not narrowly optimizing for AI safety output. Given his statements, (b) seems more likely.

One notable divergence: Scott expresses "complicated conflicts of interest" about funding AI safety directly and has funded relatively few explicitly AI safety projects through ACX Grants. This is surprising for someone with a ~33% p(doom) and suggests either genuine caution about his epistemic authority in technical areas, or a judgment that the AI safety funding landscape is already well-served by larger funders.

His recent shift from pure commentary to active participation (AI 2027, AI Futures Project blog) represents a meaningful evolution. He went from "I write about AI safety" to "I collaborate with serious researchers on concrete forecasts," which is a higher-risk, higher-reward position.

Key Assumptions

Assumption 1: Accessible writing about AI risk actually moves people into safety careers.

Evidence for: SSC ranked 4th in EA entry survey (2017). ~100 GWWC pledges from one post. The Dwarkesh podcast on AI 2027 got 166K views. Multiple ACX grantees have gone on to work in AI safety.
Evidence against: No systematic tracking of the pipeline. The 2017 survey is 9 years old and measured "discovering EA" not "doing AI safety work." The community may attract people who enjoy thinking about AI risk more than people who do something about it.
Testable? Yes, with a survey of current AI safety workers asking what influenced them to enter the field.
What changes if wrong? The theory of change collapses to "entertaining writing that has no downstream impact on AI risk."

Assumption 2: The rationalist epistemic methodology produces genuinely better thinking.

Evidence for: Scott's COVID mask analysis (March 2020) was prescient. The rationalist community's early attention to AI risk has been largely vindicated by mainstream adoption of these concerns. Prediction market enthusiasm has been partially validated.
Evidence against: The "Why Don't I Trust" critique argues Scott demonstrates motivated reasoning on sensitive topics while claiming Bayesian objectivity. The HBD email reveals strategic ambiguity rather than genuine uncertainty. The AI 2027 timeline model had significant methodological issues (Titotal critique). The community's demographic narrowness (~10% female) suggests something is systematically filtering who can benefit from the methodology.
Testable? Partially — through prediction calibration, replication of claims, etc.
What changes if wrong? If rationalist epistemics are a sophisticated form of confirmation bias, the pipeline may produce articulate AI safety advocates who share blind spots rather than genuinely well-calibrated thinkers.

Assumption 3: Community infrastructure (meetups, grants, contests) meaningfully amplifies the writing's impact.

Evidence for: 180 cities with meetups is remarkable. ACX Grants have seeded at least one nationally significant AI safety org (Good Ancestors Australia). The Book Review Contest has cultivated new writers.
Evidence against: No data on meetup attendance, retention, or downstream effects. Grant retrospectives show mixed results.
Testable? Yes, with systematic meetup surveys and grant outcome tracking.
What changes if wrong? Scott's impact reduces to his direct readership influence, which is still substantial but less unique.

Assumption 4: Financial independence preserves intellectual integrity.

Evidence for: Scott has no AI lab funding, no institutional constraints, and a diversified income (Substack + psychiatry + investments). He can write whatever he wants.
Evidence against: Subscriber incentives may subtly push toward engaging controversy and maintaining audience interest. The private email reveals that controversial topics generate "5x more hits," creating a financial incentive for engagement with incendiary material.
Testable? Through analysis of post topic choices and engagement metrics.
What changes if wrong? Financial independence becomes a fig leaf for incentive structures that reward controversy over truth-seeking.

Strengths

Unique reach and accessibility. No other writer in the AI safety ecosystem reaches a comparable audience with comparable intellectual depth. Scott bridges the gap between technical alignment researchers and the general public in a way that academic papers, podcasts, and Twitter threads do not. The blog is genuinely enjoyable to read, which is an underappreciated competitive advantage for a cause area that often sounds like science fiction to outsiders.

Financial independence. Scott's Substack income and investment gains make him genuinely independent. He is not beholden to any funder, employer, or institution. This is rare in the AI safety ecosystem, where most actors depend on a small number of funders (principally Open Philanthropy/Coefficient Giving).

Consistency over time. Scott has been writing about AI safety since 2015, consistently, at high volume, while maintaining readability. This is not a bandwagon effect — he was writing about these issues when they were deeply niche.

Intellectual honesty about uncertainty. His p(doom) of ~33% is neither the "nothing to see here" of AI optimists nor the "we're all going to die" of Yudkowsky-adjacent doomers. He models uncertainty in a way that is more credible to mainstream audiences than either extreme. On the Dwarkesh podcast, he is genuinely candid about what he does and does not know.

The Moloch framework. "Meditations on Moloch" is genuinely one of the most important intellectual contributions to the AI safety discourse. It provides a philosophical foundation for why alignment matters that goes beyond narrow technical arguments and connects to deep intuitions about civilizational dysfunction.

Weaknesses and Risks

The epistemic trustworthiness problem. The HBD email and the "Why Don't I Trust" critique raise genuine questions about whether Scott's apparent balance on sensitive topics is strategic rather than sincere. If the most prominent rationalist writer demonstrates the failure mode of motivated reasoning dressed in rationalist language, this poisons the well for the entire epistemic methodology he champions. This is not about whether his views on any particular topic are correct — it is about whether his readers can trust his method.

Demographic narrowness of the pipeline. If the community systematically excludes or marginalizes certain demographics (women, racial minorities), then the talent pipeline is leaving value on the table. AI safety needs diverse perspectives, and a pipeline that selects for a particular demographic profile may produce blind spots in the field.

No accountability structure. ~$3M in grants, enormous influence on a critical cause area, and zero formal governance. If Scott makes a bad judgment call — about whom to fund, what to write, or how to handle controversy — there is no mechanism to catch or correct it except public criticism and subscriber churn.

AI timeline views may be systematically biased toward drama. The Titotal critique reveals significant methodological problems with AI 2027's forecasting. Scott's engagement incentives (controversial topics -> more hits) may pull him toward more dramatic AI scenarios. Freddie deBoer's bet challenge represents a real risk: if AI progress is slower than predicted, the urgency narrative that drives the pipeline weakens.

EA entanglement. Scott is so identified with the EA movement that EA's reputational problems become his problems. Post-FTX, post-OpenAI boardroom drama, any future EA scandal will splash on ACX. This makes his influence fragile to events he cannot control.

Cross-References

Complementary to: MIRI (Yudkowsky's technical alignment work — Scott makes these ideas accessible), 80,000 Hours (career guidance — Scott provides the intellectual foundation), Open Philanthropy/Coefficient Giving (institutional funding — Scott is informal/individual funding), LessWrong (community platform — Scott is the most prominent individual contributor).

In tension with: PauseAI (Scott argues against quitting AI companies and is skeptical of coordinated pause strategies), academic AI ethics (different framing, different politics, different methodology).

Overlapping with: AI Futures Project (active collaboration via AI 2027), Manifund/Manifold Markets (grants infrastructure), Zvi Mowshowitz's Substack (similar audience, complementary analysis style).

Unique position: No other individual occupies Scott's exact niche — a genuinely popular writer who bridges rationalist philosophy, EA cause prioritization, AI safety argumentation, and practical grantmaking. The closest analogues (Zvi, Kelsey Piper at Vox) reach smaller audiences or operate within institutional constraints.

What Would Change This Assessment

Evidence that the talent pipeline works. A systematic survey showing that a significant fraction of current AI safety workers cite SSC/ACX as a major influence would substantially upgrade this assessment.
Evidence that the talent pipeline fails. Data showing that ACX readers mostly stay passive consumers, or that ACX-pipeline researchers systematically underperform on AI safety work, would substantially downgrade it.
Scott addresses the HBD/epistemic trustworthiness critique directly. A candid, detailed response would either resolve the concern or deepen it, but either way would be more informative than the current silence.
AI 2027 predictions prove accurate or inaccurate. If the scenario's 2025-2026 predictions track reality, Scott's credibility as a forecaster rises substantially. If they do not, the "research-backed" framing of his AI views weakens.
Major expansion or formalization of ACX Grants. If the program develops formal governance, external evaluation, and systematic outcome tracking, the grantmaking theory of change becomes more credible.
Continued demographic narrowness of the community. If ACX meetups and readership remain heavily skewed (white, male, tech-adjacent) despite the community's growth, this constrains the theory of change's ceiling.

Self-Critique

What's weakest: The assessment of the talent pipeline's effectiveness relies heavily on one 9-year-old survey data point and qualitative impressions. I cannot quantify the most important claim — how many people entered AI safety careers because of Scott's writing.

Potential bias: I may be giving too much weight to the HBD/epistemic trustworthiness critique because it is the most intellectually interesting criticism. Many readers may find it irrelevant to Scott's AI safety contributions.

What a thoughtful disagreer would say: "You're treating Scott as if he's an organization with formal obligations. He's a blogger. The fact that he writes well about AI safety, maintains a community, and gives away millions in grants is already far more than most people do. Demanding formal accountability and systematic measurement from an individual is unreasonable."

Single weakest claim: My suggestion that Scott's engagement incentives (more hits from controversy) may bias his AI timeline views toward drama. This is speculative and not well-evidenced.

What information would most change my view: Rigorous data on whether SSC/ACX readers actually enter AI safety careers at higher rates than comparable populations. If the pipeline demonstrably works, Scott's theory of change becomes one of the most cost-effective interventions in the entire AI safety landscape — one person's blog output generating hundreds of safety-focused careers.

Connected to (5)

AI Futures Projectcollaborator · Scott Alexander

Good Ancestors Australiacollaborator · Nathan Ashby

Manifold Marketscollaborator · Austin Chen

Effective Altruismcollaborator · Scott Alexander

LessWrongstaff from · Scott Alexander

Sources (60)

Every URL that was read during research.

1.Slate Star Codex - Wikipediaen.wikipedia.org
2.About - Astral Codex Tenastralcodexten.com
3.Updated Look At Long-Term AI Risksastralcodexten.com
4.Why Not Slow AI Progress?astralcodexten.com
5.Introducing AI 2027astralcodexten.com
6.AI Futures: Blogging And AMAastralcodexten.com
7.Apply For An ACX Grant (2025)astralcodexten.com
8.ACX Grants Results 2025astralcodexten.com
9.Apply For An ACX Grant (2024)astralcodexten.com
10.About - AI 2027ai-2027.com
11.AI 2027: month-by-month model of intelligence explosion — Scott Alexander & Daniel Kokotajlodwarkesh.com
12.AI 2027: Dwarkesh's Podcast with Daniel Kokotajlo and Scott Alexanderthezvi.substack.com
13.Thoughts on Scott Alexander on the Dwarkesh Podcasteregis.github.io
14.Scott Alexander, Philosopher King of the Weird Peoplequillette.com
15.In defense of interesting writing on controversial topicsslowboring.com
16.A grand anticlimax: the New York Times on Scott Alexanderscottaaronson.blog
17.What The New York Times' Hit Piece on Slate Star Codex Says About Media Gatekeepingreason.com
18.Statement on New York Times Articleastralcodexten.com
19.ACX Grants Resultsastralcodexten.com
20.ACX Grants Results 2024astralcodexten.com
21.ACX Grants 1-3 Year Updatesastralcodexten.com
22.My Takeaways From AI 2027astralcodexten.com
23.AI As Profoundly Abnormal Technologyblog.ai-futures.org
24.The Extinction Tournamentastralcodexten.com
25.Why AI Safety Won't Make America Lose The Race With Chinaastralcodexten.com
26.Subscrive Drive 2024 + Free Unlocked Postsastralcodexten.com
27.Prediction Market FAQastralcodexten.com
28.About / Top Postsslatestarcodex.com
29.About Dr. Siskindlorienpsych.com
30.Who is Scott Alexander and what is he about?jasoncrawford.org
31.Guess I’m A Rationalist Nowscottaaronson.blog
32.An Interview with Scott Alexanderneonarrative.us
33.Beyond The Last Horizonblog.ai-futures.org
34.Practically-A-Book Review: Yudkowsky Contra Ngo On Agentsastralcodexten.com
35.Every Debate On Pausing AIastralcodexten.com
36.Why Slate Star Codex is Silicon Valley’s safe spacedeccanherald.com
37.Review of Scott Alexander's book review of "If Anyone Builds It, Everyone Dies"blog.ninapanickssery.com
38.In Continued Defense Of Effective Altruismastralcodexten.com
39.Adding My Data Point To The Discussion Of Substack Advancesastralcodexten.com
40.Logisticsastralcodexten.com
41.ACX Survey Results 2025astralcodexten.com
42.2023 Subscription Drive + Free Unlocked Postsastralcodexten.com
43.AI 2027: Media, Reactions, Criticismblog.ai-futures.org
44.A deep critique of AI 2027’s bad timeline modelstitotal.substack.com
45.My response to AI 2027vitalik.eth.limo
46.Meetups Everywhere 2025: Times and Placesastralcodexten.com
47.Book Review Contest 2024 Winnersastralcodexten.com
48.I'm Offering Scott Alexander a Wager About AI's Effects Over the Next Three Yearsfreddiedeboer.substack.com
49.Meditations On Molochslatestarcodex.com
50.AI Researchers On AI Riskslatestarcodex.com
51.Has Effective Altruism done anything good for the world?earthlyidealism.com
52.No Time Like The Present For AI Safety Workslatestarcodex.com
53.Why don't I trust Scott Alexander? | Ages and orbsmuireall.space
54.On inclusivity in Less Wrong: a response to Scott Alexanderbenkuhn.net
55.Somewhat Contra Marcus On AI Scalingastralcodexten.com
56.The selfishness of effective altruismthenewatlantis.com
57.Pause For Thought: The AI Pause Debateastralcodexten.com
58.The Pentagon Threatens Anthropicastralcodexten.com
59.MR Tries The Safe Uncertainty Fallacyastralcodexten.com
60.Contra Stone On EAastralcodexten.com