← AI Safety Orgs

Forethought

Field-Building

Academic research support.

Founded
2024
HQ
Oxford, UK
Team
9
Structure
limited company (UK)
Model
Donations

Theory of Change

Forethought's theory of change rests on three premises:

  1. AGI is likely within 2-8 years and may trigger an "intelligence explosion" -- a century or more of technological progress compressed into a decade.
  2. This explosion creates many "grand challenges" beyond alignment: AI-enabled coups, new WMDs, digital beings' rights, space resource governance, epistemic disruption, and institutional lock-in.
  3. Early foundational research on neglected grand challenges can have outsized impact, just as early work on AI alignment (Bostrom, Shulman, Yudkowsky, Christiano) proved enormously influential despite seeming speculative at the time.

In MacAskill's words: "We don't yet have a good understanding of what this change might look like or how to navigate it. Society is not prepared."

The org also argues that making the future truly great ("Flourishing") may be as important as avoiding extinction ("Survival"). Using a simple framework: if probability of survival is 80% and the quality of the surviving future is only 10% of what's possible, the problem of non-Flourishing is 36x the scale of the extinction problem.

Forethought researches these neglected topics, then shares findings with AI companies, policymakers, and the EA/rationalist community to catalyze action.

What They Do

Forethought published 20+ research papers in its first year (March 2025 - March 2026) and runs a weekly podcast ("ForeCast") and Substack newsletter ("ForeWord," ~2.9K subscribers).

Key research outputs:

  • "Preparing for the Intelligence Explosion" (March 2025): Flagship paper making the case for a century-in-a-decade scenario and systematically cataloging grand challenges beyond alignment.
  • "AI-Enabled Coups" (April 2025): Identified a novel threat model for how AI lowers barriers to power seizure. Became 80,000 Hours' #2 problem profile. Concrete policy recommendations.
  • "Software Intelligence Explosion": Technical analysis estimating ~60% probability of 3+ years of AI progress compressed into under 1 year.
  • "Better Futures" series: Makes the case that achieving a near-best future (not just surviving) deserves equal priority. Introduces "viatopia" -- a waypoint state likely to lead to truly good outcomes.
  • "AI Tools for Existential Security": Argues for accelerating beneficial AI applications (epistemic tools, coordination tools, risk-targeted tools) rather than only focusing on slowing dangerous capabilities.

The org also works on translating research into action: Stefan Torges (Head of Strategic Initiatives) develops policy proposals on AI-enabled coup prevention and works with AI companies on "secret loyalty" detection in AI systems.

Impact claims: influenced 80K Hours' problem ranking (verified); advising frontier AI companies (unverified); inspired creation of SAIF VC fund by ex-YC president Geoff Ralston (tenuous causal claim); researchers contributed to International AI Safety Report.

Key People

Max Dalton (Director): Ex-ED of Centre for Effective Altruism. PPE Oxford, MSc Economics York. Provides operational leadership. Co-founded Forethought with MacAskill after both experienced burnout from prior roles.

William MacAskill (Senior Research Fellow, Board member): Co-founder of EA movement, co-founded CEA, 80K Hours, GWWC. Author of "What We Owe the Future." The intellectual engine of the org. Provided the most candid articulation of Forethought's worldview in a 40K-word 80K Hours podcast. Acknowledged being "too trusting" in the FTX era and missing warning signs about SBF. Post-FTX, focused on governance structures over character assessment. His AI existential risk estimate: 1%-10% (optimistic by current community standards).

Tom Davidson (Senior Research Fellow): Ex-Open Philanthropy. Author of the most detailed economic modeling of intelligence explosion dynamics. His explosive growth report for Open Phil and coups paper are Forethought's most technically rigorous outputs.

Team size: ~9 staff (6 researchers, 1 director, 1 ops director, 1 head of strategic initiatives). Backgrounds span philosophy, physics, CS, economics, history. Most came from dissolved or disrupted EA/x-risk orgs: FHI, CLR, CEA, Open Phil. Research affiliates include Toby Ord. Collaborators include Ryan Greenblatt (Redwood Research, also on board), Joe Carlsmith, Eric Drexler.

Owen Cotton-Barratt is an active co-author on multiple Forethought papers but is not listed as staff. He resigned from the EV UK board after a 2023 sexual harassment investigation and was given a 2-year ban from EV activities. His formal relationship with Forethought has not been publicly clarified.

Money and Incentives

Total budget: Estimated $1.6M-$2.6M annually at current size. Seeking $2.4M (baseline) to $3.9M (expansion) through June 2027.

Revenue sources: Two unnamed high-net-worth donors funded the org through ~March 2026. One $349K Open Phil grant in 2023 was to the old Foundation (pre-pivot). The new Forethought has been endorsed by Longview and Coefficient Giving but appears not to have received major grants from either as of the fundraiser.

Business model: Grant-funded nonprofit. Fundraising from "highly-aligned private donors" -- a small number of wealthy individuals who share longtermist values. Max Dalton describes it as "a high variance bet" with "quite hits-based" research. The org is currently in a hiring round but "might want to end up making offers to more people than we can currently fund."

Legal structure: UK Private Limited Company by guarantee (not a registered charity). This means fewer public disclosure requirements than a charity. Incorporated December 2024. Donations routed through Giving What We Can and Every.org for tax deductibility.

Funding concentration risk: Two anonymous HNW donors controlling virtually all early funding is extremely high concentration. If either withdraws, the org has limited runway.

Economic ties to AI labs: No direct lab funding or compute dependencies. Board member Ryan Greenblatt is Chief Scientist at Redwood Research. The org claims to advise frontier AI companies but specifics are not disclosed. No known revenue from AI labs.

Incentive analysis: Forethought's incentives are reasonably aligned with its mission. No product revenue to distort priorities. No lab funding creating dependency. The main incentive risk is that the org's value proposition depends on short AI timelines being correct -- if timelines are long, the urgency claim weakens and donor interest may fade. There is also a risk that "communicate to the nerds" becomes an insularity trap where research is validated only within the EA community rather than by domain experts.

What Others Say

Strongest endorsements:

  • Daniel Kokotajlo (AI 2027 author, ex-OpenAI): "currently the best institution in the world doing full-throated, scope-sensitive macrostrategy"
  • Joe Carlsmith (ex-OP Senior Advisor, now Anthropic): "tackling the crucial but neglected question of how to ensure transformative AI leads to genuinely good outcomes -- not just human survival"
  • Zach Freitas-Groff (Longview Philanthropy): "exactly the kind of foundational thinking we need"

Strongest criticisms:

  • Internal (Lizka Vaintrob): "The set of worldviews & perspectives represented on the team feels too narrow to me." Also notes tension between solving problems vs. understanding confusion, and bouts of "unproductive perfectionism."
  • Philosophical (Hugg, Masrani): Longtermism is "almost tailor-made to disable the mechanisms by which we make progress" -- it is irrefutable by construction because you can always posit more future people. Focus on long-term effects "destroys the means by which we correct our mistakes."
  • Structural (AlgorithmWatch): Focus on hypothetical AGI diverts from present-day AI harms. Longtermist calculations "trade on false certainty." Mathematical frameworks downweight rights-based approaches.
  • FTX shadow (Crooked Timber): MacAskill's framing of FTX as a character failure rather than a systemic problem with utilitarian reasoning means "he will undoubtedly manage to learn no serious lesson at all."

Notably absent: No published external critique specifically evaluates Forethought itself (rather than longtermism in general). This is likely because the org is only ~1 year old.

What's Absent

  • Financial transparency: No published financial statements. As a UK limited company (not charity), minimal disclosure is required. Actual revenue, salary ranges, and budget breakdown are unknown.
  • Owen Cotton-Barratt's role: Active co-author on Forethought papers but not listed as staff. Sexual harassment controversy and 2-year EV ban not publicly addressed by Forethought.
  • External quality validation: All testimonials from within the EA ecosystem. No engagement with mainstream governance researchers, IR scholars, or policy academics.
  • Contingency thinking: No published analysis of what happens if their 2-8 year timeline assumption is wrong.
  • Advisory specifics: Claims to advise frontier AI companies but provides no names, policies influenced, or verifiable details.
  • Co-founder departure: Amrit Sidhu-Brar resigned from the board within 9 months, with no public explanation.

Recommended Reading

  1. 80K Hours podcast with MacAskill, "AI causing a century in a decade" (Oct 2025) -- The most candid source. 40K words where MacAskill discusses the full worldview, admits past errors on timelines, and gets into the weeds of grand challenges. Start here. https://80000hours.org/podcast/episodes/will-macaskill-century-in-a-decade-navigating-intelligence-explosion/

  2. "Strong Longtermism, Irrefutability, and Moral Progress" by Ben Chugg (Dec 2020) -- The strongest philosophical counterargument to the entire framework. Argues longtermism is irrefutable and disables error correction. https://forum.effectivealtruism.org/posts/2NJszbnBTwibfdpo7/

  3. Lizka Vaintrob, "A personal take on why you should work at Forethought (maybe)" (Oct 2025) -- Internal criticism: narrow worldviews, intellectual tension, perfectionism. The most honest picture of the org's culture. https://forum.effectivealtruism.org/posts/znFgboZ62NM7Cnsx9/

  4. "Preparing for the Intelligence Explosion" -- The theory of change document. 21K words on grand challenges. https://www.forethought.org/research/preparing-for-the-intelligence-explosion

  5. 80K Hours podcast with Tom Davidson on AI-enabled coups -- The most impactful concrete output. https://80000hours.org/podcast/episodes/tom-davidson-ai-enabled-human-power-grabs/

Show Claude’s analysis
An opinionated read. Read the brief first to form your own view.

Stated Theory of Change

Forethought claims that AI will likely trigger an intelligence explosion within the next decade, creating a "century of progress in a decade" scenario with many grand challenges beyond alignment. These challenges -- AI-enabled coups, WMD governance, digital rights, space resources, epistemic disruption, institutional lock-in -- are critically neglected relative to their importance. Early foundational research on these topics can have outsized impact, just as Bostrom's "Superintelligence" in 2013-14 catalyzed the entire field of AI safety. Forethought positions itself as the successor to FHI and Open Phil's Worldview Investigations team: a small group of exceptional thinkers tackling big-picture questions that nobody else is asking.

Additionally, they argue that "Flourishing" (making the surviving future truly great) is at least as important as "Survival" (avoiding extinction), and that work on Flourishing is far more neglected.

The causal chain is: Forethought researches neglected grand challenges --> findings reach AI companies, policymakers, and the EA/rationalist community --> this catalyzes new research agendas, policy proposals, and organizational activity --> society is better prepared for the intelligence explosion.

Revealed Theory of Change

Forethought's actions are broadly consistent with their stated theory, with some interesting emphases:

What they actually produce: High-volume, medium-depth research papers self-published on their website. 20+ papers in year one. The most impactful outputs -- the coups paper and the IE modeling -- are the most technically grounded. The "Better Futures" series is more philosophical and speculative. They also produce a weekly podcast and newsletter.

How they actually influence: Through the EA/rationalist network. The 80K Hours problem ranking change is their clearest documented impact. The SAIF VC fund creation and AI company advising claims are weaker. They "communicate to the nerds" -- think tank researchers, AI company staff, EA-aligned government officials -- rather than broader audiences.

Where the stated and revealed theories diverge slightly: The stated theory emphasizes working "unrestricted by the current Overton window" on genuinely novel ideas. The revealed theory shows substantial overlap with existing EA/rationalist concerns (power concentration, lock-in, digital minds). These are indeed underresearched relative to alignment, but they are not entirely new to the community. The most genuinely novel contribution is the "Better Futures" framework and the specific grand challenges catalog.

What they optimize for: Intellectual prestige within the EA/rationalist ecosystem. Publication speed. Attracting exceptional researchers from the same talent pool (FHI alumni, ex-OP, CLR). This is a reasonable strategy for a young org but has insularity risks.

Key Assumptions

Assumption 1: Short AI timelines (2-8 years to mostly-automated AI R&D)

  • Evidence for: Current scaling trends, benchmarks approaching human-level on key tasks, unprecedented investment. MacAskill cites the track record of "inside view" predictions outperforming outside-view skepticism.
  • Evidence against: Consistent history of AI timeline overconfidence. Current models still fail on many tasks. Scaling may hit diminishing returns.
  • Testable? Yes, and rapidly. This assumption will be clearly confirmed or refuted within 3-5 years.
  • If wrong: The urgency evaporates. The grand challenges remain important in the long run but the "prepare NOW" framing loses force. Forethought's fundraising pitch weakens.

Assumption 2: The intelligence explosion creates challenges beyond alignment that are currently neglected and tractable

  • Evidence for: The coups paper identified a genuinely novel threat model that gained traction. Historical analogies (nuclear weapons governance took decades to develop) suggest pre-planning matters.
  • Evidence against: The AlgorithmWatch critique -- these speculative challenges may crowd out work on immediate, concrete AI harms. Also: if alignment is unsolved, most other challenges are moot (you can't govern space resources if a misaligned AI has already taken over).
  • Testable? Partially. If AI-enabled coups become a real concern (even attempted), that validates the research. If the intelligence explosion unfolds differently than modeled, the specific challenges may be wrong.
  • If wrong: Forethought's research portfolio is misallocated. But even wrong macrostrategy research has option value -- it trains people to think about these questions.

Assumption 3: Small teams doing foundational research can have outsized impact

  • Evidence for: Strong historical precedent. Bostrom's "Superintelligence" with a small FHI team arguably catalyzed the entire AI safety field. Early nuclear strategy thinkers (RAND) shaped decades of policy.
  • Evidence against: Survivorship bias. Many small teams doing foundational research had zero impact. The current policy environment may be too competitive for a 9-person Oxford org to influence.
  • Testable? Only retrospectively, over years.
  • If wrong: Forethought's research is intellectually interesting but practically irrelevant. This is the nightmare scenario they're trying to avoid.

Assumption 4: "Communicate to the nerds" is the right dissemination strategy

  • Evidence for: The FHI model worked -- ideas percolated from a small academic group into mainstream policy. The EA/rationalist network includes people in key positions at AI labs and government.
  • Evidence against: The EA network is smaller and less trusted post-FTX. Ideas validated only within this community may have blind spots. Mainstream policy researchers, IR scholars, and governance academics might reject or ignore work that doesn't engage with their existing frameworks.
  • Testable? Yes, by whether Forethought's ideas gain traction beyond the EA community within 2-3 years.
  • If wrong: Forethought becomes an intellectual echo chamber producing smart research that nobody outside the community reads.

Strengths

  1. Genuinely novel intellectual contributions. The AI-enabled coups paper identified a real, previously under-analyzed threat. The Better Futures framework is a genuine philosophical innovation. The three-types-of-intelligence-explosion taxonomy is useful.

  2. Exceptional talent density. Davidson's IE modeling is the most rigorous in the field. MacAskill's ability to synthesize across philosophy, economics, and AI strategy is unusual. The FHI/CLR/OP alumni bring institutional knowledge from organizations that no longer exist.

  3. Speed and volume. 20+ papers in year one is remarkable for a 9-person org. By publishing on their own website rather than going through academic peer review, they sacrifice external validation but gain speed -- which matters if timelines are short.

  4. Low incentive misalignment. No product revenue, no lab funding, no compute dependencies. The org's financial incentives are about as clean as you can get in the AI safety space.

  5. Demonstrated policy impact. The 80K Hours problem ranking change is real and significant -- it redirects career advice to thousands of people. Whether the coups paper influences actual government policy remains to be seen, but the early signs (advising companies, MATS stream, International AI Safety Report contributions) are positive.

  6. Intellectual honesty. MacAskill is transparently willing to admit past errors (timelines, FTX). Lizka's self-critical post was published with organizational approval. The founding announcement uses "galaxy-brained" unironically as a self-description.

Weaknesses and Risks

  1. Timeline dependency. If short timelines are wrong, the entire urgency narrative collapses. The org has no published contingency plan for longer timelines.

  2. Worldview homogeneity. Lizka's criticism is damning: "The set of worldviews & perspectives represented on the team feels too narrow." All staff share EA/rationalist backgrounds. All board members have direct ties to OP, CEA, FHI, or Redwood. There is no institutional mechanism to introduce genuinely outside perspectives.

  3. Owen Cotton-Barratt governance concern. An active contributor with a sexual harassment history whose role is deliberately left ambiguous represents a real reputational and ethical risk. The silence about his status is inconsistent with post-FTX governance commitments.

  4. The "galaxy-brain" trap. Macrostrategy research is inherently difficult to evaluate. Space governance and "deals with AIs" and "viatopia" may be genuinely important neglected topics -- or they may be intellectually entertaining exercises that distract from more concrete work. The absence of external peer review means there is no quality check beyond internal disagreement.

  5. MacAskill reputational risk. MacAskill is simultaneously Forethought's greatest asset (intellectual leadership, network, name recognition) and greatest liability (FTX association, EA polarization). The org's identity is deeply intertwined with his, which creates single-point-of-failure risk.

  6. Financial fragility. Two anonymous HNW donors controlling most early funding, with runway only through December 2026, creates real organizational risk. If the fundraiser underperforms, research plans must be cut. The org has no endowment, no diversified revenue, and falls outside major funders' core areas.

  7. Impact measurement problem. Beyond the 80K Hours ranking change, impact claims are hard to verify. "Advising AI companies" is vague. The SAIF VC attribution is tenuous. For an org that espouses scope sensitivity and evidence-based reasoning, the impact evidence is largely anecdotal.

Cross-References

Most similar to: Future of Humanity Institute (intellectual lineage, Oxford base, big-picture thinking, small team), Open Phil's Worldview Investigations team (macrostrategy research), and to a lesser extent GovAI (AI governance research from Oxford).

Complementary to: Technical alignment orgs (MIRI, Redwood, Anthropic's alignment team) -- Forethought works on the problems alignment alone can't solve. Also complementary to policy-focused orgs (CSET, RAND AI) that need foundational thinking to build on.

Tension with: Orgs focused on near-term AI harms (AlgorithmWatch, AI Now Institute) that see longtermist/AGI focus as a distraction. Also tension with high-p(doom) alignment researchers who might see non-alignment grand challenges as premature if alignment itself is unsolved.

Gap-filling: Forethought uniquely occupies the "macrostrategy beyond alignment" niche. Nobody else is doing systematic research on AI-enabled coups, intelligence explosion dynamics, Better Futures, or space governance from an x-risk perspective at this level of depth.

What Would Change This Assessment

Positive updates:

  • A Forethought-originated policy proposal is adopted by a government or AI company (moves impact from theoretical to demonstrated)
  • An external academic review validates the coups paper or IE modeling as high quality (breaks the echo chamber concern)
  • A major non-EA funder invests, suggesting the ideas have traction beyond the community
  • The org publicly addresses OCB's role and establishes transparent governance documents

Negative updates:

  • AI timelines extend significantly beyond 8 years (weakens urgency narrative)
  • A former Forethought researcher departs with public criticism of the research quality or culture
  • The fundraiser significantly underperforms, forcing layoffs (existential risk to the org itself)
  • A specific policy recommendation turns out to be badly wrong or harmful
  • Evidence emerges that the AI company advisory work is minimal or performative

Self-Critique

Weakest claim: My assessment of Forethought's impact beyond the 80K Hours ranking change. The org may be doing significant work advising AI companies that I simply can't verify due to confidentiality. I may be underestimating the importance of this unobservable advisory work.

Potential biases: I may be overly critical of the OCB situation relative to its actual importance. I may also be biased by the post-FTX narrative -- holding MacAskill to a higher standard than I would hold other researchers because of the FTX association. The FTX failures were not MacAskill's direct doing, and his governance lessons may be genuinely internalized.

What a thoughtful person who disagrees would say: "You're holding a 1-year-old, 9-person research org to standards appropriate for a major institution. Of course they don't have external peer review, financial transparency, or governance documents yet. The fact that they've published 20+ papers and changed 80K Hours' problem ranking in year one is remarkable. Give them time."

What I might have missed: The ForeCast podcast may contain important material. Manifund donor comments were inaccessible. I also have limited visibility into how the ideas are being received by government officials and AI company staff behind closed doors.

What would most change my view: Seeing a concrete, verifiable policy change at a frontier AI company or government that was directly influenced by Forethought's research. That would move them from "promising but unproven" to "demonstrably impactful."

Connected to (12)

Sources (47)
Every URL that was read during research.
  1. 1.Forethoughtforethought.org
  2. 2.Researchforethought.org
  3. 3.Forethought: A new AI macrostrategy groupea.greaterwrong.com
  4. 4.Aboutforethought.org
  5. 5.Forethought has room for more fundingea.greaterwrong.com
  6. 6.Max Dalton | Forethoughtforethought.org
  7. 7.Tom Davidson | Forethoughtforethought.org
  8. 8.Tom Davidson on how quickly AI could transform the world | 80,000 Hours80000hours.org
  9. 9.A personal take on why you should work at Forethought (maybe)ea.greaterwrong.com
  10. 10.Will MacAskill on AI causing a “century in a decade” — and how we're completely unprepared | 80,000 Hours80000hours.org
  11. 11.We're Not Ready for AGI (with Will MacAskill)podcast.futureoflife.org
  12. 12.Human Takeover Might be Worse than AI Takeovernewsletter.forethought.org
  13. 13.Preparing for the Intelligence Explosionforethought.org
  14. 14.Preparing for the Intelligence Explosionea.greaterwrong.com
  15. 15.AI-Enabled Coups: How a Small Group Could Use AI to Seize Powerforethought.org
  16. 16.Macrostrategy | GovAIgovernance.ai
  17. 17.46 - Tom Davidson on AI-enabled Coupsaxrp.net
  18. 18.Tom Davidson on how AI-enabled coups could allow a tiny group to seize power | 80,000 Hours80000hours.org
  19. 19.Rose Hadshar on why automating human labour will break our political system | 80,000 Hours80000hours.org
  20. 20.Goodbye to FHIfinmoorhouse.com
  21. 21.About mefinmoorhouse.com
  22. 22.Fin Moorhouse | Why We Need to Aim Higher Than Survivalexistentialhope.com
  23. 23.Announcing the new Forethought Foundation for Global Priorities Researchea.greaterwrong.com
  24. 24.William MacAskill - Wikipediaen.wikipedia.org
  25. 25.Stefan Torges | Forethoughtforethought.org
  26. 26.ForeWord | Forethought | Substacknewsletter.forethought.org
  27. 27.Fundsopenphilanthropy.org
  28. 28.Forethought Foundation for Global Priorities Research - Security & Sustainabilitysecuresustain.org
  29. 29.FORETHOUGHT RESEARCHfind-and-update.company-information.service.gov.uk
  30. 30.FORETHOUGHT RESEARCHfind-and-update.company-information.service.gov.uk
  31. 31.Nonprofits — William MacAskillwilliammacaskill.com
  32. 32.A Case Against Strong Longtermismea.greaterwrong.com
  33. 33.Focus Attention on Accountability for AI − not on AGI and Longtermist Abstractions - AlgorithmWatchalgorithmwatch.org
  34. 34.Will AI R&D Automation Cause a Software Intelligence Explosion?forethought.org
  35. 35.Three Types of Intelligence Explosionforethought.org
  36. 36.(Senior) Research Fellowforethought.org
  37. 37.Donate | Forethoughtforethought.org
  38. 38.2025 fundraiser | Forethoughtforethought.org
  39. 39.Introducing Better Futuresforethought.org
  40. 40.Former Y Combinator president Geoff Ralston launches new AI ‘safety’ fund | TechCrunchtechcrunch.com
  41. 41.No Easy Eutopia: Why Great Futures Are Hard to Achieveforethought.org
  42. 42.AI Tools for Existential Securityforethought.org
  43. 43.Strong Longtermism, Irrefutability, and Moral Progressea.greaterwrong.com
  44. 44.How to Make the Future Better: Concrete Actions for Flourishingforethought.org
  45. 45.A rant on FTX, William MacAskill, and Utilitarianismcrookedtimber.org
  46. 46.AGI and Lock-inforethought.org
  47. 47.Careersforethought.org