← AI Safety Orgs

Centre for the Governance of AI (GovAI)

Governance

Oxford. Talent pipeline. Compute governance.

Founded
2016
HQ
London, UK
Team
20
Structure
501(c)(3) nonprofit
Model
Grants

Theory of Change

GovAI's theory of change is explicit and distinctive. Founder Allan Dafoe (2020): approximately 80% of the value of AI governance research comes not from the research product itself but from field building -- "improving insight, expertise, connections, and authority within that field." The Eisenhower quote Dafoe invokes: "plans are useless, but planning is indispensable."

In practice, this means GovAI's primary outputs are (1) trained people placed at consequential institutions and (2) a community of experts ready to advise when crises emerge. Research papers are important but secondary to the talent and network effects they generate. Director Ben Garfinkel (2023): "There's basically no topic area in this space that I would describe as covered... Very little work has been done on even questions that seem like a lot of work should have been done on them."

GovAI distinguishes itself intellectually through its emphasis on "structural risks" -- risks from AI that arise from competitive dynamics, inequality, and institutional failures rather than from individual misuse or technical accidents. This positions it between the alignment-focused camp (MIRI, Redwood) and the present-harms camp (AI ethics / FAccT community).

What They Do

Research. GovAI's most distinctive research contribution is compute governance -- using AI chips and compute infrastructure as regulatory levers. The flagship paper (Heim, Anderljung et al., 19 co-authors) proposes chip registries, know-your-customer for cloud compute, and hardware-level enforcement mechanisms. This work influenced US export controls on AI chips to China and EU AI Act compute thresholds. Other significant research areas: frontier AI regulation, cooperative AI, model evaluation for extreme risks, economics of AI, and international governance.

Fellowship programs. London Summer/Winter Fellowships (~45/cohort, twice yearly, 3 months, 12K GBP stipend). DC Fellowship (launched 2025, 3 months, $21K, US policy focus with bipartisan framing). These are the primary mechanism of the field-building theory of change, processing roughly 90+ fellows per year.

Policy engagement. Markus Anderljung Vice-Chaired the EU GPAI Code of Practice (Safety & Security chapter). GovAI is registered as an EU lobbyist and is a member of the US NIST AI Safety Institute Consortium. Staff have been seconded to the UK Cabinet Office. The DC Fellowship and CNAS adjunct position extend US policy reach.

Institutional creation. The Cooperative AI Foundation ($15M from Macroscopic Ventures) was created directly from GovAI research. It runs grantmaking, fellowships, workshops.

Key People

Ben Garfinkel (Director): Former FHI Research Fellow, founding member of the Yale group that became GovAI. Famous for his 2020 80K Hours interview scrutinizing classic AI risk arguments -- the most rigorous published critique from within the AI safety community. Personal p(doom): "low single digits to mid single digits" for catastrophic AI safety failure this century. This skeptical-but-engaged position shapes GovAI's approach: demanding rigor in risk arguments while still believing AI governance is among the best areas to work in.

Allan Dafoe (President, Founder): Political scientist specializing in technological determinism and great power conflict. Now Director of Frontier Safety & Governance at Google DeepMind. His departure to DeepMind in 2021 embodies GovAI's theory of change -- placing someone "in the room" at a consequential institution.

Jade Leung (Co-founder, alumna): GovAI's most prominent alumni success case. Now CTO of UK AI Security Institute and PM's AI Adviser. Named to TIME's 100 Most Influential People in AI (2024).

Notable alumni pipeline: Dafoe (DeepMind), Leung (UK AISI), Barnhart (DeepMind), Shevlane (CEO Mantic), Brundage (ex-OpenAI), O'Keefe (OpenAI), Toner (CSET), Heim (RAND). The density of placement in consequential positions is GovAI's strongest evidence that the talent pipeline theory of change works.

Money and Incentives

Total known funding: $19.5M from Coefficient Giving/Open Philanthropy across 13 grants (2020-2025). Other named funders: Survival and Flourishing Fund (Jaan Tallinn), Long-Term Future Fund (EA Funds), Center for Emerging Risk Research, Waking Up Foundation. All funders are within the EA/longtermist ecosystem.

Funder concentration: Near-total dependence on CG/Open Phil. In 2021, CG's $2.54M represented roughly 67% of $3.8M total fundraised. CG funding has accelerated: $7.2M in 2025 alone (4 grants). The stated policy is to never accept for-profit donations, which limits diversification options.

Conflict of interest: Ajeya Cotra sits on GovAI's Advisory Board while working at Coefficient Giving, the primary funder. Helen Toner and Tasha McCauley (advisory board) were both OpenAI board members. Allan Dafoe (President) works at Google DeepMind. The advisory board has connections to the funder, the labs, and the UK government.

Financial opacity: No 990 filings available despite US 501(c)(3) status. UK entity incorporated August 2024, first accounts not due until May 2026. Budget, salary levels, spending breakdown all unknown. For an organization receiving $19.5M that advocates for governance transparency, this is a significant gap.

Business model: Pure grants. No product revenue, contracts, or consulting fees.

What Others Say

The revolving door concern is the most specific critique of GovAI. Alumni go to DeepMind, OpenAI, UK AISI -- the very entities that effective AI governance would need to regulate. GovAI frames this as "insider influence" (the theory of change). Critics could frame it as regulatory capture. GovAI co-authors papers with industry researchers, and its founder-turned-President works at DeepMind. Whether alumni maintain their governance orientation or are gradually co-opted is an empirical question without public evidence either way.

AlgorithmWatch (2025) critiques the entire longtermist AI governance ecosystem that GovAI operates within: "Longtermist ethics conveniently focuses on risks that don't threaten current business models or investor returns." They argue the mathematical frameworks create an "illusion of neutrality" that directs attention "away from harder-to-quantify but immediate impacts." GovAI's emphasis on structural risks (democracy, inequality) partially addresses this critique, but its funding still comes entirely from longtermist sources.

CIP critique of the Windfall Clause: The most substantive policy-level criticism of a specific GovAI proposal. Argues the clause only kicks in "once AI has caused quite extreme wealth concentration" and proposes predistribution instead. Notably, co-author Sam Manning is a GovAI Senior Research Fellow -- a positive sign for intellectual openness.

Reason.com (2024): Frames EA-funded AI governance as "authoritarian." Ideologically motivated but represents a real political constituency that could undermine governance efforts.

Absence of direct criticism is itself a finding. After a decade of operation and $19.5M in funding, there are no substantial published critiques of GovAI specifically (as opposed to the broader ecosystem). This may reflect quality, or may reflect the insularity of the community in which GovAI operates.

What's Absent

  • No financial filings of any kind (no 990s, no UK accounts). Budget and spending entirely opaque.
  • No public evaluation of impact -- no self-assessment of whether research influenced policy, whether fellowship alumni made measurable contributions, or whether the theory of change is working.
  • No Wikipedia article.
  • No independent external review of organizational effectiveness.
  • US board composition undisclosed. Advisory board activity opaque.
  • Limited engagement with AI ethics / present-harms community (FAccT, civil society).
  • No published diversity data for an organization working on global governance.
  • 2024 Annual Report content unavailable (PDF extraction failed).

Recommended Reading

  1. Ben Garfinkel on Hear This Idea (2023) -- Most comprehensive window into GovAI's current intellectual direction and how Garfinkel thinks about risk, governance, and the field's frontier. The most candid source in the collection. https://hearthisidea.com/episodes/garfinkel/

  2. Ben Garfinkel on 80K Hours (2020) -- Famous skeptical scrutiny of classic AI risk arguments. Essential for understanding GovAI's intellectual DNA. https://80000hours.org/podcast/episodes/ben-garfinkel-classic-ai-risk-arguments/

  3. Allan Dafoe, "AI Governance: Opportunity and Theory of Impact" (2020) -- GovAI's theory of change in its most explicit form. https://www.allandafoe.com/opportunity

  4. CIP, "Predistribution over Redistribution: Beyond the Windfall Clause" -- The strongest counterargument to a specific GovAI proposal. https://www.cip.org/blog/predistribution-over-redistribution-beyond-the-windfall-clause

  5. Allan Dafoe on 80K Hours (2025) -- The founder explains why he left GovAI for DeepMind, technological determinism, cooperative AI. https://80000hours.org/podcast/episodes/allan-dafoe-unstoppable-technology-human-agency-agi/

Show Claude’s analysis
An opinionated read. Read the brief first to form your own view.

Stated Theory of Change

GovAI's stated theory is unusually explicit and thoughtful. Dafoe's 2020 essay argues that ~80% of AI governance research value comes from field building -- "improving insight, expertise, connections, and authority" -- rather than from specific research products. The mechanism: train and credential experts who care about AI's long-term impacts, then place them at consequential institutions (governments, labs, think tanks) so they are "in the room" when critical decisions are made.

This is the "Metropolis" model -- build dense connections to policy, social science, and computer science rather than an "isolated Island" focused narrowly on alignment. GovAI explicitly covers structural risks (inequality, totalitarianism, race dynamics) alongside the safety risks that dominate most AI safety organizations.

The causal chain: Research + fellowships -> trained governance experts -> placed at influential institutions -> better decisions about AI deployment/regulation -> reduced probability of catastrophic outcomes.

Revealed Theory of Change

GovAI's actions are remarkably consistent with its stated theory. The fellowship programs process ~90+ people per year through structured governance training. The alumni pipeline is the revealed priority: Jade Leung (UK AISI CTO), Allan Dafoe (DeepMind), Lennart Heim (RAND), Helen Toner (CSET), Markus Anderljung (EU Code of Practice drafter). Every major GovAI leadership figure has either moved to or maintained connections with consequential institutions.

The research agenda reveals a secondary priority: producing enough credible work to maintain the organization's intellectual authority and to justify the talent pipeline. Publications in Science, Nature Machine Intelligence, and Lawfare serve as credentialing vehicles as much as policy instruments.

One potential divergence: GovAI's stated emphasis on structural risks (inequality, democracy, totalitarianism) is broader than what its revealed research priorities suggest. The actual research output skews heavily toward frontier model regulation, compute governance, and risk management -- topics with more direct connection to the longtermist/x-risk community that funds them than to the structural risks Dafoe articulates.

Key Assumptions

1. "In the room" assumption: Having governance-minded people at labs and governments will actually improve decisions.

  • Evidence for: Dafoe reports having "a lot of potential impact" advising DeepMind leadership. Jade Leung's position at UK AISI gives governance thinking institutional weight. Anderljung's EU Code of Practice role is direct policy influence.
  • Evidence against: Labs' structural incentives (commercial competition, investor returns) may override any individual adviser's influence. People placed at labs may gradually adopt the lab's perspective. No public evidence that any GovAI alumnus has actually changed a consequential decision at a lab.
  • Testable: Yes, but difficult. Would require tracking specific decisions where governance considerations were raised by GovAI alumni.
  • If wrong: GovAI becomes an expensive feeder program for AI lab hiring without governance impact.

2. Talent scarcity assumption: The AI governance field is so small that adding trained people is highly valuable.

  • Evidence for: Garfinkel's claim that fewer than 10 people have combined regulatory and AGI expertise seems credible. The frontier of knowledge on most governance topics is "one person and two months of work."
  • Evidence against: As AI governance becomes more prominent, the talent pool expands from conventional routes (law schools, policy programs, think tanks) without GovAI's intervention. The marginal value of additional GovAI fellows may be declining.
  • If wrong: GovAI's fellowship programs become redundant as conventional institutions produce governance talent.

3. Safety tractability assumption: Garfinkel argues catastrophic AI safety risks are the most tractable long-term concern because "everyone basically is on the same page globally that we would like for AI systems to do the things that we want them to do."

  • Evidence for: RLHF, constitutional AI, and other alignment techniques were partly developed by safety-motivated researchers.
  • Evidence against: Garfinkel himself notes the equilibrium argument that the "market would otherwise have provided" safety improvements. The tractability of structural risks (inequality, authoritarianism) through governance research is less clear.
  • If wrong: GovAI's focus on safety-adjacent governance may be less impactful than structural interventions.

4. Funder independence assumption: Near-total dependence on CG/Open Phil funding does not compromise research independence.

  • Evidence for: Garfinkel's publicly skeptical views on AI risk -- lower than the CG/Open Phil median -- suggest intellectual independence. The CIP Windfall Clause critique co-authored by a GovAI researcher suggests willingness to criticize internal proposals.
  • Evidence against: Ajeya Cotra on the advisory board while at the primary funder is a structural conflict. No examples of GovAI research that contradicts CG/Open Phil priorities. Research topics cluster within the longtermist frame that motivates CG funding.
  • If wrong: GovAI's research agenda is shaped by what its funder wants to see rather than what the field needs.

Strengths

Intellectual honesty. Garfinkel's public skepticism of classic AI risk arguments, while directing an AI governance organization funded by people who believe in those risks, is genuinely unusual. His demand for rigor -- "if the entire case for treating AIs and existential risk hung on these blog posts, I wouldn't really feel comfortable advocating for millions of dollars" -- sets an epistemic standard that few organizations in this space match.

Talent pipeline results. The alumni list speaks for itself. Jade Leung at UK AISI, Dafoe at DeepMind, Heim at RAND, Toner at CSET -- these are genuine placements at institutions that make consequential decisions about AI.

Intellectual breadth. The structural risks framework (misuse + accidents + structural) is more comprehensive than the narrow alignment focus of many AI safety organizations. The cooperative AI agenda addresses a real gap.

Compute governance research. This is GovAI's most distinctive and policy-relevant contribution. The compute governance framework has influenced real policy (US chip export controls, EU AI Act thresholds).

Policy engagement. Operating across US (NIST consortium, DC fellowship, CNAS), UK (Cabinet Office secondment, AISI network), and EU (GPAI Code of Practice) simultaneously is rare and valuable.

Weaknesses and Risks

Near-total funder dependency. $19.5M from CG/Open Phil with no significant alternative funding sources. If CG shifts priorities -- as happened when it scaled back Open Phil's longtermist giving in 2023 -- GovAI could face existential financial pressure. The no-for-profit-donations policy limits diversification.

Revolving door problem. GovAI's theory of change (place people at labs) is structurally in tension with its role as an independent governance voice. Dafoe's dual role as GovAI President and DeepMind Director is the clearest example. Can GovAI credibly critique DeepMind's governance practices while its founder-President works there?

Financial opacity. No financial filings of any kind for an organization receiving nearly $20M. Budget, salaries, spending categories -- all unknown. For an organization that advocates for governance transparency, this is a credibility gap.

Echo chamber risk. All funding comes from EA/longtermist sources. The advisory board is entirely from this ecosystem. Staff backgrounds cluster in philosophy, economics, and political science from elite Anglophone universities. Engagement with the AI ethics / FAccT community is minimal.

Impact measurement vacuum. GovAI has never published an evaluation of whether its theory of change is working. The claims that alumni placement = governance influence are plausible but unsubstantiated.

Scale mismatch. 15-25 core staff trying to influence multi-trillion-dollar industries and dozens of national governments. GovAI's influence is necessarily mediated through personal networks, which means it could be disproportionately affected by key person departures.

Cross-References

vs. CSET (Georgetown): Both are AI governance research organizations, but CSET focuses more on US national security applications and data-driven analysis. Helen Toner bridges the two. CSET has deeper government connections; GovAI has broader international scope.

vs. CLTR (Centre for Long-Term Resilience): CLTR focuses specifically on UK policy and institutional resilience. GovAI is more international and more research-oriented. Alumni overlap (Jess Whittlestone, Hamish Hobbs at CLTR are GovAI affiliates).

vs. MIRI: Opposite intellectual approaches. MIRI pursues narrow alignment theory with shutdown advocacy; GovAI builds a broad governance community with structural risk emphasis. Garfinkel has explicitly criticized the intellectual foundations MIRI relies on.

vs. Coefficient Giving: GovAI is functionally a CG portfolio organization. Ajeya Cotra on both sides of the relationship. The Coefficient Giving forum post (abergal) explicitly lists GovAI as an example organization whose "marginal hires could found capacity-building orgs." CG views GovAI partly as a source of trained people.

vs. Cooperative AI Foundation: Spun out from GovAI research, separate entity with $15M endowment. Lewis Hammond (CAIF Acting ED) is a GovAI affiliate. The relationship demonstrates GovAI's institutional generativity.

What Would Change This Assessment

  • Publication of financial data (990 filings, UK accounts) showing budget allocation and salary levels would significantly improve assessment of whether funding is being used effectively.
  • Evidence of alumni actually changing a decision at a lab -- a specific case where a GovAI alumnus pushed back against a deployment decision, changed a policy position, or shaped a regulatory framework in a traceable way.
  • A substantial published critique of GovAI specifically (not just the broader ecosystem) by a credible external analyst would help calibrate the assessment.
  • Funding diversification -- if GovAI secured significant funding from government grants, non-EA foundations, or other non-longtermist sources, it would reduce the dependency concern.
  • Evidence that alumni at labs are NOT maintaining governance orientation -- reports of GovAI alumni becoming capabilities-focused or pro-deployment without safety conditions would undermine the core theory of change.
  • Failure of compute governance approach -- if algorithmic efficiency improvements make compute governance obsolete faster than expected (the Rob Wiblin pushback in the Heim podcast), GovAI's most distinctive contribution loses relevance.

Self-Critique

Weakest claim: The assessment that GovAI's funder dependency "does not compromise research independence" based on Garfinkel's skeptical views. This is indirect evidence. It is entirely possible that GovAI's research agenda is shaped by CG priorities in ways that are invisible from the outside -- not through explicit pressure but through selection of which topics to pursue and which to avoid.

Potential bias: This analysis may underweight GovAI's actual policy impact because impact through personal influence is inherently difficult to observe from outside. The brief relies on publicly visible outputs (papers, placements, policy roles) and may miss private advising, behind-the-scenes influence, and informal network effects.

What a thoughtful critic would say: "GovAI is the EA/longtermist community paying itself to train people to spread EA/longtermist ideas to government and industry. The 'field building' theory of change is unfalsifiable -- any outcome can be claimed as building the field. The alumni 'placed at labs' are being absorbed by industry, not influencing it. And the financial opacity suggests an organization that is accountable to its funder, not to the public whose governance it claims to serve."

Information that would most change my view: Access to internal GovAI communications or alumni surveys showing whether/how former fellows maintain governance orientation at their subsequent employers. Also, the 2024 annual report content would significantly update understanding of current priorities and scale.

Connected to (12)

Sources (69)
Every URL that was read during research.
  1. 1.About Us | GovAIgovernance.ai
  2. 2.Research | GovAIgovernance.ai
  3. 3.Ben Garfinkel | GovAIgovernance.ai
  4. 4.Allan Dafoe | GovAIgovernance.ai
  5. 5.GovAI has Relaunched | GovAI Bloggovernance.ai
  6. 6.Computing Power and the Governance of AI | GovAIgovernance.ai
  7. 7.How sure are we about this AI stuff?80000hours.org
  8. 8.Allan Dafoe on why technology is unstoppable & how to shape AI development anyway | 80,000 Hours80000hours.org
  9. 9.Ben Garfinkel on AI Governancehearthisidea.com
  10. 10.Allan Dafoeallandafoe.com
  11. 11.AI Governance: A Research Agenda | GovAIgovernance.ai
  12. 12.GovAIgivingwhatwecan.org
  13. 13.People | GovAIgovernance.ai
  14. 14.Open Positions | GovAIgovernance.ai
  15. 15.Summer Fellowship 2026, Research Track | GovAI Bloggovernance.ai
  16. 16.Allan Dafoe - Researchallandafoe.com
  17. 17.Ben Garfinkelbenmgarfinkel.com
  18. 18.Cooperative AI – Foundationcooperativeai.com
  19. 19.Annual Report 2024 | GovAI Bloggovernance.ai
  20. 20.Conflict of interest | GovAIgovernance.ai
  21. 21.Lennart Heim on the compute governance era and what has to come after | 80,000 Hours80000hours.org
  22. 22.Markus Anderljung | AI Governance Researchmarkusanderljung.com
  23. 23.Analysis | GovAIgovernance.ai
  24. 24.From Turing to Tomorrow: The UK's Approach to AI Regulation | GovAIgovernance.ai
  25. 25.Jade Leung (engineer) - Wikipediaen.wikipedia.org
  26. 26.Open Problems in Cooperative AI | GovAIgovernance.ai
  27. 27.New Survey: Broad Expert Consensus for Many AGI Safety and Governance Practices | GovAIgovernance.ai
  28. 28.CENTRE FOR THE GOVERNANCE OF AIfind-and-update.company-information.service.gov.uk
  29. 29.The Centre for the Governance of AI datacard on lobbyfacts.eulobbyfacts.eu
  30. 30.Frontier AI Regulation | GovAIgovernance.ai
  31. 31.Updates | GovAIgovernance.ai
  32. 32.AI Governance: Opportunity and Theory of Impact | GovAIgovernance.ai
  33. 33.Allan Dafoe - AI Talksallandafoe.com
  34. 34.GovAI Annual Report 2020 | GovAI Bloggovernance.ai
  35. 35.Unknowncooperativeai.com
  36. 36.The Cooperative AI Foundation's 2024 Strategycooperativeai.com
  37. 37.Helen Toner - Wikipediaen.wikipedia.org
  38. 38.Open-Sourcing Highly Capable Foundation Models | GovAIgovernance.ai
  39. 39.Training Compute Thresholds: Features and Functions in AI Regulation | GovAIgovernance.ai
  40. 40.Research – Markus Anderljungmarkusanderljung.com
  41. 41.A Collection of AI Governance Research Ideas (2024)markusanderljung.com
  42. 42.The authoritarian side of effective altruism comes for AIreason.com
  43. 43.Focus Attention on Accountability for AI − not on AGI and Longtermist Abstractions - AlgorithmWatchalgorithmwatch.org
  44. 44.Model Evaluation for Extreme Risks | GovAIgovernance.ai
  45. 45.Risk management in the Artificial Intelligence Act | GovAIgovernance.ai
  46. 46.Allan Dafoe - AI Governance: Opportunity and Theory of Impactallandafoe.com
  47. 47.The Brussels Effect and Artificial Intelligence | GovAIgovernance.ai
  48. 48.Research Fellow | GovAI Bloggovernance.ai
  49. 49.Research Scholar | GovAI Bloggovernance.ai
  50. 50.Centre for the Governance of AI (GovAI)cepr.org
  51. 51.Events | GovAIgovernance.ai
  52. 52.GovAI Response to the Future of Compute Review - Call for Evidence | GovAIgovernance.ai
  53. 53.Markus Anderljung | GovAIgovernance.ai
  54. 54.CNAS Welcomes New Adjunct Fellow Markus Anderljungcnas.org
  55. 55.Predistribution over Redistribution — The Collective Intelligence Projectcip.org
  56. 56.Organisationsjobs.80000hours.org
  57. 57.CENTRE FOR THE GOVERNANCE OF AIfind-and-update.company-information.service.gov.uk
  58. 58.Thinking About Risks From AI: Accidents, Misuse and Structurelawfaremedia.org
  59. 59.Centre for the Governance of AI (GovAI) - Security & Sustainabilitysecuresustain.org
  60. 60.Paul Harding | GovAIgovernance.ai
  61. 61.GovAI Annual Report 2021 | GovAI Bloggovernance.ai
  62. 62.Thinking About Risks From AI: Accidents, Misuse and Structure | GovAIgovernance.ai
  63. 63.A short biography of the Prime Minister's new AI adviseruk20.substack.com
  64. 64.DC Summer Fellowship 2026 | GovAI Bloggovernance.ai
  65. 65.Towards Best Practices in AGI Safety and Governance | GovAIgovernance.ai
  66. 66.Meet the Chairs leading the development of the first General-Purpose AI Code of Practicedigital-strategy.ec.europa.eu
  67. 67.TIME100 AI 2024: Jade Leungtime.com
  68. 68.Deciphering China’s AI Dream: The Context, Components, Capabilities, and Consequences of China’s Strategy to Lead the World in AI | GovAIgovernance.ai
  69. 69.Sharing Powerful AI Models | GovAIgovernance.ai