Theory of Change
CG's AI safety theory of change operates through three pillars articulated by CEO Alexander Berger and President Emily Oehlsen:
Visibility: "Visibility into cutting-edge AI R&D is essential for understanding the trajectory of AI capabilities and their risks and benefits." Fund evaluations, benchmarks (Cybench, LAB-Bench), public data (Epoch AI), and high-quality journalism about AI development.
Safeguards: "To make worst-case scenarios less likely, it's important to design and implement technological and policy safeguards." Fund technical safety research (alignment, control, interpretability), policy frameworks (RSP development, bipartisan legislation), and security measures.
Capacity: "The field of AI safety and security faces a severe talent and infrastructure bottleneck." Fund fellowships (MATS, BlueDot), career transition grants, academic programs, and institutional capacity.
The broader organizational theory of change rests on "hits-based giving" -- a VC-like approach where most grants may fail but a few successes justify the portfolio. CG selects cause areas using an Importance-Neglectedness-Tractability framework and allocates across worldviews using a "worldview diversification" model that treats different ethical frameworks as negotiating agents.
Berger and Oehlsen state the specific risks they focus on: "Extreme misuse of AI technology, e.g. in designing bioweapons or enabling authoritarian control," "Scenarios where humanity loses control of advanced AI agents," and "Rapid changes that strain institutions."
What They Do
CG has directed >$4B in grants across all cause areas since 2011, with >$1B in 2025 alone. In AI safety specifically:
$580M cumulative since 2015 (when CG began AI safety work, seven years before ChatGPT)
- ~$130M+ in 2025 through the Navigating Transformative AI fund (>440 total grants)
- ~$50M on technical AI safety in 2024, including ~$25M on benchmarks/evals adopted by US/UK governments
- $40M+ Technical AI Safety RFP (2025) covering 21 research areas
- AI governance team (led by Luke Muehlhauser) aiming for >$100M/year
- Partnership team (10 people) advising 20+ individual donors on AI safety giving
Key grantees in AI safety include Epoch AI, CSET Georgetown, Redwood Research, MATS, BlueDot Impact, FAR.AI ($30M+), the Center for AI Safety, and researchers at RAND, CNAS, and multiple universities.
CG launched the $30M grant to OpenAI in 2017 that included a board seat for co-founder Holden Karnofsky (resigned 2021). It funded development of the RSP framework now used by frontier AI labs. In 2025, it launched RFPs for both technical AI safety and AI governance.
Non-AI achievements include >100,000 lives saved through global health interventions, the R21 malaria vaccine, the YIMBY movement, cage-free commitments improving 250M+ birds annually, the LEAF lead-poisoning fund ($100M+), and support for David Baker's Nobel Prize-winning protein design research.
Key People
Alexander Berger (CEO, sole since June 2023): Joined GiveWell as one of its first hires in 2011 (Stanford grad). Led GHW portfolio before becoming co-CEO then sole CEO. Donated a kidney to a stranger based on utilitarian calculus. In a 42K-word 80K Hours podcast, he describes himself as more skeptical of some longtermist reasoning than his predecessor, acknowledges structural biases in grant evaluation, and values multiple ethical frameworks simultaneously.
Holden Karnofsky (co-founder, left April 2024, now at Anthropic): Co-founded GiveWell and Open Philanthropy. Married to Daniela Amodei (Anthropic President) since August 2017. Was on OpenAI board 2017-2021. Joined Anthropic January 2025 to work on RSP and safety planning. Remains on CG's 501(c)(3) board.
Ajeya Cotra (Senior Advisor): Author of the bio-anchors AI timelines framework (2020). In a 2026 80K Hours podcast (30K words), she argues "probably in the early 2030s we are going to see... top-human-expert-dominating AI" and that current safety techniques will be "increasingly inadequate as AIs become more capable."
Team patterns: 150+ total staff. Technical AI safety team has only ~3 grant investigators (Peter Favaloro, Rossa O'Keeffe-O'Donovan, plus associates) managing $130M+/year. Notable departures to Anthropic: Karnofsky, Catherine Olsson, Daniel Dewey. Departure to FTX: Nick Beckstead (2021, then to Secure AI Project).
Money and Incentives
Total budget: >$1B directed in grants in 2025. AI safety: ~$130M+ in 2025.
Revenue breakdown: Overwhelmingly from Good Ventures Foundation (~$7.9B assets, funded by Dustin Moskovitz's Facebook/Meta fortune, ~$12-14B net worth). Non-Good Ventures money: >$100M in 2024, >$200M in 2025, but still a minority share. Moskovitz's $1.9B gift to Good Ventures in 2024 was the largest single contribution.
Business model: Philanthropic advisory and grantmaking. CG staff (at the LLC) investigate, evaluate, and recommend grants. Grants are made by multiple entities (CG Advisors 501(c)(3), Good Ventures Foundation, donor-advised funds, CG Action Fund 501(c)(4)). The 990-filing entity (EIN 81-0737472) shows only ~$3-5M/yr in operational costs -- the real money flows through Good Ventures.
Funding concentration in AI safety: CG + SFF account for an estimated 80% of philanthropic AI safety funding. CG itself: "Open Philanthropy and Good Ventures are currently a concentrated share of AI safety philanthropic funding." Climate philanthropy (~$9-15B/yr) is estimated at roughly 20x AI safety philanthropy.
Economic ties to Anthropic: (1) Dustin Moskovitz is an angel investor in Anthropic. (2) Holden Karnofsky (CG co-founder, 501(c)(3) board member) now works at Anthropic and is married to Daniela Amodei (Anthropic President). (3) Multiple CG staff (Olsson, Dewey) moved to Anthropic. (4) Anthropic co-founders pledged 80% of wealth ($37.8B combined at $380B valuation), much of which may flow through EA-aligned vehicles including CG. (5) CG funds Transformer, the journalism outlet that investigated these entanglements.
Incentive dynamics: Nonprofits in the AI safety space depend heavily on CG funding. CG's "Need More Funders" post acknowledges this and notes that external funders can find "2-5x as cost-effective" opportunities vs. CG's marginal dollar. Holly Elmore (PauseAI US) has publicly accused CG of not funding PauseAI "because they serve Anthropic's interests." CG has not publicly responded to this specific accusation.
What Others Say
Structural critique (monopsony): Jason Hausenloy: "Nonprofits will be optimized toward providing the best 'safety services' for those organizations. By default, this means almost all strategic thinking must come from that funder." He recommends CG publish goals, decision processes, funding flows, and use more RFPs.
CG-Anthropic entanglement: Transformer News: "Paying people to hold you accountable always carries inherent risks" (quoting Stanford's Leif Wenar). Holly Elmore (PauseAI): publicly accused CG of not funding her org "because they serve Anthropic's interests." Karnofsky himself wrote at Anthropic that "a lot of our employees are socially integrated into the AI safety community" and could be "a major source of donations for the kinds of non-profits that could be potential external reviewers."
Ecosystem critique: AI Panic newsletter maps CG as the primary funder of "hundreds of organizations" in the AI risk ecosystem, calling it "a well-orchestrated top-down movement." effektiv-spenden: 80% of AI safety philanthropy from CG + SFF. Luke Kemp (CSER): "There's a small number of key funders who have a very particular ideology, and either consciously or unconsciously select for the ideas that most resonate with what they want."
Defense: SSIR describes CG as "arguably more transparent than any other big foundation." Rob Reich (Stanford): "You come to understand what they're doing by reading their preposterously long and complex blog posts. I find that a virtue." CG publishes annual reviews, notable lessons, and detailed grant writeups. It funded a contest for essays challenging its AI risk views and awarded the top prize to research that contradicted its thinking.
CG's own acknowledgment: "It's very likely that there are high-impact funding gaps that other philanthropists could fill. Historically, there have been cases where we wished other funders had taken different bets from ours." CG says it was "slow to act" on scaling technical AI safety and is "now playing catch-up."
What's Absent
- No independent board members with AI safety expertise (2 of ~8 board members across entities are external)
- No independent evaluation of whether $580M+ in AI safety grantmaking has been effective
- Good Ventures Foundation (~$7.9B assets) has essentially no public governance information
- No formal whistleblower policy
- No published data on grant rejection rates or reasons
- No public accounting of the CG-to-Anthropic career pipeline
- No public disclosure of Moskovitz's Anthropic investment size
- CG stopped publishing relationship disclosures in grant writeups in August 2017 — the same month Karnofsky married Amodei — and removed historical disclosures from the website. Stated reason: "some disclosures seemed to unnecessarily infringe on the privacy of their staff and grantees."
- No specific public statement on how the Anthropic conflict is managed at the board level
- No evidence CG funds organizations fundamentally opposed to continued frontier AI development (e.g., PauseAI, ControlAI, MIRI's recent pause advocacy)
Recommended Reading
Ajeya Cotra on 80,000 Hours podcast (https://80000hours.org/podcast/episodes/ajeya-cotra-transformative-ai-crunch-time/) -- CG's most sophisticated AI risk thinker speaks at length about intelligence explosion, why safety plans may fail, and the narrowing window for safety work. 30K words. The best window into how CG's AI safety team actually thinks.
Transformer News: "Anthropic employees say they'll give away billions" (https://www.transformernews.ai/p/anthropic-employees-philanthropy-billions-donations-effective-altruism-coefficient-giving-ai-safety) -- The most thorough investigation of CG-Anthropic entanglement. Discloses that CG funds Transformer. Maps revolving door, conflicts, and the coming Anthropic wealth wave.
Jason Hausenloy: "The AI Safety Ecosystem Is A Monopsony" (https://firstscattering.com/p/the-ai-safety-ecosystem-is-a-monopsony) -- Short, precise structural critique of funding concentration. The version of the criticism that should worry CG most.
CG: "Our Approach to AI Safety and Security" (https://coefficientgiving.org/research/our-approach-to-ai-safety-and-security/) -- CG's definitive AI safety strategy. Three pillars, risk pathways, specific research priorities. The official theory of change.
Alexander Berger on 80,000 Hours podcast (https://80000hours.org/podcast/episodes/alexander-berger-improving-global-health-and-wellbeing-with-open-philanthropy/) -- 42K words from the CEO. Focus is GHW, but the intellectual framework, philosophical orientation, and organizational culture are fully on display.