AI Safety Orgs

Briefs and theory-of-change analyses for 120 organizations across the AI safety ecosystem — frontier labs, funders, research orgs, policy shops, advocacy groups, and field-building programs.

Briefs are factual; analyses are opinionated and sit behind a toggle on each org’s page. Both are written by Claude from public sources.

120 orgs

Funders(12)

Coefficient Giving (formerly Open Philanthropy)Funding

THE funder. $480M+. Shapes the entire ecosystem.

Future of Life Institute (FLI)Advocacy + Funding

Advocacy + funding hybrid. Musk-founded.

Founders PledgeFunding

Giving intermediary.

GiveWellFunding

Meta-charity evaluation. Context for EA funding philosophy.

FTX Future FundFunding (defunct)

Was #2 funder. Collapsed Nov 2022. Ecosystem fragility.

Survival and Flourishing Fund (SFF)Funding

#2 active funder. Jaan Tallinn. Different philosophy from CG.

Foresight InstituteFunding

Non-EA, nanotech origins. Different intellectual tradition.

Longview PhilanthropyFunding

Donor advisory. Routes major gifts.

Long-Term Future Fund (LTFF)Funding

EA Fund. High-volume smaller grants.

AI Risk Mitigation Fund (ARM Fund)Funding

Non-EA funder. Different source.

ManifundFunding

Regranting model. Decentralized funding.

EA Infrastructure Fund (EAIF)Funding

EA Fund. Meta/community infrastructure.

Frontier Labs(8)

OpenAIFrontier Lab

Origin story, mission drift, safety exodus.

AnthropicFrontier Lab

Safety-first lab. RSP. Constitutional AI. The benchmark.

Meta AI (FAIR / Meta Superintelligence Labs)Frontier Lab

LeCun's safety skepticism. Open weights.

xAIFrontier Lab

Musk. Move-fast approach.

Google DeepMindFrontier Lab

Hassabis/Legg. Frontier Safety Framework.

Mistral AIFrontier Lab

European challenger. EU AI Act tension.

Safe Superintelligence Inc. (SSI)Research

Ilya's safety-first ASI bet.

DeepSeekFrontier Lab

Chinese frontier. Efficiency innovations.

Technical Alignment Research(14)

Machine Intelligence Research Institute (MIRI)Conceptual Research

Foundational. Agent foundations. The original alignment org.

FAR.AIEmpirical Research

Adversarial robustness. Residency program.

Center for Human-Compatible AI (CHAI)Conceptual Research

Stuart Russell. CIRL. Academic alignment.

Redwood ResearchEmpirical Research

Adversarial training → AI control agenda.

MIT Algorithmic Alignment GroupResearch

Hadfield-Menell. CIRL. Academic.

Center on Long-Term Risk (CLR)Research

S-risks, suffering focus. Different from x-risk.

ConjectureEmpirical Research

Cognitive emulation. Controversial.

Alignment Research Center (ARC)Conceptual Research

Paul Christiano. Evals + alignment theory.

Beneficial AI Foundation (BAIF)Research

CG-funded safety research.

Cadenza LabsResearch

Newer safety startup. LLM steering.

Meaning Alignment InstituteResearch

Values alignment through meaning. Philosophical.

NYU Alignment Research Group (ARG)Research

Sam Bowman. Scalable oversight.

Krueger AI Safety Lab (KASL) / EvitableResearch

David Krueger. Goal misgeneralization.

OrthogonalResearch

Agent foundations, formal verification.

Interpretability(3)

GoodfireResearch

Commercial interpretability. Sparse autoencoders.

TimaeusResearch

Singular Learning Theory. Mathematical approach.

TransluceResearch

Steinhardt. Transparency/auditing tools.

Evals & Red-Teaming(7)

UK AI Security Institute (AISI)Governance

Government evaluator. Pre-deployment testing.

US AI Safety Institute / CAISIEvals/Testing

Government evaluator. Political uncertainty.

Apollo ResearchEmpirical Research

Scheming detection. Deception evals.

METR (Model Evaluation & Threat Research)Evals/Testing

ARA evals. Beth Barnes. Gold standard.

Palisade ResearchResearch

Red-teaming, jailbreaking research.

Haize LabsEvals/Testing

Automated red-teaming. YC-backed.

EquiStampEvals/Testing

Commercial safety evals.

Governance & Policy(22)

RAND Corporation (Global and Emerging Risks)Governance

Defense think tank. Compute governance.

ARIA (Advanced Research + Invention Agency)Governance

UK gov R&D agency. Safeguarded AI.

European AI OfficeGovernance

EU AI Act implementation. Binding law.

Center for Security and Emerging Technology (CSET)Governance

Georgetown. US-China AI. Helen Toner.

Oxford Martin AI Governance Initiative (AIGI)Governance

AI treaties. International governance.

Center for a New American Security (CNAS)Governance

National security framing. $14M CG.

Partnership on AIGovernance

Industry consortium.

Institute for Law & AI (LawAI)Governance

Legal scholarship for x-risk.

Ada Lovelace InstituteGovernance

UK AI ethics. Nuffield-funded.

Institute for AI Policy and Strategy (IAPS)Governance

US AI governance. Practical policy.

Centre for Long-Term Resilience (CLTR)Governance

UK policy. Bletchley Declaration.

Centre for the Governance of AI (GovAI)Governance

Oxford. Talent pipeline. Compute governance.

Center for Long-Term Cybersecurity (CLTC)Research

AI + cybersecurity. Berkeley.

SaferAIGovernance

Compute licensing proposals.

Concordia AIGovernance

Chinese-international bridge. Newsletter.

Center for AI Policy (CAIP)Governance

US federal advocacy. Direct lobbying.

Simon Institute for Longterm GovernanceGovernance

International/multilateral governance.

Secure AI Project (SAIP)Governance

AI security + safety intersection.

Center for AI and Digital Policy (CAIDP)Governance

International governance scorecards.

Center for Law & AI Risk (CLAIR)Governance

Legal frameworks for AI risk.

Gladstone AIGovernance

National security AI risk.

Beijing Institute of AI Safety and Governance (Beijing-AISI)Governance

Chinese government approach.

Advocacy(9)

Americans for Responsible Innovation (ARI)Advocacy

US policy lobbying. SB 1047.

Center for Humane TechnologyAdvocacy

Mainstream advocacy. Netflix doc.

ControlAIAdvocacy

European AI safety advocacy.

Existential Risk ObservatoryAdvocacy

European x-risk public communication.

AI Policy Institute (AIPI)Governance

Public opinion → legislation. Polling.

Encode (formerly Encode Justice)Advocacy

Youth-led. Non-EA framing.

PauseAIAdvocacy

Temporary pause advocacy. Grassroots.

Stop AIAdvocacy

Full halt position. Most aggressive.

The Midas ProjectAdvocacy

Lab accountability. OpenAI Files.

Field-Building(19)

Effective Ventures FoundationField-Building

EA umbrella (80K, CEA, GWWC).

80,000 HoursField-Building

Career gateway. Talent bottleneck thesis.

MATS (ML Alignment Theory Scholars)Field-Building

Premier alignment training program.

Lightcone InfrastructureField-Building

LessWrong. Alignment Forum.

BlueDot ImpactField-Building

AI safety courses. Scale.

OpenMinedResearch

Privacy-preserving AI. $17M CG.

ConstellationField-Building

Berkeley research community hub.

Berkeley Existential Risk Initiative (BERI)Field-Building

Operational support for researchers.

Charity Entrepreneurship / Ambitious ImpactField-Building

Incubates charities.

Giving What We CanField-Building

10% Pledge. EA giving.

Atlas FellowshipField-Building

Youth talent. $8.8M CG.

Center for Applied Rationality (CFAR)Field-Building

Rationality → safety pipeline. Historical.

ForethoughtField-Building

Academic research support.

AI Safety SupportField-Building

Researcher wellbeing.

Apart ResearchResearch

Community hackathons. Low-barrier.

LASR LabsField-Building

Research sprints. London.

Principles of Intelligence (PIBBSS)Field-Building

Interdisciplinary. Biology + AI safety.

AI Safety CampField-Building

European volunteer research. Low-barrier.

Alignment Research Engineer Accelerator (ARENA)Field-Building

Alignment engineer bootcamp. Open-source.

Forecasting & Epistemic Infrastructure(6)

Forecasting Research InstituteForecasting

Tetlock. Superforecasting for AI risk.

MetaculusForecasting

Community forecasting platform.

Epoch AIForecasting

Compute trends. AI timelines.

Manifold MarketsForecasting

Prediction markets.

AI ImpactsForecasting

Katja Grace. Expert surveys.

Quantified Uncertainty Research Institute (QURI)Forecasting

Squiggle. Uncertainty quantification.

Research Institutions & Cross-Cutting(17)

Mila - Quebec AI InstituteResearch

Bengio. Academic → safety advocate.

Allen Institute for AI (AI2)Research

Allen Institute. Open-source academic AI.

AE StudioResearch

Commercial studio funding alignment.

Rethink PrioritiesResearch

Cause prioritization research.

Stanford HAIResearch

Human-centered AI framing.

EleutherAIResearch

Open-source AI research community.

Center for AI Safety (CAIS)Empirical Research

Hendrycks. Statement on AI Risk. Research.

Future of Humanity InstituteResearch

CLOSED. Bostrom. Spawned the field.

Leverhulme Centre for the Future of IntelligenceResearch

Cambridge. Interdisciplinary futures.

Gray Swan AIResearch

Hendrycks safety startup. CAIS cross-ref.

Ought / ElicitResearch

Research → product pivot (Elicit).

AI Now InstituteGovernance

Present harms. AI ethics. Different framing.

Cooperative AI FoundationResearch

Multi-agent cooperation.

Centre for the Study of Existential Risk (CSER)Research

Cambridge x-risk. Interdisciplinary.

Aligned AIResearch

Stuart Armstrong. FHI spinoff.

TruthfulAIResearch

AI honesty. TruthfulQA benchmark.

Global Catastrophic Risk InstituteResearch

Cross-risk research.

Independent Analysts(3)

Scott Alexander / Astral Codex TenIndependent Analyst

Ecosystem shaper. 100K readers. Rationalist community.

Daniel KokotajloIndependent Analyst

Insider → public voice. Equity forfeiture. Timeline forecasts.

Zvi MowshowitzIndependent Analyst

Field's newspaper. Quant/trader background.