MATS (ML Alignment Theory Scholars)

Field-Building

Premier alignment training program.

Founded: 2021
HQ: Berkeley, CA
Team: 8
Structure: 501(c)(3) nonprofit
Model: Grants

Theory of Change

MATS positions itself as a diversified portfolio across AI safety research agendas. Co-ED Ryan Kidd: "We are somewhat like an index fund... we have a broad portfolio, we adopt a bunch of different theories of change as valid, and we try and have our thumb in 100 pies."

The formal theory of change: MATS expands the talent pipeline for AI safety research by connecting promising scholars with senior mentors, reducing barriers for mentors to take on mentees, and developing scholars on three dimensions -- technical depth, field breadth, and research taste. Graduates are intended to work at safety organizations, found new ones, or conduct independent research.

The operating assumption: if solving AI safety requires effort on the scale of the Apollo program (~400K people), the current field of ~500-1000 technical researchers needs to grow by orders of magnitude. MATS is the largest single accelerant in this pipeline, running a 12-week Berkeley-based fellowship pairing scholars with senior mentors from labs, nonprofits, and academia.

Kidd's candid framing of the meta-strategy: "The more we do to raise the waterline of understanding on these different scenarios, the easier it will be to hand off to AI assistants" -- explicitly linking MATS's field-building to the Paul Christiano / Jan Leike "alignment MVP" strategy where AI systems accelerate alignment research.

What They Do

The program. 12-week intensive research fellowship in Berkeley. Scholars receive $15K stipend, $12K compute budget, housing, meals, travel support, a dedicated research manager, and a community of peers. ~75% of scholars continue into a funded 6-12 month extension phase where the deeper research happens. Summer 2026 will be the largest cohort: 120 fellows, 100 mentors. MATS plans to run 3 programs per year (summer, fall, winter).

Scale and output. 446+ alumni, 100+ mentors, 170+ publications, 9,500+ citations, h-index 44. Acceptance rate 4-7% overall, with extreme variance across mentor streams (2.6% to 33%).

Research portfolio. Current track breakdown: 27% evaluations, 26% interpretability, 18% oversight/control, 12% agency, 10% governance, 9% security. This represents deliberate rebalancing from earlier cohorts where interpretability dominated (52% of presentations in Winter 2023-24).

Alumni impact. 80% of alumni work in AI safety. ~10% co-founded AI safety organizations. Notable alumni-founded orgs: Apollo Research (Marius Hobbhahn), Timaeus (Jesse Hoogland), Center for AI Policy (Thomas Larsen), ARENA, Athena, Aether, Cadenza Labs, PRISM Eval. Apollo's CEO states: "Apollo Research would counterfactually not exist without MATS."

Key innovation. The Research Management model -- mandatory weekly check-ins with scholars, distilled into reports for mentors. Replaced the earlier "Scholar Support" model. Mentor Ethan Perez: "[Research Management check-in notes are] adding like... almost all of the value of my 1:1 check-ins with [scholars]." This intermediary layer scales mentorship by reducing the coordination burden on senior researchers.

New in 2026. The Anthropic/OpenAI "Megastream" -- a coalition of safety researchers from both labs (including Ethan Perez, Fabien Roger, Sam Bowman, Nicholas Carlini, Micah Carroll) who co-mentor MATS scholars directly on frontier lab safety research. Also: planning a 1-2 year residency program for senior researchers.

Key People

Ryan Kidd -- Co-Executive Director. PhD Physics (U of Queensland). Was one of five scholars in MATS's 2021 pilot program. Did not found MATS but has been "the driving force behind strategies since mid-2022." Co-founded LISA (London office). Manifund regrantor ($250K allocation). Unusually candid public communicator. In the Cognitive Revolution podcast, he is honest about uncertainty ("I'm pretty confused"), acknowledges dual-use tensions ("all safety work is capabilities work"), and frames MATS's approach in terms of portfolio diversification under deep uncertainty.

Evan Hubinger -- Original founder (2021). Provided all mentorship in the pilot. Known for Risks from Learned Optimization (mesa-optimizers). Now at Anthropic Alignment Science. Kidd describes him as "probably the most dominant connector driving force at MATS over our time." No longer a mentor as of Summer 2026 due to time constraints.

Team size: ~8 full-time (Berkeley) + London team + 20+ research managers. Only confirmed board member: Michael Aird (AI Program Director at Longview Philanthropy, IAPS co-founder). Full board not publicly disclosed.

Money and Incentives

Total known funding: $39.6M. Of this, $39.3M (99.2%) is from Coefficient Giving / Open Philanthropy across 14 grants (2021-2025). The remaining <1% comes from SFF ($289K in 2025), LTFF ($316K historically), Foresight Institute, and small donations.

Funding trajectory:

2021: $195K
2022: $2.5M
2023: $5.3M
2024: $7.1M
2025: $24.3M (including $23.6M general support grant for 2 years)

The $23.6M grant in May 2025 was a 3.4x increase over the prior year and one of Open Philanthropy's largest single AI safety grants. This is an enormous bet on MATS specifically.

Cost structure. Program cost per scholar: ~$35K (not including staff time). Scholar stipend: $15K per cohort. Compute: $12K budget (most don't use it fully). No public budget breakdown, no salary data, no overhead rate disclosed. At planned scale (3 programs/year, 120 fellows each), annual burn could approach $8-12M before staff costs.

Business model. Pure grants. No earned revenue, no endowment, no product. If Open Philanthropy changed priorities, MATS would face existential funding risk within 1-2 years. This extreme funder concentration (~99%) is the single most important structural fact about MATS's financial position.

Complex fiscal history. Before incorporating as MATS Research Inc. (501(c)(3), March 2024), grants flowed through BERI (2021-2023), AI Safety Support (2022-present), and Conjecture (2022-2023 London extension). No 990 filings yet available.

Lab pipeline incentives. MATS trains talent, frontier labs employ it. The new Anthropic/OpenAI Megastream formalizes this relationship. Labs benefit from an externally-funded training pipeline (paid for by OP, not by Anthropic or OpenAI). MATS benefits from lab prestige and mentor access. Whether this pipeline serves safety depends on whether lab safety teams actually reduce risk vs. provide legitimacy for scaling.

What Others Say

The pipeline mismatch (MATS's own strongest critique). Kidd: "Mentor applications are increasing 2.2x/year and fellow applications are increasing 1.8x/year, but deployed research talent is only increasing at 1.25x/year." MATS is training people faster than the ecosystem can absorb them.

The institutional capacity bottleneck. Marius Hobbhahn (Apollo Research CEO, MATS alumnus): "The level of talent applying to AI safety organizations and getting rejected is too high... you could probably start a handful of new orgs" from the rejected applicants. His argument: the problem isn't talent supply, it's that there aren't enough orgs to employ them. Hobbhahn estimates a 2-20x talent-to-capacity gap depending on the bar.

Noisy evaluations. Acceptance rates vary 13x across mentor streams within the same cohort (2.6% to 33%). Beth Barnes (ARC Evals founder) describes the evaluation challenge from a hiring manager's perspective: "Various people I chose not to continue with had significantly better technical skills than (I think) I did at their age, which feels confusing."

Post-MATS obstacles persist. After the program, 60%+ of scholars still report "publication record" as an obstacle to their alignment career. "Funding increased as an obstacle over the course of MATS."

Dual-use tensions. Kidd is unusually frank: "All safety work is capabilities work. Fundamentally... I actually don't know if you can avoid this." He acknowledges this applies to MATS-supported research including interpretability and RLHF-adjacent work.

The scaling-vs-quality debate. A 2022 memo argued against "mass movement building" in AI safety on grounds it would dilute field quality. MATS's response: most scholars already have ML backgrounds, and "1 more median MATS scholar focused on AI safety is worth 5-10 more median capabilities researchers." But at 360 fellows/year (planned), the dilution question intensifies.

What's Absent

No independent evaluation. $39M+ received, zero external assessments. All impact metrics are self-reported. No GiveWell-style analysis, no third-party audit, no external review of counterfactual impact claims.

Opaque governance. Board composition unknown beyond one member (Michael Aird). No published conflict of interest policy. No budget transparency. No salary disclosures.

Publication gap. No retrospectives published since Spring 2024, despite running 3-4 cohorts since then. The most detailed self-evaluation (Winter 2023-24) is now 2+ years old.

No external critics. All criticism comes from within the EA/rationalist community. No mainstream academic, policy, or media scrutiny.

Capabilities ambiguity. The 80% "working in AI safety" stat includes people at frontier labs doing work that blurs the safety/capabilities line. Kidd himself says these can't be cleanly separated.

Stated Theory of Change

MATS aims to reduce catastrophic risk from AI by expanding the supply of capable alignment researchers. The mechanism: pair promising scholars with senior mentors for a 12-week intensive, develop their technical depth, field breadth, and research taste, then funnel them into permanent roles at safety organizations, or support them in founding new ones.

The meta-strategy is explicitly portfolio-based. Rather than betting on a single research direction, MATS diversifies across interpretability, evaluations, control, governance, agency, and security -- treating alignment research agendas like an index fund. Kidd: "We adopt a bunch of different theories of change as valid, and we try and have our thumb in 100 pies."

The implicit assumption: the field needs massive scaling (orders of magnitude more researchers) before AGI arrives (median estimate ~2033), and mentor-quality research training is the bottleneck that MATS can unlock.

Revealed Theory of Change

The actions largely match the stated theory. MATS does genuinely operate as a portfolio manager across research agendas. The advisory board governance, the deliberate rebalancing away from interpretability dominance, and the track diversification all reflect a real commitment to portfolio thinking.

However, there are some divergences worth noting:

1. Pipeline to labs, not just pipeline to safety. The Anthropic/OpenAI Megastream, the prominence of lab safety team mentors, and the 80% placement stat all point to MATS functioning primarily as an outsourced training program for frontier lab safety teams -- funded by Open Philanthropy rather than by the labs themselves. This is the revealed theory of change: accelerate the hiring pipeline for Anthropic, DeepMind, OpenAI, and Redwood.

2. Researcher production, not org production. Despite Kidd's emphasis on founders, MATS's operational structure is optimized for producing researchers, not founders. The mentor selection process, the research-focused evaluation, and the extension phase all build toward papers and publications, not toward organizational capacity. The ~10% org-founding rate is impressive but appears to be an emergent outcome, not a designed one.

3. Scaling ambitions may outpace quality control. The jump from 63 scholars (Winter 2023-24) to 120 (Summer 2026) in two years, with plans for 3 programs per year, means MATS is approaching 360 fellows annually. At this scale, maintaining the mentor-to-scholar ratio that makes MATS valuable becomes harder. The retrospective publication gap (none since Spring 2024) coincides with this acceleration.

Key Assumptions

1. More alignment researchers reduces AI risk.

Evidence for: The field is clearly talent-constrained at the senior level. Specific technical problems (interpretability, evals, control) benefit from more hands.
Evidence against: If the core alignment problem is conceptual rather than labor-intensive, more researchers working on potentially wrong directions could be net negative through capabilities externalities and false confidence.
Testable: Track whether MATS alumni research leads to concrete safety improvements in deployed systems, not just publications.
If wrong: MATS is training capable ML researchers who end up advancing capabilities under a safety label.

2. Portfolio diversification across agendas is the right strategy under uncertainty.

Evidence for: Expert disagreement is genuinely high. No single agenda has a clear track record of success. Diversification protects against being wrong about which approach matters.
Evidence against: If timelines are very short (AGI by 2028), concentrating resources on the most promising 2-3 agendas could be more efficient than spreading across 6+ tracks.
Testable: Compare outcomes of MATS's diversified portfolio vs. focused programs (e.g., Redwood's control research).
If wrong: Resources are spread too thin, and MATS produces lots of "okay" researchers instead of a few "great" ones in critical areas.

3. 12 weeks of intensive mentorship is sufficient to transform promising people into safety researchers.

Evidence for: 80% alumni placement rate, 170+ publications, org-founding stories.
Evidence against: 60%+ of scholars still report "publication record" as an obstacle after MATS. 25% don't continue to the extension phase. The counterfactual analysis shows 24% would have done alignment research anyway.
Testable: Track 3-5 year career outcomes of MATS alumni vs. matched comparisons who were rejected or didn't apply.
If wrong: MATS is primarily a selection filter (identifying people who would have succeeded anyway) rather than a genuine accelerant.

4. Open Philanthropy's continued massive investment is reliable.

Evidence for: OP has been increasing funding at 2-3x per year, culminating in a $23.6M 2-year grant.
Evidence against: OP has changed priorities before. A single funder providing 99% of revenue is structurally fragile. OP leadership changes could shift priorities away from field-building toward direct research or governance.
If wrong: MATS faces existential crisis with 12-18 months of runway from current reserves.

Strengths

Network position. MATS has achieved a unique structural position in the AI safety ecosystem. It is the primary nexus connecting senior researchers (as mentors), promising newcomers (as scholars), frontier labs (as employers), and OP (as funder). This network position is genuinely hard to replicate.

Org-spawning track record. The ~10% founder rate among alumni is remarkable. Apollo Research, Timaeus, Center for AI Policy, and several other orgs trace their founding directly to MATS connections and community. No other field-building program has this track record.

Intellectual honesty. Kidd's public candor is unusual. Acknowledging that "all safety work is capabilities work," publishing detailed retrospectives with NPS scores and obstacles, and writing about the supply-demand mismatch all suggest an organization that takes self-assessment seriously.

Infrastructure innovation. The Research Management model -- intermediaries who distill scholar progress into mentor-readable summaries -- is a genuine organizational innovation that scales mentorship. The advisory board for mentor selection adds legitimacy to portfolio allocation decisions.

Self-correction. The deliberate rebalancing away from interpretability dominance (52% -> 26%), the introduction of advisory board governance, and the shift from Scholar Support to Research Management all demonstrate willingness to change course based on evidence.

Weaknesses and Risks

Extreme funder concentration. 99% dependence on Open Philanthropy is the single largest risk factor. MATS has negligible revenue diversification. A change in OP leadership, priorities, or financial position could be existential.

Scaling vs. quality tension. The jump to 360 fellows/year pushes against the mentorship bottleneck that MATS itself identifies. If mentor quality or attention degrades, MATS loses its core value proposition and becomes indistinguishable from a bootcamp.

Lab pipeline capture. The deepening relationship with frontier labs (Megastream, lab mentors, lab employment as the primary outcome) raises the question of whether MATS serves safety or serves labs' hiring needs. If lab safety teams primarily provide legitimacy for scaling rather than actual safety improvements, MATS's pipeline accelerates the wrong thing.

Governance opacity. Unknown board, no published COI policy, no budget transparency, no independent evaluation. For $39M+ in grants, this governance standard is below what should be expected.

The absorption problem. MATS trains ~360 people/year into a field where deployed research talent grows at 25%/year (~125-250 new positions). The arithmetic doesn't work. Either MATS needs to scale down, the field needs far more absorptive capacity (more orgs, more funding), or many MATS graduates won't find safety-specific positions.

Post-program gap. 60%+ of scholars still face "publication record" obstacles after MATS. 25% don't continue to the extension phase. Funding increases as an obstacle during the program. The bridge from "MATS graduate" to "permanently employed safety researcher" has structural gaps.

Cross-References

Complementary: BlueDot Impact (broader introductory funnel that feeds into MATS), ARENA (technical upskilling), PIBBSS (moonshot interpretability), Constellation/FAR Labs (physical infrastructure MATS uses), Open Philanthropy (near-sole funder).

Competing/overlapping: Anthropic's own fellows program (2% acceptance rate, launched 2024), university PhD programs in AI safety (slower but deeper), other fellowship programs (Astra, ERA, SPAR, LASR Labs).

Dependent: MATS's impact depends on downstream absorptive capacity -- the existence of organizations like Apollo Research, Redwood, METR, UK AISI that hire its graduates. If these orgs shrink or don't scale, MATS output has nowhere to go.

Pipeline partner: Anthropic and OpenAI benefit from MATS as an externally-funded training pipeline. They provide mentors and prestige; MATS provides pre-trained talent. This is a genuine symbiosis, but the question of who captures more value is important.

What Would Change This Assessment

An independent evaluation finding that MATS alumni are no better than matched comparison groups at producing safety-relevant research would significantly downgrade the theory of change. (Currently untested.)
Open Philanthropy reducing MATS funding by >50% would force existential restructuring and test whether the program can attract alternative funding.
3-5 year outcome tracking showing MATS alumni migrating to capabilities work at rates above 10-15% would undermine the safety pipeline claim.
Published 990 filings showing disproportionate administrative overhead (e.g., >40% of spending on non-program activities) would raise efficiency concerns.
Evidence that MATS-trained researchers at frontier labs have demonstrably improved safety outcomes (rather than just working at labs) would significantly upgrade the theory of change.

Self-Critique

Weakest claim: My assertion that MATS functions primarily as "an outsourced training program for frontier lab safety teams" may overstate the lab-pipeline dynamic. Many MATS alumni go to nonprofits, government, or independent research. The lab pipeline is the most visible outcome, but it's not the only one.

Potential biases: The evidence base is heavily self-reported (MATS retrospectives, Kidd's podcast, alumni surveys). I may be over-crediting MATS's self-assessments because they are unusually transparent and candid compared to most organizations. Candor does not guarantee accuracy.

What I wish I had: (1) An independent evaluation of MATS alumni outcomes. (2) The 990 filing with financial details. (3) Published retrospectives for the 2024-2025 cohorts. (4) A critic from outside the EA/rationalist community. (5) Data on how MATS alumni at frontier labs assess the safety-relevance of their own work 2-3 years out.

What a thoughtful disagreer would say: "MATS is optimizing for volume at the expense of depth. The original program was 5 scholars under Evan Hubinger -- that's a PhD-caliber mentorship ratio. Now it's 120 scholars across 100 mentors, many of whom are providing remote or part-time mentorship. You're celebrating the scaling but the thing that made MATS valuable -- deep, transformative mentorship -- may not scale." I think this is partly right, and the lack of recent retrospectives makes it hard to assess.

Single weakest claim: That 80% of alumni "work in AI safety" is a meaningful indicator of impact. Given Kidd's own admission that "all safety work is capabilities work," and that many of these roles are at frontier labs doing work that blurs the line, the 80% stat may measure career placement more than safety impact.

Connected to (20)

Anthropiccollaborator · Ethan Perez, Fabien Roger, Sam Bowman, Sam Marks, Nicholas Carlini, Kyle Fish

OpenAIcollaborator · Micah Carroll

FAR.AIcollaborator

Goodfirecollaborator · Lee Sharkey

Google DeepMindcollaborator · Neel Nanda, Arthur Conmy, Erik Jenner, David Lindner

Longview Philanthropyboard overlap · Michael Aird

METRcollaborator

Redwood Researchcollaborator · Buck Shlegeris

AI Safety Supportcollaborator

BlueDot Impactcollaborator

Lightcone Infrastructurecollaborator

UK AI Safety Institutestaff to

Alignment Research Engineer Acceleratorstaff to

Apollo Researchstaff to · Marius Hobbhahn

Berkeley Existential Risk Initiativecollaborator

Center for AI Policystaff to · Thomas Larsen

Conjecturecollaborator

London Initiative for Safe AIspun off from · Ryan Kidd, Christian Smith

Timaeusstaff to · Jesse Hoogland

Center for AI Safetystaff to · Oliver Zhang

Sources (49)

Every URL that was read during research.

1.MATS Researchmatsprogram.org
2.MATS Teammatsprogram.org
3.MATS Summer 2026matsprogram.org
4.MATS Applicationmatsprogram.org
5.Empowering humans for a flourishing future with AI.ryankidd.ai
6.Building & Scaling the AI Safety Research Community, with Ryan Kidd of MATScognitiverevolution.ai
7.Ep 10 - Accelerated training to become an AI safety researcher w/ Ryan Kidd (Co-Director, MATS) - Artificial General Intelligence (AGI) Show with Soroush Pourtheagishow.com
8.MATS Frequently Asked Questionsmatsprogram.org
9.An open letter to SERI MATS program organisersgreaterwrong.com
10.How MATS addresses “mass movement building” concernsgreaterwrong.com
11.Applying to MATS: What the Program Is Like, and Who It’s Forgreaterwrong.com
12.MATS Summer 2023 Retrospectivegreaterwrong.com
13.MATS Winter 2023-24 Retrospectivegreaterwrong.com
14.Talk: AI safety fieldbuilding at MATSgreaterwrong.com
15.MATS Spring 2024 Extension Retrospectivegreaterwrong.com
16.MATS Alumni Impact Analysisgreaterwrong.com
17.MATS Summer 2024matsprogram.org
18.MATS Summer 2025matsprogram.org
19.AI safety undervalues foundersgreaterwrong.com
20.ML Alignment Theory Program under Evan Hubingergreaterwrong.com
21.SERI ML Alignment Theory Scholars Program 2022greaterwrong.com
22.MATS Alumnimatsprogram.org
23.MATS Mentorsmatsprogram.org
24.MATS Careersmatsprogram.org
25.MATS About usmatsprogram.org
26.MATS Donatematsprogram.org
27.Mats Research Inc - Nonprofit Explorer - ProPublicaprojects.propublica.org
28.Neel Nanda at MATS: Summer 2026matsprogram.org
29.Neel Nanda - MATS Mentormatsprogram.org
30.The case for AI safety capacity-building workgreaterwrong.com
31.MATS AI Safety Strategy Curriculum v2greaterwrong.com
32.There should be more AI safety orgsgreaterwrong.com
33.Announcing the London Initiative for Safe AI (LISA)greaterwrong.com
34.I Reviewed Hundreds of AI Safety Applications. Here's What Actually Matters | Georg Langegeorglange.com
35.An Overview of the AI Safety Funding Situationgreaterwrong.com
36.MATS Winter 2025matsprogram.org
37.Can AI Do Our Alignment Homework? (with Ryan Kidd)podcast.futureoflife.org
38.AI Safety has a scaling problemgreaterwrong.com
39.Anthropic and OpenAI Megastream at MATS: Summer 2026matsprogram.org
40.Experiences and learnings from both sides of the AI safety job marketgreaterwrong.com
41.MATS AI Safety Strategy Curriculumgreaterwrong.com
42.SERI MATS Program - Winter 2022 Cohortgreaterwrong.com
43.MATS mentor selectiongreaterwrong.com
44.My MATS Summer 2023 experiencegreaterwrong.com
45.Evaluations (of new AI Safety researchers) can be noisygreaterwrong.com
46.Introducing the Anthropic Fellows Program for AI Safety Researchalignment.anthropic.com
47.MATS Researchmatsprogram.org
48.MATS Winter 2024matsprogram.org
49.MATS Board: Michael Airdmatsprogram.org