Theory of Change
LASR Labs (London AI Safety Research) is a 13-week full-time research programme that converts technically-skilled people into productive AI safety researchers through structured team-based sprints. In their own words: "a technical AI Safety research programme focussed on reducing the risk of loss of control to advanced AI. We focus on action-relevant questions tackling concrete threat models."
The causal chain: talented people who could contribute to AI safety lack entry points -> a structured programme with strong supervisors gives them a viable research project -> they produce a publishable paper in 13 weeks -> they transition into full-time AI safety roles. The programme explicitly targets the "loss of control" risk scenario rather than broader AI ethics.
Week 0 focuses on "research prioritisation" -- developing participants' ability to evaluate which research agendas are most promising, not just executing on a given agenda. This is a higher-order skill that distinguishes LASR from pure paper-production programmes.
What They Do
LASR runs ~3 cohorts per year at the LISA coworking space in London. Teams of 3-4 work under one experienced supervisor for 12 weeks after a Week 0 orientation. Participants receive a GBP 11,000 stipend plus food, office space, and travel.
Publication record shows clear improvement: 2023 cohort had 4/5 papers at NeurIPS workshops or ICLR. Summer 2024: 5/5 at NeurIPS workshops. Spring 2025: papers at NeurIPS 2025 main conference including one oral presentation, plus a "best paper" award at an ICML workshop. Total: 50+ researchers supported, 14+ papers across AI control, interpretability, deception detection, scalable oversight, collusion, data poisoning, scheming propensity, and CoT monitoring.
Research topics have converged from diverse exploration (2023) toward the AI control agenda (2024-2025), reflecting the influence of supervisors from AISI who work on control.
Notable papers: "A is for Absorption" (NeurIPS 2025 oral, identifies fundamental SAE failure mode), "CoT Red-Handed" (NeurIPS 2025, proposes hybrid monitoring protocol), "Hidden in Plain Text" (NeurIPS 2024 workshop, steganographic collusion).
Alumni outcomes: "90% have gone on to work in AI safety/security" including at UK AISI, Apollo Research, OpenAI dangerous capabilities evals, Coefficient Giving, and PhD programmes. NPS +75 from participants. These figures are self-reported with no independent verification.
New in 2026: the AISI Alignment Project grant funds a year-long research programme hiring 7-9 researchers, a significant expansion beyond the sprint model.
LASR is one programme within Arcadia Impact, which also operates: ASET (manages Inspect Evals for UK AISI), AI Governance Taskforce (12-week part-time policy research), Orion AI Governance Initiative (student policy development, 57 participants, 2 placed), and Impact First (student career support).
Key People
Erin Robertson -- Programme Director of LASR Labs and Co-Director of Arcadia Impact. Background in Maths & Philosophy (Bristol). Named lead on the AISI Alignment Project grant. Appears to be the intellectual driver of LASR's research direction but has almost no public writing, interviews, or podcast appearances. Her thinking is largely invisible from outside.
Joe Hardie -- Co-Director of Arcadia Impact, handles operations. Background in CS with AI (Sussex), previously Microsoft, Wipro. The more public-facing co-director. Started an EA group at Sussex University, co-founded what became Arcadia. Candidly acknowledges CG funding dependency.
Charlie Griffin -- Advisor, not staff. Researcher at UK AI Security Institute working on AI Control. "Helped with the first round" of LASR (then AI Safety Hub Labs). His AISI connections are a key institutional asset, likely facilitating the Alignment Project grant and strong supervisor recruitment.
Supervisors from recent cohorts include Joseph Bloom (Neuronpedia), Mary Phuong (DeepMind), Jacob Pfau, Dmitrii Krasheninnikov, David Lindner, Stefan Heimersheim. These external researchers -- not LASR's internal team -- drive research direction.
Team: 7 FTEs across all Arcadia, with only 2-3 primarily on LASR. Hiring two new operations roles (GBP 44-55k range).
Money and Incentives
Total budget: ~$1.6M/year for all of Arcadia Impact (per Joe Hardie, 2025). No public financial accounts yet filed with the Charity Commission.
Funder concentration is extreme. Coefficient Giving (formerly Open Philanthropy) "has awarded over GBP 8 million since we were founded (this includes secured multi-year grants not yet received)." This appears to be essentially all of Arcadia's funding. Joe Hardie acknowledges: "Open Philanthropy is definitely the main funder... The main downside is the risk of relying too much on one funder."
Specific known grants:
- CG/Open Phil: GBP 230,010 (~$289K), April 2024, for LASR Labs Fellowship Program
- CG/Open Phil: >GBP 8M total to Arcadia Impact (multi-year committed, not all disbursed)
- UK AISI Alignment Project: up to GBP 1M, 2026, for year-long research capacity building
- CG/Open Phil to LISA: GBP 1,268,000 for the coworking space where LASR operates (separate entity, but effectively subsidizes LASR's infrastructure)
Per-participant cost: GBP 11,000 stipend. At ~20 participants per cohort and 3 cohorts/year, annual stipend costs alone are GBP 500-800K. This is cost-effective compared to MATS (~$32K per person over 4 months) -- LASR's team model is roughly half the per-person cost.
Supervisor compensation is undisclosed. It is unknown whether supervisors are paid by LASR or volunteer their time from their primary employers.
The AISI Alignment Project grant is the first meaningful funder diversification -- government funding that reduces CG dependency and adds institutional legitimacy. This is a significant development.
Business model: Pure grants. No product revenue, no consulting, no earned income. Entirely dependent on philanthropic and government grants.
Incentive analysis: CG funding concentration creates a strong incentive to align with CG's priorities. CG classifies LASR under "Global Catastrophic Risks Capacity Building" -- field-building, not technical research per se. This may push LASR to optimize for throughput (number of researchers trained) over depth (quality of individual research contributions). The AISI grant partially mitigates this by creating a separate funder with potentially different priorities.
What Others Say
Strongest counterargument (ecosystem-level): Will Aldred's "AI Safety's Talent Pipeline is Over-optimised for Researchers" (Aug 2025) argues that >20 fellowship programmes all select for researchers when surveys of 25 AI safety org leaders found "leadership, policy expertise and media engagement" are the most needed skills. Aldred notes that <5% acceptance rates create a "bycatch" problem: the 95% rejected could be excellent at non-research roles but receive no institutional support. He argues that "non-research roles are more important to recruit for at this time" -- a direct challenge to LASR's model. Notably, Arcadia trustee Adam Jones and staff member Alicia Pollard are acknowledged in the post, suggesting Arcadia is aware of this critique.
Scaling critique: Boyd Kane ("AI Safety has a scaling problem," Dec 2025) argues the mentor bottleneck limits fellowship scaling and proposes research bounties as an alternative. LASR's team model (1 supervisor per 3-4 participants) partially addresses this relative to 1-on-1 mentorship, but the fundamental supervisor supply constraint remains.
Funder endorsement: CG's 2024 progress report names LASR alongside MATS, Astra, and ERA as programmes that "produce some of the top technical talent in AI safety and security." CG specifically highlights ASET's work on Inspect Evals.
No direct criticism of LASR Labs exists in publicly searchable sources. Extensive searching (38 queries across multiple platforms) yielded zero LASR-specific critiques. This is likely because LASR is too small and new (~3 years, <100 alumni) to attract focused criticism.
What's Absent
- Independent outcome verification. The 90% placement rate is self-reported. No external tracking exists.
- Public financial accounts. No charity accounts filed yet. All budget figures from self-reporting.
- Erin Robertson's public voice. The Programme Director has minimal public writing or interviews despite being the key research decision-maker.
- Supervisor model details. Compensation, recruitment, and retention of supervisors -- arguably LASR's most important asset -- is opaque.
- Budget breakdown across Arcadia's programmes. How much of the GBP 8M goes to LASR specifically vs. other projects?
- Failure rates. No information about abandoned projects, participant dropouts, or unsuccessful research.
- Compute resources. What computational resources are available to participants?
Recommended Reading
Joe Hardie on Arcadia Impact's Projects -- The most candid source on LASR/Arcadia. Hardie discusses origin story, CG funding dependency, $1.6M budget, team of 7, and the honest tension between serving students and professionals. Lead with this for the unfiltered view.
AI Safety's Talent Pipeline is Over-optimised for Researchers -- The strongest counterargument to LASR's theory of change. Will Aldred argues the field needs non-research talent more urgently, and the fellowship model creates perverse selection effects.
LASR Labs Past Projects -- Complete list of all research with abstracts. Essential for judging whether the research is actually good and whether topics are well-chosen.
CoT Red-Handed (arXiv) -- LASR's best paper (NeurIPS 2025). Representative of research quality at its peak. Tests whether chain-of-thought monitoring can catch untrusted model sabotage.
Arcadia Impact About page -- The single most data-dense source: full team, trustees, all projects, history, and the GBP 8M funding disclosure.