← AI Safety Orgs

LASR Labs

Field-Building

Research sprints. London.

Founded
2023
HQ
London, UK
Team
7
Structure
charity (UK)
Model
Grants

Theory of Change

LASR Labs (London AI Safety Research) is a 13-week full-time research programme that converts technically-skilled people into productive AI safety researchers through structured team-based sprints. In their own words: "a technical AI Safety research programme focussed on reducing the risk of loss of control to advanced AI. We focus on action-relevant questions tackling concrete threat models."

The causal chain: talented people who could contribute to AI safety lack entry points -> a structured programme with strong supervisors gives them a viable research project -> they produce a publishable paper in 13 weeks -> they transition into full-time AI safety roles. The programme explicitly targets the "loss of control" risk scenario rather than broader AI ethics.

Week 0 focuses on "research prioritisation" -- developing participants' ability to evaluate which research agendas are most promising, not just executing on a given agenda. This is a higher-order skill that distinguishes LASR from pure paper-production programmes.

What They Do

LASR runs ~3 cohorts per year at the LISA coworking space in London. Teams of 3-4 work under one experienced supervisor for 12 weeks after a Week 0 orientation. Participants receive a GBP 11,000 stipend plus food, office space, and travel.

Publication record shows clear improvement: 2023 cohort had 4/5 papers at NeurIPS workshops or ICLR. Summer 2024: 5/5 at NeurIPS workshops. Spring 2025: papers at NeurIPS 2025 main conference including one oral presentation, plus a "best paper" award at an ICML workshop. Total: 50+ researchers supported, 14+ papers across AI control, interpretability, deception detection, scalable oversight, collusion, data poisoning, scheming propensity, and CoT monitoring.

Research topics have converged from diverse exploration (2023) toward the AI control agenda (2024-2025), reflecting the influence of supervisors from AISI who work on control.

Notable papers: "A is for Absorption" (NeurIPS 2025 oral, identifies fundamental SAE failure mode), "CoT Red-Handed" (NeurIPS 2025, proposes hybrid monitoring protocol), "Hidden in Plain Text" (NeurIPS 2024 workshop, steganographic collusion).

Alumni outcomes: "90% have gone on to work in AI safety/security" including at UK AISI, Apollo Research, OpenAI dangerous capabilities evals, Coefficient Giving, and PhD programmes. NPS +75 from participants. These figures are self-reported with no independent verification.

New in 2026: the AISI Alignment Project grant funds a year-long research programme hiring 7-9 researchers, a significant expansion beyond the sprint model.

LASR is one programme within Arcadia Impact, which also operates: ASET (manages Inspect Evals for UK AISI), AI Governance Taskforce (12-week part-time policy research), Orion AI Governance Initiative (student policy development, 57 participants, 2 placed), and Impact First (student career support).

Key People

Erin Robertson -- Programme Director of LASR Labs and Co-Director of Arcadia Impact. Background in Maths & Philosophy (Bristol). Named lead on the AISI Alignment Project grant. Appears to be the intellectual driver of LASR's research direction but has almost no public writing, interviews, or podcast appearances. Her thinking is largely invisible from outside.

Joe Hardie -- Co-Director of Arcadia Impact, handles operations. Background in CS with AI (Sussex), previously Microsoft, Wipro. The more public-facing co-director. Started an EA group at Sussex University, co-founded what became Arcadia. Candidly acknowledges CG funding dependency.

Charlie Griffin -- Advisor, not staff. Researcher at UK AI Security Institute working on AI Control. "Helped with the first round" of LASR (then AI Safety Hub Labs). His AISI connections are a key institutional asset, likely facilitating the Alignment Project grant and strong supervisor recruitment.

Supervisors from recent cohorts include Joseph Bloom (Neuronpedia), Mary Phuong (DeepMind), Jacob Pfau, Dmitrii Krasheninnikov, David Lindner, Stefan Heimersheim. These external researchers -- not LASR's internal team -- drive research direction.

Team: 7 FTEs across all Arcadia, with only 2-3 primarily on LASR. Hiring two new operations roles (GBP 44-55k range).

Money and Incentives

Total budget: ~$1.6M/year for all of Arcadia Impact (per Joe Hardie, 2025). No public financial accounts yet filed with the Charity Commission.

Funder concentration is extreme. Coefficient Giving (formerly Open Philanthropy) "has awarded over GBP 8 million since we were founded (this includes secured multi-year grants not yet received)." This appears to be essentially all of Arcadia's funding. Joe Hardie acknowledges: "Open Philanthropy is definitely the main funder... The main downside is the risk of relying too much on one funder."

Specific known grants:

  • CG/Open Phil: GBP 230,010 (~$289K), April 2024, for LASR Labs Fellowship Program
  • CG/Open Phil: >GBP 8M total to Arcadia Impact (multi-year committed, not all disbursed)
  • UK AISI Alignment Project: up to GBP 1M, 2026, for year-long research capacity building
  • CG/Open Phil to LISA: GBP 1,268,000 for the coworking space where LASR operates (separate entity, but effectively subsidizes LASR's infrastructure)

Per-participant cost: GBP 11,000 stipend. At ~20 participants per cohort and 3 cohorts/year, annual stipend costs alone are GBP 500-800K. This is cost-effective compared to MATS (~$32K per person over 4 months) -- LASR's team model is roughly half the per-person cost.

Supervisor compensation is undisclosed. It is unknown whether supervisors are paid by LASR or volunteer their time from their primary employers.

The AISI Alignment Project grant is the first meaningful funder diversification -- government funding that reduces CG dependency and adds institutional legitimacy. This is a significant development.

Business model: Pure grants. No product revenue, no consulting, no earned income. Entirely dependent on philanthropic and government grants.

Incentive analysis: CG funding concentration creates a strong incentive to align with CG's priorities. CG classifies LASR under "Global Catastrophic Risks Capacity Building" -- field-building, not technical research per se. This may push LASR to optimize for throughput (number of researchers trained) over depth (quality of individual research contributions). The AISI grant partially mitigates this by creating a separate funder with potentially different priorities.

What Others Say

Strongest counterargument (ecosystem-level): Will Aldred's "AI Safety's Talent Pipeline is Over-optimised for Researchers" (Aug 2025) argues that >20 fellowship programmes all select for researchers when surveys of 25 AI safety org leaders found "leadership, policy expertise and media engagement" are the most needed skills. Aldred notes that <5% acceptance rates create a "bycatch" problem: the 95% rejected could be excellent at non-research roles but receive no institutional support. He argues that "non-research roles are more important to recruit for at this time" -- a direct challenge to LASR's model. Notably, Arcadia trustee Adam Jones and staff member Alicia Pollard are acknowledged in the post, suggesting Arcadia is aware of this critique.

Scaling critique: Boyd Kane ("AI Safety has a scaling problem," Dec 2025) argues the mentor bottleneck limits fellowship scaling and proposes research bounties as an alternative. LASR's team model (1 supervisor per 3-4 participants) partially addresses this relative to 1-on-1 mentorship, but the fundamental supervisor supply constraint remains.

Funder endorsement: CG's 2024 progress report names LASR alongside MATS, Astra, and ERA as programmes that "produce some of the top technical talent in AI safety and security." CG specifically highlights ASET's work on Inspect Evals.

No direct criticism of LASR Labs exists in publicly searchable sources. Extensive searching (38 queries across multiple platforms) yielded zero LASR-specific critiques. This is likely because LASR is too small and new (~3 years, <100 alumni) to attract focused criticism.

What's Absent

  • Independent outcome verification. The 90% placement rate is self-reported. No external tracking exists.
  • Public financial accounts. No charity accounts filed yet. All budget figures from self-reporting.
  • Erin Robertson's public voice. The Programme Director has minimal public writing or interviews despite being the key research decision-maker.
  • Supervisor model details. Compensation, recruitment, and retention of supervisors -- arguably LASR's most important asset -- is opaque.
  • Budget breakdown across Arcadia's programmes. How much of the GBP 8M goes to LASR specifically vs. other projects?
  • Failure rates. No information about abandoned projects, participant dropouts, or unsuccessful research.
  • Compute resources. What computational resources are available to participants?

Recommended Reading

  1. Joe Hardie on Arcadia Impact's Projects -- The most candid source on LASR/Arcadia. Hardie discusses origin story, CG funding dependency, $1.6M budget, team of 7, and the honest tension between serving students and professionals. Lead with this for the unfiltered view.

  2. AI Safety's Talent Pipeline is Over-optimised for Researchers -- The strongest counterargument to LASR's theory of change. Will Aldred argues the field needs non-research talent more urgently, and the fellowship model creates perverse selection effects.

  3. LASR Labs Past Projects -- Complete list of all research with abstracts. Essential for judging whether the research is actually good and whether topics are well-chosen.

  4. CoT Red-Handed (arXiv) -- LASR's best paper (NeurIPS 2025). Representative of research quality at its peak. Tests whether chain-of-thought monitoring can catch untrusted model sabotage.

  5. Arcadia Impact About page -- The single most data-dense source: full team, trustees, all projects, history, and the GBP 8M funding disclosure.

Show Claude’s analysis
An opinionated read. Read the brief first to form your own view.

Stated Theory of Change

LASR Labs' stated theory: AI safety needs more researchers, and the bottleneck is not talent availability but structured entry points. There are technically-skilled people who would make strong AI safety researchers but lack the institutional context, research direction, and peer support to begin contributing. LASR solves this by providing 13-week structured sprints with top supervisors where participants learn AI safety research by doing it -- taking a project "from proposal all the way to publication." The output is twofold: publishable research papers that advance the field, and researchers who transition into full-time safety roles.

The mechanism is specific and testable: produce papers at top venues AND place alumni into safety positions. Their focus on "loss of control" risk via "concrete threat models" scopes the research agenda to the most actionable safety work rather than broad theoretical concerns.

Revealed Theory of Change

LASR's actions largely match its stated theory, with some notable additions and tensions:

Research direction is externally driven. LASR's core team (Erin, Joe, Brandon) are operators and community builders, not researchers. Research direction comes entirely from external supervisors -- Charlie Griffin (AISI), Joseph Bloom (Neuronpedia), Mary Phuong (DeepMind), etc. This is not a criticism; it is a structural feature. LASR is a platform that connects supervisors with participants and provides the infrastructure for research to happen. The revealed theory is closer to "research programme management" than "research leadership."

Topical convergence toward AI control. The progression from diverse 2023 topics to control-focused 2024-2025 projects reveals that Charlie Griffin's influence (as advisor and AISI researcher) has been decisive in shaping LASR's research agenda. This is good in that AI control is a high-priority agenda, but creates a dependency on one person's research taste.

The AISI grant signals a pivot. Hiring 7-9 researchers for a year-long programme is qualitatively different from running 13-week sprints. It suggests Arcadia/LASR is evolving from pure field-building toward becoming a research organisation. Whether this is a strategic expansion or mission drift depends on execution.

Scale matters more than depth to the parent org. Arcadia Impact runs 5+ programmes simultaneously with 7 FTEs. This is an organisation optimizing for breadth of talent development, not depth of research. LASR is the flagship programme, but it competes for management attention with the Governance Taskforce, Orion, ASET, and Impact First.

Key Assumptions

1. More researchers is the bottleneck.

  • Evidence for: LASR's 90% alumni placement rate suggests demand exists for people with LASR-level training. CG continues to fund research capacity building, implying they believe more researchers are needed.
  • Evidence against: Surveys of 25 AI safety org leaders found "leadership, policy expertise and media engagement" are the most needed skills, not research. >20 fellowship programmes already target researchers. Aldred's "over-optimised for researchers" argument directly challenges this assumption.
  • Testable? Yes, in principle -- track whether LASR alumni fill genuinely new roles vs. competing with existing researchers for the same positions.
  • If wrong: LASR's marginal impact is much lower than it appears. Producing the 101st good AI safety researcher matters less than producing the 2nd good AI safety advocacy director.

2. 13 weeks is sufficient to produce meaningful research.

  • Evidence for: NeurIPS main conference acceptances, an oral presentation, and a best paper award. These are objective signals of quality.
  • Evidence against: Most truly novel research directions emerge from years of deep engagement, not 13-week sprints. LASR papers generally extend existing agendas (control, interpretability) rather than opening new ones.
  • Testable? Yes -- compare citation counts and downstream impact of LASR papers vs. papers from longer programmes.
  • If wrong: LASR produces credential signals (conference acceptances) without generating the kind of deep insight that moves the field forward.

3. Supervisor quality can be sustained.

  • Evidence for: LASR has attracted consistently strong supervisors from DeepMind, AISI, and top universities across multiple cohorts.
  • Evidence against: Supervisor compensation is unknown. If they volunteer their time, this depends on goodwill that could evaporate if supervisors get busy or disillusioned.
  • Testable? Track supervisor retention across cohorts.
  • If wrong: LASR's research quality collapses without strong supervisors, regardless of participant quality.

4. CG funding will continue.

  • Evidence for: CG has committed >GBP 8M multi-year, and Joe Hardie says the relationship is strong.
  • Evidence against: CG priorities can shift. CG has never rejected Arcadia but has funded "less than we asked for" on specific items.
  • If wrong: Arcadia would need to dramatically cut programmes or find alternative funding quickly. The AISI grant provides some buffer.

Strengths

Efficient conversion of talent into research output. At roughly $14K per participant (vs. MATS' ~$32K), LASR is the most cost-effective way to produce AI safety researchers with conference-quality publications. The team model (3-4 per supervisor) is more scalable than 1-on-1 mentorship.

Genuinely strong research quality. The progression from workshop papers to NeurIPS main conference, plus an oral presentation, is objective evidence that LASR is not just producing quantity. The feature absorption paper and CoT monitoring paper address real open questions.

Well-chosen research focus. The convergence on AI control and monitoring aligns with what many in the field consider the most actionable near-term safety research. LASR is not working on toy problems or theoretical curiosities.

Strategic location and colocation. Operating from LISA alongside Apollo Research, ARENA, Pivotal, and other safety organisations creates natural collaboration and recruitment channels for alumni.

Funder diversification beginning. The AISI Alignment Project grant (up to GBP 1M) is the first significant non-CG funding, and comes from a government-backed source that adds institutional credibility.

Concrete infrastructure contributions. ASET's management of Inspect Evals for UK AISI provides tangible value beyond research papers -- this is safety infrastructure that many organisations use.

Weaknesses and Risks

Extreme funder concentration. Near-total dependence on Coefficient Giving. If CG shifts priorities, reduces AI safety spending, or loses confidence in Arcadia, the organisation faces existential risk. The AISI grant helps but is not sufficient to sustain operations independently.

Core team lacks research credentials. Erin, Joe, and Brandon are community builders and operators. Research quality depends entirely on external supervisors who have no formal obligations to LASR. This creates a fragile dependency.

Marginal value in a crowded fellowship landscape. MATS, Astra, ERA, PIBBSS, Pivotal, and ARENA already serve overlapping populations. It is unclear how many of LASR's 50+ alumni would not have entered AI safety through another programme. The counterfactual impact may be lower than the headline numbers suggest.

No independent outcome verification. The 90% placement claim is the strongest argument for LASR's impact, and it is entirely self-reported. Without external tracking, this figure cannot be evaluated.

The "bycatch" problem. With ~500 applications and ~20 accepted per cohort, ~480 people per round are rejected from LASR. Most of these apply to multiple programmes and receive no institutional support if rejected from all. LASR contributes to this ecosystem problem without addressing it.

Programme Director is invisible. Erin Robertson has almost no public voice. For an organisation whose value depends on research direction, this opacity makes it difficult for outsiders to evaluate whether LASR is well-led intellectually.

Sprint model limits depth. 13 weeks is enough to produce a paper but not to develop deep research taste or open genuinely new research directions. LASR may be optimising for publication count over intellectual contribution.

Cross-References

MATS -- The most direct comparison. MATS is larger (100+ per year), runs longer (4 months), uses 1-on-1 mentorship, and is better known. LASR is cheaper per participant, team-based, and London-based. They serve overlapping but not identical populations (LASR emphasizes team work and research prioritisation; MATS emphasizes deep individual mentorship).

ARENA -- Curriculum-based ML engineering programme, also at LISA. More introductory than LASR; could function as a feeder pipeline (learn ML at ARENA, do research at LASR).

UK AISI -- LASR's most important institutional relationship. Charlie Griffin's advisory role, supervisor recruitment from AISI, the Alignment Project grant, and ASET's Inspect Evals work all create deep ties. LASR is effectively part of the AISI talent ecosystem.

Pivotal Research -- Also at LISA, also does research sprints. Potential redundancy in the London AI safety coworking space.

Coefficient Giving -- LASR's lifeline. CG's "Global Catastrophic Risks Capacity Building" programme is the primary funder of field-building work like LASR. CG's strategic direction determines LASR's funding future.

What Would Change This Assessment

Evidence that would significantly upgrade my view:

  • Independent verification showing >80% of LASR alumni are in safety-relevant roles 2+ years post-programme (not just immediately after)
  • A LASR paper that opens a genuinely new research direction rather than extending existing ones
  • Successful funder diversification (e.g., >30% of revenue from non-CG sources within 2 years)
  • Erin Robertson publishing a clear articulation of LASR's research strategy and prioritisation framework

Evidence that would significantly downgrade my view:

  • Alumni tracking showing most LASR graduates would have entered AI safety through other programmes (low counterfactual impact)
  • Loss of key supervisors leading to visible research quality decline
  • CG funding cut or reduction signaling loss of confidence
  • Evidence that LASR optimises for CG reporting metrics (throughput, acceptance rates) over research quality

Self-Critique

Weakest claim: My assessment that LASR's marginal impact may be lower due to a crowded fellowship landscape is inferential. I do not have counterfactual data on what LASR alumni would have done otherwise. Some alumni specifically cited the team-based format as what attracted them, suggesting LASR reaches people other programmes do not.

Potential biases: (1) I may overweight the "over-optimised for researchers" critique because it is the most intellectually interesting argument, even though the specific survey evidence (25 org leaders) is thin. (2) I may underweight the simple value of producing more researchers -- even if the marginal researcher is less impactful than the marginal policy person, the research pipeline produces concrete outputs (papers, tools) that are easier to evaluate than policy impacts. (3) My assessment is shaped by the evidence scout's searches, which may have missed relevant sources.

What a thoughtful disagreer would say: "You are overthinking this. LASR takes $14K and 13 weeks per person and converts them into published AI safety researchers at top venues. The cost-effectiveness is unmatched. The researcher bottleneck is real even if not the only bottleneck. Perfect is the enemy of good, and producing more safety researchers is clearly positive-EV even if it is not the most neglected intervention."

Single weakest claim: That supervisor quality is a "fragile dependency." In practice, LASR has attracted strong supervisors for 4+ cohorts, the colocation at LISA creates natural recruitment channels, and the AISI relationship provides institutional backing. The fragility is theoretical; empirically, it has been robust so far.

Information that would most change my view: Independent counterfactual data on LASR alumni. If a credible study showed that, say, 50% of LASR alumni would not have entered AI safety without the programme, that would dramatically increase my assessment of LASR's impact. If it showed 90% would have entered through other channels, the opposite.

Connected to (10)

80,000 Hoursboard overlap · Jorgen Ljoenes
Anthropicboard overlap · Adam Jones
Apollo Researchstaff to
Arcadia Impactspun off from
BlueDot Impactadvisor at · Adam Jones
Coefficient Givingstaff to
Google DeepMindcollaborator · Mary Phuong
London Initiative for Safe AIcollaborator
OpenAIstaff to
UK AI Security Institutecollaborator · Charlie Griffin
Sources (35)
Every URL that was read during research.
  1. 1.LASR Labslasrlabs.org
  2. 2.About — LASR Labslasrlabs.org
  3. 3.Careers — LASR Labslasrlabs.org
  4. 4.About — Arcadia Impactarcadiaimpact.org
  5. 5.Arcadia Impactarcadiaimpact.org
  6. 6.FAQs — LASR Labslasrlabs.org
  7. 7.LASR Labs opens applications for 2025 AI safety summer research program in Londongetcoai.com
  8. 8.Navigating Transformative AIopenphilanthropy.org
  9. 9.London AI Safety Research (LASR)cybersecurityintelligence.com
  10. 10.Joe Hardie on Arcadia Impact's Projects (FBB #7)fieldbuilding.substack.com
  11. 11.The London Initiative for Safe AIsafeai.org.uk
  12. 12.About — The London Initiative for Safe AIsafeai.org.uk
  13. 13.ARCADIA IMPACT CICfind-and-update.company-information.service.gov.uk
  14. 14.About — Safe AI Londonsafeailondon.org
  15. 15.Past Projects — LASR Labslasrlabs.org
  16. 16.Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMsarxiv.org
  17. 17.CoT Red-Handed: Stress Testing Chain-of-Thought Monitoringarxiv.org
  18. 18.Search the register of charities - prd-ds-register-of-charities.charitycommission.gov.ukregister-of-charities.charitycommission.gov.uk
  19. 19.AI Governance Taskforce — Arcadia Impactarcadiaimpact.org
  20. 20.About | Orion AI Governance Initiativeorionaigov.org
  21. 21.The Alignment Project by AISI — The AI Security Institutealignmentproject.aisi.gov.uk
  22. 22.Navigating Transformative AIopenphilanthropy.org
  23. 23.JavaScript is not available.x.com
  24. 24.LASR Labgithub.com
  25. 25.AI Safety’s Talent Pipeline is Over-optimised for Researchers — EA Forumforum.effectivealtruism.org
  26. 26.AI Safety has a scaling problemboydkane.com
  27. 27.Fundsopenphilanthropy.org
  28. 28.Funding 60 projects to advance AI alignment research | AISI Workaisi.gov.uk
  29. 29.LASR Labs Programme (Summer 2026) | Effective Altruismeffectivealtruism.org
  30. 30.GitHub - UKGovernmentBEIS/inspect_evals: Collection of evals for Inspect AIgithub.com
  31. 31.Operations Associate — Arcadia Impactarcadiaimpact.org
  32. 32.Business Operations Manager — Arcadia Impactarcadiaimpact.org
  33. 33.Announcing Arcadia Impact — EA Forumforum.effectivealtruism.org
  34. 34.Our Progress in 2024 and Plans for 2025coefficientgiving.org
  35. 35.nowadamjones.me