Theory of Change
ARIA's overarching mission is metascience, not AI safety: "fund breakthrough R&D in underexplored areas to catalyse new paths to prosperity." Founding CEO Ilan Gur framed ARIA as a "catalyst" whose job is "wave creation, not wave riding" -- injecting energy into flat waters before trends form, unlike VCs who surf existing waves.
Within ARIA, the AI safety theory of change is concentrated in one programme: Safeguarded AI (£59M, ~$80M), led by David "davidad" Dalrymple. His theory of change targets "the critical period" -- a window (estimated 6-7 year median) between current AI and superintelligence, where systems are too powerful to deploy openly but possible to contain. The plan:
- Keep a superintelligent-class AI in a formally verified containment system (verified hardware, verified OS, controlled I/O channels).
- Use that contained system to develop special-purpose controllers for specific real-world domains (energy grids, drug manufacturing, autonomous vehicles).
- Formally verify each controller against mathematical specifications before deployment.
- Deploy only the verified controllers -- not the frontier model itself.
davidad (FLI podcast, Jan 2025): "We create a workflow around this contained system which is a general purpose way of using it to do autonomous R&D of special purpose agents that have quantitative safety guarantees relative to specific contexts of use."
The paper "Towards Guaranteed Safe AI" (2024, co-authored by davidad, Bengio, Russell, Tegmark, and others) is the intellectual manifesto: world model + safety specification + formal verifier.
What They Do
Safeguarded AI Programme (~£59M):
- TA1.1 Theory: 57+ funded mathematicians building a formal language for representing real-world problems. Teams at Oxford, Edinburgh, UCL, York, Topos UK, ALTER (Vanessa Kosoy), and others.
- TA1.2/1.3 Platform (£14.2M): Software implementation of the theoretical frameworks plus human-computer interfaces.
- TA1.4 Sociotechnical (£3.4M): Tools for multi-stakeholder deliberation on risk thresholds.
- TA2 Machine Learning (£18M): Originally planned as a single grant to a new nonprofit entity with "unprecedented governance robustness." Cancelled October 2025 -- davidad concluded frontier AI capabilities were advancing faster than expected, making dedicated ML R&D less necessary.
- TA3 Applications (£5.4M Phase 1): Domain-specific applications. Phase 2 cancelled October 2025, pivoted toward cybersecurity (formally verified firewalls for critical infrastructure).
The October 2025 pivot is significant. davidad: "every frontier model has more capability than I expected at that point in time." The revised strategy broadens TA1 from cyber-physical system control to "software and hardware verification, auditable multi-agent systems, and more informal knowledge." The toolkit will be released open source for frontier labs to use.
Other AI-relevant programmes:
- Scaling Trust (~£50M, Alex Obadia): Trust infrastructure for AI agents to securely coordinate. Scaling Trust Arena launching Q3 2026.
- Scaling Compute (~£100M, Suraj Bramhavar): Reduce AI hardware costs 1000x. Includes £50M Scaling Inference Lab.
- Co-funder of AISI Alignment Project alongside OpenAI, Anthropic, Microsoft, AWS, Schmidt Sciences.
Non-AI portfolio (majority of ARIA): Programmable Plants, Precision Neurotechnologies (£69M), climate monitoring, geoengineering (£46M), innate immunity, ecosystem resilience.
Operational model: 3-page applications, 3-week decisions. Seed funding up to £500K. 40% of funding goes outside academia. 15 companies already spun out. "Activation Partners" including Convergent Research (FROs), Google DeepMind, Pillar VC.
Key People
Kathleen Fisher (CEO from Feb 2026): ACM Fellow, Tufts CS professor. Ran DARPA's HACMS programme (~$50M), which successfully used formally verified software to defend a military helicopter from hackers. Directed DARPA's Information Innovation Office ($500M portfolio, 50+ programmes). Her HACMS background is directly relevant to the Safeguarded AI pivot toward cybersecurity.
David "davidad" Dalrymple (Programme Director, Safeguarded AI): Youngest MIT master's graduate (age 14). Harvard biophysics (funded by Larry Page and Peter Thiel). Co-invented Filecoin. Visiting researcher at MIRI. Research fellow at FHI (Oxford). The intellectual architect of ARIA's AI safety programme and co-author of the "Guaranteed Safe AI" paper. On the FLI podcast, discusses mind uploading and existential risk with a seriousness unusual for a government programme director.
Ilan Gur (founding CEO, Jan 2023 - Feb 2026, now advisor): Materials scientist, ARPA-E programme director, Activate.org founder. Designed ARIA's PD cohort recruitment model. Farewell letter warned against "normalcy creep" as ARIA's greatest risk.
Matt Clifford (Board Chair): Co-founder of Entrepreneur First. Vice Chair of UK AISI Advisory Board. Was PM's representative for the AI Safety Summit at Bletchley Park. Creates a direct governance link between ARIA and AISI.
Board: 8 Non-Executive Directors. Notable: Max Jaderberg (Chief AI Officer, Isomorphic Labs/Alphabet), Dame Angela McLean (UK Chief Scientific Adviser), Dame Kate Bingham (Vaccine Taskforce). Advisors: Demis Hassabis (DeepMind), Patrick Collison (Stripe/Arc), Arun Majumdar (founding ARPA-E director).
Team: ~53 staff (March 2025), 16 Programme Directors across 2 cohorts. Extremely lean for a £1B budget.
Money and Incentives
Budget: £1B+ through 2028-29. Annual budget reaching ~£400M/year by 2030. Entirely UK government-funded via DSIT.
Spending pace: Only £16.5M actually spent on research in first 2 years, with ~£600M unspent. £219M under contract by FY 2024-25 end. Gur defends this as expected for multi-year programmes, but critics note the slow ramp-up.
AI safety allocation: ~£210M+ across Safeguarded AI, Scaling Trust, and Scaling Compute -- roughly 20% of total budget. The rest covers biology, climate, neurotech, and other areas.
Incentive structure: ARIA is rare among AI safety-adjacent organizations in having zero financial ties to AI labs. No lab funding, no compute credits, no industry revenue. This removes the conflict of interest that plagues lab-affiliated safety organizations. The tradeoff: ARIA is subject to political risk (government changes) rather than market risk (lab partnerships).
Comparison to DARPA: £400M/year is roughly one DARPA office. DARPA spends ~$4B/year. But DARPA has the Department of Defense as a built-in customer; ARIA has no equivalent procurement power. This is the structural gap that most concerns ARIA's supporters.
Exempt from Treasury savings squeeze. ARIA was not asked to find savings during the 2025 Spending Review when other departments faced 5% cuts.
What Others Say
Ben Goertzel (May 2024): "Provably Safe AGI is Potentially a Very Dangerous Concept." He acknowledges the technical value of formal verification research but argues: (1) the paper downplays the "extreme practical difficulty" of applying formal methods to anything near AGI; (2) world models can never be fully correct -- "a superintelligence might be able to easily poke holes in our attempts to formally describe various aspects of our world"; (3) the concept serves as "a stalking horse for heavy AI regulation restricting AGI R&D to government/corporate labs." He identifies a "disturbing convergence" between safety researchers, megacorporations, and governments all benefiting from restricting who can do AGI research.
George Freeman (former UK science minister, friendly critic): "Whilst Aria was inspired by Darpa, we have to recognise that it doesn't have the US defence procurement budget behind it." Worried about the "diffuse" portfolio and wants parliamentary accountability: "its chair and board must come before the Public Accounts and Science committees and explain their strategy."
House of Commons Science Committee (2021): Called ARIA "a brand in search of a product." This preceded actual programme creation.
Terence Kealey (former University of Buckingham VC): "DARPA has refused to be audited and now Aria has been set up without even an attempt at a cost-benefit model -- it's quite extraordinary." Warns ARIA's grants to private companies may "crowd out" rather than "crowd in" industry R&D.
Steven Chu (Nobel laureate, ARPA-E founder): Defends the model. "The only thing we could point to, in spades, was whenever Arpa-E money flowed, a lot more private sector money came behind it within months." Key metric: private follow-on investment.
davidad himself is candid about limitations: "most of the safety benefits would be compromised by needing to make a Bayesian prior assumption" -- he acknowledges that non-Bayesian imprecise probability (not yet fully developed) is needed for the strongest safety guarantees. The TA2 pivot was an honest admission that frontier AI was advancing faster than expected.
What's Absent
- No published research outputs from the 57+ funded TA1.1 mathematicians after ~2 years of funding.
- No external evaluation framework for the Safeguarded AI programme's success or failure.
- No criticism specific to ARIA's execution of Safeguarded AI (all critique targets the theoretical framework, not implementation).
- No discussion in the AI safety community of ARIA as an institution -- davidad's work is discussed, but ARIA's governance, portfolio balance, and institutional design are largely invisible to LessWrong/EA Forum.
- No public IP governance framework for the formal verification toolkit that ARIA plans to release open source.
- Zero programme director departures (first cohort ~2 years in, second cohort ~1 year).
Recommended Reading
Ilan Gur oral history (FreakTakes/Asimov Press, Aug 2025) -- 21K-word two-hour interview. The single most candid, detailed source on ARIA's philosophy. Gur on incentives: "I think that is the only theme that matters." https://goodscienceproject.org/articles/an-oral-history-interview-with-aria-ceo-ilan-gur/
davidad on FLI podcast (Jan 2025) -- Full transcript of the safeguarded AI stack: containment, verification layers, the critical period, imprecise probability, and mind uploading. Reveals davidad as deeply motivated by existential risk. https://futureoflife.org/podcast/david-dalrymple-on-safeguarded-transformative-ai/
Goertzel: "Provably Safe AGI is Potentially a Very Dangerous Concept" (May 2024) -- The strongest independent critique. https://bengoertzel.substack.com/p/provably-safe-agi-is-potentially
"How will we know if Aria is hitting the right notes?" (THE, Feb 2026) -- Most balanced critical analysis. Features George Freeman, Steven Chu, the Turing Institute cautionary tale. https://www.timeshighereducation.com/depth/amid-funding-discord-how-will-we-know-if-aria-hitting-right-notes
How to Build the British ARPA (Statecraft, Sep 2024) -- Insider account of ARIA's political creation. https://www.statecraft.pub/p/how-to-build-the-british-arpa