ARIA (Advanced Research + Invention Agency)

Governance

UK gov R&D agency. Safeguarded AI.

Founded: 2023
HQ: London, UK
Team: 53
Structure: non-departmental public body (UK)
Model: Government Contracts

Theory of Change

ARIA's overarching mission is metascience, not AI safety: "fund breakthrough R&D in underexplored areas to catalyse new paths to prosperity." Founding CEO Ilan Gur framed ARIA as a "catalyst" whose job is "wave creation, not wave riding" -- injecting energy into flat waters before trends form, unlike VCs who surf existing waves.

Within ARIA, the AI safety theory of change is concentrated in one programme: Safeguarded AI (£59M, ~$80M), led by David "davidad" Dalrymple. His theory of change targets "the critical period" -- a window (estimated 6-7 year median) between current AI and superintelligence, where systems are too powerful to deploy openly but possible to contain. The plan:

Keep a superintelligent-class AI in a formally verified containment system (verified hardware, verified OS, controlled I/O channels).
Use that contained system to develop special-purpose controllers for specific real-world domains (energy grids, drug manufacturing, autonomous vehicles).
Formally verify each controller against mathematical specifications before deployment.
Deploy only the verified controllers -- not the frontier model itself.

davidad (FLI podcast, Jan 2025): "We create a workflow around this contained system which is a general purpose way of using it to do autonomous R&D of special purpose agents that have quantitative safety guarantees relative to specific contexts of use."

The paper "Towards Guaranteed Safe AI" (2024, co-authored by davidad, Bengio, Russell, Tegmark, and others) is the intellectual manifesto: world model + safety specification + formal verifier.

What They Do

Safeguarded AI Programme (~£59M):

TA1.1 Theory: 57+ funded mathematicians building a formal language for representing real-world problems. Teams at Oxford, Edinburgh, UCL, York, Topos UK, ALTER (Vanessa Kosoy), and others.
TA1.2/1.3 Platform (£14.2M): Software implementation of the theoretical frameworks plus human-computer interfaces.
TA1.4 Sociotechnical (£3.4M): Tools for multi-stakeholder deliberation on risk thresholds.
TA2 Machine Learning (£18M): Originally planned as a single grant to a new nonprofit entity with "unprecedented governance robustness." Cancelled October 2025 -- davidad concluded frontier AI capabilities were advancing faster than expected, making dedicated ML R&D less necessary.
TA3 Applications (£5.4M Phase 1): Domain-specific applications. Phase 2 cancelled October 2025, pivoted toward cybersecurity (formally verified firewalls for critical infrastructure).

The October 2025 pivot is significant. davidad: "every frontier model has more capability than I expected at that point in time." The revised strategy broadens TA1 from cyber-physical system control to "software and hardware verification, auditable multi-agent systems, and more informal knowledge." The toolkit will be released open source for frontier labs to use.

Other AI-relevant programmes:

Scaling Trust (~£50M, Alex Obadia): Trust infrastructure for AI agents to securely coordinate. Scaling Trust Arena launching Q3 2026.
Scaling Compute (~£100M, Suraj Bramhavar): Reduce AI hardware costs 1000x. Includes £50M Scaling Inference Lab.
Co-funder of AISI Alignment Project alongside OpenAI, Anthropic, Microsoft, AWS, Schmidt Sciences.

Non-AI portfolio (majority of ARIA): Programmable Plants, Precision Neurotechnologies (£69M), climate monitoring, geoengineering (£46M), innate immunity, ecosystem resilience.

Operational model: 3-page applications, 3-week decisions. Seed funding up to £500K. 40% of funding goes outside academia. 15 companies already spun out. "Activation Partners" including Convergent Research (FROs), Google DeepMind, Pillar VC.

Key People

Kathleen Fisher (CEO from Feb 2026): ACM Fellow, Tufts CS professor. Ran DARPA's HACMS programme (~$50M), which successfully used formally verified software to defend a military helicopter from hackers. Directed DARPA's Information Innovation Office ($500M portfolio, 50+ programmes). Her HACMS background is directly relevant to the Safeguarded AI pivot toward cybersecurity.

David "davidad" Dalrymple (Programme Director, Safeguarded AI): Youngest MIT master's graduate (age 14). Harvard biophysics (funded by Larry Page and Peter Thiel). Co-invented Filecoin. Visiting researcher at MIRI. Research fellow at FHI (Oxford). The intellectual architect of ARIA's AI safety programme and co-author of the "Guaranteed Safe AI" paper. On the FLI podcast, discusses mind uploading and existential risk with a seriousness unusual for a government programme director.

Ilan Gur (founding CEO, Jan 2023 - Feb 2026, now advisor): Materials scientist, ARPA-E programme director, Activate.org founder. Designed ARIA's PD cohort recruitment model. Farewell letter warned against "normalcy creep" as ARIA's greatest risk.

Matt Clifford (Board Chair): Co-founder of Entrepreneur First. Vice Chair of UK AISI Advisory Board. Was PM's representative for the AI Safety Summit at Bletchley Park. Creates a direct governance link between ARIA and AISI.

Board: 8 Non-Executive Directors. Notable: Max Jaderberg (Chief AI Officer, Isomorphic Labs/Alphabet), Dame Angela McLean (UK Chief Scientific Adviser), Dame Kate Bingham (Vaccine Taskforce). Advisors: Demis Hassabis (DeepMind), Patrick Collison (Stripe/Arc), Arun Majumdar (founding ARPA-E director).

Team: ~53 staff (March 2025), 16 Programme Directors across 2 cohorts. Extremely lean for a £1B budget.

Money and Incentives

Budget: £1B+ through 2028-29. Annual budget reaching ~£400M/year by 2030. Entirely UK government-funded via DSIT.

Spending pace: Only £16.5M actually spent on research in first 2 years, with ~£600M unspent. £219M under contract by FY 2024-25 end. Gur defends this as expected for multi-year programmes, but critics note the slow ramp-up.

AI safety allocation: ~£210M+ across Safeguarded AI, Scaling Trust, and Scaling Compute -- roughly 20% of total budget. The rest covers biology, climate, neurotech, and other areas.

Incentive structure: ARIA is rare among AI safety-adjacent organizations in having zero financial ties to AI labs. No lab funding, no compute credits, no industry revenue. This removes the conflict of interest that plagues lab-affiliated safety organizations. The tradeoff: ARIA is subject to political risk (government changes) rather than market risk (lab partnerships).

Comparison to DARPA: £400M/year is roughly one DARPA office. DARPA spends ~$4B/year. But DARPA has the Department of Defense as a built-in customer; ARIA has no equivalent procurement power. This is the structural gap that most concerns ARIA's supporters.

Exempt from Treasury savings squeeze. ARIA was not asked to find savings during the 2025 Spending Review when other departments faced 5% cuts.

What Others Say

Ben Goertzel (May 2024): "Provably Safe AGI is Potentially a Very Dangerous Concept." He acknowledges the technical value of formal verification research but argues: (1) the paper downplays the "extreme practical difficulty" of applying formal methods to anything near AGI; (2) world models can never be fully correct -- "a superintelligence might be able to easily poke holes in our attempts to formally describe various aspects of our world"; (3) the concept serves as "a stalking horse for heavy AI regulation restricting AGI R&D to government/corporate labs." He identifies a "disturbing convergence" between safety researchers, megacorporations, and governments all benefiting from restricting who can do AGI research.

George Freeman (former UK science minister, friendly critic): "Whilst Aria was inspired by Darpa, we have to recognise that it doesn't have the US defence procurement budget behind it." Worried about the "diffuse" portfolio and wants parliamentary accountability: "its chair and board must come before the Public Accounts and Science committees and explain their strategy."

House of Commons Science Committee (2021): Called ARIA "a brand in search of a product." This preceded actual programme creation.

Terence Kealey (former University of Buckingham VC): "DARPA has refused to be audited and now Aria has been set up without even an attempt at a cost-benefit model -- it's quite extraordinary." Warns ARIA's grants to private companies may "crowd out" rather than "crowd in" industry R&D.

Steven Chu (Nobel laureate, ARPA-E founder): Defends the model. "The only thing we could point to, in spades, was whenever Arpa-E money flowed, a lot more private sector money came behind it within months." Key metric: private follow-on investment.

davidad himself is candid about limitations: "most of the safety benefits would be compromised by needing to make a Bayesian prior assumption" -- he acknowledges that non-Bayesian imprecise probability (not yet fully developed) is needed for the strongest safety guarantees. The TA2 pivot was an honest admission that frontier AI was advancing faster than expected.

What's Absent

No published research outputs from the 57+ funded TA1.1 mathematicians after ~2 years of funding.
No external evaluation framework for the Safeguarded AI programme's success or failure.
No criticism specific to ARIA's execution of Safeguarded AI (all critique targets the theoretical framework, not implementation).
No discussion in the AI safety community of ARIA as an institution -- davidad's work is discussed, but ARIA's governance, portfolio balance, and institutional design are largely invisible to LessWrong/EA Forum.
No public IP governance framework for the formal verification toolkit that ARIA plans to release open source.
Zero programme director departures (first cohort ~2 years in, second cohort ~1 year).

Stated Theory of Change

Institutional level: ARIA exists to "catalyze new paths to prosperity" by funding high-risk research that consensus-based funding systems would reject. The mechanism is the programme director model: recruit exceptional individuals, give them £50M and 3-5 years, let them shape opportunity spaces and fund "constellations of teams" pursuing paradigm-shifting goals.

AI safety level (Safeguarded AI): davidad's theory of change targets the "critical period" between current AI and superintelligence. The mechanism: use formally verified containment to safely harness superintelligent-class AI for generating domain-specific controllers with mathematical safety guarantees. These verified controllers (not the frontier models themselves) would be deployed in safety-critical applications. The programme aims to demonstrate by ~2028 that this workflow produces economic value on the order of £1B/year, creating an equilibrium where it is rational for actors to delay pursuing full superintelligence.

The critical insight: davidad is not trying to solve alignment. He is trying to solve a deployment problem for a specific capability window, creating economic incentives to stay in that window longer. This is a game-theoretic intervention, not just a technical one.

Revealed Theory of Change

ARIA's actions are largely consistent with its stated mission, with some important caveats:

What it actually does: Funds 9+ programmes across biology, climate, neurotech, AI, and other areas. About 20% of its budget is AI-related. The Safeguarded AI programme is genuinely ambitious and technically coherent. The October 2025 pivot (cancelling TA2 and pivoting TA3 to cybersecurity) demonstrates real adaptability -- the organisation updated on evidence rather than persisting with an outdated plan.

Where actions diverge from rhetoric:

The pivot narrows near-term ambition. The original Safeguarded AI vision was breathtaking: a new nonprofit entity running a contained superintelligence to produce verified controllers for energy grids, drug manufacturing, and autonomous vehicles. After the pivot, the near-term focus is formally verified firewalls. This is still valuable but dramatically more modest. The grand vision remains aspirational rather than operational.
The AI safety programme is small relative to ARIA's total portfolio. £59M for Safeguarded AI within a £1B+ agency means AI safety is one of many bets, not the core mission. ARIA is institutionally committed to portfolio diversification, not to AI safety specifically.
The new CEO's background signals a further pivot toward cybersecurity and defense applications. Kathleen Fisher's HACMS programme is the most successful formal verification project in DARPA history -- but it was a defense cybersecurity programme. Her appointment, combined with the TA3 pivot to cybersecurity, suggests ARIA's Safeguarded AI programme may increasingly emphasize near-term cybersecurity over long-term AI containment.

Key Assumptions

Assumption 1: Formal verification can scale to complex real-world systems.

Evidence for: DARPA's HACMS successfully verified a helicopter control system. AlphaProof demonstrates AI-assisted proof generation. The formal methods community has made steady progress on verified compilers (CompCert, seL4).
Evidence against: Goertzel: formal methods are "notoriously difficult to apply to neural network-based AI systems." Current verification handles narrow, well-specified domains. The gap between verifying a firewall and verifying an energy grid controller operating in a complex, uncertain physical world is enormous.
Testable: Yes -- the programme targets a 2028 demonstration.
If wrong: The formal verification toolkit becomes a useful academic contribution but fails to produce the deployment-ready workflow davidad envisions. AI safety loses a rare government-funded technical programme.

Assumption 2: A "critical period" exists where containment is feasible.

Evidence for: Current systems are powerful but containable. METR capability evaluations show steep capability curves that suggest intermediate capabilities persist for years.
Evidence against: If capability gains are discontinuous (e.g., a sudden jump past containment), the critical period could be very short. If capable systems are widely available open-source (as they increasingly are), containment of any one system is moot.
Testable: Only in retrospect.
If wrong: The entire Safeguarded AI programme architecture is obsolete.

Assumption 3: Economic incentives can create an equilibrium to delay superintelligence.

Evidence for: davidad's game-theoretic argument: if safeguarded AI captures 15% of economic value with strong safety guarantees, rational actors may accept this rather than rushing to full AGI. Companies have made frontier safety commitments that would (in theory) cause them to pause.
Evidence against: Competitive pressure between labs. First-mover advantage in AGI is perceived as enormous. "Provable" safety from Safeguarded AI covers only narrow domains, not the full economic potential of AGI.
Testable: Only if the programme succeeds technically.
If wrong: Safeguarded AI becomes a useful set of tools for specific applications but fails to alter the trajectory of AI development.

Assumption 4: ARIA can maintain institutional quality long-term.

Evidence for: Bipartisan survival through four PMs. Budget increase under Labour. The Gur-to-Fisher transition was smooth.
Evidence against: The Turing Institute precedent (mission creep, fragmented portfolio). Gur's own warning about "normalcy creep." The 10-year kill switch means ARIA must demonstrate value by ~2033. PD term limits create knowledge loss.
Testable: Watch for mission creep, hiring patterns, and whether bold bets persist under Fisher.
If wrong: ARIA becomes another well-funded but unfocused government R&D body.

Strengths

Structural independence from AI labs. ARIA is the only significant AI safety funder with zero financial ties to frontier labs. No lab funding, no compute credits, no revolving door (yet). This is a genuine advantage for credibility and independent evaluation.
davidad is a rare combination of technical depth and institutional positioning. A visiting researcher at MIRI and FHI, co-inventor of a major cryptocurrency, youngest MIT master's graduate, now running a £59M government programme. He is perhaps uniquely positioned to bridge alignment research and formal methods.
The PD model enables genuine adaptability. The October 2025 pivot -- cancelling a major procurement process mid-stream because the evidence changed -- is almost unheard of in government. Gur: "One of the things I appreciate about ARIA is that they back Programme Directors to steer into new directions before the picture is complete."
Government funding means long time horizons. Unlike philanthropy-funded safety orgs that must justify each grant cycle, ARIA's 5-year programmes with guaranteed budgets allow genuinely long-term research planning. The 57 mathematicians working on formal language foundations could not be funded this way through most philanthropic channels.
Kathleen Fisher's HACMS background. The new CEO literally ran the most successful formal verification programme in DARPA history. If there is anyone who can assess whether Safeguarded AI is technically feasible, it is her.

Weaknesses and Risks

The programme may be too early. 57 mathematicians building formal language foundations is the right research, but it may need a decade to mature, not the 3-5 years of a PD term. The disconnect between the theoretical foundations (TA1) and the demo timeline (~2028) is concerning.
The pivot significantly narrowed near-term ambition. From "contained superintelligence producing verified controllers for energy grids" to "formally verified firewalls" is a large step down in ambition. This may be appropriate given the evidence, but it also means the programme's flagship demonstration will be less impressive than the original vision.
ARIA's AI safety commitment is proportionally small. £59M for Safeguarded AI within a £1B+ agency. If the political winds shift -- say, a new government wants to focus ARIA on defense or economic growth -- the AI safety programme could be deprioritized without anyone outside the programme noticing.
No accountability mechanism for programme success. ARIA is FOI-exempt, has no published evaluation framework for Safeguarded AI, and its parliamentary scrutiny is light. Who decides whether the 2028 demonstration succeeded? There is no external review panel, no published milestones, and no public consequences for missing targets.
Goertzel's political critique has merit. The "provably safe AI" concept can be weaponized to argue that only organizations with formal verification capabilities should be allowed to develop advanced AI. Even if davidad does not intend this, the paper he co-authored with Bengio, Russell, and Tegmark can be cited to justify regulatory capture.
The UK lacks DARPA's customer base. DARPA's successes were amplified by Department of Defense procurement. ARIA has no built-in customer for its AI safety technology. If Safeguarded AI produces a useful toolkit, who deploys it? Frontier AI labs have no obligation to use ARIA's tools, and ARIA has no regulatory authority to require them.

Cross-References

AISI (UK AI Safety Institute): ARIA's closest institutional partner. Matt Clifford chairs ARIA and vice-chairs AISI's advisory board. davidad works with AISI on safety cases. ARIA develops technology; AISI evaluates and advises. They are complementary, not competing.
MIRI: davidad was a frequent visitor. His programme builds on the formal verification tradition that MIRI helped develop, but with a pragmatic "critical period" focus rather than MIRI's more ambitious alignment approach.
Anthropic/OpenAI/DeepMind: ARIA co-funds the AISI Alignment Project alongside these labs. But ARIA has no financial dependency on them. The relationship is collaborative, not subservient.
DARPA: ARIA's explicit model. HACMS (Fisher's programme) is the direct precedent for Safeguarded AI. The institutional design -- PDs, term limits, high-risk tolerance -- is directly copied.
Coefficient Giving/Open Phil: No relationship. ARIA is government-funded and does not receive philanthropic grants. This is a meaningful structural difference from most AI safety organizations.

What Would Change This Assessment

Published research outputs from TA1.1 teams showing genuine progress on a unified formal language for real-world systems would significantly strengthen confidence.
A frontier lab adopting the formal verification toolkit would validate the deployment pathway.
A programme director departure (especially davidad) would signal problems.
ARIA budget cuts or mission redirection under political pressure would threaten the entire AI safety portfolio.
A successful 2028 demonstration of economically valuable verified controllers would be transformative for the field.
Evidence that the "critical period" is much shorter than expected would make the programme's timeline untenable.

Self-Critique

What sources should I have checked but didn't?

davidad's LessWrong/Alignment Forum posts (blocked by bot detection). These contain his most detailed technical writing and community engagement.
The Safeguarded AI programme thesis PDF (50+ pages, returned binary data). This is the most detailed articulation of the technical vision.
Parliament hearing transcript (September 2025) -- only available through inaccessible parliament.uk.

Where is this analysis potentially biased?

I may be overly impressed by davidad's technical vision because it is articulated so clearly and ambitiously. Clear articulation is not the same as feasibility.
I may underweight the political risks because ARIA has survived government changes so far. Past survival does not guarantee future survival.
ARIA's AI safety work is a small fraction of a much larger agency, but I have given it disproportionate attention because of the brief's focus. A more balanced view would note that ARIA's primary impact will likely be outside AI safety.

What would a thoughtful person who disagrees say?

"Formal verification for AI is a dead end. The gap between verifying a firewall and verifying meaningful real-world AI systems is unbridgeable. ARIA is throwing £59M at an academic exercise."
"Government agencies cannot innovate in AI safety. The pace of AI development is too fast for a 3-page-application-3-week-turnaround government funder to keep up."
"ARIA's FOI exemption and lack of evaluation framework means we have no way to know if the money is being well spent."

What's my single weakest claim? My claim that davidad's game-theoretic argument (rational actors delay superintelligence if they can capture value through verified controllers) is plausible. This requires frontier labs to voluntarily limit their most profitable capabilities, which conflicts with intense competitive pressure. The history of technology regulation suggests that "rational equilibria" rarely hold when billions of dollars are at stake.

What information would most change my view? Published technical results from the TA1.1 teams. If 57 funded mathematicians have been working for 2 years and cannot show even preliminary results toward a unified formal language, the programme may be more aspiration than execution. Conversely, if there are genuine breakthroughs, this could be the most important AI safety programme in the world.

Connected to (11)

Anthropiccollaborator Convergent Researchcollaborator OpenAIcollaborator Future of Humanity Institutestaff from · David Dalrymple Google DeepMindadvisor at · Demis Hassabis UK AI Safety Institutecollaborator · Matt Clifford

DARPAstaff from · Kathleen Fisher

ARPA-Estaff from · Ilan Gur

Entrepreneur Firstboard overlap · Matt Clifford

Machine Intelligence Research Institutestaff from · David Dalrymple

Stripeadvisor at · Patrick Collison

Sources (67)

Every URL that was read during research.

1.Advanced Research and Invention Agency - Wikipediaen.wikipedia.org
2.About ARIAaria.org.uk
3.Safeguarded AIaria.org.uk
4.AI progress and a Safeguarded AI pivotaria.org.uk
5.Can AI be used to control safety critical systems? A U.K.-funded research program aims to find out | Fortunefortune.com
6.David Dalrymple on Safeguarded, Transformative AI - Future of Life Institutefutureoflife.org
7.Davidad Dalrymple: Towards Provably Safe AIthegradientpub.substack.com
8.Inside ARIA, the UK's Bet to Build Scientific Revolutionsasimov.press
9.An Oral History Interview with ARIA CEO Ilan Gurgoodscienceproject.org
10.Introducing ARIA’s next CEOaria.org.uk
11.Stay bold, ARIA: A note from Ilan Guraria.org.uk
12.Is Aria, the UK’s new R&D agency, the answer to low growth?newstatesman.com
13.Pressure on ARIA to juggle risk in the pursuit of breakthroughsnature.com
14.Programme Directorsaria.org.uk
15.Directorsaria.org.uk
16.Our teamaria.org.uk
17.ARIA's Ilan Gur on backing the big breakthroughs barely anyone thinks are possiblesifted.eu
18.Uncovering the secrets of ARIAprospectmagazine.co.uk
19.ARIA: Failure to launch | Ayming UKayming.co.uk
20.Funding | Safeguarded AIaria.org.uk
21.Meet the Creators | Safeguarded AIaria.org.uk
22.David Dalrymple — Grokipediagrokipedia.com
23.Oxford researchers awarded ARIA funding to develop safety-first AI for businesscs.ox.ac.uk
24.Safe autonomous driving moves a step closer thanks to new ARIA fundingyork.ac.uk
25.Kathleen Fisher - Wikipediaen.wikipedia.org
26.Trust Everything, Everywherearia.org.uk
27.Factcheck: How the UK is – and is not – studying solar geoengineering - Carbon Briefcarbonbrief.org
28.ARIA’s ‘Exploring Climate Cooling’ Folly - Solar Geoengineering Non-Use Agreementsolargeoeng.org
29.Computer scientist Kathleen Fisher to become next head of UK’s advanced research agencychemistryworld.com
30.Ex-DARPA director to lead UK moonshot factory ARIAsifted.eu
31.Safeguarded AIaria.org.uk
32.Opportunity spacesaria.org.uk
33.Amid funding discord, how will we know if Aria is hitting the right notes?timeshighereducation.com
34.UK science agency ARIA spared Treasury savings squeezetheregister.com
35.Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systemsarxiv.org
36.Opportunity spacesaria.org.uk
37.The n-Category Cafégolem.ph.utexas.edu
38.Provably Safe AGI is Potentially a Very Dangerous Conceptbengoertzel.substack.com
39.Will ARIA bite the dust? 'UK DARPA' faces uncertain future - Tech Monitortechmonitor.ai
40.ARIA: Does UK Really Need its Own DARPA?granttree.co.uk
41.UK's Darpa clone faces tough test as govt considers fundingtheregister.com
42.ARIA's annual report and accounts 2024 to 2025gov.uk
43.Scaling Trustaria.org.uk
44.Scaling Computearia.org.uk
45.UK moonshot factory ARIA loses CEO Ilan Gur after three yearssifted.eu
46.David Dalrymple on Safeguarded, Transformative AI | Future of Life Instituteblog.biocomm.ai
47.David 'davidad' Dalrymple | AI Frontiersai-frontiers.org
48.What is the Advanced Research and Invention Agency (ARIA)?itpro.com
49.Is this Dominic Cummings' biggest legacy?spectator.co.uk
50.How to Build the British ARPAstatecraft.pub
51.About the Alignment Project — by AISI (The AI Security Institute)alignmentproject.aisi.gov.uk
52.Dominic Cummings on ARIA and research fundingwonkhe.com
53.Advanced Research and Invention Agency (ARIA): policy statementgov.uk
54.Board and Advisorsaria.org.uk
55.Reeves uplifts Aria budget to £1 billion and funds AI coursestimeshighereducation.com
56.November update: A Safeguarded AI pivot + real-time climate monitoringariaresearch.substack.com
57.Safeguarded AIaria.org.uk
58.How we workaria.org.uk
59.#11 David Dalrymple (davidad) on Mathematics for Safeguarded AI program at ARIA — ALIGNaialign.net
60.A look back at 2025aria.org.uk
61.Contactaria.org.uk
62.ARIA. A conversation on Safeguarded AI: A deep dive on TA2blog.biocomm.ai
63.March-April 2025 Progress in Guaranteed Safe AIgsai.substack.com
64.Oxford researchers awarded ARIA funding to develop safety-first AI |ox.ac.uk
65.Guaranteed Safe AI? World Models, Safety Specs, & Verifiers, with Nora Ammann & Ben Goldhabercognitiverevolution.ai
66.David Dalrymple - Foresight Instituteforesight.org
67.Category Theorists in AIjohncarlosbaez.wordpress.com