← AI Safety Orgs

US AI Safety Institute / CAISI

Evals/Testing

Government evaluator. Political uncertainty.

Founded
2023
HQ
Gaithersburg, MD
Team
71
Structure
federal_agency_division
Model
Government Appropriations

Theory of Change

The original AISI (Nov 2023 - June 2025) had a clear three-pillar theory: test frontier models before deployment, issue guidance and standards, conduct fundamental AI safety research. Elizabeth Kelly: "Our mission at the AI Safety Institute is really to advance the science of AI safety." She framed the three pillars as a "virtuous cycle" -- research informing testing, testing informing guidance, guidance informing research -- and explicitly rejected the safety-vs-innovation framing: "Safety promotes trust, which promotes adoption, which drives innovation."

The current CAISI (June 2025 - present) has a different theory. Commerce Secretary Lutnick: "For far too long, censorship and regulations have been used under the guise of national security. Innovators will no longer be limited by these standards." OSTP Director Kratsios: Biden "hijacked" NIST and "turned it into a safety model evaluation agency"; NIST should "go back to basics" on standards and measurement science, and then "the industry can do all the evals that they ever imagined."

The new mission centers on: (1) developing standards "free from ideological bias," (2) evaluating US vs adversary AI capabilities for national security, (3) serving as "industry's primary point of contact" for AI within the federal government.

What They Do

Evaluations. CAISI evaluates AI models, but the focus has shifted from pre-deployment safety testing of American frontier models to competitive intelligence against Chinese models. The publicly visible outputs since the rebrand are the DeepSeek evaluation (3 Chinese models vs 4 US models, finding DeepSeek lags in performance and has 94% jailbreak compliance vs 8% for US models) and the Kimi K2 evaluation. The joint US-UK pre-deployment evaluation of OpenAI o1 (Dec 2024) was the last publicly known evaluation of an American frontier model.

MOUs. Non-binding agreements with OpenAI and Anthropic for pre- and post-deployment model testing, first signed August 2024. These continued under the new administration (confirmed September 2025). However, no safety evaluation results from MOU testing have been published under the Trump administration. The full MOU text has never been made public.

Standards and guidance. NIST guidance documents: AI 800-1 (Managing Misuse Risk), AI 600-1, AI 100-4 (synthetic content), SP 800-218A (secure software development). The AI Risk Management Framework (created by Tabassi before AISI's founding) is used globally. The AI Action Plan directs CAISI to revise the AI RMF to remove references to "misinformation, DEI, and climate change."

Agent security. AI Agent Standards Initiative launched Feb 2026; RFI on securing AI agents issued Jan 2026; red-teaming competition on agent hijacking (jointly with Gray Swan + UK AISI, Mar 2026). This is technically substantive, forward-looking work.

Consortium. 280+ member organizations (renamed "NIST AI Consortium" from AISIC). Five working groups on biosecurity, safeguards, risk management, synthetic content, and red-teaming. Specific deliverables are largely unpublished.

International. Founded the International Network of AISIs with 10 countries (Nov 2024). The network was renamed to drop "safety" under US-UK pressure (resisted by other members). International consensus on automated evaluation best practices published Feb 2026.

Partnerships. NIST-MITRE $20M AI cybersecurity centers. CRADA with OpenMined for privacy-preserving evaluations. MOU with GSA for federal AI procurement evaluations. Signature Science $3M contract for CBRN AI testing. TRAINS taskforce for inter-agency national security AI testing (Commerce, Defense, Energy, DHS, NSA, NIH).

Key People

Elizabeth Kelly (First Director, Feb 2024 - Feb 2025). Co-authored Biden's AI Executive Order at the White House NEC. TIME 100 Most Influential in AI 2024. Built the original AISI from scratch in under a year, secured MOUs with OpenAI and Anthropic, launched the International Network, and attracted top researchers including Paul Christiano. Resigned shortly after Trump took office. In her farewell: "There is no other group with the technical skill or subject matter expertise to match AISI across the entire U.S. government."

Paul Christiano (Head of AI Safety, Apr 2024 - present). Invented RLHF at OpenAI. Founded ARC (Alignment Research Center). PhD UC Berkeley. Publicly estimates 50/50 chance of doom from human-level AI. His appointment caused a revolt among NIST staff (March 2024) who feared his EA ties would compromise NIST's objectivity. He remains at CAISI under the Trump administration but has been completely silent publicly -- no statements about the rebrand, mission change, or his work.

Austin Mayron (Acting Director, 2025 - present). Lawyer, not an AI specialist. Former Senior Legal Advisor at USPTO, former Deputy Associate Counsel at the White House. Replacing a tech policy leader with a lawyer signals administrative orientation rather than technical leadership.

Elham Tabassi (CTO, Feb 2024 - Mar 2025). Created NIST's globally adopted AI Risk Management Framework. 25+ years at NIST. TIME 100 Most Influential in AI 2023. Departed for Brookings in March 2025 -- removing the person who built NIST's most important AI governance tool.

Staff: ~71 currently listed (division 5701). 73 NIST probationary employees were fired in March 2025 DOGE-driven layoffs. CAISI was hiring for an AI Research Scientist as of December 2025.

Money and Incentives

Budget: ~$10M/year. FY2026 appropriations: $10M for CAISI (flat from prior years). Biden requested $47.7M for AI at NIST for FY2025; Congress provided far less. Actual spending was reportedly ~$6M in FY2024. The gap between requested and received funding reveals that Congress -- not just the Trump administration -- has been unwilling to fund AI safety at scale.

Comparison with UK AISI: 10x budget gap. UK AISI operates on ~$65M/year (GBP 50M), with GBP 240M committed for 2026-2030. UK AISI also disbursed GBP 40M in alignment research grants and receives in-kind compute from AWS, OpenAI, Anthropic, and Microsoft. The US -- which hosts all the major frontier labs and spends orders of magnitude more on AI R&D -- invests roughly one-tenth what the UK does on AI safety.

Comparison with expert recommendations. FAS proposes a "CAISI+" needing $67-155M/year operating budget and $155-275M in setup costs. Current CAISI funding is 7-15% of the minimum recommended level.

Adjacent funding. NIST-MITRE partnership: $20M (2x CAISI's entire annual budget, directed at cybersecurity). NIST overall: $1.847B in FY2026 (21% increase). NIST has resources for AI when it is framed as cybersecurity, manufacturing, or competitiveness -- not safety.

No external funding. As a federal agency, CAISI has no philanthropic, lab, or industry funding. This eliminates financial conflicts of interest but creates total dependence on political will. No grant-making authority or budget.

Talent competition. Private sector AI researchers can earn near $1M. Government salaries are a fraction of this. Biden's AI Talent Surge aimed for 500+ experts but hired only 250. Only 205 individuals received AI PhDs in 2022. CAISI cannot compete on compensation.

Incentive structure. CAISI's current incentives are to produce work that (1) is politically acceptable to the Trump administration (national security framing, US-vs-China evaluations), (2) does not antagonize frontier AI labs (voluntary cooperation, non-binding standards), and (3) can be cited by Congress as evidence that the US has AI governance capacity (justifying continued appropriations). There is no incentive to produce findings that would slow AI deployment or embarrass American companies.

What Others Say

Academic critique: An arXiv paper argues that AISIs and voluntary commitments function as substitutes for regulation rather than precursors to it. The feedback loop: labs adopt voluntary commitments, governments create AISIs to conduct evaluations, and the existence of this infrastructure is cited as evidence that binding regulation is unnecessary. "AISIs may actively prevent the binding regulation that would be more effective."

Safety org critique (CAIS): "Voluntary commitments are insufficient. Despite even the best intentions, AI companies are susceptible to pressures from profit motives that can erode safety practices." Helen Toner and Tasha McCauley (former OpenAI board members): "AI companies can't be trusted to govern themselves."

From within the administration (Kratsios): Biden "hijacked" NIST for "x-risk evals" and the agency "lost its way." NIST should produce standards; industry should do evaluations.

National security analysts (Lawfare): The DOGE layoffs constitute "a self-imposed, likely long-term brain drain" that "cuts against Trump's AI aims." The talent pool is tiny (205 AI PhDs in 2022). "Supporters of the layoffs may contend that by removing staff with a more risk-averse approach, Trump is increasing odds of rapid AI progress. But that argument assumes NIST sets the nation's AI priorities rather than what NIST actually does, which is technical implementation."

Equity critics (Data & Society): "Years of research aimed at addressing well-documented AI harms are being cast to the wayside as innovation is being framed as the only concept that matters." The shift from multi-stakeholder collaboration to "industry's primary point of contact" excludes civil society and academia.

Pro-acceleration critics (Reason): EA "doomsayers" have been "lobbying to create agencies with utterly alarming authority." CAIP's proposed bill would allow the government to "seize and destroy hardware and software." This critique explains the ideological opposition driving CAISI's direction.

Industry (mixed): Amazon, Meta, Microsoft, OpenAI, and Scale AI all signed letters urging Congress to authorize CAISI -- because it produces voluntary standards they can influence rather than regulations they cannot. This paradoxical support validates the anti-regulatory critique.

What's Absent

  • No public evaluation of American frontier models under Trump administration. DeepSeek and Kimi evaluations were Chinese models. The o1 joint evaluation was Biden-era. If CAISI is testing GPT-5, Claude Opus 4, or Gemini under MOUs, there is no public evidence.
  • Paul Christiano's voice. The person with arguably the most relevant technical expertise in any government AI role has said nothing publicly since joining NIST. No statements on the rebrand, the mission change, or alignment research.
  • No open-source tools. UK AISI released Inspect (widely adopted), ControlArena, and RepliBench. CAISI has released no equivalent evaluation tools.
  • No alignment research program. UK AISI runs the Alignment Project (GBP 27M, 60 grantees, advisory board including Bengio and Shlegeris). CAISI has no visible alignment research.
  • No grant-making function. CAISI cannot fund independent safety research.
  • No secure compute infrastructure. FAS notes CAISI cannot host model weights for deep evaluations without dedicated SL-5 compute.
  • No Frontier AI Trends Report equivalent. No comprehensive public assessment of AI capability and risk trends.
  • No published AISIC consortium deliverables. 280+ members, 5 working groups, but publicly visible outputs are minimal.

Recommended Reading

  1. Elizabeth Kelly, CSIS interview (July 2024) -- 9,400-word transcript where Kelly speaks candidly about AISI's origin, the "nimble startup in government" framing, Paul Christiano's RLHF work, international coordination, and why NIST (not a regulator) is the right home. The most detailed public articulation of the original theory of change. Essential for understanding what was lost in the transition. Watch/read at CSIS

  2. "Anti-Regulatory AI" (arXiv, 2025) -- Academic paper analyzing how AISIs and voluntary commitments may function as substitutes for regulation. The strongest argument that CAISI's existence is counterproductive: it provides the appearance of governance without the substance, and its mere existence is cited as evidence that binding regulation is unnecessary.

  3. FAS CAISI+ Proposal (June 2025) -- What a serious US AI safety body would look like: $67-155M/year, secure compute, emergency NSC reporting. Reveals the scale of under-investment in current CAISI. Read at FAS

  4. Kratsios "back to basics" (FedScoop, July 2025) -- OSTP Director explains the Trump administration's view: Biden "hijacked" NIST for "x-risk evals." NIST should do standards; industry should evaluate. Read at FedScoop

  5. UK AISI brief (for comparison) -- Read the companion analysis of the UK counterpart. Same founding moment, same rebrand away from "safety," but 10x the budget, stronger technical capability, open-source tools, and a GBP 40M grant-making function. The comparison is the most illuminating lens on CAISI's structural limitations.

Show Claude’s analysis
An opinionated read. Read the brief first to form your own view.

Stated Theory of Change

Biden-era AISI (Nov 2023 - June 2025): Advance the science of AI safety through three reinforcing pillars: testing frontier models before deployment, issuing guidance/standards, and conducting fundamental research. The causal chain: AISI evaluates models -> identifies risks -> shares findings with labs (who fix vulnerabilities) and government (which can act on the intelligence) -> AI development becomes safer. Safety enables trust, trust enables adoption, adoption drives innovation. NIST's non-regulatory status was a feature: it allowed cooperation with labs who would resist a regulator.

Trump-era CAISI (June 2025 - present): Develop standards for AI that ensure American competitiveness and national security. Evaluate US models vs adversary models. Serve as "industry's primary point of contact" within the federal government. Guard against "burdensome and unnecessary regulation" of American AI by foreign governments. Ensure US dominance of international AI standards. The causal chain: CAISI produces standards -> industry coalesces around them -> American AI remains globally dominant -> national security is protected.

Revealed Theory of Change

CAISI's actions under the new administration suggest a theory of change that is neither the original safety vision nor the stated competitiveness vision, but something more pragmatic:

What CAISI actually optimizes for:

  1. Institutional survival (continuing to exist despite hostile political environment)
  2. Producing politically acceptable outputs (Chinese model evaluations, cybersecurity standards, agent security work)
  3. Maintaining voluntary lab relationships (MOUs with OpenAI/Anthropic continued)
  4. Avoiding confrontation on anything the administration considers "ideological" (no bias research, no x-risk framing, no deployment recommendations)
  5. Building bipartisan legislative support for codification (the Cantwell-Young bill)

Where actions diverge from both stated theories:

  • The original theory assumed CAISI would test American frontier models and publish findings. Under the new administration, publicly visible evaluations are exclusively of Chinese models.
  • The current theory assumes CAISI enables American competitiveness. But $10M/year (1/10th of the UK budget) cannot meaningfully contribute to US AI leadership when private AI investment exceeds $100B/year.
  • Both theories assume CAISI produces standards. But the AI Action Plan directs CAISI to remove risk categories (misinformation, DEI, climate change) from the AI Risk Management Framework -- which is political interference in standards-setting, not standards development.
  • Paul Christiano -- the person whose entire career is about alignment and x-risk -- remains employed as Head of AI Safety at an organization that has deleted "safety" from its name. Either he is doing meaningful safety work that is invisible to the public, or his position is vestigial.

The revealed theory of change is closest to: CAISI maintains institutional continuity and lab relationships through a politically hostile period, hoping that either (a) the political environment shifts, (b) Congressional codification provides durability, or (c) a crisis creates demand for the expertise that currently has no outlet. This is a bet on surviving to be useful later, not on being useful now.

Key Assumptions

1. Standards without enforcement can meaningfully reduce AI risk.

  • Evidence for: NIST's cybersecurity framework (widely adopted voluntarily) demonstrates that well-crafted standards can shape industry behavior without regulation. Kelly cited NIST's 120-year track record.
  • Evidence against: Voluntary AI safety commitments have already frayed. The arXiv paper argues that voluntary standards substitute for rather than precede binding rules. CAIS: "AI companies are susceptible to pressures from profit motives." Former OpenAI board members: "AI companies can't be trusted to govern themselves."
  • Testable: Track whether CAISI-published standards are adopted by labs, and whether adoption correlates with safer outcomes.
  • If wrong: CAISI is producing compliance theater -- standards that labs cite for PR but do not meaningfully constrain behavior.

2. A $10M/year budget with ~71 staff can keep pace with frontier AI development.

  • Evidence for: Kelly argued CAISI could "do a lot with our budget because of the talent we're attracting" and the 280-member consortium as a force multiplier.
  • Evidence against: UK AISI has 10x the budget and ~250 staff. FAS argues $67-155M/year is the minimum for adequacy. CAISI has no secure compute, no grant-making, and no open-source evaluation tools. Private labs spend billions annually on development; $10M for safety is rounding error.
  • Testable: Compare CAISI's publication output, tool releases, and evaluation cadence with UK AISI's.
  • If wrong: CAISI is structurally incapable of its mission regardless of political environment. This is the most likely failure mode.

3. Chinese model evaluations serve national security.

  • Evidence for: DeepSeek evaluation found genuine security gaps (94% jailbreak compliance). Understanding adversary AI capabilities is a legitimate national security function.
  • Evidence against: Evaluating Chinese models is the politically safe version of AI safety. It lets CAISI demonstrate competence without confronting American labs. The real national security AI risk may be from insufficiently safe American models deployed at scale, not from Chinese models Americans choose to use.
  • Testable: Does CAISI also evaluate American frontier models for national security risks? If not, the national security framing is selective.
  • If wrong: CAISI becomes a competitive intelligence tool masquerading as a safety body.

4. Congressional codification will provide institutional durability.

  • Evidence for: The Cantwell-Young bill has bipartisan support (Cantwell-Young-Hickenlooper-Blackburn) and broad industry backing (Amazon, Meta, Microsoft, OpenAI, SIIA, BSA, ITI). Cantwell: "It is unhealthy for each administration to restart."
  • Evidence against: The bill has been introduced multiple times without passing. Even if codified, Congressional authorization does not guarantee adequate funding (see: NIST's chronic underfunding).
  • Testable: Track legislative progress.
  • If wrong: CAISI remains a political football, subject to redirection with each administration change.

5. Paul Christiano's continued presence means meaningful safety work is happening internally.

  • Evidence for: Christiano is the most technically credentialed alignment researcher in any government worldwide. His continued employment suggests he is doing something.
  • Evidence against: Federal employees can be effectively sidelined without being fired. His public silence is total. The organization he leads within CAISI ("AI Safety") has had "safety" removed from the parent organization's name.
  • Testable: Watch for any CAISI publication or evaluation that addresses alignment, x-risk, or advanced safety concerns.
  • If wrong: Christiano's presence is residual, and the alignment expertise he represents is unused.

Strengths

NIST's institutional legacy. NIST has 120 years of credibility in measurement science and standards. The AI Risk Management Framework (created by Tabassi) is used globally. This institutional brand cannot be easily replicated and provides a foundation for future expansion.

Lab relationships survived the transition. MOUs with OpenAI and Anthropic were renewed under the Trump administration. Whatever happens publicly, there appears to be an ongoing working relationship between CAISI and the major labs. This continuity is valuable.

Christiano's presence. Having the inventor of RLHF -- someone who understands AI alignment at a fundamental level -- inside the US government is significant even if his work is currently constrained. If a political window opens, the expertise is in place.

Bipartisan codification momentum. The Cantwell-Young bill has genuine cross-party and cross-industry support. If passed, it would be the most important structural improvement possible for CAISI.

Real technical work continues. Agent security standards, red-teaming competitions (with UK AISI and Gray Swan), privacy-preserving evaluations (OpenMined CRADA), and evaluation methodology development are substantive technical contributions regardless of political framing.

National security framing as camouflage. The DeepSeek evaluation, TRAINS taskforce, and cybersecurity partnerships allow CAISI to do safety-adjacent work under a politically acceptable label. National security is the language this administration speaks; using it to maintain technical capacity is pragmatic.

Weaknesses and Risks

Catastrophic underfunding. $10M/year is not a serious budget for evaluating frontier AI systems. The UK spends 10x more. FAS recommends 7-15x more as the minimum. At current funding, CAISI is structurally incapable of its stated mission -- whether that mission is safety evaluation or standards development.

No enforcement power, no deployment authority. Same fundamental weakness as UK AISI. CAISI can identify problems but cannot compel action. Non-binding MOUs with non-public terms and no published evaluation results means there is no way to verify impact.

Political redirection. The rebrand from "safety" to "standards and innovation" is not cosmetic. Directing CAISI to remove risk categories from the AI RMF (misinformation, DEI, climate change) is political interference in what should be scientific standards-setting. This damages NIST's 120-year reputation for neutrality.

Brain drain. Kelly (director), Tabassi (CTO, 25-year NIST veteran), and an unknown number of DOGE-driven layoffs have hollowed out leadership and institutional memory. Replacing them with a lawyer (Mayron) as acting director signals a shift from technical to administrative orientation.

Chinese-models-only evaluation pattern. If CAISI only evaluates Chinese models and never publishes safety evaluations of American frontier models, it becomes a competitive intelligence tool rather than a safety body. This would validate the anti-regulatory critique that AISIs exist to provide the appearance of governance without the substance.

Invisible outputs. AISIC consortium has 280+ members but few published deliverables. MOU evaluations are non-public. Christiano's work is invisible. Without transparency, CAISI's impact is unverifiable.

Substitution effect. The strongest critique: CAISI's existence may prevent binding regulation. Labs cite their cooperation with CAISI as evidence that they are being responsible, reducing pressure for mandatory rules. If this substitution effect is real, CAISI is net negative for AI safety.

Cross-References

UK AI Security Institute (AISI): Founded the same week at the same summit. Both renamed from "safety" in 2025. But the UK has ~10x the budget, ~3x the staff, open-source tools (Inspect), GBP 40M in alignment research grants, and a chief scientist (Irving) who speaks publicly with extraordinary candor. CAISI is the lesser version in every measurable dimension except one: it is located in the country where all the frontier labs are headquartered.

ARC (Alignment Research Center): Christiano founded ARC before joining NIST. ARC continues as an independent alignment research organization (now led by others). The question is whether Christiano's work at CAISI has any relationship to ARC's research agenda.

METR (Model Evaluation and Threat Research): Independent evaluation org that Christiano helped launch (spun out of ARC Evals). Does similar frontier model testing outside government. Can be more adversarial because it doesn't depend on lab goodwill.

OpenAI / Anthropic: The labs CAISI evaluates under MOUs. The power asymmetry is extreme: labs control access, MOUs are voluntary, and CAISI provides evaluation services. Whether CAISI can identify and communicate problems the labs don't want to hear about is the central open question.

Center for AI Policy (CAIP): Was the main US AI safety policy advocacy organization. Called DOGE layoffs evidence of "defy[ing] common sense." Then shut down in May 2025. Its closure reduces the external pressure for CAISI to do more.

FAS / CSIS / Brookings: Think tanks that have produced the most substantive external analysis of CAISI. FAS's CAISI+ proposal, CSIS's network analysis, and Brookings (where Tabassi now works) provide the policy infrastructure for future CAISI expansion.

What Would Change This Assessment

Upward:

  • CAISI publishes a safety evaluation of an American frontier model (not just Chinese models). This would demonstrate that the safety evaluation function survived the political transition.
  • Congressional codification passes with meaningful funding authorization ($50M+/year).
  • Christiano publishes research or speaks publicly about alignment work at CAISI. This would indicate that the safety mission persists internally.
  • CAISI releases open-source evaluation tools comparable to UK AISI's Inspect.
  • Evidence emerges that CAISI evaluations have privately influenced a deployment decision.

Downward:

  • Christiano leaves CAISI. This would confirm that the safety function has been fully abandoned.
  • CAISI's budget is cut or the codification bill fails.
  • AI Action Plan directions are implemented to gut the AI Risk Management Framework (removing entire risk categories).
  • A model evaluated (or not evaluated) by CAISI causes significant real-world harm.
  • Other countries withdraw from the International Network due to US politicization.

Self-Critique

Weakest claim: That CAISI is doing nothing meaningful internally. I do not have visibility into CAISI's day-to-day work. Christiano's presence and the continued MOU relationships suggest at least some safety work persists. My assessment may overweight the visible political narrative (rebrand, layoffs, Kratsios) and underweight invisible internal work. If CAISI is quietly evaluating American frontier models and sharing findings with labs, the practical impact is significantly higher than the public record suggests.

Potential bias: I may be excessively influenced by the UK AISI comparison. The UK is a much smaller country with a different political system; its AISI model may not be directly transferable to the US context. The US has NIST's broader institutional infrastructure, inter-agency coordination (TRAINS taskforce), and a more developed legislative pipeline. These structural differences may matter more than the budget gap.

What I should have checked but could not: Paul Christiano's current work (he is completely silent). Internal CAISI documents. Unpublished MOU evaluation results. Congressional briefing transcripts. Detailed FY2026 budget allocation within NIST. Forum posts on EA Forum and LessWrong about the Christiano appointment and CAISI transition (blocked domains).

What a thoughtful disagreer would say: "You are comparing CAISI to an idealized version of what a safety institute should be, rather than evaluating it within the constraints of American politics. The US has never had a binding AI safety regulator, and CAISI -- even in its weakened state -- represents more government technical capacity on AI than existed before November 2023. The codification bill, the continued lab MOUs, and the national security framing are pragmatic adaptations that keep the institutional infrastructure alive for a future administration that may re-empower it. Christiano staying is the tell: if the safety mission were truly dead, he would have left."

This is a reasonable argument. But it requires accepting that the entire purpose of a $10M/year federal body with 71 staff is to exist as a placeholder for a future political moment -- and that in the meantime, the world's most advanced AI systems are being developed and deployed without meaningful government evaluation. Whether that bet on future relevance is justified depends on how fast AI capabilities advance. If transformative AI arrives before the next political window opens, CAISI will have been an expensive observer of a catastrophe it lacked the power to prevent.

Single weakest claim: That the anti-regulatory substitution effect is real. I lean on the arXiv paper's argument that CAISI's existence crowds out binding regulation, but this is a theoretical claim about counterfactual policy dynamics. It is possible that without CAISI, there would be even less AI governance, not more. The claim is important but unverified.

Information that would most change my view: Any evidence that CAISI has conducted a safety evaluation of an American frontier model under the Trump administration and communicated findings to the lab. This would show the core safety function persists despite the political narrative.

Connected to (12)

Signature SciencecollaboratorMITRE Corporationcollaborator
General Services Administrationcollaborator
Gray Swancollaborator
OpenMinedcollaborator
Brookings Institutionstaff to · Elham Tabassi
Alignment Research Centerstaff from · Paul Christiano
Anthropicevaluates
OpenAIevaluates
OpenAIstaff from · Paul Christiano
UK AI Security Institutecollaborator
METRcollaborator · Paul Christiano
Sources (80)
Every URL that was read during research.
  1. 1.Artificial intelligence safety institute - Wikipediaen.wikipedia.org
  2. 2.Center for AI Standards and Innovation (CAISI)nist.gov
  3. 3.Renaming the US AI Safety Institute Is About Priorities, Not Semanticstechpolicy.press
  4. 4.A Self-Imposed AI Brain Drainlawfaremedia.org
  5. 5.AI safety advocates slam Trump administration’s reported targeting of standards agency | Fortunefortune.com
  6. 6.The U.S. Vision for AI Safety: A Conversation with Elizabeth Kelly, Director of the U.S. AI Safety Institutecsis.org
  7. 7.The US Government's AI Safety Gambit: A Step Forward or Just Another Voluntary Commitment?techpolicy.press
  8. 8.U.S. AI Safety Institute Signs Agreements Regarding AI Safety Research, Testing and Evaluation With Anthropic and OpenAInist.gov
  9. 9.Pre-Deployment Evaluation of OpenAI's o1 Modelnist.gov
  10. 10.Commerce Transforms Center for AI Standards and Innovation - Americans for Responsible Innovationari.us
  11. 11.CAIP Responds to Reported Mass Layoffs at NIST's AI Safety Institute | Center for AI Policy | CAIPcenteraipolicy.org
  12. 12.U.S. Commerce Secretary Gina Raimondo Announces Expansion of U.S. AI Safety Institute Leadership Teamnist.gov
  13. 13.Trump administration rebrands AI Safety Institutefedscoop.com
  14. 14.U.S. Commerce Secretary Gina Raimondo Announces Key Executive Leadership at U.S. AI Safety Institutenist.gov
  15. 15.Announcing the "AI Agent Standards Initiative" for Interoperable and Secure Innovationnist.gov
  16. 16.Insights into AI Agent Security from a Large-Scale Red-Teaming Competitionnist.gov
  17. 17.Announcement: CAISI signs CRADA with OpenMined to Enable Secure AI Evaluationsnist.gov
  18. 18.NIST Launches Centers for AI in Manufacturing and Critical Infrastructurenist.gov
  19. 19.Executive Order 14110 - Wikipediaen.wikipedia.org
  20. 20.NIST, MITRE announce $20 million research effort on AI cybersecuritycyberscoop.com
  21. 21.NIST AI Consortiumnist.gov
  22. 22.Elizabeth Kellytime.com
  23. 23.Paul Christiano - Wikipediaen.wikipedia.org
  24. 24.The AI Safety Institute International Network: Next Steps and Recommendationscsis.org
  25. 25.US AI Safety Institute (USAISI) - Future of Life Institutefutureoflife.org
  26. 26.The authoritarian side of effective altruism comes for AIreason.com
  27. 27.Lawmaker eyes bill to codify NIST AI center | Federal News Networkfederalnewsnetwork.com
  28. 28.Bipartisan Senators Revive Bill to Codify Federal AI Standards Centermeritalk.com
  29. 29.Legislation to codify NAIRR, authorize safety body among nine AI bills passed by House panelfedscoop.com
  30. 30.US Dept. of Commerce nixes ‘safety’ in NIST AI institute rebrandtechnical.ly
  31. 31.FACT SHEET: U.S. Department of Commerce & U.S. Department of State Launch the International Network of AI Safety Institutes at Inaugural Convening in San Francisconist.gov
  32. 32.U.S. AI Safety Institute Establishes New U.S. Government Taskforce to Collaborate on Research and Testing of AI Models to Manage National Security Capabilities & Risksnist.gov
  33. 33.NIST adds 5 new members to its AI Safety Institutenextgov.com
  34. 34.National AI safety group and CHIPS for America at risk with latest Trump administration firingstechnical.ly
  35. 35.OpenAI and Anthropic agree to let U.S. AI Safety Institute test and evaluate new modelscnbc.com
  36. 36.Technical Blog: Strengthening AI Agent Hijacking Evaluationsnist.gov
  37. 37.Cantwell, Young, Hickenlooper and Blackburn Reintroduce Bill to Ensure U.S. Leads Global AI Innovationcommerce.senate.gov
  38. 38.Why the U.S. Launched an International Network of AI Safety Institutestime.com
  39. 39.CAISI Issues Request for Information About Securing AI Agent Systemsnist.gov
  40. 40.Paul Christianonist.gov
  41. 41.Staffnist.gov
  42. 42.OSTP’s Kratsios touts ‘incredible’ interest in Tech Force, defends Trump science and tech cutsfedscoop.com
  43. 43.Michael Kratsios - Wikipediaen.wikipedia.org
  44. 44.CAISI Evaluation of DeepSeek AI Models Finds Shortcomings and Risksnist.gov
  45. 45.Anti-Regulatory AI: How “AI Safety” is Leveraged Against Regulatory Oversightarxiv.org
  46. 46.AI Safety Institutes Crop Up Around the World: Helpful or Hype?techpolicy.press
  47. 47.CAISI signs MOU with GSA to boost AI evaluation science in federal procurement through USAinist.gov
  48. 48.Exclusive Interview: U.S. AI Safety Institute’s Elizabeth Kelly on U.S. Leadership in AI Innovation and Safetywashingtonainetwork.com
  49. 49.CAISI Evaluation of Kimi K2 Thinkingnist.gov
  50. 50.AI Safety Newsletter #36: Voluntary Commitments are Insufficientnewsletter.safe.ai
  51. 51.Apply on USAJobs: Open CAISI Position for an AI Research Scientistnist.gov
  52. 52.The Global Landscape of AI Safety Institutes — All Tech Is Humanalltechishuman.org
  53. 53.U.S. AI Safety Institute Consortium Holds First Plenary Meeting to Reflect on Progress in 2024 & Outline Research Priorities for 2025nist.gov
  54. 54.Unknownnist.gov
  55. 55.FY2025 National Institute of Standards and Technologyaip.org
  56. 56.Unpacking the White House AI Action Plan with OSTP Director Michael Kratsios | The AI Policy Podcast | CSIS Podcastscsis.org
  57. 57.Regulating Artificial Intelligence Must Not Undermine NIST’s Integritytechpolicy.press
  58. 58.International Network for Advanced AI Measurement, Evaluation, and Science Publishes Consensus Areas on Practices for Automated Evaluationsnist.gov
  59. 59.New Report: Expanding the AI Evaluation Toolbox with Statistical Modelsnist.gov
  60. 60.NIST would ‘have to consider’ workforce reductions if appropriations cut goes throughfedscoop.com
  61. 61.NIST cuts would put US behind AI eightball, tech groups warn Commerce secretaryfedscoop.com
  62. 62.Unknownnist.gov
  63. 63.Seven Key Features of Trump’s AI Plan - Americans for Responsible Innovationari.us
  64. 64.CAISI Works with OpenAI and Anthropic to Promote Secure AI Innovationnist.gov
  65. 65.NIST Center for AI Standards and Innovation Selects Signature Science, LLC for AI Safety Basic Ordering Agreementsignaturescience.com
  66. 66.America's AI Action Plan: What's In, What's Out, What's Next | Insights | Holland & Knighthklaw.com
  67. 67.AI Agent Standards Initiativenist.gov
  68. 68.The US Has Committed to Spend Far Less Than Peers on AI Safety | Center for AI Policy | CAIPcenteraipolicy.org
  69. 69.A National Center for Advanced AI Reliability and Securityfas.org
  70. 70.Understanding the First Wave of AI Safety Institutes: Characteristics, Functions, and Challenges — Institute for AI Policy and Strategyiaps.ai
  71. 71.Elham Tabassi to join Brookings as the new Director of Artificial Intelligence and Emerging Technology Initiative | Brookingsbrookings.edu
  72. 72.Shaping AI Standards to Protect America’s Most Vulnerable: Tech Innovatorstechpolicy.press
  73. 73.Unpacking New NIST Guidance on Artificial Intelligencetechpolicy.press
  74. 74.NIST AISI Signs First-Ever AI Testing Pacts With Anthropic, OpenAImeritalk.com
  75. 75.News and Updatesnist.gov
  76. 76.U.S. Secretary of Commerce Gina Raimondo Releases Strategic Vision on AI Safety, Announces Plan for Global Cooperation Among AI Safety Institutesnist.gov
  77. 77.NIST unveils strategic vision for AI safety worknextgov.com
  78. 78.NIST AI Consortiumnist.gov
  79. 79.Kratsios: NIST needs ‘to go back to basics’ on standards for AI, not safety evaluationfedscoop.com
  80. 80.White House holds back on national AI framework specificsrollcall.com