← AI Safety Orgs

Allen Institute for AI (AI2)

Research

Allen Institute. Open-source academic AI.

Founded
2014
HQ
Seattle, WA
Team
275
Structure
501(c)(3) nonprofit
Model
Grants

Theory of Change

AI2's theory of change is that radical openness in AI development is the primary mechanism for safety. CEO Ali Farhadi (before departing March 2026): "If we keep AI closed, or push more towards closing AI, this would be the most dangerous thing that could actually happen to AI and also to humanity." The causal chain: fully open models (weights + data + code + checkpoints) enable external scrutiny, community-driven safety research, reproducible science, and prevent dangerous concentration of AI power in a few corporations.

Farhadi explicitly framed safety as a technical problem: "All of them are technical problems more than policy or regulation problems because we're having those conversations because we still don't have enough understanding of these models... policy won't solve the problems that we have today." He argued that having the wrong policy "would be way worse than not having a policy."

Nathan Lambert, AI2's post-training lead, frames the mission more strategically: "We are trying to accelerate the science of language models so academics, regulators, and everyone else can keep up with the many parties trying to accelerate the commercialization of AI." He describes AI2 as "what OpenAI originally seemed like they were going to be."

AI2's founding CEO Oren Etzioni (2013-2022) published "AI Won't Exterminate Us -- It Will Empower Us" in 2014, which Luke Muehlhauser (then MIRI executive director) publicly rebutted. This skepticism of existential risk is part of AI2's intellectual DNA -- the organization has never operated from an x-risk framework.

What They Do

Fully open language models. OLMo is AI2's flagship model family. OLMo 3 (Nov 2025) released 7B and 32B parameter models with the complete "model flow" -- training data (Dolma 3, 9.3T tokens), code, weights, and all intermediate checkpoints. Claimed to be "the strongest fully open thinking model" at its scale. Trained on up to 1,024 H100 GPUs.

Open post-training recipes. Tulu 3 (Nov 2024) is an open post-training pipeline (SFT, DPO, RLVR, safety evaluation) that works on any base model including Meta's Llama. This may be more impactful than OLMo itself -- it democratizes the process that makes models useful while most labs keep it secret.

Safety toolkit. WildTeaming identifies 4.5x more jailbreak attacks than prior systems. WildJailbreak provides 262K safety training examples. WildGuard moderation tool outperforms GPT-4 on adversarial harmfulness detection by 4.8%. All open-source. These address the misuse/moderation layer of safety, not alignment or x-risk.

Training data transparency. OlmoTrace traces model outputs back to training data across 4.6T tokens in real time -- genuinely novel, impossible for closed models. Dolma is the largest fully open training dataset for LLMs.

Multimodal and vision. Molmo (Sep 2024) multimodal models outperform models 12x their size using dramatically less training data. MolmoWeb (Mar 2026) open-source web agent outperforms GPT-4o on web navigation.

AI for science. Semantic Scholar (200M+ papers, 8M monthly users). Asta platform for agentic scientific research at 194 institutions. Cancer AI Alliance with Fred Hutch ($20M commitment).

Conservation. EarthRanger wildlife monitoring at 500+ sites in 60 countries. Skylight anti-illegal-fishing. OlmoEarth for satellite/climate AI.

Publication volume. 1,000+ total papers, 111 models released in 2024. "More best paper awards than any other institutes of this size." Historic: ELMo (2018, 8,000+ citations), AllenNLP, Aristo.

Key People

Peter Clark (interim CEO, second stint). Founding AI2 member, led the Aristo project. Caretaker leader, not a visionary hire. Compensation: $1M+ (2024). His first interim stint was Oct 2022 - Jul 2023.

Nathan Lambert (post-training lead). The most publicly visible AI2 researcher. RLHF expert, author of the RLHF Book, writes the Interconnects newsletter. Drives the ATOM Project (American Truly Open Models). Candidly admits: "I mostly use closed models" and that AI2 needs to "actually dogfood the models" to take the next step.

Notable departures. Ali Farhadi (CEO), Hanna Hajishirzi (OLMo co-lead, OMAI co-PI), Ranjay Krishna (Molmo creator), Sophie Lebrecht (COO) -- all left for Microsoft Superintelligence team in March 2026. Yejin Choi (MacArthur Fellow, common-sense reasoning) left in 2024. Oren Etzioni (founding CEO) left in 2022 for TrueMedia.org. The pattern of serial leadership departures is the organization's most visible vulnerability.

~250-300 employees. Salaries doubled from $22.8M (2021) to $47.3M (2022).

Money and Incentives

Revenue: $204M (2024), nearly quadrupled from $53M in 2021. Virtually 100% from contributions (Allen estate/FFST). Zero product revenue, zero Coefficient Giving/Open Phil funding.

Primary funder: Fund for Science and Technology (FFST). $3.1B endowment from Paul Allen's estate, launched 2025. Led by Dr. Lynda Stuart (physician-scientist). FFST board includes Dr. Eric Horvitz (Microsoft Chief Scientific Officer). FFST is shifting from annual grants to proposal-based funding and favoring applied AI over frontier model research. A spokesperson said "broader program strategies are still under development."

The funding shift drove the brain drain. As Complete AI reported: "This funding shift explains why researchers focused on model development chose to move to Microsoft, where resources for that work are substantially larger." AI2 confirmed all 2026 programs remain fully funded, but the future of frontier model work is uncertain.

$152M NSF/NVIDIA OMAI grant. $75M NSF + $77M NVIDIA, 5-year, for building fully open AI models for science. PI: Noah Smith. This partially insulates open model work from FFST shifts, but the departure of co-PI Hajishirzi raises continuity questions.

Compensation: Farhadi $2.08M (2024), Clark $1.0M, Dumas (GC) $910K. Competitive for Seattle nonprofit but cannot match Microsoft/Google signing bonuses. The compensation gap is a structural talent retention vulnerability.

Incentive analysis. AI2 is structurally unusual: a private foundation funded almost entirely by a single estate, with no commercial revenue, no advertising platform, and no shareholders. This means its research priorities are set by the funder (FFST/Jody Allen), not the market. The key incentive misalignment is between AI2's research ambitions (frontier open models) and FFST's emerging preference (applied AI for science and conservation). AI2 cannot simply "find other funders" -- its entire operating model depends on the Allen estate.

What Others Say

The structural critique (from AI2's own people): Lambert in the Turing Post interview: "Open models have fewer resources, and resources normally determine the outcome... There's no reason to think open models will catch up." He estimates open models are 6-9 months behind closed ones. Epoch AI finds a consistent ~15-month lag in training compute. This challenges AI2's core claim that openness keeps pace with the frontier.

The security critique: IEEE Spectrum article by David Evan Harris (former Meta employee, UC Berkeley): "Open-Source AI Is Uniquely Dangerous." Calls for licensing, liability, and pausing releases of open models. Argues "unsecured AI poses an enormous risk that we are not yet able to contain." Global Center for AI identifies open models as vectors for deepfakes, cyberattacks, and terrorism recruitment.

The brain drain as verdict: The simultaneous departure of CEO, top researcher, COO, and a director for Microsoft's Superintelligence team is the market's judgment on AI2's ability to compete. Microsoft offered the compute and resources that FFST's funding shift was removing.

The democratization critique: Cornell research: open-source AI "do not themselves 'democratize' access to AI, or enable outside scrutiny, as both require resources concentrated in the hands of a few large companies."

Defense: The ai-frontiers.org argument that "precaution shouldn't keep open-source AI behind the frontier" and that restricting open models creates "digital feudalism" where users depend on a few tech firms. Lambert's honest case: open models are the engine for the next decade of AI research, even if they never reach the frontier.

What's Absent

No formal AI safety strategy document -- nothing comparable to Anthropic's RSP or DeepMind's Frontier Safety Framework. No engagement with core alignment concerns (deceptive alignment, mesa-optimization, power-seeking). Safety work is entirely at the misuse/moderation layer. No pre-release safety evaluation by external organizations (METR, Apollo, UK AISI). No assessment of catastrophic capabilities in OLMo models. No published conflict of interest policy despite Microsoft board members at both AI2 and FFST. No permanent CEO or announced search. No stated threshold at which AI2 would restrict model releases. Very little direct criticism from the AI safety community -- AI2 appears to be simply not on the x-risk radar.

Recommended Reading

  1. Nathan Lambert -- Turing Post interview (Feb 2026) -- The most candid source. Lambert admits open models won't catch up, that he uses closed models himself, and that AI2 needs to dogfood its own models. The honest case for open models as research infrastructure, not frontier competition. turingpost.com/p/nathanlambert

  2. David Evan Harris -- "Open-Source AI Is Uniquely Dangerous" (IEEE Spectrum, Jan 2024) -- The strongest counterargument to AI2's core approach. Calls for licensing and liability. spectrum.ieee.org/open-source-ai-2666932122

  3. Ali Farhadi -- The Letter Two profile (Nov 2024) -- Farhadi's most candid interview before departing. Covers the mission, the evaluation crisis, and why he thinks AGI is "marketing jargon." thelettertwo.com

  4. Epoch AI -- "Open vs. closed AI" (Nov 2024) -- Rigorous data on the capability gap between open and closed models. Essential context. epoch.ai/blog/open-models-report

Show Claude’s analysis
An opinionated read. Read the brief first to form your own view.

Stated Theory of Change

AI2's stated theory is that radical openness in AI development is the most important mechanism for ensuring AI safety and benefiting humanity. The specific causal chain:

  1. Build fully open language models (weights + training data + code + checkpoints)
  2. This enables external scrutiny, reproducible science, and community-driven safety research
  3. Open models prevent dangerous power concentration in a few corporations
  4. Safety problems are fundamentally technical, not policy problems
  5. Therefore, solving safety requires understanding models deeply, which requires openness
  6. Result: a safer, more democratic AI ecosystem

This is complemented by a secondary theory: AI2's conservation and science applications (EarthRanger, Asta, Cancer AI Alliance) demonstrate AI's positive potential and produce direct social good.

Revealed Theory of Change

AI2's actions are largely consistent with its stated theory, which is unusual. They genuinely release everything -- data, code, weights, checkpoints, intermediate artifacts. Tulu 3 and the Safety Toolkit are real contributions to open safety. OlmoTrace is genuinely novel. The publication volume (1,000+ papers, 111 models in 2024) is exceptional.

Where actions diverge from stated theory:

Safety work is narrow. AI2's safety research addresses misuse (jailbreaking, content moderation, adversarial attacks) but not alignment. There is no work on deceptive alignment, mesa-optimization, reward hacking at scale, power-seeking, or any of the failure modes that the alignment community considers most dangerous. AI2 treats "safety" as "preventing bad actors from misusing current models" rather than "ensuring advanced systems remain aligned with human values."

No threshold for restricting releases. AI2's stated position is that openness is always better. They have not articulated any capability level at which they would restrict model access. This means their theory of change has no built-in response to a scenario where open models become genuinely dangerous.

The people who built the vision left. Farhadi, who articulated the "openness as safety" philosophy most forcefully, departed for Microsoft's Superintelligence team. The departure of the CEO, top researcher, COO, and a director for a lab that will likely keep much of its work closed undermines the credibility of the stated theory.

Lambert's candid admissions. AI2's most visible researcher admits he uses closed models for his own work, that open models will never catch up, and that AI2 doesn't adequately "dogfood" its own models. This is an honest assessment that reveals the gap between the open-model ideal and the current reality.

Key Assumptions

Assumption 1: Openness enables effective safety research.

  • Evidence for: OlmoTrace demonstrates capabilities impossible with closed models. Open training data allows contamination analysis. WildTeaming/WildGuard were developed using open model access.
  • Evidence against: The most significant AI safety research (Anthropic's interpretability, DeepMind's alignment work) happens at closed labs with proprietary access. No major alignment breakthrough has come from studying open models.
  • Testable: Has any safety-critical discovery been made because OLMo was fully open that couldn't have been made with API access?
  • If wrong: AI2's core theory of change loses its safety justification and becomes a pure public-good argument.

Assumption 2: Open models will remain near the frontier.

  • Evidence for: Epoch AI data shows ~15-month lag has been relatively stable. The OMAI grant and compute from NVIDIA/NSF provide resources.
  • Evidence against: Lambert himself says "there's no reason to think open models will catch up" and estimates the gap at 6-9 months. Meta, the largest open-model producer, is pulling back from frontier releases.
  • Testable: Track benchmark performance and compute usage of OLMo relative to GPT/Claude/Gemini.
  • If wrong: AI2's models become increasingly irrelevant as the frontier moves away. Safety through openness only works if the open models are capable enough to study.

Assumption 3: AI2's funding will sustain frontier model research.

  • Evidence for: All 2026 programs fully funded. $152M OMAI grant provides 5 years of guaranteed compute.
  • Evidence against: FFST explicitly shifting to applied AI. The brain drain was driven by this funding shift. The OMAI grant covers a specific project, not AI2's general model development.
  • Testable: Track FFST's annual grants to AI2 vs. total FFST deployment.
  • If wrong: AI2 becomes primarily an applied AI organization (science, conservation) and abandons frontier open model development.

Assumption 4: Safety is primarily a technical problem solved through model understanding.

  • Evidence for: Many current safety failures (jailbreaks, hallucinations, biases) are indeed technical. OlmoTrace-type tools provide genuine insight.
  • Evidence against: The most concerning AI risks (misuse by state actors, power concentration, loss of control over advanced systems) have large governance and coordination components. Technical understanding alone does not prevent deliberate misuse of open models.
  • If wrong: AI2's approach of "understand the models better" is necessary but not sufficient. Policy, governance, and access control become essential complements.

Strengths

Genuine commitment to openness. Unlike Meta (which used open-source as a competitive strategy and is now pulling back), AI2 has no commercial incentive for openness. They release everything because they believe it's right. This is credible and rare.

Exceptional research output. 1,000+ papers, 111 models in 2024, multiple best-paper awards. ELMo (8,000+ citations), Semantic Scholar (8M monthly users), OlmoTrace (genuinely novel). The quality is real, not inflated.

Practical safety tools. The Safety Toolkit (WildTeaming, WildGuard, WildJailbreak) addresses a genuine gap in the open-model ecosystem. No one else is producing open safety tools at this quality.

Financial independence from commercial pressure. As a private foundation funded by an endowment, AI2 does not optimize for quarterly earnings, advertising revenue, or market share. Research priorities are set by mission, not by revenue.

Strong institutional partnerships. UW Allen School, NSF, NVIDIA, Fred Hutch. The $152M OMAI grant demonstrates federal confidence in AI2's mission.

Nathan Lambert's public communication. His newsletter (Interconnects), RLHF Book, and public speaking make AI2's work accessible and honest. His candid admissions about open model limitations build more credibility than they undermine.

Weaknesses and Risks

Existential funding risk. FFST is shifting priorities. AI2 is almost entirely dependent on a single funding source (Allen estate). If FFST fully deprioritizes frontier models, AI2's core mission collapses. The $152M OMAI grant is a 5-year buffer, not a permanent solution.

Catastrophic brain drain. The March 2026 departures are devastating. CEO + top researcher + COO + director in a single month, all to one competitor. This suggests structural problems (compensation, resources, strategic direction) that will recur.

No interim-to-permanent leadership path. Peter Clark is serving as interim CEO for the second time. AI2 has never successfully recruited a permanent CEO who stayed -- Etzioni left, Farhadi left. The organization may be structurally unable to retain a CEO because the role is constrained by the Allen estate's governance.

Safety work is too narrow. Misuse-layer safety (jailbreaks, content moderation) is necessary but does not address the alignment and x-risk concerns that most of the AI safety community considers most important. AI2 is not working on the problems that would matter most if AI systems become transformatively capable.

Microsoft governance conflicts. Peter Lee (Microsoft Research VP) sits on AI2's board. Eric Horvitz (Microsoft CSO) sits on FFST's board. Microsoft just hired AI2's CEO and top researchers. This creates an extraordinary conflict of interest that is not publicly addressed.

Open models may not enable meaningful safety research. The strongest alignment research happens at labs with access to frontier closed models (Anthropic, DeepMind). If the most important safety problems require frontier-scale models to study, AI2's open but sub-frontier models may not be relevant to the safety research that matters most.

Cross-References

Meta AI: Both pursue "open" AI but with fundamentally different motivations. Meta's openness was a competitive strategy ("commoditize your complement") and is being abandoned as the business case weakened. AI2's openness is philosophically motivated and faces financial rather than strategic pressure. AI2 is what Meta pretended to be.

Anthropic/OpenAI/DeepMind: The frontier labs that AI2's models trail by 6-15 months. Their safety work (RSPs, alignment research, interpretability) is fundamentally different from AI2's safety toolkit. AI2 addresses misuse; they address alignment. The approaches are complementary in theory, but there is almost no interaction between them.

Hugging Face/EleutherAI: Fellow travelers in the open-model ecosystem. Hugging Face provides the platform; EleutherAI provides research; AI2 provides the highest-quality fully open models. These organizations form a loose coalition with AI2.

NVIDIA: Both funder ($77M OMAI grant) and infrastructure provider. NVIDIA's Nemotron open models are competitors to OLMo, but NVIDIA's incentive (sell more GPUs) is served by a thriving open model ecosystem.

What Would Change This Assessment

  • FFST publicly commits to long-term frontier model funding. This would remove the existential financial risk and signal that AI2's open model work has a sustainable future.
  • AI2 hires a permanent CEO with strong AI safety credentials. A leader from the alignment community would signal a genuine expansion of AI2's safety vision beyond misuse.
  • An alignment breakthrough using open models. If studying OLMo's training data or internals leads to a discovery about alignment that closed models couldn't have produced, this would validate AI2's core theory of change.
  • Open models reach dangerous capability levels. If OLMo or a derivative model is used for a significant harm that closed models prevented, this would challenge AI2's unconditional openness.
  • Meta permanently exits open models. This would make AI2 and NVIDIA the primary open-model producers, dramatically increasing AI2's strategic importance.

Self-Critique

What sources should I have checked but didn't:

  • Glassdoor reviews in detail (especially post-March 2026)
  • GeekWire's blocked articles (Farhadi departure, broader impact) -- only search snippets available
  • Fast Company's blocked Farhadi interviews
  • Internal documents about the FFST transition
  • Any communications between AI2 and AI safety organizations (if they exist)

Where this analysis is potentially biased:

  • I may be too harsh on AI2's safety work because I'm evaluating it against an x-risk standard that AI2 never claimed to meet. AI2's theory of change is about democratizing AI, not preventing existential catastrophe. By their own standard, they may be succeeding.
  • I may underweight the value of practical safety tools (WildGuard, WildTeaming) relative to theoretical alignment work. In the short term, preventing misuse of current models may prevent more actual harm than speculative alignment research.

What a thoughtful person who disagrees would say: "AI2's approach is correct precisely because it's different from the x-risk framing. The actual near-term risks of AI are misuse, power concentration, and lack of transparency -- not misaligned superintelligence. AI2 is addressing these real risks while the alignment community chases speculative ones. Open models have already enabled more safety research than any amount of RSP compliance."

Single weakest claim: That the Microsoft governance conflicts (Lee on AI2 board, Horvitz on FFST board) represent a meaningful problem rather than typical nonprofit board composition in the Seattle tech community. These connections may be more benign than I've characterized them.

What information would most change my view: Details of FFST's actual 5-year plan for AI2 funding. If FFST commits to sustained frontier model research funding, the existential financial risk disappears and AI2's position becomes much stronger.

Connected to (9)

Fred Hutch Cancer Centercollaborator
Madrona Venture Groupboard overlap · Hope Cochran
Microsoftstaff to · Ali Farhadi, Hanna Hajishirzi, Ranjay Krishna, Sophie Lebrecht
Microsoftboard overlap · Peter Lee
Hugging Facecollaborator
University of Washingtoncollaborator
AI2 Incubatorspun off from
TrueMedia.orgstaff to · Oren Etzioni
Applestaff to · Ali Farhadi
Sources (60)
Every URL that was read during research.
  1. 1.Allen Institute for AI - Wikipediaen.wikipedia.org
  2. 2.About us | Ai2allenai.org
  3. 3.Research principles | Ai2allenai.org
  4. 4.Olmo from Ai2allenai.org
  5. 5.AI2: The Pioneering Open AI Research House Paul Allen Builtthelettertwo.com
  6. 6.IA Summit 2023 Ali Farhadi: The State of Open-Source Models & Importance of an Open AI Ecosystemmadrona.com
  7. 7.Aboutff-st.org
  8. 8.NSF and NVIDIA award Ai2 a combined $152M to support building a national level fully open AI ecosystem | Ai2allenai.org
  9. 9.NSF and NVIDIA partnership enables Ai2 to develop fully open AI models to fuel U.S. scientific innovationnsf.gov
  10. 10.Open-Source AI Is Uniquely Dangerousspectrum.ieee.org
  11. 11.Microsoft hires top AI researchers from Allen Institute for AI for Suleyman's Superintelligence teamthe-decoder.com
  12. 12.About us | Ai2allenai.org
  13. 13.Why I build open language modelsinterconnects.ai
  14. 14.Semantic Scholar - Wikipediaen.wikipedia.org
  15. 15.Open Language Models (OLMos) and the LLM landscapeinterconnects.ai
  16. 16.AI for the planet | Ai2allenai.org
  17. 17.Tulu | Ai2allenai.org
  18. 18.Tülu 3 opens language model post-training up to more tasks and more people | Ai2allenai.org
  19. 19.AI for science | Ai2allenai.org
  20. 20.AI2 Incubatorai2incubator.com
  21. 21.Tülu 3: The next era in open post-training | Ai2allenai.org
  22. 22.Responsible use guidelines | Ai2allenai.org
  23. 23.Going beyond open data – increasing transparency and trust in language models with OLMoTrace | Ai2allenai.org
  24. 24.Ai2 releases new language models competitive with Meta's Llama | TechCrunchtechcrunch.com
  25. 25.Molmo | Ai2allenai.org
  26. 26.Who We Are | Allen Family Philanthropiesallenphilanthropies.org
  27. 27.Olmo 3: Charting a path through the model flow to lead open-source AI | Ai2allenai.org
  28. 28.Peter Clarkpclark425.github.io
  29. 29.Latest research | Ai2allenai.org
  30. 30.Team Directory | Ai2allenai.org
  31. 31.Research - The Global Security Risks of Open-Source AI Modelsglobalcenter.ai
  32. 32.Gallery Headingnews.cornell.edu
  33. 33.Nathan Lambertnatolambert.com
  34. 34.What comes next with open modelsinterconnects.ai
  35. 35.Nathan Lambert's ATOM Project Seeks American Open Source AI Modelsthenewstack.io
  36. 36.Careers | Ai2allenai.org
  37. 37.Olmo from Ai2allenai.org
  38. 38.EarthRanger | Ai2allenai.org
  39. 39.Microsoft hires former Ai2 CEO Ali Farhadi and top researchers for Suleyman's superintelligence teamcompleteaitraining.com
  40. 40.OLMo 2: The best fully open language model to date | Ai2allenai.org
  41. 41.Luke Muehlhauserlukemuehlhauser.com
  42. 42.The Ai2 Safety Toolkit: Datasets and models for safe and responsible LLMs development | Ai2allenai.org
  43. 43.Evaluation frameworks | Ai2allenai.org
  44. 44.Allen School partners with Ai2 to advance open AI for sciencenews.cs.washington.edu
  45. 45.Ai2 and Google Cloud commit $20M to advance AI-powered research for the Cancer AI Alliance | Ai2allenai.org
  46. 46.Open vs. closed AI: How behind are open models?epoch.ai
  47. 47.The Allen Institute For Artificial Intelligence - Nonprofit Explorer - ProPublicaprojects.propublica.org
  48. 48.Open-source LLMs' harmlessness gapinterconnects.ai
  49. 49.The koan of an open-source LLMinterconnects.ai
  50. 50.RLHF 201 - with Nathan Lambert of AI2 and Interconnectslatent.space
  51. 51.Asta: Accelerating science through trustworthy agentic AI | Ai2allenai.org
  52. 52.🎙️Nathan Lambert: Open Models Will Never Catch Upturingpost.com
  53. 53.MolmoWeb: An open agent for automating web tasks | Ai2allenai.org
  54. 54.PRIORprior.allenai.org
  55. 55.What’s next for the Allen Institute for AI: CEO Ali Farhadi charts a course for broader impacthalloftechnologie.blogspot.com
  56. 56.Precaution Shouldn't Keep Open-Source AI Behind the Frontierai-frontiers.org
  57. 57.The Allen Institute for Artificial Intelligence | Charity Navigator Profilecharitynavigator.org
  58. 58.OlmoEarth | Ai2allenai.org
  59. 59.Climate modeling | Ai2allenai.org
  60. 60.Transcript for State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 - Lex Fridmanlexfridman.com