Theory of Change
AI2's theory of change is that radical openness in AI development is the primary mechanism for safety. CEO Ali Farhadi (before departing March 2026): "If we keep AI closed, or push more towards closing AI, this would be the most dangerous thing that could actually happen to AI and also to humanity." The causal chain: fully open models (weights + data + code + checkpoints) enable external scrutiny, community-driven safety research, reproducible science, and prevent dangerous concentration of AI power in a few corporations.
Farhadi explicitly framed safety as a technical problem: "All of them are technical problems more than policy or regulation problems because we're having those conversations because we still don't have enough understanding of these models... policy won't solve the problems that we have today." He argued that having the wrong policy "would be way worse than not having a policy."
Nathan Lambert, AI2's post-training lead, frames the mission more strategically: "We are trying to accelerate the science of language models so academics, regulators, and everyone else can keep up with the many parties trying to accelerate the commercialization of AI." He describes AI2 as "what OpenAI originally seemed like they were going to be."
AI2's founding CEO Oren Etzioni (2013-2022) published "AI Won't Exterminate Us -- It Will Empower Us" in 2014, which Luke Muehlhauser (then MIRI executive director) publicly rebutted. This skepticism of existential risk is part of AI2's intellectual DNA -- the organization has never operated from an x-risk framework.
What They Do
Fully open language models. OLMo is AI2's flagship model family. OLMo 3 (Nov 2025) released 7B and 32B parameter models with the complete "model flow" -- training data (Dolma 3, 9.3T tokens), code, weights, and all intermediate checkpoints. Claimed to be "the strongest fully open thinking model" at its scale. Trained on up to 1,024 H100 GPUs.
Open post-training recipes. Tulu 3 (Nov 2024) is an open post-training pipeline (SFT, DPO, RLVR, safety evaluation) that works on any base model including Meta's Llama. This may be more impactful than OLMo itself -- it democratizes the process that makes models useful while most labs keep it secret.
Safety toolkit. WildTeaming identifies 4.5x more jailbreak attacks than prior systems. WildJailbreak provides 262K safety training examples. WildGuard moderation tool outperforms GPT-4 on adversarial harmfulness detection by 4.8%. All open-source. These address the misuse/moderation layer of safety, not alignment or x-risk.
Training data transparency. OlmoTrace traces model outputs back to training data across 4.6T tokens in real time -- genuinely novel, impossible for closed models. Dolma is the largest fully open training dataset for LLMs.
Multimodal and vision. Molmo (Sep 2024) multimodal models outperform models 12x their size using dramatically less training data. MolmoWeb (Mar 2026) open-source web agent outperforms GPT-4o on web navigation.
AI for science. Semantic Scholar (200M+ papers, 8M monthly users). Asta platform for agentic scientific research at 194 institutions. Cancer AI Alliance with Fred Hutch ($20M commitment).
Conservation. EarthRanger wildlife monitoring at 500+ sites in 60 countries. Skylight anti-illegal-fishing. OlmoEarth for satellite/climate AI.
Publication volume. 1,000+ total papers, 111 models released in 2024. "More best paper awards than any other institutes of this size." Historic: ELMo (2018, 8,000+ citations), AllenNLP, Aristo.
Key People
Peter Clark (interim CEO, second stint). Founding AI2 member, led the Aristo project. Caretaker leader, not a visionary hire. Compensation: $1M+ (2024). His first interim stint was Oct 2022 - Jul 2023.
Nathan Lambert (post-training lead). The most publicly visible AI2 researcher. RLHF expert, author of the RLHF Book, writes the Interconnects newsletter. Drives the ATOM Project (American Truly Open Models). Candidly admits: "I mostly use closed models" and that AI2 needs to "actually dogfood the models" to take the next step.
Notable departures. Ali Farhadi (CEO), Hanna Hajishirzi (OLMo co-lead, OMAI co-PI), Ranjay Krishna (Molmo creator), Sophie Lebrecht (COO) -- all left for Microsoft Superintelligence team in March 2026. Yejin Choi (MacArthur Fellow, common-sense reasoning) left in 2024. Oren Etzioni (founding CEO) left in 2022 for TrueMedia.org. The pattern of serial leadership departures is the organization's most visible vulnerability.
~250-300 employees. Salaries doubled from $22.8M (2021) to $47.3M (2022).
Money and Incentives
Revenue: $204M (2024), nearly quadrupled from $53M in 2021. Virtually 100% from contributions (Allen estate/FFST). Zero product revenue, zero Coefficient Giving/Open Phil funding.
Primary funder: Fund for Science and Technology (FFST). $3.1B endowment from Paul Allen's estate, launched 2025. Led by Dr. Lynda Stuart (physician-scientist). FFST board includes Dr. Eric Horvitz (Microsoft Chief Scientific Officer). FFST is shifting from annual grants to proposal-based funding and favoring applied AI over frontier model research. A spokesperson said "broader program strategies are still under development."
The funding shift drove the brain drain. As Complete AI reported: "This funding shift explains why researchers focused on model development chose to move to Microsoft, where resources for that work are substantially larger." AI2 confirmed all 2026 programs remain fully funded, but the future of frontier model work is uncertain.
$152M NSF/NVIDIA OMAI grant. $75M NSF + $77M NVIDIA, 5-year, for building fully open AI models for science. PI: Noah Smith. This partially insulates open model work from FFST shifts, but the departure of co-PI Hajishirzi raises continuity questions.
Compensation: Farhadi $2.08M (2024), Clark $1.0M, Dumas (GC) $910K. Competitive for Seattle nonprofit but cannot match Microsoft/Google signing bonuses. The compensation gap is a structural talent retention vulnerability.
Incentive analysis. AI2 is structurally unusual: a private foundation funded almost entirely by a single estate, with no commercial revenue, no advertising platform, and no shareholders. This means its research priorities are set by the funder (FFST/Jody Allen), not the market. The key incentive misalignment is between AI2's research ambitions (frontier open models) and FFST's emerging preference (applied AI for science and conservation). AI2 cannot simply "find other funders" -- its entire operating model depends on the Allen estate.
What Others Say
The structural critique (from AI2's own people): Lambert in the Turing Post interview: "Open models have fewer resources, and resources normally determine the outcome... There's no reason to think open models will catch up." He estimates open models are 6-9 months behind closed ones. Epoch AI finds a consistent ~15-month lag in training compute. This challenges AI2's core claim that openness keeps pace with the frontier.
The security critique: IEEE Spectrum article by David Evan Harris (former Meta employee, UC Berkeley): "Open-Source AI Is Uniquely Dangerous." Calls for licensing, liability, and pausing releases of open models. Argues "unsecured AI poses an enormous risk that we are not yet able to contain." Global Center for AI identifies open models as vectors for deepfakes, cyberattacks, and terrorism recruitment.
The brain drain as verdict: The simultaneous departure of CEO, top researcher, COO, and a director for Microsoft's Superintelligence team is the market's judgment on AI2's ability to compete. Microsoft offered the compute and resources that FFST's funding shift was removing.
The democratization critique: Cornell research: open-source AI "do not themselves 'democratize' access to AI, or enable outside scrutiny, as both require resources concentrated in the hands of a few large companies."
Defense: The ai-frontiers.org argument that "precaution shouldn't keep open-source AI behind the frontier" and that restricting open models creates "digital feudalism" where users depend on a few tech firms. Lambert's honest case: open models are the engine for the next decade of AI research, even if they never reach the frontier.
What's Absent
No formal AI safety strategy document -- nothing comparable to Anthropic's RSP or DeepMind's Frontier Safety Framework. No engagement with core alignment concerns (deceptive alignment, mesa-optimization, power-seeking). Safety work is entirely at the misuse/moderation layer. No pre-release safety evaluation by external organizations (METR, Apollo, UK AISI). No assessment of catastrophic capabilities in OLMo models. No published conflict of interest policy despite Microsoft board members at both AI2 and FFST. No permanent CEO or announced search. No stated threshold at which AI2 would restrict model releases. Very little direct criticism from the AI safety community -- AI2 appears to be simply not on the x-risk radar.
Recommended Reading
Nathan Lambert -- Turing Post interview (Feb 2026) -- The most candid source. Lambert admits open models won't catch up, that he uses closed models himself, and that AI2 needs to dogfood its own models. The honest case for open models as research infrastructure, not frontier competition. turingpost.com/p/nathanlambert
David Evan Harris -- "Open-Source AI Is Uniquely Dangerous" (IEEE Spectrum, Jan 2024) -- The strongest counterargument to AI2's core approach. Calls for licensing and liability. spectrum.ieee.org/open-source-ai-2666932122
Ali Farhadi -- The Letter Two profile (Nov 2024) -- Farhadi's most candid interview before departing. Covers the mission, the evaluation crisis, and why he thinks AGI is "marketing jargon." thelettertwo.com
Epoch AI -- "Open vs. closed AI" (Nov 2024) -- Rigorous data on the capability gap between open and closed models. Essential context. epoch.ai/blog/open-models-report