Theory of Change
Apart Research's theory of change is that AI safety research can be accelerated by making it radically accessible. Their causal chain: weekend hackathons draw thousands of participants worldwide -> the best projects are invited into an 8-week Studio -> the best of those advance to a 12-24 week Fellowship -> fellows produce peer-reviewed papers and transition into AI safety careers. The mechanism is progressive filtering from a wide initial funnel.
In their own words: "We are a non-profit research and community-building AI safety lab with a strategic target on high-volume frontier technical research." The explicit emphasis on "high-volume" distinguishes Apart from orgs pursuing deep, focused research.
Esben Kran (founder): "Non-profits lack an engine for growth and don't benefit from the compound interest that drives for-profit ventures. They remain perpetually at the mercy of their funders -- indefinitely." He has publicly argued the non-profit model is broken and advocates for-profit AI safety as a complement.
Richard Ngo's endorsement captures the external view: "Building a culture of hands-on experimentation is probably the best way to do AI safety outreach, and Apart seems to have executed on it really well." The word "outreach" is notable -- the strongest endorsement frames Apart as community infrastructure, not a research lab.
What They Do
Research output: 22 peer-reviewed publications across NeurIPS, ICLR, ACL, ICML, EMNLP (13 peer-reviewed, 6 main conference, 9 workshop). The standout result is DarkBench (ICLR 2025 Oral Spotlight, top 1.8%), a benchmark for dark design patterns in LLMs across 660 prompts. Min-p Sampling also received an ICLR 2025 Oral Spotlight. Two oral spotlights at a top venue in one year is an extraordinary result for an org of this size.
Other notable work: Catastrophic Cyber Capabilities Benchmark (3CB), contributions to Anthropic's Sleeper Agents paper (via former CTO Fazl Barez), sandbox attack research on SAEs, and benchmark inflation detection.
Scale: 3,000-3,500+ participants across 42-55+ sprints, 50+ global locations, 26+ countries. 485 research reports submitted. 100+ research fellows. The pipeline is genuinely global and low-barrier.
METR partnership (Code Red Hackathon): 128 participants, 231 task ideas, 108 specifications, 28 full implementations following METR's task standard. METR confirmed the collaboration was "useful to METR's work." One participant joined METR full-time. Some tasks are now used by UK AISI. This is the strongest evidence of real-world safety impact.
Recent hackathon quality: March 2026 sprint outputs include multi-agent red-teaming of AI control evaluations, sandbagging detection via steering vectors (closing 85% of the gap), and HYDRA (88% jailbreak success on Claude Sonnet 4). The quality of recent hackathon work appears to be improving toward the AI control agenda.
Placements: Claims 30 placements at 20+ organizations. Named recipients include METR, Martian, Palisade Research, Anthropic, Redwood Research, AISI. Self-reported survey of 25 fellows: 10 now work primarily in AI safety, with Apart's contribution rated 5.6/10.
Policy engagement: Claims EU AI Act Code of Practice expert consultant role, IASEAI presentations, DC policy hackathon.
Key People
Esben Kran (Founder, Board Chairman). Left grad school at 22 to start Apart. Now co-running Seldon Lab, a for-profit AI safety startup accelerator in SF. P(doom) 20-40%. Candid critic of the non-profit model who argues for-profit AI safety companies are necessary. His primary attention has visibly shifted to Seldon.
Jaime Raldua Veuthey (CEO). Entered AI safety through an Apart hackathon, became CTO, then CEO when Esben stepped back. Co-authored DarkBench. His trajectory embodies the pipeline but also raises the question of whether hackathon-to-CEO is sufficient depth for leading a research organization.
Jason Hoelscher-Obermaier (Director of Research). PhD quantum physics, 9 years AI research engineering. Most research-credentialed person on the core team. Co-authored the Cooperative AI Foundation multi-agent risks report. Low public visibility.
Former CTO Fazl Barez (co-author on Sleeper Agents) departed for Oxford Martin AIGI -- a notable loss of research credibility.
Board: Esben Kran, Buck Shlegeris (Redwood Research CEO), Mathias Kirk Bonde (ControlAI), Finn Metz (Seldon Lab co-founder). Buck's involvement is the strongest credibility signal from technical AI safety.
Money and Incentives
Total funding: ~$1.5-2M lifetime (estimate). $700K raised in 2024 ("biggest year"). $250K total from CG/Open Phil (2 grants: $89K in Dec 2022, $161K in Jan 2025 -- both narrowly scoped to hackathons). Remaining funding from LTFF, SFF, Manifund regranters, and individual donors.
2025 funding crisis: Sought $955K (12 months' runway). Budget: Staff $691K (73%), Programs $156K (16%), Indirect $108K (11%). Mid-campaign raised $597K. Campaign completed July 2025, securing "funding into 2026." Root cause: concentrated funder dependency, insufficient funder relationship management, and scaling assumptions that didn't materialize.
Cost efficiency: ~$12.5K per hackathon, ~$5K per fellowship participant, ~$1K per studio participant. These are among the lowest costs in the AI safety talent pipeline -- roughly 6x cheaper per participant than LASR Labs, ~6x cheaper than MATS.
Funder revealed preference: CG/Open Phil has given Apart $250K over 3 years (scoped to hackathons) vs. GBP 8M+ to LASR Labs/Arcadia Impact (for research fellowships). This 30:1 funding ratio despite Apart having 10x more participants is the clearest signal of how major funders assess Apart's per-participant impact relative to more selective programs.
Manifund signal: One regranter gave $100K but later indicated willingness to fund at ~$50K, citing quality concerns about hackathon output (though caveated positively with conference feedback).
Fiscal structure: Danish non-profit using Ashgro Inc (US 501(c)(3)) as fiscal sponsor. No independently auditable financials.
Compute partnership: Lambda provides $5K per Lab team, $400 per hackathon team.
Key incentive tensions:
- The founder publicly argues the non-profit model is broken while running a non-profit. His co-founding of Seldon Lab (for-profit accelerator) is the logical consequence.
- Two of four board members (Esben, Finn) co-founded Seldon Lab, which explicitly uses Apart's non-profit hackathon pipeline as a talent source ("Access to Apart Research's hackathon pipeline, 3,000+ annual participants" -- from Seldon's batch 2 RFS).
- No public conflict-of-interest policy governs the Apart/Seldon relationship.
- 73% of budget going to staff salaries for 5-8 people means the org's survival is essentially a payroll question; program costs are only $156K.
What Others Say
Strongest endorsement: Richard Ngo: "Building a culture of hands-on experimentation is probably the best way to do AI safety outreach, and Apart seems to have executed on it really well."
Alumni testimonials (named, specific):
- Philip Quirke (Martian Research Lead): "Apart Research transformed my career and life trajectory... In less than two years, they helped me transition from an IT professional with zero AI experience to a Research Lead."
- Amir Abdullah (Thoughtworks AI Safety Lead): "I found myself out of research positions for the next several years due to location, time and family constraints... As a consequence of Apart's support and the 3 papers I published with them, I received my first full time offer."
- Sami Jawhar joined METR full-time via Code Red hackathon.
Institutional endorsement: METR confirmed the Code Red hackathon was "useful to METR's work." UK AISI uses some hackathon-produced evaluation tasks. Tyler Tracy (Redwood Research) credits Apart hackathon with generating interest in AI control.
Strongest counterargument: The hackathon model optimizes for accessibility and volume, not depth or novelty. Of 3,500+ participants producing 485+ research reports, 22 reached peer review (~4.5% yield). The per-dollar question is whether scaling research sprints produces more safety-relevant work than funding fewer, deeper programs. CG/OP's revealed preference (funding LASR at 30x the level of Apart) suggests major funders believe selective programs produce more value per dollar, even if they reach fewer people.
Criticism scarcity: Despite 46+ searches, no substantive published criticism of Apart exists on LW/EA Forum. This is likely because Apart is too small to attract critical attention, not because it's above reproach.
Fellowship ecosystem data (independent): Chris Clay's analysis of 600+ alumni across 9 programs found Apart Labs had 52 tracked alumni, with 17.3% doing another fellowship afterward (vs. 11.1% average). This positions Apart as more of a feeder program than an endpoint.
What's Absent
- No independent evaluation of impact. The 30 placements claim and all outcome data are self-reported.
- No audited financials or published annual reports.
- No public research strategy document explaining how research areas are prioritized.
- No citation or adoption metrics for published papers (DarkBench adoption by labs would be key evidence).
- No conflict-of-interest policy governing the Apart/Seldon Lab relationship.
- No data on hackathon participant retention (repeat participation rates).
- No explanation for Fazl Barez's departure (most credentialed early researcher).
- No public record of hackathon topic selection process.
Recommended Reading
Esben Kran, "Where we are on for-profit AI safety" -- The most candid source. Esben directly critiques the non-profit funding model, cites the SBF lesson, and argues for-profit safety ventures are necessary. Reveals the thinking that led to Seldon Lab and the tensions at the heart of Apart's identity.
"Why is Apart Research suddenly in dire need of funding?" (EA Forum) -- The strongest critical lens on Apart. Community discussion about why major funders weren't renewing, with insights about funder relationship failures and whether the hackathon model warranted scaling.
Code Red Hackathon Wrap Up (METR + Apart) -- The best evidence of the model working: specific numbers, METR's direct endorsement, a concrete placement outcome, and tasks now used by UK AISI.
Seldon Lab Mission -- Essential for understanding the for-profit sister org, Esben's evolving vision, and why he believes non-profit safety research needs for-profit infrastructure to scale.
"Where do AI Safety Fellows go?" (EA Forum, Chris Clay) -- Independent analysis of 600+ fellowship alumni across 9 programs. Shows Apart as an earlier-stage feeder program with above-average rates of participants going on to other fellowships.