Theory of Change
PIBBSS's theory of change centers on what they call the "epistemic access problem": we want to understand future AI systems that don't yet exist, but science depends on empirical access to phenomena. Their proposed solution is to study intelligent behavior "in the wild" -- in biological systems, brains, ecosystems, economies, social structures -- and transfer insights to AI alignment.
Nora Ammann (co-founder): "The premise is that intelligent behavior as a phenomena in the world is governed to some extent by the same or similar principles, irrespective of what system or specific substrate or specific scale it's implemented in. If this assumption is true, then it warrants the idea that we can look at currently existing systems that implement intelligent behavior... and insights about how intelligent behavior is implemented in the wild, can that help us transfer insights to the AI question specifically?"
She positions this as a "third way" between ML-centric safety research (assumes future systems resemble current ones) and agent-foundations research (based on idealized rational agents). The operative principle is "epistemic pluralism" -- triangulating understanding through multiple independent perspectives, acknowledging that no single analogy is true but convergence across many is informative.
The founding document explicitly pre-morts the risk of shallow analogies, citing Yudkowsky's criticism of biology-based timelines, and argues that mentor-fellow pairs with deep domain expertise serve as "the first line of defense against epistemic pollution."
What They Do
PIBBSS Summer Fellowship (core program): 3-month research fellowship pairing PhD/postdoc researchers from non-CS fields with AI alignment mentors. Three completed cohorts: 2022 (20 fellows, Czech Republic), 2023 (18 fellows, Prague), 2024 (~12 fellows, London). Stipend $3,000-$4,000/month plus housing. Total ~50+ fellows. Now running twice per year (summer + winter). New themed tracks for 2025: Cooperative AI (up to 6 fellows) and Gradual Disempowerment (up to 4 fellows).
Affiliate Program (launched January 2024): 6-12 month funding ($6,000-$10,000/month FTE) for senior researchers to develop their own agendas. Key outcome: Adam Shai's Simplex was incubated here. Currently 6+ affiliates.
Iliad (mathematical alignment arm): Research group under PrincInt umbrella. Runs a 6-week research residency (evolving into 3-month fellowship in 2026) focused on SLT, computational mechanics, and agent foundations. Connected to Timaeus (SLT org) through board member Alexander Gietelink Oldenziel.
AMI -- Ambitious Mechanistic Interpretability (launched 2026): New in-house research division sprinting for NeurIPS submission. Hiring Research Engineers and Research Scientists. Represents a major strategic shift from pure field-building to direct research.
Notable research outputs: Shai et al., "Transformers Represent Belief State Geometry in Their Residual Stream" (NeurIPS 2024, most upvoted AF post 2024). Weil, "Tort Law as a Tool for Mitigating Catastrophic AI Risk" (covered by Vox). Vallinder, "Cultural Evolution of Cooperation among LLM Agents" (AAMAS 2025). Kulveit et al., "Gradual Disempowerment" (ICML 2025). Corlouer, information-theoretic study of lying in LLMs (ICML 2024 workshop).
Key People
Nora Ammann (co-founder, Board President): Complex systems, philosophy of science. Led PIBBSS 2021-2024. Now Technical Specialist at ARIA Safeguarded AI under davidad. Co-author of "Guaranteed Safe AI" paper with Bengio, Russell, Tegmark. Remains the most publicly documented intellectual voice of the org across 3+ podcasts, but has moved to a board-only role.
Lucas Teixeira (Executive Director, Research): Philosophy, anthropology, computer science background. Previously at Conjecture. Joined ~8 months before taking over. Has virtually no public intellectual footprint -- one blog post on renormalization. This is the most significant information gap about the org's future direction.
Team size: ~5 core staff, 6+ research affiliates, ~20 fellows per summer cohort. Hiring actively: 4 positions open as of March 2026.
Mentor network is disproportionately strong: Abram Demski (MIRI), John Wentworth, David Dalrymple (davidad), Jan Kulveit, Vanessa Kosoy (CORAL), Joseph Bloom (SAEs), Tsvi Benson-Tilsen (MIRI).
Money and Incentives
Total known funding: ~$2.5M over 5 years
- Coefficient Giving/Open Philanthropy: $1,925,290 (77% of known total)
- $239,000 (March 2023)
- $1,686,290 (May 2025, three sub-grants)
- LTFF: $305,000 (December 2021)
- SFF: ~$186,000 (2023)
- CAIF: $34,147 (2024-2025)
- Various smaller: AISTOF, Foresight Institute, Manifund donors
Financial precariousness: In late 2024, PIBBSS described having "5-6 months of runway" during Marginal Funding Week. Monthly running cost ~$20,000. The 2025 CG grant of $1.69M significantly improved the picture, but the org was genuinely close to crisis.
Funder concentration: Open Philanthropy provides ~77% of known funding. PIBBSS was NOT listed in SFF's 2025 S-process recommendations (previously funded ~$186K in 2023).
Business model: Pure grants. No product revenue, no contracts, no endowment. Marginal fellow cost ~$20K per 3 months. Marginal affiliate cost ~$35K per 6 months.
Incentive tension: The MATS director notes "mainstream technical AI safety funders have pivoted more towards applied research." PIBBSS's original blue-sky positioning is increasingly hard to fund, which may explain the pivot toward mechanistic interpretability (AMI division). The question is whether this pivot preserves or undermines the org's distinctive value.
No 990 financial data available. All figures are from self-reporting and known grant records.
What Others Say
Ryan Kidd (MATS Director), the most informed external assessor, funds PIBBSS $25K but notes: "PIBBSS might be pivoting away from higher variance blue sky research to focus on more mainstream AI interpretability. While this might create more opportunities for funding, I think this would be a mistake. The AI safety ecosystem needs a home for 'weird ideas' and PIBBSS seems the most reputable, competent, EA-aligned place for this!" He positions PIBBSS as essential for developing "Connector" archetype researchers who "bridge exploratory theory and empirical science" with development time "on the order of years."
LTFF (Evan Hubinger) funded PIBBSS's first fellowship with a specific concern: "by targeting candidates with strong interdisciplinary backgrounds but not necessarily much background in EA or AI safety, we were somewhat concerned that such candidates might not stick around."
PIBBSS's own 2022 retrospective is remarkably candid: only 6-10 of 20 fellows "made interesting progress"; concrete output was "insufficient"; prosaic alignment transfer was the hardest domain; academic incentive conflicts curtailed value. By 2024, mentor satisfaction was 7/10+ for 6 of 8 mentors, and output quality had improved significantly.
Mentor testimonials are consistently strong. One alignment researcher said of their fellow: "20% [they] will clearly surpass everyone else in AI alignment before we all die."
Independent criticism is virtually absent. No one has published a substantive argument against the interdisciplinary-to-alignment pipeline. This likely reflects the org's small size rather than consensus endorsement.
What's Absent
- No public financial statements, budget breakdown, or annual report despite multi-million dollar funding
- No documented fellowship selection criteria, despite MATS director specifically recommending publication
- No public intellectual profile for Lucas Teixeira (ED Research) -- the person now leading research direction
- No independent impact evaluation or counterfactual analysis of fellow outcomes
- No explicit conflict-of-interest policy despite board members with dual roles (Timaeus, ARIA, alumnus)
- No single theory-of-change document -- must be reconstructed from scattered sources
Recommended Reading
Cognitive Revolution Podcast with Nora Ammann (2023) -- The most candid, unfiltered view of how PIBBSS thinks. Nora explains the epistemic access problem, "third way" approach, and research agenda in her own words. Start here. https://www.cognitiverevolution.ai/what-biological-social-systems-can-teach-us-about-ai-with-nora-ammann-cofounder-of-pibbss-research/
"Why I Funded PIBBSS" by Ryan Kidd, MATS Director (2024) -- The strongest counterargument bundled with endorsement. "Embrace the weird" vs. interpretability pivot tension clearly articulated. https://www.lesswrong.com/posts/Yjiw5rnu4mJsYN8Xc/why-i-funded-pibbss-1
Reflections on the PIBBSS Fellowship 2022 -- Unusually honest self-assessment of what worked and what didn't. https://www.lesswrong.com/posts/gbeyjALdjdoCGayc6/reflections-on-the-pibbss-fellowship-2022
Gradual Disempowerment paper (Kulveit et al., ICML 2025) -- The most distinctive intellectual output of the PIBBSS network. Shows the kind of original risk framing the interdisciplinary approach can produce. https://gradual-disempowerment.ai/