Principles of Intelligence (PIBBSS)

Field-Building

Interdisciplinary. Biology + AI safety.

Founded: 2021
HQ: London, UK / Bay Area, CA
Team: 5
Structure: 501(c)(3) nonprofit
Model: Grants

Theory of Change

PIBBSS's theory of change centers on what they call the "epistemic access problem": we want to understand future AI systems that don't yet exist, but science depends on empirical access to phenomena. Their proposed solution is to study intelligent behavior "in the wild" -- in biological systems, brains, ecosystems, economies, social structures -- and transfer insights to AI alignment.

Nora Ammann (co-founder): "The premise is that intelligent behavior as a phenomena in the world is governed to some extent by the same or similar principles, irrespective of what system or specific substrate or specific scale it's implemented in. If this assumption is true, then it warrants the idea that we can look at currently existing systems that implement intelligent behavior... and insights about how intelligent behavior is implemented in the wild, can that help us transfer insights to the AI question specifically?"

She positions this as a "third way" between ML-centric safety research (assumes future systems resemble current ones) and agent-foundations research (based on idealized rational agents). The operative principle is "epistemic pluralism" -- triangulating understanding through multiple independent perspectives, acknowledging that no single analogy is true but convergence across many is informative.

The founding document explicitly pre-morts the risk of shallow analogies, citing Yudkowsky's criticism of biology-based timelines, and argues that mentor-fellow pairs with deep domain expertise serve as "the first line of defense against epistemic pollution."

What They Do

PIBBSS Summer Fellowship (core program): 3-month research fellowship pairing PhD/postdoc researchers from non-CS fields with AI alignment mentors. Three completed cohorts: 2022 (20 fellows, Czech Republic), 2023 (18 fellows, Prague), 2024 (~12 fellows, London). Stipend $3,000-$4,000/month plus housing. Total ~50+ fellows. Now running twice per year (summer + winter). New themed tracks for 2025: Cooperative AI (up to 6 fellows) and Gradual Disempowerment (up to 4 fellows).

Affiliate Program (launched January 2024): 6-12 month funding ($6,000-$10,000/month FTE) for senior researchers to develop their own agendas. Key outcome: Adam Shai's Simplex was incubated here. Currently 6+ affiliates.

Iliad (mathematical alignment arm): Research group under PrincInt umbrella. Runs a 6-week research residency (evolving into 3-month fellowship in 2026) focused on SLT, computational mechanics, and agent foundations. Connected to Timaeus (SLT org) through board member Alexander Gietelink Oldenziel.

AMI -- Ambitious Mechanistic Interpretability (launched 2026): New in-house research division sprinting for NeurIPS submission. Hiring Research Engineers and Research Scientists. Represents a major strategic shift from pure field-building to direct research.

Notable research outputs: Shai et al., "Transformers Represent Belief State Geometry in Their Residual Stream" (NeurIPS 2024, most upvoted AF post 2024). Weil, "Tort Law as a Tool for Mitigating Catastrophic AI Risk" (covered by Vox). Vallinder, "Cultural Evolution of Cooperation among LLM Agents" (AAMAS 2025). Kulveit et al., "Gradual Disempowerment" (ICML 2025). Corlouer, information-theoretic study of lying in LLMs (ICML 2024 workshop).

Key People

Nora Ammann (co-founder, Board President): Complex systems, philosophy of science. Led PIBBSS 2021-2024. Now Technical Specialist at ARIA Safeguarded AI under davidad. Co-author of "Guaranteed Safe AI" paper with Bengio, Russell, Tegmark. Remains the most publicly documented intellectual voice of the org across 3+ podcasts, but has moved to a board-only role.

Lucas Teixeira (Executive Director, Research): Philosophy, anthropology, computer science background. Previously at Conjecture. Joined ~8 months before taking over. Has virtually no public intellectual footprint -- one blog post on renormalization. This is the most significant information gap about the org's future direction.

Team size: ~5 core staff, 6+ research affiliates, ~20 fellows per summer cohort. Hiring actively: 4 positions open as of March 2026.

Mentor network is disproportionately strong: Abram Demski (MIRI), John Wentworth, David Dalrymple (davidad), Jan Kulveit, Vanessa Kosoy (CORAL), Joseph Bloom (SAEs), Tsvi Benson-Tilsen (MIRI).

Money and Incentives

Total known funding: ~$2.5M over 5 years

Coefficient Giving/Open Philanthropy: $1,925,290 (77% of known total)
- $239,000 (March 2023)
- $1,686,290 (May 2025, three sub-grants)
LTFF: $305,000 (December 2021)
SFF: ~$186,000 (2023)
CAIF: $34,147 (2024-2025)
Various smaller: AISTOF, Foresight Institute, Manifund donors

Financial precariousness: In late 2024, PIBBSS described having "5-6 months of runway" during Marginal Funding Week. Monthly running cost ~$20,000. The 2025 CG grant of $1.69M significantly improved the picture, but the org was genuinely close to crisis.

Funder concentration: Open Philanthropy provides ~77% of known funding. PIBBSS was NOT listed in SFF's 2025 S-process recommendations (previously funded ~$186K in 2023).

Business model: Pure grants. No product revenue, no contracts, no endowment. Marginal fellow cost ~$20K per 3 months. Marginal affiliate cost ~$35K per 6 months.

Incentive tension: The MATS director notes "mainstream technical AI safety funders have pivoted more towards applied research." PIBBSS's original blue-sky positioning is increasingly hard to fund, which may explain the pivot toward mechanistic interpretability (AMI division). The question is whether this pivot preserves or undermines the org's distinctive value.

No 990 financial data available. All figures are from self-reporting and known grant records.

What Others Say

Ryan Kidd (MATS Director), the most informed external assessor, funds PIBBSS $25K but notes: "PIBBSS might be pivoting away from higher variance blue sky research to focus on more mainstream AI interpretability. While this might create more opportunities for funding, I think this would be a mistake. The AI safety ecosystem needs a home for 'weird ideas' and PIBBSS seems the most reputable, competent, EA-aligned place for this!" He positions PIBBSS as essential for developing "Connector" archetype researchers who "bridge exploratory theory and empirical science" with development time "on the order of years."

LTFF (Evan Hubinger) funded PIBBSS's first fellowship with a specific concern: "by targeting candidates with strong interdisciplinary backgrounds but not necessarily much background in EA or AI safety, we were somewhat concerned that such candidates might not stick around."

PIBBSS's own 2022 retrospective is remarkably candid: only 6-10 of 20 fellows "made interesting progress"; concrete output was "insufficient"; prosaic alignment transfer was the hardest domain; academic incentive conflicts curtailed value. By 2024, mentor satisfaction was 7/10+ for 6 of 8 mentors, and output quality had improved significantly.

Mentor testimonials are consistently strong. One alignment researcher said of their fellow: "20% [they] will clearly surpass everyone else in AI alignment before we all die."

Independent criticism is virtually absent. No one has published a substantive argument against the interdisciplinary-to-alignment pipeline. This likely reflects the org's small size rather than consensus endorsement.

What's Absent

No public financial statements, budget breakdown, or annual report despite multi-million dollar funding
No documented fellowship selection criteria, despite MATS director specifically recommending publication
No public intellectual profile for Lucas Teixeira (ED Research) -- the person now leading research direction
No independent impact evaluation or counterfactual analysis of fellow outcomes
No explicit conflict-of-interest policy despite board members with dual roles (Timaeus, ARIA, alumnus)
No single theory-of-change document -- must be reconstructed from scattered sources

Stated Theory of Change

PIBBSS's stated theory of change has three layers:

Layer 1 -- Epistemic: Future AI systems may be sufficiently different from current ones that studying present-day ML alone won't prepare us. Intelligence is a natural phenomenon that appears across substrates (brains, ecosystems, economies, cells). By studying how intelligent behavior works in nature, we can develop substrate-independent principles that apply to AI regardless of its architecture.

Layer 2 -- Talent pipeline: The AI safety field needs people with deep expertise in complex systems, neuroscience, evolutionary biology, social science, and other domains. These experts exist in academia but face career barriers to engaging with AI safety. PIBBSS bridges this gap by pairing them with alignment researchers for structured collaboration.

Layer 3 -- Institutional: The field needs an institutional home for "blue sky" research that is too speculative for AI labs and too interdisciplinary for traditional academia. PIBBSS aims to be "a Bell Labs of AI Safety" -- a place where weird, high-variance ideas get serious support and can eventually spin off into focused research programs.

The causal chain: recruit domain experts -> pair with alignment mentors -> generate cross-domain insights -> spin off promising research lines into independent organizations or lab positions -> field grows in both breadth and quality.

Revealed Theory of Change

Where actions diverge from stated theory:

The interpretability pivot: The most significant divergence is the move toward mechanistic interpretability via AMI (Ambitious Mechanistic Interpretability) and the Iliad research group's SLT focus. These are high-quality, mathematically rigorous research directions -- but they are not the "blue sky" interdisciplinary exploration that made PIBBSS distinctive. AMI is essentially a mech interp lab. The MATS director specifically flagged this: "PIBBSS might be pivoting away from higher variance blue sky research to focus on more mainstream AI interpretability. I think this would be a mistake." This pivot is likely driven by funding incentives -- funders have shifted toward applied research, and interpretability is more legible and fundable than "what can ant armies teach us about multi-agent AI."

Scaling via structure, not serendipity: The original fellowship was radically open-ended -- find brilliant people, match them with mentors, see what happens. The new themed tracks (Cooperative AI, Gradual Disempowerment) and the move to twice-yearly fellowships with a dedicated Program Director suggest a shift toward more structured, repeatable program delivery. This is good for institutional sustainability but may reduce the high-variance hits the original model was optimized for.

From field-builder to research org: The AMI division with full-time hires represents a fundamental identity shift. A field-builder hosts other people's research; a research org does its own. The question is whether PrincInt can do both, or whether in-house research will crowd out the more speculative fellowship projects.

Nora's departure: The intellectual driving force left for a government position. The remaining leadership (Lucas Teixeira) has a thin public research identity. In practice, the intellectual vision is now distributed across the board (Ammann, Zhi-Xuan, Oldenziel, Goldhaber) and the mentor network rather than concentrated in an executive.

Key Assumptions

1. Intelligence has substrate-independent principles worth discovering.

Evidence for: Convergent evolution in biology, universality in statistical physics, information-theoretic invariants. The Simplex results (transformers representing belief state geometry predicted by computational mechanics) are direct evidence that physics-derived theory can explain ML behavior.
Evidence against: Many proposed analogies between natural and artificial systems have been shallow or misleading (Yudkowsky's critique of biology-based timelines). Deep learning may be "its own thing" with principles that don't map well to biological intelligence.
Testable: Yes -- each PIBBSS-style project is a test of whether a specific cross-domain transfer works.
If wrong: PIBBSS's value reduces to a generic talent pipeline (still valuable but not distinctively so).

2. The right talent exists in academia and can be recruited.

Evidence for: Three cohorts successfully recruited PhDs/postdocs from physics, neuroscience, philosophy, biology. The 2023 cohort was more senior (3 years more experience than 2022). The "100+ academics" post suggests growing openness to AI safety.
Evidence against: Social sciences remain underrecruited. Academic career incentives actively discourage AI safety work. The most senior academics (professors) are most dismissive. Retention after the program is uncertain.
Testable: Yes -- long-term tracking of fellow careers.
If wrong: PIBBSS becomes a brief exposure program rather than a genuine talent transition mechanism.

3. Three months is enough to produce meaningful cross-domain research.

Evidence for: Some fellows (Gabriel Weil, Adam Shai) produced genuinely significant work. The 2024 cohort's output was more publication-ready than earlier years.
Evidence against: PIBBSS's own 2022 retrospective: only 30-50% made "interesting progress." Prosaic alignment transfer was especially hard. The MATS director notes Connector development takes "on the order of years."
Testable: Yes -- compare output quality across cohorts.
If wrong: The affiliate program (6-12 months) becomes the real value creator, and the fellowship is primarily a screening/exposure tool.

4. Hits-based outcomes justify the program.

Evidence for: Simplex alone may justify a significant portion of PIBBSS's total funding. Gabriel Weil's tort law framework. The Gradual Disempowerment paper (ICML 2025). If even 1-2 fellows per cohort produce genuinely important work, the expected value is high.
Evidence against: No rigorous counterfactual analysis exists. Would Shai have found computational mechanics for transformers without PIBBSS? Possibly -- he was already adjacent. The hits may be cases where talented people were going to produce important work regardless.
If wrong: PIBBSS is a nice community-building exercise with less counterfactual impact than claimed.

5. The org can survive and thrive after founder departure.

Evidence for: Board includes several intellectually strong figures. New CG grant of $1.69M suggests funder confidence in post-Nora leadership. Nora remains on board.
Evidence against: Lucas has no public intellectual profile. The original vision was Nora's. The mentor network was built on Nora's relationships. The "embrace the weird" positioning requires charismatic intellectual leadership.
If wrong: PIBBSS becomes a competent but generic fellowship program, losing its distinctive epistemic identity.

Strengths

Genuinely distinctive intellectual niche: No other AI safety org does exactly what PIBBSS does -- bringing in deep domain experts from outside CS/ML. MATS, ERA, and other fellowships train people within the alignment paradigm; PIBBSS recruits from outside it. This is valuable regardless of any individual project's success.
Disproportionately strong mentor network: Having Abram Demski, John Wentworth, davidad, Jan Kulveit, and Vanessa Kosoy as mentors for a tiny org is extraordinary. This network is hard to replicate.
Honest self-assessment culture: The retrospective publications are unusually candid about failures. The 2022 post's admission that "concrete output was insufficient" and prosaic transfer was harder than expected builds genuine credibility.
Proof of concept: Simplex is a real company doing real research that genuinely emerged from the PIBBSS pipeline. Adam Shai's work on computational mechanics and transformers demonstrates the interdisciplinary transfer working at its best.
Nimble and adaptive: The org has evolved significantly in 4 years -- from one-off summer fellowship to affiliate program, Horizon Scanning, Iliad, AMI, twice-yearly fellowships. This suggests a leadership team that learns and adjusts.

Weaknesses and Risks

Extreme funder concentration: Open Philanthropy provides ~77% of known funding. If CG changes priorities, PIBBSS faces existential risk. The "5-6 months runway" crisis in late 2024 shows this is not hypothetical.
Identity crisis between "weird ideas" and legibility: The tension between the original blue-sky positioning and the interpretability pivot is unresolved. If PIBBSS becomes another mech interp lab, it loses its distinctive value. If it stays fully blue-sky, it may not be fundable. The org hasn't articulated a clear resolution.
Post-founder leadership vacuum: Lucas Teixeira has no public intellectual footprint. The vision that attracted funders, mentors, and fellows was Nora's. The org needs to develop a public intellectual identity for its current leadership or it risks becoming a "fellowship factory" without a unifying intellectual vision.
Uncertain impact measurement: No rigorous counterfactual analysis of fellow outcomes. The "73 alumni placed" claim is plausible but unverified. Without this data, the org is making claims it can't fully support.
Scale mismatch with aspiration: "Bell Labs of AI Safety" with ~5 full-time staff and $2.5M lifetime funding is a dramatic mismatch. Bell Labs had thousands of researchers and a captive corporate funding stream. The aspiration is inspiring but the path from here to there is not articulated.
Governance gaps: Board formed only in 2024 (year 3). Multiple board members have dual roles (Timaeus, ARIA, alumnus). No public conflict-of-interest policy. No documented selection process despite funder requests.

Cross-References

MATS: Closest comparator. Explicitly positioned as complementary: MATS trains "Iterators" on established mentors' agendas (10 weeks); PIBBSS develops "Connectors" who bridge fields (3-12 months). MATS is larger, more mainstream, and better-funded.
ACS Research (Jan Kulveit): Deeply interconnected. ACS's "gradual disempowerment" and "AI sociology" agendas overlap with PIBBSS fellowship tracks. Jan Kulveit is both PIBBSS mentor and ACS co-founder. These appear to be sister organizations.
Timaeus: Connected through board member Alexander Gietelink Oldenziel. Timaeus focuses on SLT; PIBBSS's Iliad program runs SLT-adjacent work. Complementary but potential for resource competition.
Simplex: PIBBSS's most successful spinoff. Demonstrates the pipeline working. Simplex's computational mechanics approach to interpretability is exactly the kind of cross-domain transfer PIBBSS was designed to enable.
ARIA Safeguarded AI: Nora's current employer. The "Guaranteed Safe AI" framework that davidad leads at ARIA has intellectual roots in the PIBBSS network. This connection could channel UK government resources toward PIBBSS-style research.

What Would Change This Assessment

Lucas Teixeira gives a substantive public talk or writes a detailed vision document -- would significantly reduce the leadership-vacuum concern.
PIBBSS publishes audited financials or a detailed annual report -- would address transparency concerns.
A rigorous counterfactual impact study of fellowship alumni -- would validate or challenge the talent pipeline claim.
The AMI division produces significant interpretability results -- would validate the pivot, even if it comes at the cost of blue-sky research.
Another fellow/affiliate produces a Simplex-caliber outcome -- would strengthen the hits-based case.
Open Philanthropy reduces or eliminates funding -- would test whether PIBBSS can survive on diversified funding.
A credible external critic publishes a substantive argument against the interdisciplinary approach -- would test the intellectual foundations.

Self-Critique

What sources should I have checked but didn't?

The full Manifund page with donor comments (rate-limited during scouting)
LinkedIn profiles of all 50+ fellows to verify placement claims
The SFF internal evaluation that led to PIBBSS not being funded in 2025
Any private evaluations from Open Philanthropy staff

Where is this analysis potentially biased?

I may be overweighting the intellectual framework's elegance versus its practical impact. "Epistemic pluralism" sounds impressive but could be a sophisticated justification for unfocused research.
I may be too charitable about the Simplex success story -- attributing it heavily to PIBBSS when Shai/Riechers may have found their approach independently.
I may be too critical of the interpretability pivot. If the field genuinely needs more mech interp capacity, PIBBSS adding it is valuable regardless of whether it's "distinctive."

What would a thoughtful person who disagrees say? "PIBBSS is a nice community-building exercise that produces a few interesting papers per year but has minimal counterfactual impact on AI safety. The interdisciplinary transfer thesis is mostly wrong -- the most impactful safety work requires deep ML expertise, not biology analogies. The few successes (Simplex, Weil) are talented people who would have found their way regardless. The org's financial fragility is a feature revealing its marginal value, not a bug."

My single weakest claim? That the post-Nora leadership can maintain the intellectual vision and mentor network that made PIBBSS distinctive. I have essentially no evidence about Lucas Teixeira's intellectual capabilities because he hasn't made them public.

What information would most change my view? A rigorous counterfactual impact study showing that PIBBSS fellows would have entered AI safety at similar rates without the program. This would turn the assessment from "valuable hits-based approach" to "nice community with modest impact."

Connected to (12)

Anthropicstaff to

ARIA Safeguarded AIstaff to · Nora Ammann

Conjecturestaff from · Lucas Teixeira

Cooperative AI Foundationcollaborator

DeepMindstaff to

Future of Life Foundationboard overlap · Ben Goldhaber

MATScollaborator

Simplexspun off from · Adam Shai

Timaeusboard overlap · Alexander Gietelink Oldenziel

ACS Researchcollaborator · Jan Kulveit

MIRIadvisor at · Abram Demski

UK AI Safety Institutestaff to

Sources (56)

Every URL that was read during research.

1.About – Principles of Intelligenceprincint.ai
2.Programs – Principles of Intelligenceprincint.ai
3.PIBBSS Fellowship – Principles of Intelligenceprincint.ai
4.Horizon Scanning – Principles of Intelligenceprincint.ai
5.Research Residency – Principles of Intelligenceprincint.ai
6.Research Highlights – Principles of Intelligenceprincint.ai
7.News – Principles of Intelligenceprincint.ai
8.Principles of Intelligenceprincint.ai
9.What Biological & Social Systems Can Teach us About AI with Nora Ammann, Cofounder of PIBBSS Researchcognitiverevolution.ai
10.A Look At Our Recent Partnerships To Support Early-Career Researcherscooperativeai.com
11.Nora Ammannnora-ammann.replit.app
12.Nora Ammann - Foresight Instituteforesight.org
13.Retrospective: PIBBSS Fellowship 2024greaterwrong.com
14.Retrospective: PIBBSS Fellowship 2023greaterwrong.com
15.Reflections on the PIBBSS Fellowship 2022greaterwrong.com
16.Why I funded PIBBSSgreaterwrong.com
17.New Executive Team & Board — PIBBSSgreaterwrong.com
18.PIBBSS Fellowship 2025 Research Management Postpibbss.ai
19.Research Residency – Principles of Intelligencepibbss.ai
20.ILIAD Conferenceiliadconference.com
21.Proceedings — ILIAD Conferenceiliadconference.com
22.Fellows & Alumni – Principles of Intelligenceprincint.ai
23.Simplexsimplexaisafety.com
24.Symposium ’23pibbss.ai
25.News – Principles of Intelligencepibbss.ai
26.How to Avoid Two AI Catastrophes: Domination and Chaos (with Nora Ammann)podcast.futureoflife.org
27.Guaranteed Safe AI? World Models, Safety Specs, & Verifiers, with Nora Ammann & Ben Goldhabercognitiverevolution.ai
28.How to Avoid Two AI Catastrophes: Domination and Chaos (with Nora Ammann) - Future of Life Institutefutureoflife.org
29.SFF-2025 S-Process Recommendations Announcement | Survival and Flourishing Fundsurvivalandflourishing.fund
30.Now Hiring: Research Engineers and Research Scientists at Principles of Intelligence – Principles of Intelligenceprincint.ai
31.Now Hiring: PIBBSS Fellowship Program Director (PD) at Principles of Intelligence – Principles of Intelligenceprincint.ai
32.Technical Research Manager (Iliad Fellowship 2026 & PIBBSS Fellowship 2026) – Principles of Intelligenceprincint.ai
33.Affiliateship – Principles of Intelligencepibbss.ai
34.Systemic Existential Risks from Incremental AI Developmentgradual-disempowerment.ai
35.Where do AI Safety Fellows go? Analyzing a dataset of 600+ alumnigreaterwrong.com
36.Team & Advisors – Principles of Intelligencepibbss.ai
37.32 - Understanding Agency with Jan Kulveitaxrp.net
38.ACS Research Programacsresearch.org
39.[missing post]greaterwrong.com
40.PIBBSS Fellowship 2025: Bounties and Cooperative AI Track Announcementgreaterwrong.com
41.Horizon Scanning – Principles of Intelligencepibbss.ai
42.Introducing the Principles of Intelligent Behaviour in Biological and Social Systems (PIBBSS) Fellowshipgreaterwrong.com
43.PIBBSS taggreaterwrong.com
44.Lessons learned from talking to >100 academics about AI safetygreaterwrong.com
45.Safeguarded AIaria.org.uk
46.Opportunity Space: Renormalization for AI Safetygreaterwrong.com
47.Symposium ’24pibbss.ai
48.December 2021: Long-Term Future Fund Grants | Effective Altruism Fundsfunds.effectivealtruism.org
49.Iliadiliad.ac
50.Announcement: Iliad Intensive + Iliad Fellowshipgreaterwrong.com
51.Principles Of Intelligence - Nonprofit Explorer - ProPublicaprojects.propublica.org
52.36 - Adam Shai and Paul Riechers on Computational Mechanicsaxrp.net
53.How would your project use extra funding? (Marginal Funding Week 2024)ea.greaterwrong.com
54.Grants Awarded by the Cooperative AI Foundationcooperativeai.com
55.Mechanistic Interpretability for AI Safety — A Reviewleonardbereska.github.io
56.Apply to the 2025 PIBBSS Summer Research Fellowshipgreaterwrong.com