Theory of Change
Timaeus claims that Singular Learning Theory (SLT) -- a mathematical theory of Bayesian statistics using algebraic geometry -- provides fundamental tools for understanding how training data shapes model structure. The causal chain: (1) training data determines the geometry of the loss landscape, (2) loss landscape geometry determines which algorithms models learn, (3) understanding this chain enables new tools for interpretability ("reading" what models have learned) and alignment ("writing" desired properties into models more reliably).
Jesse Hoogland: "Currently all of our existing techniques for trying to align models look like this: Train the model on examples of the kind of behavior you would like to see. It's a very indirect process... We don't understand how it works and we don't know that the way it's actually changing models is deep or significant or robust or lasting."
Daniel Murfet frames the alignment relevance through Nate Soares's "capabilities generalize further than alignment" argument: capabilities are deeply inscribed by patterns pervasive in training data, while alignment interventions (RLHF, constitutional AI) may produce shallower structures that break under distribution shift. SLT's contribution would be tools to measure which structures are "deep" vs "shallow."
What They Do
Timaeus conducts fundamental research on SLT applications to AI safety, organized into three prongs: (1) near-term safety applications (UK AISI partnership on singular psychometrics and weight exfiltration), (2) interpretability via developmental interpretability -- tracking how models change during training using SLT-derived observables, and (3) longer-term alignment foundations.
Key research outputs:
- ICLR 2025 Spotlight: "Differentiation and Specialization of Attention Heads" -- used refined LLC to track how attention heads develop distinct roles during training
- Best Paper HiLD ICML 2024: "Loss Landscape Degeneracy and Stagewise Development in Transformers" -- demonstrated that small transformers progress through distinct developmental stages, detectable by the Local Learning Coefficient even when hidden from the loss curve
- AISTATS 2025: "The Local Learning Coefficient" -- foundational method paper for estimating the key geometric invariant
- Scaled techniques from toy models to billions of parameters, finding that sampling scales sublinearly with model size
- Discovered the "multigram circuit" -- a new circuit type for nested pattern matching, found using SLT tools
Open-source devinterp library (127 GitHub stars). 20+ talks at frontier labs and academic venues. Organized 6+ major conferences/workshops since 2023. MATS Summer 2026 mentoring stream.
Key People
Jesse Hoogland (Executive Director, co-founder): MSc Theoretical Physics (U Amsterdam), MATS 3.0/3.1 under Evan Hubinger, prior health-tech CTO. Responsible for outreach, operations, and research engineering. 20+ talks at OpenAI, DeepMind, Anthropic and other major venues. Raised $3.5M+ in grants.
Daniel Murfet (Research Director, co-founder): Algebraic geometer who left a tenured position at U Melbourne in early 2025 to join full-time. Established the SLT for AI safety research agenda. The core mathematical expertise behind Timaeus's approach. "A very robust argument supports the view that AGI will be dangerous."
Team: Grew from 4 at founding (Oct 2023) to ~20+ by March 2026 (16 FTE by Jul 2025). Includes ~10 researchers, 5 engineers. Three advisors: Evan Hubinger (Anthropic), Davidad (ARIA), Adam Gleave (FAR AI). No public departures.
Money and Incentives
Total funding: ~$3.5M+ raised cumulatively through July 2025. 2025 budget: ~$2.5M.
Funding breakdown:
- Open Philanthropy: $1,557,000 (Jan 2025, ~44% of cumulative total) -- two grants for operating expenses
- Survival and Flourishing Fund: ~$1.05M across all rounds (estimated: ~$500K pre-2025, $276K S-Process 2025, $206K matching pledge, $70K speculation grant)
- Manifund: ~$255K (initial $143K regrant + ~$112K additional)
- LTFF, AISTOF: mentioned but amounts unknown
- Jesse Hoogland: personal FTX Future Fund grant (amount unknown, for career transition)
Business model: 100% grants. No product revenue, no compute credits from labs, no venture investment.
Salaries: $70K-$140K/year full-time; $50-100/hour contract. Below-market for AI safety research in early period, likely improved after Open Phil grant.
Incentive structure: Pure nonprofit with no commercial pressures. No lab affiliations beyond the advisory relationships (Hubinger at Anthropic, Gleave at FAR AI). UK AISI partnership is the only documented institutional relationship. The absence of lab compute dependencies or revenue is a structural advantage for independence.
Potential concerns: Open Phil is now the single largest funder. Advisor-funder overlap (Hubinger as both advisor and first major funder). No board of directors separate from leadership. No public financial filings (EIN unknown).
What Others Say
Strongest technical critique -- Joar Skalse (Nov 2023): SLT cannot explain generalization because it abstracts away the parameter-function map and Bayesian prior. "To understand the generalisation behaviour of a learning machine, we must understand its inductive bias. SLT abstracts away from both the prior and the parameter-function map. Hence, SLT is at its core unable to explain generalisation." Skalse concedes SLT may help with phase transitions and grokking, but denies it can serve as a unified theory of deep learning.
Even the advisor is skeptical -- Evan Hubinger (Timaeus advisor): "My modal outcome is that it's mostly just wrong and doesn't explain machine learning inductive biases that well. Inductive biases are very complex and most theories like this in the past have failed." Also: "I think Singular Learning Theory is a real contender for a theory that has a chance of effectively explaining and predicting the mechanisms behind machine learning inductive biases."
Constructive critique -- Beren Millidge (Dec 2025): SLT's insights may be more general than SLT proves. The real action may be in "pseudo-singularities" from SGD noise and finite training, not true mathematical singularities. SLT practitioners "should study the noisy stochastic optimization regime much more closely."
Community endorsement -- Zvi Mowshowitz (2025): Rates Timaeus "High" confidence with "High" funding need. "Slam dunk for interpretability funding." Notes excellent advisors and that "Evan, John Wentworth and Vanessa Kosoy have offered high praise, and there is evidence they have impacted top lab research agendas."
Theoretical gap acknowledged by founders -- Murfet: the bridge between Bayesian statistics (where SLT theorems are proven) and SGD training (how models are actually trained) is "not a theoretically justified step at this point in some rigorous sense." The empirical results are encouraging but the theoretical bridge is incomplete.
What's Absent
- No 990 financial filings available; EIN unknown. Cannot verify financial details independently.
- No board of directors separate from leadership team. Governance is opaque for a $2.5M+ org.
- No specific documented examples of frontier labs adopting SLT techniques, despite talks at all major labs. The claimed "impact on lab research agendas" is unverifiable.
- No published results from UK AISI partnership projects (due April 2025 per Feb 2025 update). Status unknown as of March 2026.
- No formal published response to Skalse's generalization critique.
- No documented institutional safeguards for capabilities risk, despite promising them in Oct 2023.
- Alexander Gietelink Oldenziel's (co-founder) current status at Timaeus is unclear.
Recommended Reading
AXRP Episode 31: Singular Learning Theory with Daniel Murfet (https://axrp.net/episode/2024/05/07/episode-31-singular-learning-theory-dan-murfet.html) -- 21K-word deep dive where Murfet explains SLT's foundations, honestly acknowledges theoretical gaps, and articulates alignment relevance. The most technically candid source available.
Joar Skalse: "My Criticism of Singular Learning Theory" (https://www.lesswrong.com/posts/ALJYj4PpkqyseL7kZ/my-criticism-of-singular-learning-theory) -- The strongest technical critique. Essential counterpoint to the SLT enthusiasm.
Hyper-Exponential: "Safe AI with Singular Learning Theory" (https://www.hyper-exponential.com/p/safe-ai-with-singular-learning-theory) -- Jesse tells the founding story and explains SLT in plain language. Best accessible introduction to what Timaeus is actually doing and why.
"Timaeus in 2024" (https://www.lesswrong.com/posts/gGAXSfQaiGBCwBJH5/timaeus-in-2024) -- Most recent organizational update covering research progress, UK AISI partnership, and strategic direction.
Cognitive Revolution: "Embryology of AI" (https://www.cognitiverevolution.ai/embryology-of-ai-how-training-data-shapes-ai-development-w-timaeus-jesse-hoogland-daniel-murfet/) -- Joint interview where Jesse and Daniel explain the developmental biology analogy and contrast with SAE-based interpretability.