Theory of Change
Aligned AI was founded on the theory that concept extrapolation -- teaching AIs to generalize human concepts correctly across changing environments -- is "the very center" of AI alignment. Co-founder Stuart Armstrong:
"At some point, it'll become much easier to generalize capabilities than to generalize values, goals, those kind of objects. If the goals change as the world model changes, then you are naturally -- well not naturally, artificially -- solving part of the alignment problem." (AXRP #18, Sep 2022)
The mechanism: develop algorithms (ACE) that notice when an AI's world model changes, generate multiple candidate extrapolations of its goals, and select safe ones. Armstrong gave his agenda a 10% chance of directly achieving alignment and a 95% chance that "some of these ideas will be very useful for other methods of alignment."
By 2025-2026, the theory of change has shifted fundamentally. CEO Rebecca Gorman now argues that LLMs are "glorified autocomplete," scaling laws are dead, and AGI via chatbots is impossible. The company has pivoted to Canvas, an offline family-safe hardware+software ecosystem for children, with a Wikimedia Enterprise partnership. The new theory of change addresses near-term AI harms (parasocial relationships, cognitive deskilling, content safety) rather than existential risk from misaligned superintelligence.
The company's mission-lock clause is broad enough to encompass both: "To create technology (particularly AI and machine learning) that increases human safety, agency, ability, well-being, self-actualisation, and understanding."
What They Do
Technical alignment research (2022-2023):
- ACE (Algorithm for Concept Extrapolation) beat the CoinRun misgeneralization benchmark: 72% vs 59% prior best (55% random baseline). Patent pending.
- Content moderation tool scored 97% vs OpenAI's 32% on Google Jigsaw's benchmark.
- faAIr: gender bias measurement tool tested across 14 LLMs. Won CogX 2023 "Best Innovation: Algorithmic Bias Mitigation."
- GPT-Eliezer/DATDP: jailbreak defense method blocking 99.5-100% of augmented jailbreak attempts. Open-sourced.
Near-term safety and consumer product (2025-2026):
- AI Chaperones: framework for preventing parasocial relationships with chatbots (Aug 2025 paper).
- Emergent misalignment analysis: proposed "broken superego" hypothesis for finetuned models (Mar 2025).
- Canvas: offline digital ecosystem with e-reader, "Privacy Pod" (family cloud), offline Wikipedia, non-chatbot AI, and coding tools for children. Announced Feb 2026, waitlist only.
- Wikimedia Enterprise partnership for offline Wikipedia integration (Mar 2026).
Policy engagement: Gorman has advised EU, UN, OECD, and UK Parliament. Papers on EU AI Act manipulation mechanisms (27 citations).
Publication record: 7 formal papers (arXiv), ~15 blog posts on technical topics. No publications in top ML venues (NeurIPS, ICML, ICLR). Most-cited paper: "Recognising the importance of preference change" (36 citations, co-authored with Franklin and Ashton).
Key People
Rebecca Gorman -- Co-Founder & CEO. American, grew up in Silicon Valley, programmed since age 8, Oxford-educated. 20+ years AI experience. Strongly contrarian worldview: dismisses LLMs as "glorified autocomplete" and predicts an AI winter. REWork Top 100 Women Advancing AI, Fortune 50 AI Innovators (both 2023). SERI MATS mentor.
Stuart Armstrong -- Co-Founder & Chief Mathematician. British/Canadian. Decade at FHI Oxford. Pioneer of safely interruptible agents (with DeepMind's Orseau), corrigibility, low-impact agents. Author of "Smarter Than Us" (MIRI, 2014). Co-authors include Bostrom, Orseau, Leike, Soares, Sandberg. DPhil in Cartan geometry.
Team: 2-10 employees per LinkedIn. Social anthropologist Jocelyn Kelley joined for Canvas market research. Advisors include Dylan Hadfield-Menell (MIT), Adam Gleave (FAR.AI), Anders Sandberg.
Notable departure: Dr. Adam Bell served as director for only 2 months (May-Jul 2022). Described as "IP Advisor" -- likely a limited-scope engagement.
Money and Incentives
Total funding: ~$730K angel round (Sep 2023) at $24M post-money valuation. Investors anonymous. Additional small raises suggested by share allotments in Jun 2022, Mar 2024, and Jan 2025. Funded by "bootstrapping and angel investments."
Revenue: No evidence of product revenue from any source. EquitAI, faAIr, and ClassifAI have no visible customers. Canvas is pre-launch (waitlist). Gorman claimed "more inbound sales inquiries than we can service" in Aug 2023, but no visible adoption followed.
No Coefficient Giving/Open Philanthropy funding. This is notable: CG/Open Phil is the dominant funder of alignment research organizations. The absence suggests either non-application, rejection, or deliberate avoidance.
Financial health is opaque. UK "total exemption full accounts" provide minimal detail. A ~$730K raise in Sep 2023 should be nearly exhausted by March 2026 even at minimal burn. The Jan 2025 share allotment may represent bridge funding.
Compute constraints: Gorman reveals they investigated matching OpenAI's compute but found it infeasible: "we put a ridiculous amount of time and energy dragging up every piece of free and paid and back-room intel on how much compute was used to train GPT-3 and GPT-4."
Business model: Currently unclear. Canvas appears to be the intended revenue path -- a hardware+software consumer product -- but no pricing, ship date, or hardware specs are public. The company has no visible government contracts, lab partnerships, or consulting revenue.
Incentive analysis: As a for-profit with mission-lock, the company needs to generate revenue or raise more capital. The pivot from alignment research (hard to monetize) to consumer product (addressable market exists) is consistent with survival pressure. The risk: consumer product demands diverge from alignment research priorities.
What Others Say
External criticism is essentially absent. No alignment researcher has published a focused critique of Aligned AI's approach, concept extrapolation's viability, or the company's commercial strategy. This reflects the company's small size rather than the quality of its work -- it is below the radar of most commentators.
The alignment research community has not engaged deeply. No Open Phil funding, no mentions in major alignment roundups, limited citations. Armstrong went from being a widely-cited FHI researcher to running a company whose alignment work the community largely ignores.
Gorman's "state of AI" views are themselves controversial. Her claim that scaling inference "can't work" because errors compound oversimplifies -- it ignores verification steps, error correction, and the distinction between serial and parallel chains. "Glorified autocomplete" is reductive about transformer capabilities. A thoughtful critic would note that dismissing the entire LLM paradigm conveniently aligns with having failed to raise compute-scale funding.
The pivot can be read charitably or uncharitably. Charitably: Gorman genuinely assessed the AI landscape, concluded near-term harms matter more, and built a product to address them. Uncharitably: the company couldn't compete in alignment research, couldn't raise significant funding, and pivoted to a consumer market where small teams can compete.
Fortune coverage (2023) positioned the company as a credible small player: "These small startups are making headway on A.I.'s biggest challenges."
What's Absent
- No concept extrapolation applied beyond toy benchmarks (image classification, video games) to real-world systems after 4 years.
- No publications in top ML venues despite Armstrong's prior academic standing.
- No visible product revenue or customer base from any product.
- No engagement from major alignment organizations with the company's research.
- Angel investor identities undisclosed.
- EA Forum launch post and AMA (Feb/Mar 2022) inaccessible -- likely contain the most detailed original theory of change.
- No articulation of how Canvas connects to alignment research expertise.
- No evidence Armstrong is continuing to advance concept extrapolation theory post-2023.
Recommended Reading
Rebecca Gorman, "A letter on the state of AI" (Oct 2025) -- https://buildaligned.ai/blog/a-letter-on-the-state-of-ai -- The most candid source. Gorman's unfiltered contrarian worldview: LLMs are glorified autocomplete, scaling laws are dead, an AI winter is coming. Reveals the intellectual foundation for the Canvas pivot.
Stuart Armstrong on AXRP #18 (Sep 2022) -- https://axrp.net/episode/2022/09/03/episode-18-concept-extrapolation-stuart-armstrong.html -- The deepest technical articulation of concept extrapolation as "the very center" of alignment. Essential for understanding what the company set out to do.
Fortune: CoinRun breakthrough (Sep 2023) -- https://fortune.com/2023/09/28/u-k-startup-aligned-ai-claims-coinrun-ai-safety-breakthrough/ -- Best external account of the technical achievement, funding, and commercial positioning.
Wikimedia Enterprise: Canvas partnership (Mar 2026) -- https://enterprise.wikimedia.com/blog/aligned-ai-is-developing-ethical-ai/ -- The clearest picture of what the company looks like now: an offline family AI ecosystem, not an alignment research lab.