ThermlTHERML

Est. 2026  ·  Stealth

The world needs 10× more
inference than it can build.

// First principles

The shift

Two ways to make a machine think.

A short film. Click any chapter to jump straight to it.

Chapter 1 · Intro

How a machine thinks.

A short film, in two substrates and three architectures.

/ 01. The Cost

AI is outpacing
what we can build.

< 3W
LLM inference
To match an NVIDIA B200 burning a kilowatt for the same job.
500K+
tokens / second
An NVIDIA H100 manages roughly fifteen thousand.
0 W
on the memory bus
Because there isn't one.

For seventy years, digital silicon solved every hard problem, from weather and fluid flow to option prices and protein folding, by running numerical approximations of stochastic math on deterministic transistors. It was brilliant, and it led us blind into the inference era. Modern AI is stochastic too, but the silicon still pretends it isn't.

Every GPU on earth is paying a tax to do an arithmetic impression of what a physical system would do for free, and roughly eighty percent of its energy moves data rather than computes with it, which works out to six hundred billion dollars this decade. The ten-times shortfall above is not a resource problem; it sits on the wrong assumption, that stochastic math belongs on deterministic silicon, and you cannot out-spend an assumption.

There is no version of this that ends with enough GPUs.

/ 02. The Answer

We built
the other computer.

So we changed the assumption. Instead of bending physics to squeeze more performance out of the wrong substrate, we asked the physics to do the work for us. We built silicon where the equations are the circuit, and where inference is the natural physics of the system.

~18,000×
energy efficiency vs. H100
58M
inferences per heartbeat

/ 03. Access

Investor
portal.

Data room · investor materials · partnership inquiries