Models on
your metal.

Frontier-grade capability doesn't have to live behind a meter. We serve open-weight and fine-tuned models on your own compute — owned, private, and free of per-token billing.

models behind a meter→models on your metal

Book the Diagnostic → What we build

The shadow

Capability you rent by the token.

A hosted model is capability you rent — metered, monitored, and gone the moment you stop paying. Your prompts and your data ride someone else's wire to get there.

The reality

Capability you host.

Open-weight models, optionally fine-tuned, served on the compute layer you own. The data stays in-perimeter, the cost is your hardware, and the capability is yours to keep.

What we build

Models, owned.

Open weights

Inspectable models

Open models you can run, inspect, and modify — no black box you're only allowed to query.

On your compute

Your region, your metal

Served on the owned compute layer (OpenStack / Ubuntu), in a region you define.

No meter

Priced as hardware

Capability scales with your infrastructure, not a per-token bill that grows with success.

Private

In-perimeter by default

Prompts and data never leave your perimeter to be answered — nothing to leak, nothing to train.

Where it fits

The serving tier of Intelligence.

Self-Hosted Models are how L6 — Intelligence — actually runs. TL-MoA orchestrates them, Custom AI Development tunes them, and they sit on the Compute layer (L2).

L6 · Intelligence TL-MoA Custom AI Development L2 · Compute EDEN ★

Start here