Cost & Efficiency

Token Margin Tracker

Estimate the gap between public token prices and rough raw hosting costs for open-weight model serving.

How to use this dashboard

Estimate the gap between public token prices and rough raw hosting costs for open-weight model serving.

Use this tracker to estimate the spread between raw hosting assumptions and public API token prices. Treat it as a pricing signal, not a profit-margin claim.

Token Margin Tracker

11 records
API wrapper / routerOpen-weight 8B chat model0.180.044.5xHigh utilization on low-cost GPU; excludes engineering, support, margin, failed generations, and idle timeLow / estimate
Hosted open model APIOpen-weight 70B chat model0.90.283.2xSteady batch traffic on marketplace GPU; excludes redundancy and orchestration costLow / estimate
Premium proprietary APIFrontier reasoning model15UnknownNot knowableClosed model. Raw cost cannot be verified from public data.Unknown
GPU marketplace self-hostSelf-hosted embedding model0.10.0156.7xEmbeddings are easy to batch; utilization matters more than peak GPU speedMedium estimate
Serverless inference providerOpen-weight mixture-of-experts model0.650.183.6xMoE serving has routing/VRAM complexity; raw GPU cost is not the full costLow / estimate
OpenRouterPublic model and pricing catalog0Not knowable from public dataUnknownClosed/private cost stack. Public API price can be shown, but true raw serving cost cannot be verified.Public price only
OpenrouterPareto Code Router-1,000,000Not knowable from public dataUnknownClosed/private cost stack. Public API price can be shown, but true raw serving cost cannot be verified.Public price only
OpenrouterBody Builder (beta)-1,000,000Not knowable from public dataUnknownClosed/private cost stack. Public API price can be shown, but true raw serving cost cannot be verified.Public price only
NvidiaNVIDIA: Nemotron 3 Nano Omni (free)0Not knowable from public dataUnknownClosed/private cost stack. Public API price can be shown, but true raw serving cost cannot be verified.Public price only
PoolsidePoolside: Laguna XS.2 (free)0Not knowable from public dataUnknownClosed/private cost stack. Public API price can be shown, but true raw serving cost cannot be verified.Public price only
PoolsidePoolside: Laguna M.1 (free)0Not knowable from public dataUnknownClosed/private cost stack. Public API price can be shown, but true raw serving cost cannot be verified.Public price only