The Cost and Platform Landscape of Enterprise AI
The headline cost of model inference is a small and shrinking fraction of what enterprises actually spend to run generative AI in production. The production evidence reviewed here consistently shows inference accounting for 20–40% of run-rate cost for mature deployments. Retrieval, evaluation, observability, governance, and human review consume the remainder — and are largely invisible at planning time.
Related
Membership
Become a Member to receive new research as they are published.
