Today2h ago
Project update
OpenAI Halves Its Inference Costs With Software Alone

OpenAI Halves Its Inference Costs With Software Alone

OpenAI engineers found a software optimization that cuts the cost of running its models by more than 50% — squeezing far more from existing GPUs, with no new hardware.


Applied to logged-out ChatGPT, it now runs on just a couple hundred Nvidia GPUs — a fraction of the compute once needed.


The savings give OpenAI room to widen margins, raise ChatGPT usage limits, or ease API pricing, per The Information.