OpenAI Halves Its Inference Costs With Software Alone

02 Jul 202610:52

Project update

OpenAI Halves Its Inference Costs With Software Alone

OpenAI engineers found a software optimization that cuts the cost of running its models by more than 50% — squeezing far more from existing GPUs, with no new hardware. Applied to logged-out ChatGPT, it now runs on just a couple hundred Nvidia GPUs — a fraction of the compute once needed. The savings give OpenAI room to widen margins, raise ChatGPT usage limits, or ease API pricing, per The Information.

Source

Theinformation