The generative AI revolution faces a multi-trillion-dollar challenge: the soaring cost of inference, or running AI models.