vLLM 1 Forecasting Your Private LLM Resources: Unlocking Lightning-Fast, Scalable AI Performance Jul 14, 2024