VoltanaLLM: Feedback-Driven Frequency Control and State-Space Routing for Energy-Efficient LLM Serving

arXiv – cs.AI Original
Anzeige

Ähnliche Artikel