news · Reddit r/MachineLearning
How're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]
<!-- SC_OFF --><div class="md"><p>I've been developing an AI product using LLM APIs (from OpenRouter) but want to deploy an open-source LLM in my own Prod env. which I can control. </p> <p>Few reasons behind this are:</p> <p>- I wanna own the complete stack around my product.</p> <p>- Second I wanna fine-tune the model around my usecase. </p> <p>So, what's the most affordable but a good platform for this? I'm not an AI engineer so don't wanna stuck in CUDA or Transformers hell, anything which ca
Want the primary source?View original →