After self-hosting LLMs for a year, I realized that models are not the real bottleneck
…I treated the prompt box exactly like a Google search bar. I’d throw a messy handful of keywords, a vague sentence, and a prayer at the model, fully expecting it to…
