Custom with open-source (LangChain, custom RAG, llama.cpp, vLLM) wins when the process is strategically important but can be built on existing components. Most custom projects in mid-sized B2B fit this category. Example: AI knowledge search for a specific company with 200k technical documents. SaaS isn't enough (too specific), but you also don't need your own model — a good RAG on open-source is sufficient.
Custom from scratch (own model, fine-tuning, own on-prem GPU infrastructure) rarely wins in mid-market B2B. Realistic scenarios: pharma with highly sensitive patient data, defense, fintech with compliance that bans cloud, or companies with a very unique domain (e.g. AI for the mining industry with 50 years of geological data).
Build also wins when: you have 2–3 senior AI engineers in IT (or plan to hire them), long-term TCO is lower (typically at 100+ users on a core system), industry regulation requires on-prem (NIS2, DORA, sector-specific), or you want to build your own IP.