My SBC cluster runs bigger models than a single Raspberry Pi, but the trade-offs are brutal ...
We moved away from an LLM-first approach and shifted toward a code-first architecture with bounded AI assistance.