Discussion about this post

User's avatar
JP's avatar

Running locally is still the best option for privacy and zero restrictions. No argument there.

But there's a middle ground now for people who don't want to manage the hardware. Proxy services have popped up that give you flat-rate access to the full DeepSeek V3 (not the distilled versions) alongside GLM-4.7 and others for $30/mo. The tradeoff is obvious: you're sending requests to external servers. But for anyone already using Claude Code or similar cloud agents, the flat pricing removes the token anxiety you'd get with pay-per-use APIs. And you get the full 671B model rather than a distilled variant.

I wrote up the whole setup here https://reading.sh/how-to-get-3x-claude-rate-limits-for-30-a-month-1d3fdb8658df

No posts

Ready for more?