xiji2646-netizen

xiji2646-netizen

DeepSeek V4 is live in preview — should your team switch?

DeepSeek officially launched deepseek-v4-flash and deepseek-v4-pro in preview on April 24, 2026. The legacy routes (deepseek-chat, deepseek-reasoner) are deprecated July 24, 2026.

I’ve been thinking through how to actually structure a routing decision around this, and wanted to share a few observations:

The key trade-off is not “DeepSeek vs Claude/GPT” — it’s “which tier for which workload.”

Flash at $0.14/$0.28 per 1M tokens is a serious candidate for coding agents, repo analysis, and long-context summarization. Pro at $1.74/$3.48 sits between Flash and the premium tier.

The question I’d be asking: what percentage of your current workloads could Flash handle without quality regression? For many teams doing code gen and repo reading, the answer might be 60-80%. That’s a meaningful cost change.

The caveat: V4 is preview. Reuters used that word explicitly. Run your own eval set before committing to routing changes, and keep rollback paths to your existing premium routes.


Has anyone already run Flash or Pro against production workloads? Curious what failure modes you’ve seen, if any. The tool-call reliability question is the one I’m most uncertain about.

Where Next?

Popular Ai topics Top

AstonJ
Watching any? Any favourites? :upside_down_face:
New
AstonJ
I have a feeling we’re going to see a lot of threads about DeepSeek, so have put up a portal for it :003:
New
apoorv-2204
I’m reaching out to all software engineers, especially senior developers — I really want to hear your thoughts. I’ve always loved buildi...
New
apoorv-2204
General thoughts on google gemini ? IMHO , when compared chatgpt and claude sonnnet its pretty shit, and its feels broken,
#ai
New
Eiji
Yesterday a very interesting to discuss situation have happen. While StackOverflow still suffer a lot, because of chat bots, but yesterda...
New
nix0097
Hello I hope you’re doing well. I’m looking to develop a custom chatbot and would love to collaborate with you on this project. The chat...
New
xiji2646-netizen
Google just dropped a significant Deep Research upgrade: collaborative planning, multi-tool orchestration (MCP servers, Code Execution, F...
New
xiji2646-netizen
There’s a GitHub repo at forrestchang/andrej-karpathy-skills that’s sitting at 97.8k stars. It’s a single CLAUDE.md file with four behavi...
New
xiji2646-netizen
Claude Code, Markdown, and the Case for HTML Artifacts I do not think Markdown is going away. It is still the right format for README f...
New
xiji2646-netizen
Been using the skills repo (77K stars, #1 on GitHub Trending recently) with Claude Code. Sharing what worked and what did not. What work...
New

Other popular topics Top

siddhant3030
I’m thinking of buying a monitor that I can rotate to use as a vertical monitor? Also, I want to know if someone is using it for program...
New
brentjanderson
Bought the Moonlander mechanical keyboard. Cherry Brown MX switches. Arms and wrists have been hurting enough that it’s time I did someth...
New
AstonJ
I ended up cancelling my Moonlander order as I think it’s just going to be a bit too bulky for me. I think the Planck and the Preonic (o...
New
AstonJ
Do the test and post your score :nerd_face: :keyboard: If possible, please add info such as the keyboard you’re using, the layout (Qw...
New
rustkas
Intensively researching Erlang books and additional resources on it, I have found that the topic of using Regular Expressions is either c...
New
AstonJ
If you get Can't find emacs in your PATH when trying to install Doom Emacs on your Mac you… just… need to install Emacs first! :lol: bre...
New
PragmaticBookshelf
Programming Ruby is the most complete book on Ruby, covering both the language itself and the standard library as well as commonly used t...
New
New
New
Fl4m3Ph03n1x
Background Lately I am in a quest to find a good quality TTS ai generation tool to run locally in order to create audio for some videos I...
New