xiji2646-netizen
Gemini 3.5 Flash launched today - quick breakdown for anyone running agent workloads
Google shipped 3.5 Flash at I/O 2026. The “budget” Flash model now beats 3.1 Pro on coding and tool-calling benchmarks.
Key numbers (from Google):
- MCP Atlas (tool calling): 83.6% vs 3.1 Pro’s 78.2%
- Terminal-Bench (coding): 76.2% vs 70.3%
- Finance Agent v2: 57.9% vs 43.0%
- 4x faster, ~40% cheaper than Pro
- $1.50/M input, $9/M output, $0.15/M cached
Where it does NOT win:
- Computer Use: not supported (GPT-5.5 only)
- SWE-Bench Pro: Opus 4.7 still leads
- Abstract reasoning: 3.1 Pro still edges it
My quick take on model routing:
- Multi-tool agent loops → Flash
- Heavy code refactoring → Opus 4.7
- GUI automation → GPT-5.5
Anyone tested it on real agent workflows yet? Curious how the 4x speed claim holds up in practice.
Popular Ai topics
New
Can you spot the AI generated person in the pic below?
▶
Spoiler
Video here:
New
I have a feeling we’re going to see a lot of threads about DeepSeek, so have put up a portal for it :003:
New
AI has been a hot topic here on Devtalk recently, so along that theme: How useful do you think AI dev tools are right now and how useful ...
New
I’m reaching out to all software engineers, especially senior developers — I really want to hear your thoughts.
I’ve always loved buildi...
New
How are you using AI in my life? How the day to day life is changed around you? professional and in personal life?
I it use for autocom...
New
I’ve been tracking this for the past two weeks and wanted to see if others are experiencing the same thing.
BridgeBench (independent hal...
New
Google just dropped a significant Deep Research upgrade: collaborative planning, multi-tool orchestration (MCP servers, Code Execution, F...
New
Been using a two-stage workflow for AI video production that’s been consistently more reliable than text-to-video:
Generate a 3×3 stor...
New
There’s a GitHub repo at forrestchang/andrej-karpathy-skills that’s sitting at 97.8k stars. It’s a single CLAUDE.md file with four behavi...
New
Other popular topics
If it’s a mechanical keyboard, which switches do you have?
Would you recommend it? Why?
What will your next keyboard be?
Pics always w...
New
Stop developing web apps with yesterday’s tools. Today, developers are increasingly adopting Clojure as a web-development platform. See f...
New
I know that these benchmarks might not be the exact picture of real-world scenario, but still I expect a Rust web framework performing a ...
New
From finance to artificial intelligence, genetic algorithms are a powerful tool with a wide array of applications. But you don't need an ...
New
I’ve been hearing quite a lot of comments relating to the sound of a keyboard, with one of the most desirable of these called ‘thock’, he...
New
Thanks to @foxtrottwist’s and @Tomas’s posts in this thread: Poll: Which code editor do you use? I bought Onivim! :nerd_face:
https://on...
New
Author Spotlight
Erin Dees
@undees
Welcome to our new author spotlight! We had the pleasure of chatting with Erin Dees, co-author of ...
New
zig/http.zig at 7cf2cbb33ef34c1d211135f56d30fe23b6cacd42 · ziglang/zig.
General-purpose programming language and toolchain for maintaini...
New
Will Swifties’ war on AI fakes spark a deepfake porn reckoning?
New
As digital systems increasingly run the world, mastery of the recurring patterns of software development risk is the key to fast and effe...
New
Categories:
Sub Categories:
Popular Portals
- /elixir
- /rust
- /wasm
- /ruby
- /erlang
- /phoenix
- /keyboards
- /python
- /js
- /rails
- /security
- /go
- /swift
- /vim
- /clojure
- /java
- /emacs
- /haskell
- /typescript
- /svelte
- /onivim
- /kotlin
- /c-plus-plus
- /crystal
- /tailwind
- /react
- /gleam
- /ocaml
- /flutter
- /vscode
- /elm
- /ash
- /html
- /deepseek
- /opensuse
- /zig
- /centos
- /php
- /scala
- /react-native
- /lisp
- /sublime-text
- /textmate
- /nixos
- /debian
- /agda
- /deno
- /django
- /kubuntu
- /arch-linux
- /nodejs
- /spring
- /ubuntu
- /revery
- /manjaro
- /diversity
- /lua
- /julia
- /markdown
- /quarkus









