We're hiring our first FDEs!
Being a FDE at Google DeepMind has a fun flavor. You will:
π€ Act as a technical bridge between research teams & strategic partners
π οΈ Build benchmarks and evaluation tooling
π Work at the frontier of AI
Join us π
https://t.co/3ybAhNZngz
The scariest bug from a coding agent isn't the one that crashes.
It's the one that runs cleanly, passes tests, and quietly produces wrong results.
So we built one on @Antigravity's Gemini Managed Agents API to hunt them.
Give it a repo. Get back the bugs that passed review.
New stat from @vercel's AI Gateway in @BusinessInsider: Gemini 3 Flash is leading across AI models in token usage as of April. π See more stats on how developers are using our models β https://t.co/0xbulg4X2S
πvia @BusinessInsider
Can you set up an autonomous fine tuning loop entirely from your phone?
As a fun experiment, I used my @NousResearch Hermes agent to orchestrate @karpathy's autoresearch to improve @googlegemma 4 E2B's tool calling perf on BFCL. @modal made it v easy to spin up the necessary compute and run the experiments.
All of this set up, all from my phone! It took a bit of debugging, but the pipeline is finally stable and running experiments.
I also used the Gemini API's new flex inference to cut costs on the scientist by 50%. Since this is a long running task, the cost for latency trade off was worth it.
Regardless of end performance, being able to do this all from my phone is a pretty cool glimpse into the future of agentic UX!
@bernaferrari@reah_ai If you're using AI Studio, you can enable Logs for your project. This allows you to track all the requests made via the project
https://t.co/2uEkwwnqj0