Ivan Nardini @ivnardini - Twitter Profile

Pinned Tweet

about 1 month ago

Presenting at the Next '26 Developer Keynote is one of those moments I'll remember for a long time. Thank you to everyone who played a part. Till the next Next!

ivnardini's tweet photo. Presenting at the Next '26 Developer Keynote is one of those moments I'll remember for a long time.

Thank you to everyone who played a part.

Till the next Next! https://t.co/O08VVShuoG

5

16

1

1K

Ivan Nardini

@ivnardini

3 days ago

I will personally realize two dreams here. Going to Japan and present with さん Kaz

Kazunori Sato

@kazunori_279

3 days ago

6/10開催のAnthropic主催イベントCode w/ Claudeでは、 @ivnardini と私で"Building with Claude on GoogleCloud"を担当します。すでにオンサイトは満席ですが、オンライン視聴可能なのでぜひ。 https://t.co/bIWi3Xmvee

0

30

3

12

7K

1

5

0

4K

Ivan Nardini

@ivnardini

4 days ago

With v2.1.158, Anthropic shipped Auto mode in Claude Code with Google Cloud You can now run commands in Claude Code using Claude models on Google Cloud without stopping for permission prompts every time https://t.co/iDHnBLoqko

0

3

2

1

1K

Ivan Nardini

@ivnardini

11 days ago

Next Friday we are running a hands-on Claude Code on Google Cloud workshop together with the @AnthropicAI team in SF Half day, Guided labs, and Live Q&A Link https://t.co/Lgi7B11HuB

0

6

0

1

258

Who to follow

François Fleuret

@francoisfleuret

Research Scientist @meta (FAIR), Prof. @Unige_en, co-founder @neural_concept_. I like reality.

Kevin Patrick Murphy

@sirbayes

Research Scientist at Google DeepMind. Interested in Bayesian Machine Learning.

Khuyen Tran

@KhuyenTran16

Founder @ CodeCut | Author of Production-Ready Data Science

Ivan Nardini

@ivnardini

11 days ago

Github https://t.co/xhAoEuFYpN Docs https://t.co/ULy5EpE0fT

0

113

Ivan Nardini

@ivnardini

11 days ago

I looked into Keras Kinetic recently Keras Kinetic is a framework that lets you run Keras and JAX workloads on Cloud TPUs by writing a training function and adding a decorator Personally, it is one of the easiest ways I’ve seen to run a first TPU job so far Here is a great blog post on fine tuning Gemma to speak Gen-Z slang using Kinetic Blog https://t.co/lqZu5MCI6U

1

2

0

2

274

Ivan Nardini

@ivnardini

11 days ago

@graceg0ng Ofc! Great job Grace

0

38

Ivan Nardini

@ivnardini

12 days ago

A good read about Cloud TPU generations https://t.co/w16T3cO18J

0

9

3

0

1K

Ivan Nardini

@ivnardini

14 days ago

What a great series about getting started with TPUs https://t.co/6Cd9mdhhKJ

1

0

1

207

Ivan Nardini

@ivnardini

27 days ago

I spent some time testing elastic training capabilities on MaxText recently. MaxText is Google’s open-source JAX library for the full LLM lifecycle scaling from one host to hundreds of TPU chips. Pre-train with train method, run SFT/DPO/GRPO in the same package, and serve via vLLM. It supports several models including Gemma, DeepSeek, Qwen, Kimi and more. Docs https://t.co/ppOa6xUMu9 Tutorial coming soon.

ivnardini's tweet photo. I spent some time testing elastic training capabilities on MaxText recently.

MaxText is Google’s open-source JAX library for the full LLM lifecycle scaling from one host to hundreds of TPU chips.

Pre-train with train method, run SFT/DPO/GRPO in the same package, and serve via vLLM.

It supports several models including Gemma, DeepSeek, Qwen, Kimi and more.

Docs
https://t.co/ppOa6xUMu9

Tutorial coming soon.

0

452

Ivan Nardini

@ivnardini

29 days ago

Wrapping up the demo for Code with Claude in SF. If you’re around, I'm happy to talk. See you tomorrow!

0

1

0

298

Ivan Nardini

@ivnardini

about 1 month ago

Ray Serve now supports multi-host TPU slice deployments with gang scheduling. Before, TPU slices required manual host counts and bundle replication, with no guarantee of a single co-located slice. Now, Ray Serve uses Ray Core’s SlicePlacementGroup to pin deployments to one co-located TPU slice, matching Ray Train. Code https://t.co/YUSuQ27ZGe

0

4

0

2

398

Ivan Nardini

@ivnardini

about 1 month ago

Anthropic released the public beta of Cowork on Third-Party Providers (3P) Claude Desktop with Cowork and Code can now run using your own Google Cloud endpoint, billed as token consumption to your GCP project. Docs https://t.co/D5upoOpDzb

0

7

2

3

719

Ivan Nardini

@ivnardini

about 2 months ago

Release notes https://t.co/924W5kY7n5 Recipe https://t.co/TH4eiuj6a4

0

1

276

Ivan Nardini

@ivnardini

about 2 months ago

vLLM v0.19.1 shipped a bunch of optimizations and fixes for Gemma 4 > Gemma 4 MoE quantization support > Eagle3 speculative decoding for faster inference > Streaming and tool-call bug fixes for production applications

1

7

1

622

Ivan Nardini

@ivnardini

about 2 months ago

Vertex AI Agent Engine Memory Bank just landed two features I’ve been looking for. You can now push events yourself and decide when memories get generated. Before, agent memory was passive. You knew conversations were flowing in, but you didn’t know when extraction happened. Now you have > ingest events method lets you push raw turns in per user (and force_flush if you want it now) > generation trigger config sets idle-duration, fixed-interval, and event-count rules Code https://t.co/I5vRgZR8Y7

1

3

0

1

373

Ivan Nardini

@ivnardini

about 2 months ago

Release notes https://t.co/LXaVdD5132

0

208

Ivan Nardini

@ivnardini

about 2 months ago

Claude Code adds 1-hour prompt cache support for Vertex AI. Following interactions are now cheaper for long-running agentic coding sessions. Under the hood, it is the ttl field on cache_control field: {"cache_control": {"type": "ephemeral", "ttl": "1h"}} Documentation https://t.co/v89XgWhaLZ

1

7

0

4

535

Ivan Nardini

@ivnardini

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users