Anthony Ivan @anthonyivn - Twitter Profile

2 days ago

Delta Sharing became one of the most popular ways to exchange data thanks to its open cross-platform nature. We’ve now expanded it to also support any Iceberg client and to share AI assets like agent skills and unstructured data. It needed a new name, so welcome OpenSharing!

matei_zaharia's tweet photo. Delta Sharing became one of the most popular ways to exchange data thanks to its open cross-platform nature. We’ve now expanded it to also support any Iceberg client and to share AI assets like agent skills and unstructured data. It needed a new name, so welcome OpenSharing! https://t.co/ZsBKTZX6LR

2

48

6

9

4K

anthonyivn retweeted

Aaron Levie

@levie

2 days ago

This is a critical post to read if you’re building an applied AI company right now. “An application earns its place in the untrainable corner by doing unglamorous work: arranging a company's private reality so a model can act on it, handing the model the tools to act, working with the customer to change the reality of its workforce. A company that brings the translation is tough to copy – and the translation never ends. Integration and maintenance run as long as the relationship does, won by teams that put domain-specialized engineers and tools next to the customer.” There’s still an insanely large gulf between model capabilities and what it takes to apply them to specific corporate workflows. Some of that is technology that needs to be built, a lot is access to (and formatting of) the right data to work with, and a ton more is on the change management and specific implementation work (FDEs, etc.) it takes to make AI work in any specific corporate setting. 2 things can be very true at once: frontier models and labs will continue to grow an incredible amount, and there will be a vast ecosystem of software and services companies that emerge to bring the power of these models to real enterprises. This makes room for new infrastructure provides, applied AI companies in every vertical, new versions of system integrators, and more players. Incredibly exciting time on all fronts.

74

999

103

2K

223K

anthonyivn retweeted

sarah guo

@saranormous

2 days ago

https://t.co/Hw02laH9yp

97

2K

192

5K

1M

anthonyivn retweeted

Matei Zaharia @matei_zaharia

about 1 month ago

Cool work from the Databricks Model Serving team and Superhuman, to scale their custom LLM serving to 200K QPS with sub-second P99 latency! Here's how our teams got a +60% throughput gain vs the previous engine to serve over 40 million daily users. https://t.co/csjPLQzQ7C

4

57

12

11

5K

Who to follow

Isabel

@Isabel67197206

The more I learn about people, the more I like my dog.

iLagrangian

@i_Lagrangian

Memory Palace Interior Designer

Imran Bhatti

@ib9994

Consultant PB and Hernia Surgeon. Robotic Surgery. ERCP. @UHDB @DerbyPBunit

anthonyivn retweeted

Yoonho Lee

@yoonholeee

4 days ago

https://t.co/jCgH0doXCQ

11

390

53

592

109K

anthonyivn retweeted

Matei Zaharia @matei_zaharia

5 days ago

There’s a ton of interest in custom model tuning as agents reach production and scale up. Here is how we made Databricks Knowledge Assistant 3x faster using our new Instructed Retriever model trained end-to-end to do parallel test-time compute. It’s rolling out to customers now!

5

114

10

53

17K

anthonyivn retweeted

Yuchen Jin

@Yuchenj_UW

6 days ago

Before AI, I’d spend a weekend building 1 useless app. Now I can build 67 useless apps over a weekend, each with a logo, a fancy webpage, and 0 user.

429

8K

557

412

263K

anthonyivn retweeted

Matei Zaharia @matei_zaharia

5 days ago

Stay tuned!

2

99

7

36

24K

anthonyivn retweeted

Sam Altman

@sama

10 days ago

one of the quotes i find most inspiring on a hard day: "Whatever your hand finds to do, do it with all your might, for in the realm of the dead, where you are going, there is neither working nor planning nor knowledge nor wisdom" Ecclesiastes 9:10

1K

19K

3K

6K

2M

anthonyivn retweeted

Marc Andreessen 🇺🇸

@pmarca

9 days ago

We have moved on to entirely new moral panics, such as [squints, checks notes] water consumption in datacenters. And in a few years (or months, or weeks, or days), that will be completely forgotten too.

79

2K

101

93

173K

anthonyivn retweeted

Austen Allred

@Austen

9 days ago

Water usage in data centers really is the fakest issue I’ve seen in a long, long time

50

890

50

62

43K

anthonyivn retweeted

Alex Volkov

@altryne

10 days ago

.@satyanadella just put the whole "water" debate to rest. Datacenters run on a closed loop cooling system, the water usage of a datacenter for an entire year is roughly equivalent to a usage of 1 restaurant!

290

7K

899

2K

1M

anthonyivn retweeted

VV

@visualizevalue

11 days ago

“We are trying to prove ourselves wrong as quickly as possible, because only in that way can we find progress.”

5

188

26

37

9K

anthonyivn retweeted

JC Investing

@AIInvestorHQ

12 days ago

30 year old $VOO holders watching 18 year olds retire after finding out what options and semiconductors are

16

3K

50

251

325K

anthonyivn retweeted

Ethan He

@EthanHe_42

11 days ago

"You can outsource thinking, but not understanding." I still find writing toy code one of the best ways to build real understanding. It catches the nuances that skimming code and explanations lets you skip. So I wrote nanoRL (nanoGPT, but for post-training). SFT, DPO, GRPO, PPO: four single files, ~150 lines each, converging on a toy task in ~30 steps on a MacBook. Readable end-to-end. Then I continue RL Qwen2.5-0.5B-Instruct on GSM8K with this toy code + autoresearch. Interestingly, the accuracy improves tho it's a trained model.

EthanHe_42's tweet photo. "You can outsource thinking, but not understanding."

I still find writing toy code one of the best ways to build real understanding. It catches the nuances that skimming code and explanations lets you skip.

So I wrote nanoRL (nanoGPT, but for post-training).
SFT, DPO, GRPO, PPO: four single files, ~150 lines each, converging on a toy task in ~30 steps on a MacBook. Readable end-to-end.

Then I continue RL Qwen2.5-0.5B-Instruct on GSM8K with this toy code + autoresearch. Interestingly, the accuracy improves tho it's a trained model.

20

771

53

636

43K

anthonyivn retweeted

Susan Zhang

@suchenzang

12 days ago

children unknowingly absorbing AI-isms will make AI detection completely irrelevant sooner or later

16

241

14

21

42K

anthonyivn retweeted

Marc Andreessen 🇺🇸

@pmarca

13 days ago

Legal AI superempowers normal individuals with no legal background to fight big institutions in bureaucracies and in courts on a level knowledge/skill playing field, for the first time in human history. As such, it is one of the most inspiring applications of AI.

318

4K

287

457

429K

anthonyivn retweeted

λux

@novasarc01

14 days ago

i’m increasingly convinced that the best agent evals will come from mining real agent failure traces. my view is that every failed trace contains a potential eval but not in its raw form. raw traces are messy, long and too specific. the research problem is to distill them into clean reproducible tests. the pipeline i’m interested in is (which i'm currently working on): failure trace → failure attribution → earliest divergence point → minimal reproducible state → targeted eval → regression suite this turns trace data from passive observability into an active improvement loop. like can we extract the exact decision point where the agent should have behaved differently? and can we convert that into an eval that catches the same failure class in the future? i guess this matters because most agent failures are trajectory-level failures and not just output-level failures. personally i think this is much more realistic than relying only on hand-written benchmarks (imo they should look more like failure memory systems). hand-written evals encode what we think agents will fail on. traces encode what agents actually failed on. also once you have the mechanism, you can mutate the trace into variants. that is basically fuzzing for agents.

26

302

23

371

55K

anthonyivn retweeted

dax

@thdxr

13 days ago

i have seen enough proof now that using a coding agent is a deep skill it's confusing because the people you see heavily using them produce horrible results but that's because it's a skill! you can get better and the ceiling seems pretty high - this is very exciting to me

320

6K

397

1K

378K

anthonyivn retweeted

Addy Osmani

@addyosmani

15 days ago

https://t.co/0DyFIXJueI

84

3K

366

5K

782K

Anthony Ivan

@anthonyivn

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users