Venu Vasudevan

about 1 month ago

Was Johannes Kepler an LLM?

0

75

about 20 hours ago

In their day meta-search engines were a but. In fact, service routers generally so. But in model routers, there is real money to be saved (at least for the near future)

1 day ago

Model routers are becoming a favored way for companies to cut AI costs by sending simpler tasks to cheaper models. The rise of routing could pressure frontier model providers as customers get more disciplined about token spending. Read more: https://t.co/EbSuYfgkcm

3

10

3

4

8K

0

13

4 days ago

This is totally at odds with the claim that glm 5.2 etc have now reached parity with closed models

5 days ago

Anthropic Mythos’ advancements that were “on a totally different level” gave DeepSeek’s CEO an “epiphany” he needed to raise $7.4B. "If DeepSeek were to stay in the game, I mean to remain competitive in the long run, he really needed to build a massive war chest, at least in the realms of like tens of billions of dollars to begin with.” — @jingyanghk, Asia Bureau Chief

11

105

24

38

34K

0

32

Perfect score on the LSAT, went to the same elite HS as geohot. Full-stack dev, indie hacker.

4 days ago

I’d feel better about this if I also knew the FAA has transitioned from 1960s mainframes for ATC

Blake Scholl 🛫

@bscholl

4 days ago

BREAKING: FAA officially announced the rulemaking to legalize supersonic flight, including the Boomless Cruise ("Mach cutoff") approach we demonstrated on XB-1. This is a major step toward the supersonic renaissance.

bscholl's tweet photo. BREAKING: FAA officially announced the rulemaking to legalize supersonic flight, including the Boomless Cruise ("Mach cutoff") approach we demonstrated on XB-1.

This is a major step toward the supersonic renaissance. https://t.co/1in06V68Qk

122

6K

412

176

477K

0

29

Who to follow

Drew Hinkes

@propelforward

6 days ago

@PythiaR many might intellectually, but few behaviorally (to Peter Lynch'es - "you may have the brain, but do you have the stomach to take a loss" ..)

0

1

23

6 days ago

CATL trying to 'Nvidia Brand' itself is a sign of both strength (current NMC battery monopoly with strong brand identity) and weakness (Blade 2 from BYD could inspire Geely and others)

8 days ago

Robin Zeng, exacting and detail obsessed, keeps a stranglehold over a market that touches everything from AI data centers to electric cars. Even if Silicon Valley wanted to, it couldn’t live without him. Full story: https://t.co/6tF3rJuk6h

1

0

4K

0

56

6 days ago

Perhaps inspired by Ramp, Coinbase getting into the AI Influencer game 😀

mark

@markletree

7 days ago

At @coinbase our AI spend is down nearly half this quarter while token usage keeps climbing. My team built the infrastructure behind it: routing, caching, cheaper defaults, and the spend services that track it. We route everything through our own gateway: a single endpoint and format for dozens of models, with cross-provider failover, redaction, logging, and cost controls all applied before anything reaches a vendor. We started with cheaper defaults and caching. 91% of employees weren't hitting their usage caps. Instead of lowering caps, we set cheaper model defaults to cut spend. Caching took more work to get consistent across every tool and model family. A cache hit needs the prefix to match exactly, so we keep building a long, stable prefix across turns. Each request only pays full rate on the new tokens and reads the rest from cache. Our routing accounts for caching too. The naive approach scores each turn on its own and sends it to whichever model fits, which seems reasonable but would run up spend. The cache is per-model, so switching mid-conversation invalidates it. Our router weighs cache state alongside how hard the task is: a conversation keeps its model while the cache is warm, and the chance to re-route comes only when it goes quiet long enough for the TTL to lapse. Once it does, the router is free again to pick the best model for the task. These improvements happened at the gateway, so they apply across every team and tool. Next we're going deeper on the coding harness, where we have the most signal and flexibility, tuning how subagents and context get managed.

101

2K

178

3K

571K

0

1

49

6 days ago

@charliebilello Gerontocracy then generational wealth transfer. Likely creates Trust Fund generation

0

18

6 days ago

Number of generational companies has not increased from the 80s, at the proportion of the population of VC backed companies (most existing on backs of talented immigrants). So there is a viable argument that the inefficiency needs to be optimized beyond 'invisible hands'

Paul Graham

@paulg

6 days ago

Nearly all those who say the US should only admit the most talented immigrants would not themselves clear the bar they're proposing. They're effectively saying "Immigration is ok so long as you keep out people like me."

2K

6K

354

497

1M

0

75

6 days ago

Expect the pricing of output tokens to be binary. Those that are exact equivalents of digital humans will be priced like human labor. Others will asymptote to the price of bits. Cost of human level output tokens may be achieved by OSS models - but only if funded by sovereign

Matt Harney

@SaaSletter

7 days ago

👀 excellent new "State of AI" deck (66-slides) from @azeem @ExponentialView Excerpts here, 🔗next tweet

2

685

80

1K

100K

0

1

59

7 days ago

A situation where both Elon and Masa are correct. One forgets that Masa comes later to the party, and bets bigger (bit like Druckenmiller). Elon has vertically integrated knowledge. Masa - an 'interpolation manifold' that others dont got

The Wall Street Journal

@WSJ

7 days ago

Even SoftBank’s Masayoshi Son—no shrinking violet when it comes to wild ideas—has reservations about data centers in outer space, @timkhiggins writes https://t.co/ajD1Bpb4JP

17

79

16

12

75K

0

1

53

7 days ago

isn't that the whole hoopla about world models?

9 days ago

If a chatbot hallucinates, you get a bad answer. If a physical robot hallucinates, it breaks itself or hurts someone nearby. @rocketalignment explains: "Robotics doesn't have access to the same kind of data that language models use, right? There's no internet worth of training data, just waiting there for robotics to take and to train on." "As a robot if you hallucinate something that isn't there. You trip, you fall over, you break, your parts break, maybe you hurt someone nearby. And the stakes are just much higher."

1

7

3

1

5K

0

30

7 days ago

when memory prices crash in 24 months, Apple gets to keep the $106 (or more) that it is temporarily paying for memory inflation per iphone

venuv62's tweet photo. when memory prices crash in 24 months, Apple gets to keep the $106 (or more) that it is temporarily paying for memory inflation per iphone https://t.co/2KG8KEporf

0

29

7 days ago

the transition from ICE to EV cars was not gas to diesel. It's more horse to car in competitive dynamics

Financial Times

@FT

7 days ago

German carmakers embark on historic job cuts as Chinese rivals flood market https://t.co/g0M4fof1o3

82

607

196

76

127K

0

35

7 days ago

Chinese system - capitalism at small scale, govt 'blessed' regulated capitalism at the top. US .. getting to the same place

The Wall Street Journal

@WSJ

8 days ago

The government is allowing trusted partners to access the Mythos 5 model following a two-week restriction that rattled the tech industry. https://t.co/jkNL2Ui2qU

19

126

28

16

61K

0

51

7 days ago

Add 1% to the capital costs of Data Centers. Dole it out as an annuity to the local community over 15 years to keep politics under control. Jobs are temporary, this has duration.

9 days ago

🧵 The AI data center boom has a new bottleneck: local opposition. The Information identified 300+ temporary and permanent bans on new data center development passed by state and local governments across the U.S. since 2023. More than 75 additional measures are under consideration. https://t.co/ictHGSc1J4

1

6

2

4

4K

0

30

7 days ago

@dwarkesh_sp My conspiracy theory (given that computer use is the most trivial 'world model') is that it is a business reason, not technical. It is a net negative for the entire internet industry based on eyeballs. It'll magically become possible once agentic commerce gets going

0

12

8 days ago

Vance admits he's a joker. And so it is 😏

Aaron Rupar

@atrupar

9 days ago

JD Vance: "I think Nixon's historical legacy is enjoying a bit of a renaissance, and deservedly so. I joked that if Watergate happened tomorrow, it would be like a 12 hours news story. The idea that it took down a presidency is crazy."

4K

24K

3K

6K

15M

0

19

8 days ago

This is indeed an interesting way to use Claude Code and Slack to creating dynamic (human-human-agent) teams. Might give $CRM something to hang its hat on ..

Andrej Karpathy

@karpathy

11 days ago

This is a new paradigm for interacting with Claude that is significantly more "inline" with all the other human activity org-wide. Once you do all of the under the hood engineering work to make this "just work" (e.g. across tools, integrations, compute environments, memory, security, etc.), Claude basically joins the team in a seamless way - you can talk to it as you would talk to a person and it can help with a very large variety of workloads. Imo this is the 3rd major redesign of LLM UIUX. The first paradigm was that the LLM is a website you go to, the second was that it is an app you download to your computer. This third one is that it is a self-contained, persistent, asynchronous entity with org-wide tools and context, working alongside teams of humans. It really takes a while to wrap your head around it, but it works and it is awesome.

1K

23K

2K

14K

8M

0

64

8 days ago

What a VC 'talking his book' looks like. Some validity bundled with a strong skew based on Benchmark's thesis and investments

Bill Gurley

@bgurley

8 days ago

This is what’s causing Anthropic to aggressively beg for govt protection (see below). Customers are finding cheaper alternatives. Keeping employees requires continuing ultra-rich secondaries ($$$) that are dependent on revenue growth. When you can’t win on the field go to DC.

167

4K

484

1K

927K

1

0

190