Midwest Frontier AI Consulting @mdwstfrontierAI - Twitter Profile

4 days ago

First, the released dataset has pretty good geographic coverage, and accounts for a majority of the US population! Everything including zoning, noise, housing codes, and whether or not you can ride an ATV without a license.

barrowjoseph's tweet photo. First, the released dataset has pretty good geographic coverage, and accounts for a majority of the US population!

Everything including zoning, noise, housing codes, and whether or not you can ride an ATV without a license.

12

963

102

114

38K

mdwstfrontierAI retweeted

Joe Barrow

@barrowjoseph

4 days ago

New paper: every law in America is technically public. But not really, until now! With @DenisPeskoff at UC Berkeley, we built a corpus of ~every publicly accessibly city and county law, and released a huge chunk of it! 2.2 million laws, you're (probably) covered in it! 🧵

barrowjoseph's tweet photo. New paper: every law in America is technically public. But not really, until now!

With @DenisPeskoff at UC Berkeley, we built a corpus of ~every publicly accessibly city and county law, and released a huge chunk of it!

2.2 million laws, you're (probably) covered in it!

🧵

210

7K

1K

4K

1M

mdwstfrontierAI retweeted

Neal Agarwal

@nealagarwal

5 days ago

Made a site that takes objects from wikipedia and turns them into endless I Spy > https://t.co/3cDJmbdpf9

234

36K

5K

31K

3M

mdwstfrontierAI retweeted

Claude

@claudeai

4 days ago

New in Claude Code: Artifacts. Interactive pages built from your session, like a PR walkthrough or a living project dashboard, shared with your team at a private link. Available in beta on Team and Enterprise plans.

704

18K

1K

8K

4M

mdwstfrontierAI retweeted

Kevin Roose

@kevinroose

5 days ago

https://t.co/V3Mb2drB1n

98

925

48

268

292K

mdwstfrontierAI retweeted

Cathy Russon

@cathyrusson

6 days ago

WOW - This trial just got moved to later in the year. This follows two full days of jury selection. A juror used CHATGPT to research this case, and then he told other jurors. That juror is going to be summoned for a contempt hearing. This trial is not happening this week.

27

323

44

30

109K

mdwstfrontierAI retweeted

Kobi Hackenburg

@KobiHackenburg

7 days ago

New w/ @AISecurityInst & @UniofOxford: Frontier AI can now out-persuade expert humans in conversation - incl. world-champ debaters and professional canvassers. This held even when humans chose their topics, prepared in advance, and competed for £1,000 prizes 🧵

58

943

228

598

205K

mdwstfrontierAI retweeted

Adam Thierer

@AdamThierer

7 days ago

this sort of headline -- "The Job That AI Was Supposed to Kill Needs More Humans Than Ever" -- is becoming more common as pundits and papers realize the AI job apocalypse narrative is exactly backwards.

AdamThierer's tweet photo. this sort of headline -- "The Job That AI Was Supposed to Kill Needs More Humans Than Ever" -- is becoming more common as pundits and papers realize the AI job apocalypse narrative is exactly backwards.

22

299

59

69

135K

mdwstfrontierAI retweeted

Harvey @harvey

7 days ago

Harvey is live inside Microsoft 365 Copilot and Copilot Cowork. Use Harvey in @Copilot for instant legal answers. Click through to Harvey for deeper analysis. Use Copilot Cowork for full multi-step legal workflows, all without leaving Cowork.

2

52

4

22

5K

Midwest Frontier AI Consulting

@mdwstfrontierAI

10 days ago

@Ned_Donovan I am once again asking that we make websites use serif fonts

Midwest Frontier AI Consulting

@mdwstfrontierAI

about 2 months ago

@mualphaxi @MADarbyshire I’ve wondered the same about people named Al. “Did Al write this?” “That’s fake, it’s from Al.” This is why we need serif fonts.

0

30

0

2

2K

0

2

0

577

Midwest Frontier AI Consulting

@mdwstfrontierAI

12 days ago

@TheZvi Obviously he timed it so we’d read after we hit the usage caps.

0

3

0

110

mdwstfrontierAI retweeted

Gabe Pereyra

@gabepereyra

12 days ago

We think Fable-5 is an incredible model and want to give our customers the controls to be able to use it safely for their legal work. We are currently allowing firms to opt-in to using Mythos-class models and being very explicit to avoid customers being unaware like you mentioned.

7

63

2

32

16K

Midwest Frontier AI Consulting

@mdwstfrontierAI

12 days ago

@emollick "without vowels, all that's left of poetry is shivering, growling, and falling asleep." (and emdashes)

0

51

0

2

3K

mdwstfrontierAI retweeted

Claude

@claudeai

13 days ago

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.

5K

105K

15K

22K

56M

Midwest Frontier AI Consulting

@mdwstfrontierAI

14 days ago

@KurtisCarman @SMB_Attorney Misusing*

0

16

Midwest Frontier AI Consulting

@mdwstfrontierAI

14 days ago

@KurtisCarman @SMB_Attorney I don’t think so yet, but a Nebraska attorney was suspended recently until further notice. Bunch of hallucinated citations in a divorce case that went to the state Supreme Court and then initially denied missing AI.

1

0

62

Midwest Frontier AI Consulting

@mdwstfrontierAI

17 days ago

@AbeGreenwald There’s research showing that LLMs prefer their own output if the same model is rating it (just further supports the point from your friend’s job hunt), e.g., https://t.co/ILiDsbdl4A

0

73

mdwstfrontierAI retweeted

clem 🤗

@ClementDelangue

17 days ago

Token costs are why there will be no saas apocalypse / good dev tools are cached intelligence for agents! The popular theory goes: agents can write code, so they'll just rebuild every tool from scratch and hit raw APIs. no more dev tools, no more CLIs, no more software layers. just agents and endpoints! We just tested this and the data says the opposite. We benchmarked Claude Code and Codex on real Hugging Face Hub tasks (~1,000 graded runs), with two setups: the agent-optimized hf CLI vs the agent hand-rolling curl or SDK calls from scratch. Hand-rolling burns up to 6x more tokens on multi-step tasks and fails more often (84% vs 94% task success). And that's just dropping one abstraction layer. It would obviously be orders of magnitude more tokens and a dramatically higher failure rate if the agent tried to bypass HF altogether and rebuild model hosting, versioning, and distribution from scratch. Every time an agent re-derives a workflow from raw API calls, you pay for that reasoning in tokens. every single run. a good CLI compresses that entire chain into a few high-level commands the agent can't get wrong. In a world where everyone is complaining tokens are too expensive, abstraction is leverage: thousands of hours of design decisions your agent doesn't have to re-reason about at inference time. Good tools are cached intelligence for agents! So no, agents won't rebuild everything from scratch. they'll gravitate to the most token-efficient tools, because that's what their owners pay for. The software that survives won't just be accessible to agents, it will be accurate and cheap for them to drive. We're seeing it happen with HF, which is becoming the platform for agents to use AI: ~49M requests in just two months, and growing fast! https://t.co/Y7q6yuxZrZ

ClementDelangue's tweet photo. Token costs are why there will be no saas apocalypse / good dev tools are cached intelligence for agents!

The popular theory goes: agents can write code, so they'll just rebuild every tool from scratch and hit raw APIs. no more dev tools, no more CLIs, no more software layers. just agents and endpoints!

We just tested this and the data says the opposite. We benchmarked Claude Code and Codex on real Hugging Face Hub tasks (~1,000 graded runs), with two setups: the agent-optimized hf CLI vs the agent hand-rolling curl or SDK calls from scratch.

Hand-rolling burns up to 6x more tokens on multi-step tasks and fails more often (84% vs 94% task success).

And that's just dropping one abstraction layer. It would obviously be orders of magnitude more tokens and a dramatically higher failure rate if the agent tried to bypass HF altogether and rebuild model hosting, versioning, and distribution from scratch. Every time an agent re-derives a workflow from raw API calls, you pay for that reasoning in tokens. every single run. a good CLI compresses that entire chain into a few high-level commands the agent can't get wrong.
In a world where everyone is complaining tokens are too expensive, abstraction is leverage: thousands of hours of design decisions your agent doesn't have to re-reason about at inference time.

Good tools are cached intelligence for agents!

So no, agents won't rebuild everything from scratch. they'll gravitate to the most token-efficient tools, because that's what their owners pay for. The software that survives won't just be accessible to agents, it will be accurate and cheap for them to drive.

We're seeing it happen with HF, which is becoming the platform for agents to use AI: ~49M requests in just two months, and growing fast!

https://t.co/Y7q6yuxZrZ

95

544

92

325

117K

Midwest Frontier AI Consulting

@mdwstfrontierAI

18 days ago

@dbreunig @dbreunig you can see part of what I mean here. The point I saw here is the creativity comes with some nonsensical outputs, but it at least gets you out of the basin.

Midwest Frontier AI Consulting

@mdwstfrontierAI

8 months ago

Just wrote a blog about this paper from the perspective of Des Moines metro's quirky Halloween joke-telling tradition. https://t.co/3jYwY1XCGg @shi_weiyan

1

3

1

461

0

16

Midwest Frontier AI Consulting

@mdwstfrontierAI

18 days ago

@dbreunig I’ve had some success generating truly novel synthetic data for test purposes using the VerbalizedSampling method but I’m not sure I’d characterize it as necessarily “good” or “creative” on its own the way I’ve done it.

1

0

1

98

Midwest Frontier AI Consulting

@mdwstfrontierAI

Last Seen Users on Sotwe

Trends for you

Most Popular Users