Nick Lothian @nlothian - Twitter Profile

Pinned Tweet

2 months ago

I wrote a new Agentic text-to-SQL benchmark and tested every local model I could against it: https://t.co/SDQ9fTwmyG Thanks to DuckDB WASM you can try your own models from the browser.

2

210

22

178

28K

Nick Lothian @nlothian

2 days ago

@thomas_thoresen @jeremyphoward There is. I don't know the link but the agent interface is great. Point your agent at it and it will work it out https://t.co/or4dOpIH2e

1

0

43

Nick Lothian @nlothian

2 days ago

First the I've seen @jeremyphoward speak in person. Really great talk about building with purpose.

1

9

0

2

1K

Nick Lothian @nlothian

3 days ago

@igorcosta Great talk Igor. Was wondering if you've looked at techniques like https://t.co/C79Xm1huLp and how they compare to your HRM work? Super interested to see you move HRMs beyond the ARC benchmarks.

0

1

0

125

Who to follow

Prabin

@prabin_01

A middle class conservative guy, a software engineer by profession, with confused thoughts n half baked mind. Tweets are purely personal.

Data Sigh

@Data_sigh

Data science, machine learning and data visualisation fangirl. Operational analytics at https://t.co/AXimnKuf1d @alisondavey.bsky.social

aurel.eth

@aurel_dev

Full-stack dev

Nick Lothian @nlothian

3 days ago

Great talk from @sarahmsachs Especially like the "Forgo discounts for optionality" point - something often missed.

0

2

0

35

Nick Lothian @nlothian

3 days ago

New agentic SQL benchmark results. Minmax 3: 23/25, $0.04, 369 sec StepFun 3.7: 21/25, $0.06, 254 sec MinMax lost a lot of time stuck on Q6, but otherwise a great looking model. https://t.co/KwZDN1MFxO

nlothian's tweet photo. New agentic SQL benchmark results.

Minmax 3: 23/25, $0.04, 369 sec
StepFun 3.7: 21/25, $0.06, 254 sec

MinMax lost a lot of time stuck on Q6, but otherwise a great looking model.

https://t.co/KwZDN1MFxO https://t.co/z168RHVFvF

0

1

0

1

57

Nick Lothian @nlothian

3 days ago

Great talk by @grmcameron from @ArtificialAnlys

0

3

0

80

Nick Lothian @nlothian

3 days ago

@swyx is huge!

0

20

Nick Lothian @nlothian

3 days ago

I'm speaking at AI Engineer Melbourne later today on Privacy Tech for AI. Come say hi: https://t.co/erdbmV4rWd

0

2

1

0

72

Nick Lothian @nlothian

6 days ago

I've been using SCAD a lot lately via AI and find Codex is much better than Opus. My tasks are much easier than this, but I don't know CAD at all so have to rely on the model's interpretation of my own terms.

Michael Rabinovich

@MikushRab

8 days ago

Opus 4.8 just dropped and I ran it through our CAD tasks. 4.6 → 4.7 → 4.8 side by side. The results are unexpected!

199

4K

193

2K

706K

0

102

Nick Lothian @nlothian

6 days ago

@dlwiest @checksumbyte There is Claude Teams Max plans (6.5x normal usage) but only up to 150 seats. Then everyone has to switch to API billing.

0

41

Nick Lothian @nlothian

6 days ago

Surprised people don't know the reason for this. Claude Teams only goes up to 150 seats before you have to switch to API billing. It also maxes out at 6.5 x Pro plan (there is no Claude Teams 20x plan)

cozybear

@dlwiest

7 days ago

Can anyone explain to me why companies don’t just give employees $100 / month Claude Code or Codex plans instead of paying per token? There has to be an explanation, because this keeps happening and doesn’t make sense otherwise

dlwiest's tweet photo. Can anyone explain to me why companies don’t just give employees $100 / month Claude Code or Codex plans instead of paying per token? There has to be an explanation, because this keeps happening and doesn’t make sense otherwise https://t.co/CEWoeEa3LQ

435

2K

16

330

613K

0

1

0

92

Nick Lothian @nlothian

9 days ago

@ade_oshineye Have you tried putting raw mermaid code into an image model? 😀 Not there for complex ones yet, but for simple to medium ones nano-banana and GPT-Image do great...

0

65

Nick Lothian @nlothian

11 days ago

I fixed the #anthropic #claude desktop buddy. Fork of Anthropic's Desktop buddy for the FNK0104 touchscreen devboard ($20) https://t.co/gqhk40bHhU

0

1

0

219

Nick Lothian @nlothian

11 days ago

@JakeKAllDay Auto-generated data and massive benchmark performance games smells like benchmaxxing to me. Hope to be wrong though.

1

0

21

Nick Lothian @nlothian

11 days ago

@ade_oshineye I haven't tried this yet, but "Punch is seasoning. Mechanism is the meal" in the README reads like decorative contrast to me?

1

0

37

nlothian retweeted

Jack Zhang

@awxjack

12 days ago

I keep seeing headlines about layoffs driven by AI automation… and honestly, I don’t fully understand it yet. At @airwallex, we’ve dedicated almost all of our engineering resources to building customer-facing AI products and infrastructure. We barely even have enough engineers working on internal automation yet. The demand for AI talent inside our company has gone up, not down. Maybe I’m missing something, but right now it feels like AI is creating more product opportunities, more engineering demand, and more ambitious problems to solve, not fewer. What are others seeing?

awxjack's tweet photo. I keep seeing headlines about layoffs driven by AI automation… and honestly, I don’t fully understand it yet.

At @airwallex, we’ve dedicated almost all of our engineering resources to building customer-facing AI products and infrastructure. We barely even have enough engineers working on internal automation yet.

The demand for AI talent inside our company has gone up, not down.

Maybe I’m missing something, but right now it feels like AI is creating more product opportunities, more engineering demand, and more ambitious problems to solve, not fewer.

What are others seeing?

17

52

6

5K

Nick Lothian @nlothian

13 days ago

@championswimmer That's a reasonable attempt at balancing cache and compaction. I think it's better to think of it as a less aggressive from of compaction though. It still drops large amounts of the cache.

0

24

Nick Lothian @nlothian

14 days ago

@AlexanderLong Congrats! That looks awesome.

0

1

0

64

nlothian retweeted

JFPuget 🇫🇷🇺🇦🇨🇦🇬🇱

@JFPuget

14 days ago

Another thing my team, incl @raja_biswas (and me a little) contributed to a lot.

0

33

4

15

4K

Nick Lothian

@nlothian

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users