Wilford Lam

3 days ago

🔥

Kalshi @Kalshi

3 days ago

Prediction markets are coming to Canada. For the first time, millions of Canadians will be able to trade what's next through Wealthsimple.

Kalshi's tweet photo. Prediction markets are coming to Canada.

For the first time, millions of Canadians will be able to trade what's next through Wealthsimple. https://t.co/UemU39GOhQ

298

3K

203

399

1M

0

36

wlam919 retweeted

3 days ago

if you want an easier step before looping everything, recommend similar to Thomas here. Right now I'm using one orchestrator thread per project, which has collected a lot of context on my product goals and thinking. The orchestrator delegates to other threads, similar to subagents, the Codex app has thread creation, read thread, worktree options, etc. Then you can decide on setting a small loop from your kicked off task, or just manually calling orchestrator to check after. A lot easier to start off without burning your tokens. Plus I find the orchestrator flow saves a lot of time repeating myself on previous context.

0

3

1

356

wlam919 retweeted

Director Cryptosityclub CEO of Receipts Finance, Tech and Politics

4 days ago

Built with LayerZero

6

163

22

2

12K

Who to follow

Joshua Jake

@itzjoshuajake

Brittanya

@judejerman

Brittanya and her friends 🥰💕♥️

goose.eth

@xGooseFN

hm

wlam919 retweeted

4 days ago

The tokenization of everything continues. With 173 new tokenized stocks and ETFs live across Ethereum, Solana and BNB Chain, Ondo Global Markets now brings 430+ of the world's most in-demand assets to users everywhere. Powered by LayerZero. https://t.co/KTlorK3oy0

17

260

56

9

24K

wlam919 retweeted

5 days ago

$260B+ across 830+ OFTs on 170+ chains. From memecoins to tokenized treasuries to state-issued stable tokens, builders of all shapes use LayerZero, configured exactly to suit their needs. Read more about the Builder Spectrum at the link below. https://t.co/77sMqimExT

LayerZero_Core's tweet photo. $260B+ across 830+ OFTs on 170+ chains.

From memecoins to tokenized treasuries to state-issued stable tokens, builders of all shapes use LayerZero, configured exactly to suit their needs. Read more about the Builder Spectrum at the link below.

https://t.co/77sMqimExT https://t.co/xcjX2D5c0L

21

252

60

6

12K

wlam919 retweeted

9 days ago

The world's most anticipated IPO, live onchain from day 1. Starting today, Ondo brings SpaceX exposure to Ondo Global Markets across Ethereum, Solana, and BNB Chain. Powered by LayerZero. https://t.co/LEQ09ZwfM4

LayerZero_Core's tweet photo. The world's most anticipated IPO, live onchain from day 1.

Starting today, Ondo brings SpaceX exposure to Ondo Global Markets across Ethereum, Solana, and BNB Chain.

Powered by LayerZero.

https://t.co/LEQ09ZwfM4 https://t.co/safWpYSalz

15

260

82

13

21K

wlam919 retweeted

9 days ago

1. believe in yourself 2. commit 100% 3. win life

0

4

1

0

558

10 days ago

🔥

10 days ago

building a computer use mcp that works with any agent. Give your claude, devin, grok and any other agent computer use. computer use helps with a lot, some faves: - work across your computer in the background - agent self verification and feedback loop - handle annoying tasks on computer - help parents fix their computers Link below. Initial version and will likely break

6

29

1

18

17K

1

0

54

11 days ago

@mattlam_ @petergyang LFG 🚀

1

0

5

wlam919 retweeted

11 days ago

Fable makes me even more bullish on the compute resource layer

1

2

1

0

139

11 days ago

@mattlam_ If AI agents become dramatically more useful, demand for the infrastructure that powers and hosts those agents will explode.

1

0

13

wlam919 retweeted

12 days ago

was wondering why FrontierCode's results were so different from DeepSWE from @datacurve, plus a lot of the vibes on X seem to agree with DeepSWE on gpt 5.5 performing better than opus 4.8. Turns out the benchmarks are grading differently. Asked codex and claude, and both agree that these benchmarks aren't directly comparable. FrontierCode judges more production readiness, and whether this code would be mergeable. DeepSWE is more judging if the agent can solve long horizon repo tasks with behavioral correctness. FrontierCode might be better at judging enterprise code, and DeepSWE seems to be more focused on "will this work". Hopefully someone can do a deep dive comparison, maybe @theo?

mattlam_'s tweet photo. was wondering why FrontierCode's results were so different from DeepSWE from @datacurve, plus a lot of the vibes on X seem to agree with DeepSWE on gpt 5.5 performing better than opus 4.8.

Turns out the benchmarks are grading differently. Asked codex and claude, and both agree that these benchmarks aren't directly comparable. FrontierCode judges more production readiness, and whether this code would be mergeable. DeepSWE is more judging if the agent can solve long horizon repo tasks with behavioral correctness.

FrontierCode might be better at judging enterprise code, and DeepSWE seems to be more focused on "will this work". Hopefully someone can do a deep dive comparison, maybe @theo?

3

8

1

0

6K

12 days ago

@mattlam_ @LLMJunky Continuous learning is the goal

0

12

wlam919 retweeted

Cognition @cognition

13 days ago

Introducing FrontierCode: a coding eval that raises the bar for difficulty & quality. Each task took 40+ hrs of work by leading open-source maintainers. Models write sloppy code that works but isn’t maintainable. Our eval is first to measure: would you actually merge this code?

cognition's tweet photo. Introducing FrontierCode: a coding eval that raises the bar for difficulty & quality. Each task took 40+ hrs of work by leading open-source maintainers.

Models write sloppy code that works but isn’t maintainable. Our eval is first to measure: would you actually merge this code?

242

4K

317

2K

3M

wlam919 retweeted

13 days ago

Grok Build is improving crazy fast. Wanted to check out the latest updates since I see their team shipping updates multiple times per week, props to them, and it already has most of the features I expect out of cli agents. Plugins, marketplace, btw, etc. Also, with @elonmusk tweeting out that the 1.5T model is going through RL, it'll be very exciting to see how Grok performs after. Plus they seem to be the least compute constrained. I think with automations (/loop), /goal, and computer use that'll cover most of the features devs look for in cli agents, other than harness perf. Demoing some of my fave devx improvements, wish all cli agents had: /theme and double click to navigate to your last message. I know these are small features, but I think it shows the quality and devx details.

7

56

3

9

5K

wlam919 retweeted

13 days ago

auto review -> approve for me. I've switched from full yolo mode to auto approval awhile ago, and haven't noticed an impact on devx. Plus this gives me extra peace of mind. Would be cool if @OpenAIDevs gave a breakdown of how it works behind the scenes, what triggers a permission check, how is that evaluated?

0

1

0

261

wlam919 retweeted

13 days ago

0

2

1

0

207

14 days ago

💯

15 days ago

if companies continue to have AI spend problems I think model routing will be an important layer AI labs work on. So far I only know @cursor_ai and @FactoryAI doing model routing, maybe @cognition? Yes claude and codex let's you spin up agents/subagents with different thinking levels and models. But it's not the same, I'm thinking an enterprise option for "auto" thinking level, and companies can get a default model routing, or optionally customize their own model layer. Switch between gpt auto switches between low, med, and even mini. Claude switches between opus, sonnet, haiku, high, med, etc.

27

66

3

40

26K

0

1

0

17

wlam919 retweeted