debuggingfuture

Verified account

@debuggingfuture

🤝Fractional CTO/CISO @fractalboxdev Building secure, private and compliant-by-design solutions for agents

Base in Singapore, from HK

Twitter'a katıldığı tarih July 2022

5K Takip Edilen

821 Takipçi

533 Gönderi

debuggingfuture retweetledi

4 gün önce

This is the actual bottleneck. The models are smart enough already. What is missing is the company-specific context locked in senior people heads. Whoever cracks knowledge extraction at the company level unlocks the rest. As you work on this, please consider using GBrain as your OSS retrieval layer https://t.co/0F5uDQzPHu

267

2K

143

2K

548K

debuggingfuture retweetledi

Startup Archive

@StartupArchive_

19 gün önce

Shopify CEO Tobi Lutke explains Goodhart’s law and why he doesn’t like KPIs or OKRs “Goodhart’s law is real. The moment a metric becomes a goal, it’s no longer a useful metric… No metric by itself is a complete heuristic for a complex business. There’s a million different tensions in a company, and you can’t keep all of them in harmony by optimizing for one thing.” For this reason, Shopify doesn’t use KPIs or OKRs. But as Tobi explains, this doesn’t mean they don’t value data and metrics. “We are extremely data informed. We have invested enormous amounts of money and time into systems that give us basically everything at our fingertips… But what Shopify attempts to do is just not over-fit for what’s quantifiable.” People love optimizing for highly-quantifiable things because there’s immediate gratification that comes from seeing a number go up. But Tobi thinks that the most important aspects of a product are rarely quantifiable: “The overlap of the most valuable things you can do with a product and the things that happen to be fully quantifiable are like maybe 20%. Which leaves 80% of a value space unaddressable by the people who only look at quantifiable things.” He continues: “Shopify is comfortable with unquantifiable things like taste, quality, passion, love, hate… The sort of deep satisfaction that a craftsperson feels when they’ve done a job well is actually a better proxy if you allow it to be.” They then have robust analytics systems that tell the company if something’s wrong or a new rollout breaks something. “We think about it as a cockpit for a pilot. The decisions are still made by pilots, and we think this leads to better results… I think there needs to be more acceptance in business of unquantifiable things… And then metrics take a support function.” Source: @lennysan (Feb 2025)

58

3K

255

3K

687K

debuggingfuture

@debuggingfuture

yaklaşık 1 ay önce

@KernelKennethG @karpathy @leanprover Many case 3d animations really helps understanding how things are linked together from different perspectives

0

1

0

0

26

debuggingfuture

@debuggingfuture

yaklaşık 1 ay önce

It's pretty insane how AI has been transforming the way I study Math. On top of @karpathy 's wiki idea, I can immediate generate visualization and generate @leanprover programmable proofs to keep myself engaged and really ponder the concepts

1

3

0

0

87

Takip edebileceğin hesaplar

Verified account

@PingMe_xyz contributor , $MOCA OG

mocadigest.moca

Verified account

🎙️We digest all things @Moca_Network, @MOCAFoundation & @Animocabrands. MD, the place where the #MocaFam comes together 🤝. Hosted by @0xbeka & @420axiefarmer

🪐 CUBE ⭕️

debuggingfuture

@debuggingfuture

4 ay önce

Echoing this. My 'aha' moment with Al is internalizing open-source libraries to make them faster and safer. Lately I'm exploring @leanprover with Claude Code, utilizing it both for formal verification and as the most concise spec for agents (maths) https://t.co/ivTX6KtxVy

Andrej Karpathy

4 ay önce

I think it must be a very interesting time to be in programming languages and formal methods because LLMs change the whole constraints landscape of software completely. Hints of this can already be seen, e.g. in the rising momentum behind porting C to Rust or the growing interest in upgrading legacy code bases in COBOL or etc. In particular, LLMs are *especially* good at translation compared to de-novo generation because 1) the original code base acts as a kind of highly detailed prompt, and 2) as a reference to write concrete tests with respect to. That said, even Rust is nowhere near optimal for LLMs as a target language. What kind of language is optimal? What concessions (if any) are still carved out for humans? Incredibly interesting new questions and opportunities. It feels likely that we'll end up re-writing large fractions of all software ever written many times over.

696

8K

650

4K

1M

0

1

0

0

142

debuggingfuture retweetledi

4 ay önce

Thanks to good people at @AnthropicAI we now have an official MCP for Excalidraw! Take it for a spin on @claudeai (search for Excalidraw in Connectors, or use in Claude Code and elsewhere). More to come. ✌

190

6K

533

4K

797K

debuggingfuture

@debuggingfuture

4 ay önce

With AI, Working code is a low bar. Secure by Design. Elegant by Design.

@VitalikButerin

4 ay önce

> We built a browser with GPT-5.2 in Cursor. It ran uninterrupted for one week. It’s 3M+ lines of code across thousands of files. I would actually be more impressed if it had 3000 lines of code, and came with a Lean proof that its sandboxing is bug-free :D I think now that code in general (for non-frontier use cases) is on its way to being too cheap to meter, the next challenge is pushing everything up to the top tier of security.

67

160

24

22

9K

0

1

0

0

83

debuggingfuture

@debuggingfuture

5 ay önce

@martinfowler Software Engineering is unusual in that it works with deterministic machines. Maybe LLMs mark the point where we join our engineering peers in a world on non-determinism." https://t.co/nmIlm8ik9s

0

0

0

0

77

debuggingfuture

@debuggingfuture

5 ay önce

AI didnt just make us architect, it also calls for better engineer. Peter Rice, the legendary structural engineer behind Centre Pompidou, once said: The engineer is the objective inventor and the architect the creative input.

debuggingfuture's tweet photo. AI didnt just make us architect, it also calls for better engineer.

Peter Rice, the legendary structural engineer behind Centre Pompidou, once said:
The engineer is the objective inventor and the architect the creative input. https://t.co/rcpMMUApqY

1

2

1

0

127

debuggingfuture

@debuggingfuture

5 ay önce

@martinfowler wrote "Other forms of engineering have to take into account the variability of the world. A structural engineer builds in tolerance for all the factors she can’t measure... "

1

2

0

0

85

debuggingfuture

@debuggingfuture

6 ay önce

Really refreshing to see how sync engine Jazz handles decentralized permission in CRDT. With crypto signature, the chain of edits is verifiable like a blockchain. Check against the whitelist of who can read/write the values. (another example of crypto more useful outside CT)

debuggingfuture's tweet photo. Really refreshing to see how sync engine Jazz handles decentralized permission in CRDT.

With crypto signature, the chain of edits is verifiable like a blockchain. Check against the whitelist of who can read/write the values.

(another example of crypto more useful outside CT) https://t.co/Ur00aY6YMM

Anselm Eickhoff

6 ay önce

My talk from @sync_conf 2025 is ready! Check it out to learn how: 1) CRDTs + cryptographic permissions work 2) @jazz_tools is now a general-purpose database 3) our unique Jazz Cloud infra compares to - traditional stacks - other sync engines - Durable Objects

3

50

12

20

4K

0

15

3

7

2K

debuggingfuture

@debuggingfuture

6 ay önce

@Wise has the worst customer support ever and best @Reddit censorship

debuggingfuture's tweet photo. @Wise has the worst customer support ever and best @Reddit censorship https://t.co/tdhkt9BT0s

1

1

0

0

70

debuggingfuture

@debuggingfuture

6 ay önce

There is no way @claudeai talk about NEA photocathodes when asked to build a landing page, unless it is leaking response of other users's private query

debuggingfuture's tweet photo. There is no way @claudeai talk about NEA photocathodes when asked to build a landing page, unless it is leaking response of other users's private query https://t.co/zj2SBzQWOt

0

1

0

0

84

debuggingfuture

@debuggingfuture

7 ay önce

with $NET now it is actually possible for @Cloudflare to pay out for damages for each 5xx on them

0

1

0

0

113

debuggingfuture

@debuggingfuture

7 ay önce

The more I realize AI can do my job, the more I feel I should focus on what humans are good at. If I'm not getting enough sleep, I'm definitely doing something terribly wrong

0

3

1

0

108

debuggingfuture

@debuggingfuture

7 ay önce

Fund in my Coinbase is still locked with random API error. The non-existent CS motivates me to self-custodial, but wait, I can't

1

2

0

0

79

debuggingfuture

@debuggingfuture

7 ay önce

Today developers are shaping agents with skills shared on Github. Very soon it the COO office will be interviewing, onboarding, assigning, reviewing and laiding off agents like the HR today

debuggingfuture's tweet photo. Today developers are shaping agents with skills shared on Github. Very soon it the COO office will be interviewing, onboarding, assigning, reviewing and laiding off agents like the HR today https://t.co/0jgGI1B8ip

0

1

0

0

57

debuggingfuture

@debuggingfuture

7 ay önce

Sandboxing is no longer just a best practice --- it's a core part of the agentic workflow

7 ay önce

mcps are changing turns out designing mcps to load every tool definition into the model prompt was a bad idea anthropic’s nov 4 blog post suggests a new pattern treat each mcp server like a normal code library, e.g. typescript modules or files, and let the agent write and run small programs that do two things: discover only what is needed, list a servers directory to see what exists, open just the specific tool files, import only those functions process data locally, call mcp tools from code, then filter, join, and aggregate inside a sandboxed runner so only the small final bits go back to the model doing this dramatically cuts tokens anthropic shows a typical case dropping from 150k tokens down to ~2k (98.7% savings) below a viz showing before/after

alxfazio's tweet photo. mcps are changing

turns out designing mcps to load every tool definition into the model prompt was a bad idea

anthropic’s nov 4 blog post suggests a new pattern

treat each mcp server like a normal code library, e.g. typescript modules or files, and let the agent write and run small programs that do two things:

discover only what is needed, list a servers directory to see what exists, open just the specific tool files, import only those functions

process data locally, call mcp tools from code, then filter, join, and aggregate inside a sandboxed runner so only the small final bits go back to the model

doing this dramatically cuts tokens

anthropic shows a typical case dropping from 150k tokens down to ~2k (98.7% savings)

below a viz showing before/after

41

1K

89

1K

88K

0

2

1

0

278

debuggingfuture

@debuggingfuture

7 ay önce

Perks for WFH in Singapore What should I stream when claude code is doing the work

debuggingfuture's tweet photo. Perks for WFH in Singapore

What should I stream when claude code is doing the work https://t.co/w55gXTNFON

0

0

0

0

94

debuggingfuture retweetledi

8 ay önce

The takeaway nobody wants to hear from yesterday, is that perps are a fundamentally unsound market foundation. Just like options markets outsizing equities markets in tradfi, it's unhealthy to have perps markets outsizing spot markets in crypto

50

336

24

26

25K

Sotwe'de En Son Ziyaret Edilenler

Senin İçin Trendler

En Popüler Kullanıcılar