ukituki @ukituki - Twitter Profile

Pinned Tweet

over 2 years ago

Not your weights, not your algo, not open ai. We know that nondeterminism is a feature, but llms randomly leaking training data will be an important vector of attack

Alex Ker 🔭

@thealexker

over 2 years ago

Wild: GPT-3.5 leaked a random dude's photo in the output... Lesson: what you upload online will probably become training data.

84

1K

162

392

594K

1

13

0

2

2K

ukituki @ukituki

3 days ago

Robinhood but for real

0xflorent.eth

@0xFlorent_

4 days ago

First white-hat exploit on Ethereum: I unlocked 1,003.62 Ξ ($2,000,000) trapped in a 2016 ICO smart contract for 9 years. The 48 original investors can now claim their funds.

0xFlorent_'s tweet photo. First white-hat exploit on Ethereum: I unlocked 1,003.62
Ξ ($2,000,000) trapped in a 2016 ICO smart contract
for 9 years.

The 48 original investors can now claim their funds. https://t.co/lyh5iyaDu7

379

5K

377

873

534K

0

1

0

40

ukituki @ukituki

4 days ago

It rhymes well with classical 12 leverage points for system intervention https://t.co/LVY3RSg6ok

Visa is doing marketing consults (see pinned!)

@visakanv

4 days ago

here is some vague abstract advice, may it be weirdly relevant to whatever specific thing you're stuck on

22

340

23

147

11K

0

39

ukituki @ukituki

4 days ago

Docs (full tutorial): https://t.co/4ymPTSKcPY Docs (optimization): https://t.co/D1OBnKVAu9 Haiku scoring metric (with 20 sub-checks) by @dbreunig : https://t.co/9Ulufvyetd

0

1

0

1

67

Who to follow

EPP Group

@EPPGroup

The largest political group in the @Europarl_EN. We defend centre-right policies to promote growth & jobs in a more integrated Europe. #JobsJobsJobs

fly51fly

@fly51fly

BUPT prof | Sharing latest AI papers & insights | Join me in embracing the AI revolution! #MachineLearning #AI #Innovation

adil.eth

@AdilMouja

AI developer creating next-gen products to shape the future 🤖✨

ukituki @ukituki

4 days ago

This is the initial prompt: "Write a classical haiku given the provided inputs." The screenshot shows the new version. This is how @DSPyOSS adds clarity: - express intent in logical building blocks - add your eval criteria + dataset - GEPA optimization algo

ukituki's tweet photo. This is the initial prompt:
"Write a classical haiku given the provided inputs."

The screenshot shows the new version.

This is how @DSPyOSS adds clarity:
- express intent in logical building blocks
- add your eval criteria + dataset
- GEPA optimization algo https://t.co/2G3CE5CQsi

1

14

2

4

889

ukituki @ukituki

5 days ago

@kanekallaway Turtle traders but for content 👌

0

27

ukituki @ukituki

5 days ago

@wojventures @SudoCorentin @AniC_dev @radek_baczynski Radek spotted, small world🤫

1

2

0

94

ukituki @ukituki

6 days ago

@visakanv having the ideas expressed at the right level of compression is already half of the battle won. The issue with x is poor discoverability so the value doesn't have too much room to spread outside the narrow recency window. Substack is also not perfect but SEO juice flows better

0

21

ukituki @ukituki

9 days ago

@wojventures The overlap with those whose work is visible online is tiny

0

152

ukituki @ukituki

11 days ago

Tokenmaxxing is the symptom 🤫

Ashwin Gopinath

@ashwingop

13 days ago

Misreading the Bitter Lesson is how agents end up burning fortunes rebuilding context. Expensive amnesia, paid to anthropic in tokens. The fix: semantic state at ingestion, ontology at retrieval, tiny models for traversal, frontier models for judgment.

4

148

14

262

25K

0

1

0

613

ukituki retweeted

Isha Puri

@ishapuri101

13 days ago

It's never made sense to me that RL collapses all reward signals to a single scalar. Today, we fix that! Introducing Vector Policy Optimization: we train models to inherently optimize for the varied nature of a reward vector, creating diverse sets of answers ideal for test time search. Website and code coming soon!

11

712

67

575

68K

ukituki retweeted

Yohei

@yoheinakajima

15 days ago

i'm excited to open source Active Graph: an event-sourced reactive graph runtime for long-running, agents 🔄🧠 events/logs projects a graph. reactive behaviors react and affect the graph. fork-and-diff agent runs. no A2A, no workflows, no DAG site: https://t.co/Bbknu3ieUi docs: https://t.co/HAnKYjrZxZ github: https://t.co/jXQpMcyP1n quick start: pip install activegraph this is an early experiment in a new paradigm for agent architecture 🧪

57

533

53

596

95K

ukituki @ukituki

14 days ago

Zuck quoted in Ben Evans presentation + 8k ppl fired = system collapses under higher velocity and oversupply It's either: "we really can't handle and integrate all the new opportunities without sacrificing the revenue" or "we need to do less better" https://t.co/DyIBp7tHaJ

ukituki's tweet photo. Zuck quoted in Ben Evans presentation + 8k ppl fired = system collapses under higher velocity and oversupply

It's either: "we really can't handle and integrate all the new opportunities without sacrificing the revenue"

or

"we need to do less better"

https://t.co/DyIBp7tHaJ https://t.co/aY0PwNfj1S

0

36

ukituki @ukituki

14 days ago

Important, yet intuitive idea: agents need dynamic table of contents to navigate long context tasks

Joshua Gu

@astrogu_

14 days ago

Recent agentic systems (Claude Code, Codex, RLM, etc.) push context out of the prompt and into the environment (e.g., as files). This helps them maintain long-term knowledge about their goals and functionality. 🚨 While this is a good idea, we show a surprising result: systems that use external environments like this perform much better when given a small, fixed-size, in-context, agent-managed cache that "𝘱𝘦𝘦𝘬𝘴 𝘪𝘯𝘵𝘰" these environments. 🚀 Our paper, 𝗣𝗘𝗘𝗞: 𝙖 𝙨𝙮𝙨𝙩𝙚𝙢 𝙛𝙤𝙧 𝙗𝙪𝙞𝙡𝙙𝙞𝙣𝙜 𝙖𝙣𝙙 𝙢𝙖𝙞𝙣𝙩𝙖𝙞𝙣𝙞𝙣𝙜 𝗮𝗻 𝗼𝗿𝗶𝗲𝗻𝘁𝗮𝘁𝗶𝗼𝗻 𝗰𝗮𝗰𝗵𝗲 𝙛𝙤𝙧 𝙇𝙇𝙈 𝙖𝙜𝙚𝙣𝙩𝙨, introduces this idea. Compared with strong baselines, including RAG, Compaction Agents, and SOTA prompt-learning frameworks, PEEK dominates the cost–quality Pareto frontier: achieving +6.3–34.0% in quality, with fewer iterations and lower cost. Paper: https://t.co/67pm4Dqbw5 GitHub: https://t.co/JNMehuzN9M More in the thread below! (1/N)

astrogu_'s tweet photo. Recent agentic systems (Claude Code, Codex, RLM, etc.) push context out of the prompt and into the environment (e.g., as files). This helps them maintain long-term knowledge about their goals and functionality.

🚨 While this is a good idea, we show a surprising result: systems that use external environments like this perform much better when given a small, fixed-size, in-context, agent-managed cache that "𝘱𝘦𝘦𝘬𝘴 𝘪𝘯𝘵𝘰" these environments.

🚀 Our paper, 𝗣𝗘𝗘𝗞: 𝙖 𝙨𝙮𝙨𝙩𝙚𝙢 𝙛𝙤𝙧 𝙗𝙪𝙞𝙡𝙙𝙞𝙣𝙜 𝙖𝙣𝙙 𝙢𝙖𝙞𝙣𝙩𝙖𝙞𝙣𝙞𝙣𝙜 𝗮𝗻 𝗼𝗿𝗶𝗲𝗻𝘁𝗮𝘁𝗶𝗼𝗻 𝗰𝗮𝗰𝗵𝗲 𝙛𝙤𝙧 𝙇𝙇𝙈 𝙖𝙜𝙚𝙣𝙩𝙨, introduces this idea.

Compared with strong baselines, including RAG, Compaction Agents, and SOTA prompt-learning frameworks, PEEK dominates the cost–quality Pareto frontier: achieving +6.3–34.0% in quality, with fewer iterations and lower cost.

Paper: https://t.co/67pm4Dqbw5
GitHub: https://t.co/JNMehuzN9M

More in the thread below! (1/N)

17

353

37

484

109K

0

2

0

51

ukituki @ukituki

17 days ago

@termsheetinator 300

0

1

0

26

ukituki retweeted

Jediwolf

@Jediwolf

21 days ago

What happens when you post a real Monet and say it’s AI? The coolest art social experiment I’ve seen in a while. Thank you @SHL0MS

980

21K

3K

6K

2M

ukituki @ukituki

26 days ago

Depth-first network is the life hack for the distraction era: a bunch of folks that get it and have aligned incentives beat wide and shallow networks all the time. It’s a meta heuristic that works everywhere: - friends - feedback from the right icp - less tools+smart defaults

Incentivising

@incentivising

27 days ago

Game theory proves that the size of your network is not the most important factor at all. If you have a network of a thousand weak ties with no mutual dependency, it will produce near-zero results most of the time. And when put under pressure, it collapses immediately. You should focus on a network of twelve people with highly overlapping incentives and clear reciprocity structures. It will outperform a grand network every time. That's because the brain's social cognition system can only maintain a high sense of trust with a limited number of people. Beyond that, everything feels transactional. Depth beats width every time. Aligned people outwork the crowd every time.

21

1K

183

606

31K

0

1

0

41

ukituki @ukituki

28 days ago

Brian Eno’s Oblique Strategies approve this direction. Another banger research from Omar’s lab and RLM is not even a half year old 👌

Omar Khattab

@lateinteraction

29 days ago

I’ve never been this excited about search. 6-7 years ago, IR got an influx of the paradigms we still use, all enabled by the big headroom MS MARCO and then BEIR created. Then progress slowed. Today, Diane releases perhaps the most ambitious IR benchmark to date: OBLIQ-Bench. Queries in it are meant to be increasingly opaque to current first-stage retrieval paradigms. Oblique queries put the bottleneck very early in the search process, as the relevance of a document to the query is quite latent. I can't wait for core IR research on fundamentally more powerful paradigms for first-stage search to be reignited again. Stay tuned for more stories about this, and read Diane's thread and her paper below!!

8

361

44

212

39K

0

38