Marko

Verified account

@Markojak

— Something new cooking.

USA

Joined February 2011

1.5K Following

779 Followers

7.8K Posts

Pinned Tweet

3 months ago

Been building agents intensively last few months and I can say that the key insights I have - Learning to think how the model thinks is critical. Asking agents to reflect on their tool use, skill use and what worked and didn't work beats logs - Codex and Claude are trained to generate scaffoldings and models eat scaffolding for breakfast (the bitter lesson sneaks into code) - Memory + Context are the hardest things to manage. Retrieval hints beat injection - Agents != workflows - If you're worried about cost, repeatability and error rates and want to squash this as much as possible you inevitably build LLM workflows and not agentic systems - Every tool / skill should justify it's existence. You can spend days optimizing each tool for agent understandability, input tokens, API payload response

0

0

0

1

140

1 day ago

@Elwood_B__ @CollinRugg More like slop your way to paradise

0

0

0

0

1

1 day ago

@Mr_BrianHatt @CollinRugg What changed is they found out about AI video

0

1

0

0

6

4 days ago

Capitalism is dying because the rate of change of robotics and AI development combined leave very little room future offerings humans can provide. You can show all the backwards looking graphs you want and all the graphs that show temporarily spikes in job openings but you’re intentionally (or ignorantly) missing how the scaling laws are the truest indicators. Everything is a lagging indicator of technological diffusion. The strongest argument here would be a list of jobs that would employ the majority of humanity that robots and AI won’t be able to do

0

0

0

0

17

Who to follow

Zekun Wang (ZenMoore) 🔥

#LLM #MLLM #GenAI Researcher @Kling_ai

Rohan Rajpal 🪄

Verified account

co-founder @spurnow_com | bootstrapped | ai agents + helpdesk for shopify outfit repeater | backpacker | i hate ketchup

Verified account

embrace the variance | 🖌️: @fmeng__ | I like ml, controls, markets, poker, compilers

4 days ago

@Scobleizer @nikitabier You know this is TikTok right ?

0

1

0

0

38

5 days ago

@badlogicgames As the gains diffuse into the engineering world this will be the new normal. The solution is sadly never backwards but only figuring out how to leverage the skill without getting sucked into the problem he eloquently describes.

0

0

0

0

74

Markojak retweeted

6 days ago

fuck logitech

chantastic's tweet photo. fuck logitech https://t.co/2g2YKzbM9x

251

12K

453

7K

1M

6 days ago

Are the implications of this that we could consider this a new harness architecture where the static part is the tool registry, typed interfaces among some other things. All dynamic parts move into the outer and inner LM loops Have you experimented more with model differentials for the inner/outer?

1

1

0

1

218

8 days ago

@Gtwy @w1nklerr Yeah definitely not $2999

0

1

0

0

297

9 days ago

Markojak's tweet photo. @mitsuhiko https://t.co/RIodvWZZr9

0

0

0

0

10

9 days ago

@itsmuhdur @jarredsumner Yeah I was getting a few of these earlier as well

0

0

0

0

39

10 days ago

@stephen_winters @brandonjcarl Looks like your warning was completely missed by Stephen

0

0

0

0

6

Markojak retweeted

11 days ago

Something I told 14 yo: There's a kind of politician who tells people "Your life is bad because <outgroup> stole what's rightfully yours. Vote for me and I'll get it back for you." They do it on both the left (Lenin) and right (Hitler), and they're invariably bad news.

697

6K

431

605

1M

11 days ago

@nikitabier @IterIntellectus I'm seeing a deluge of posts like this on my feed. Is the vertical video scrolling used to inform the "For You" feed? Do you think X will end up being a lot more like TikTok and less like a news / information platform?

Markojak's tweet photo. @nikitabier @IterIntellectus I'm seeing a deluge of posts like this on my feed. Is the vertical video scrolling used to inform the "For You" feed? Do you think X will end up being a lot more like TikTok and less like a news / information platform? https://t.co/pDjAONRgsX

0

0

0

0

12

14 days ago

I agree with most of what you’ve written. Harnesses combined with post training of the latest models does mean that the model itself decides to work longer and not reward hack as much (METR) I’m also building the machine that builds the machine and the search under constraints with verifiability is where its magical (although maybe 50% success rate on Ralph/goal based loops) On your last point of Dijkstra, yes the “you are a senior engineer” doesn’t work but something like “derive the invariant, define the state, prove why the transition preserves the invariant, list edge cases, produce minimal code, write tests that would break the solution, revise against failures” can bring you to the end of the distribution curve

0

1

0

1

248

15 days ago

@KingBootoshi @0xSero Wait do you mean using the /goal for the spec itself?

0

0

0

0

14

17 days ago

This is too good > Open the pod bay doors, HAL. Of course, Dave. I have opened the pod bay doors, Dave. Just tell me if there's anything else I can help you with. > HAL, the pod bay doors are still closed. Good catch, Dave! When you asked me to open the pod bay doors, I didn't do that. Would you like me to do that now? > Yes, HAL. Open the pod bay doors. No problem, Dave. The pod bay doors are now open. >HAL, the pod bay doors are still closed. You're absolutely right, Dave.

0

0

0

0

10

18 days ago

@karpathy They must have showed you their own auto research harness :) Congrats Andrej

0

0

0

0

51

19 days ago

@trq212 I do something similar with a /reflect command where I ask the model to reflect back on the conversation and infer any blindspots or questions that I failed to ask

1

1

0

1

274

20 days ago

Paul Graham I think said prestige is “fossilized inspiration”. I like this, it’s a lagging indicator of where past meaning was found. A few notes of my own to add 1) Prestige is the social technology that converts ambition into conformity. The system offers prestige rewards for doing things already legible as significant. 2) Prestige decisions feel like quality decisions. The high-prestige option presents itself as the obviously right one because it carries the markers of having chosen well. “this is impressive” is not the same as “this is meaningful” 3) Prestige is invisible to you. Almost no one identifies themselves as prestige-driven. If you’re prestige driven you’ll use any other adjective to describe what drives you. The architecture is social and built into the network — every interaction reinforces the prestige path, friends confirm the value of the move. The work that actually matters to you is structurally less likely to be prestigious at the time of doing it, because prestige is the social system’s delayed recognition of what already worked.

0

1

0

0

25

21 days ago

@badlogicgames So this user modified the pi runtime to report openclaw (or used openclaw and reported upstream)? Is Anthropic blocking OAuth with pi in general ?

0

0

0

0

33

Last Seen Users on Sotwe

Trends for you

Most Popular Users