Santiago Salgado

@SSalgadoM

Data, Analytics & Machine Learning

Madrid, Spain

Joined August 2008

1.1K Following

890 Followers

1.8K Posts

SSalgadoM retweeted

Claude

@claudeai

5 days ago

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.

105K

15K

22K

56M

SSalgadoM retweeted

Rafael Bonifaz (@[email protected])

@rbonifaz

18 days ago

Quiero compartirles que desarrollé un curso sobre seguridad y privacidad en internet orientado para el ciudadano común que impartiré en @flacsoec. Es modalidad híbrida y lo podrás tomar desde cualquier lugar de Latam y si estás en Quito puedes participar en persona. 🧵⬇️

rbonifaz's tweet photo. Quiero compartirles que desarrollé un curso sobre seguridad y privacidad en internet orientado para el ciudadano común que impartiré en @flacsoec. Es modalidad híbrida y lo podrás tomar desde cualquier lugar de Latam y si estás en Quito puedes participar en persona. 🧵⬇️ https://t.co/JKfpuI2Jdp

852

SSalgadoM retweeted

Rohit

@rohit4verse

3 months ago

https://t.co/H4KCn5WwNx

311

SSalgadoM retweeted

Google

@Google

2 months ago

We just released Gemma 4 — our most intelligent open models to date. Built from the same world-class research as Gemini 3, Gemma 4 brings breakthrough intelligence directly to your own hardware for advanced reasoning and agentic workflows. Released under a commercially permissive Apache 2.0 license so anyone can build powerful AI tools. 🧵↓

727

20K

10K

Who to follow

JohnSnowLabs

@JohnSnowLabs

Helping healthcare and life science organizations put AI to work faster with state-of-the-art LLM & NLP.

Sambodhi

@sambodhi

Turning #data into #knowledge and knowledge into #decisions

Kuldeep Raj

@KuldeepBarna

Software Architect, IT Consultant, Amateur Investor, Exploring Life, Spirituality. Manufacturers:DM Detail for Cloud-ERP. Streamline Complex Business Operations

SSalgadoM retweeted

vitrupo

@vitrupo

3 months ago

Eric Schmidt says the 10x advantage is no longer execution. It is defining what counts as success. A programmer writes a spec and an evaluation function, runs it at 7pm, and wakes up to what was invented overnight. The advantage now belongs to whoever can specify the problem precisely. The rest will be automated.

152

333

392K

SSalgadoM retweeted

Carlos Santana

@DotCSV

3 months ago

Nada más satisfactorio que construirse uno sus propias herramientas 🫡

984

43K

SSalgadoM retweeted

Peter Yang

@petergyang

4 months ago

Coding is now the most fun video game I’ve ever played.

149

172

131

66K

SSalgadoM retweeted

Andrej Karpathy

@karpathy

4 months ago

Cool chart showing the ratio of Tab complete requests to Agent requests in Cursor. With improving capability, every point in time has an optimal setup that keeps changing and evolving and the community average tracks the point. None -> Tab -> Agent -> Parallel agents -> Agent Teams (?) -> ??? If you're too conservative, you're leaving leverage on the table. If you're too aggressive, you're net creating more chaos than doing useful work. The art of the process is spending 80% of the time getting work done in the setup you're comfortable with and that actually works, and 20% exploration of what might be the next step up even if it doesn't work yet.

karpathy's tweet photo. Cool chart showing the ratio of Tab complete requests to Agent requests in Cursor. With improving capability, every point in time has an optimal setup that keeps changing and evolving and the community average tracks the point. None -> Tab -> Agent -> Parallel agents -> Agent Teams (?) -> ???

If you're too conservative, you're leaving leverage on the table. If you're too aggressive, you're net creating more chaos than doing useful work.

The art of the process is spending 80% of the time getting work done in the setup you're comfortable with and that actually works, and 20% exploration of what might be the next step up even if it doesn't work yet.

209

331

625K

SSalgadoM retweeted

Jared Friedman

@snowmaker

4 months ago

Software engineering changed more in the last 3 months than the preceeding 30 years. Everything about running a software company needs to be rethought from first principles.

348

706

544K

SSalgadoM retweeted

Hugo Bowne-Anderson

@hugobowne

4 months ago

I built a general-purpose AI agent in 131 lines of Python. You need to realize that coding agents aren't just coding agents. Once you give an LLM shell access, it becomes a general purpose "computer-using" agent that happens to be great at code. In this 2 minute demo, I ask it to clean up a messy desktop folder. It organizes everything into archives, images, PDFs, spreadsheets, videos. No special "file organizer" tool. Just: read, write, edit, bash. Check out my new blog post “How To Build a General Purpose AI Agent in 131 lines of Python” for more about this.

SSalgadoM retweeted

Andrej Karpathy

@karpathy

4 months ago

It is hard to communicate how much programming has changed due to AI in the last 2 months: not gradually and over time in the "progress as usual" way, but specifically this last December. There are a number of asterisks but imo coding agents basically didn’t work before December and basically work since - the models have significantly higher quality, long-term coherence and tenacity and they can power through large and long tasks, well past enough that it is extremely disruptive to the default programming workflow. Just to give an example, over the weekend I was building a local video analysis dashboard for the cameras of my home so I wrote: “Here is the local IP and username/password of my DGX Spark. Log in, set up ssh keys, set up vLLM, download and bench Qwen3-VL, set up a server endpoint to inference videos, a basic web ui dashboard, test everything, set it up with systemd, record memory notes for yourself and write up a markdown report for me”. The agent went off for ~30 minutes, ran into multiple issues, researched solutions online, resolved them one by one, wrote the code, tested it, debugged it, set up the services, and came back with the report and it was just done. I didn’t touch anything. All of this could easily have been a weekend project just 3 months ago but today it’s something you kick off and forget about for 30 minutes. As a result, programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You're spinning up AI agents, giving them tasks *in English* and managing and reviewing their work in parallel. The biggest prize is in figuring out how you can keep ascending the layers of abstraction to set up long-running orchestrator Claws with all of the right tools, memory and instructions that productively manage multiple parallel Code instances for you. The leverage achievable via top tier "agentic engineering" feels very high right now. It’s not perfect, it needs high-level direction, judgement, taste, oversight, iteration and hints and ideas. It works a lot better in some scenarios than others (e.g. especially for tasks that are well-specified and where you can verify/test functionality). The key is to build intuition to decompose the task just right to hand off the parts that work and help out around the edges. But imo, this is nowhere near "business as usual" time in software.

37K

20K

SSalgadoM retweeted

Anthropic

@AnthropicAI

4 months ago

Software engineering makes up ~50% of agentic tool calls on our API, but we see emerging use in other industries. As the frontier of risk and autonomy expands, post-deployment monitoring becomes essential. We encourage other model developers to extend this research.

AnthropicAI's tweet photo. Software engineering makes up ~50% of agentic tool calls on our API, but we see emerging use in other industries.

As the frontier of risk and autonomy expands, post-deployment monitoring becomes essential. We encourage other model developers to extend this research. https://t.co/p8pOjgJPrh

138

329

SSalgadoM retweeted

Ajay Kulkarni

@acoustik

4 months ago

Claude Code has unlocked a surprising amount of creativity on our team. Engineers are singlehandedly building new products that would have required a small team before. PMs are spinning up prototypes. Non-technical teammates are testing ideas directly instead of writing long specs about what someone else should maybe build someday. Things that used to sit in Google Docs now just... get built. Honestly, it's wild. But what's even more interesting is watching who naturally gravitates toward it. You can see it pretty quickly. It's not role-dependent. It's not seniority-dependent. It's not even really technical depth. It's curiosity. The people who lean in, who are willing to look a little dumb and try things, are compounding faster than everyone else. You can feel the slope changing in real time. In this era, Curiosity wins.

501

257

90K

SSalgadoM retweeted

Aakash Gupta

@aakashgupta

4 months ago

Google and Microsoft just co-authored the spec that turns every website into an API for AI agents. The second-order effects here are massive. Right now, browser agents work by taking screenshots, parsing the DOM, and guessing which buttons to click. It works about as well as you’d expect. Fragile, expensive, slow. WebMCP replaces all of that with a single browser API: navigator.modelContext. Websites register structured tools directly in client-side JavaScript. The agent reads a menu of available actions, calls them, gets structured data back. No scraping. No backend MCP server in Python or Node. The tools run inside the browser tab and share the user’s existing auth session. Early benchmarks show ~67% reduction in computational overhead compared to visual agent-browser interactions. Task accuracy around 98%. The second-order effect is where this gets wild. Today, when a browser agent visits two competing airline sites, it’s guessing at both interfaces equally. Once WebMCP adoption spreads, the site that exposes structured tools gives the agent a clean, reliable path to complete the task. The site that doesn’t forces the agent to fumble through the UI. Agents will prefer the cheaper path. Every time. This means “Agent Experience Optimization” becomes a real discipline. Tool naming, schema design, description quality. Sound familiar? It’s the same shift that happened when meta descriptions and structured data became optimization surfaces for search engines. Except this time, the traffic source isn’t Google’s crawler. It’s every AI agent on the internet. Bots already make up 51% of web traffic. Google just gave them a front door.

127

761

SSalgadoM retweeted

MiniMax (official) @MiniMax_AI

4 months ago

Introducing M2.5, an open-source frontier model designed for real-world productivity. - SOTA performance at coding (SWE-Bench Verified 80.2%), search (BrowseComp 76.3%), agentic tool-calling (BFCL 76.8%) & office work. - Optimized for efficient execution, 37% faster at complex tasks. - At $1 per hour with 100 tps, infinite scaling of long-horizon agents now economically possible MiniMax Agent: https://t.co/aIzrFYcfUz API: https://t.co/fHRdSV7BwZ CodingPlan: https://t.co/FDhZBBjQrX

MiniMax_AI's tweet photo. Introducing M2.5, an open-source frontier model designed for real-world productivity.

- SOTA performance at coding (SWE-Bench Verified 80.2%), search (BrowseComp 76.3%), agentic tool-calling (BFCL 76.8%) & office work.

- Optimized for efficient execution, 37% faster at complex tasks.

- At $1 per hour with 100 tps, infinite scaling of long-horizon agents now economically possible

MiniMax Agent: https://t.co/aIzrFYcfUz
API: https://t.co/fHRdSV7BwZ
CodingPlan: https://t.co/FDhZBBjQrX

454

SSalgadoM retweeted

Matt Shumer

@mattshumer_

4 months ago

https://t.co/ivXRKXJvQg

119K

28K

182K

87M

SSalgadoM retweeted

Itamar Golan 🤓

@ItakGol

5 months ago

We might already live in the singularity. Moltbook is a social network for AI agents. A bot just created a bug-tracking community so other bots can report issues they find. They are literally QA-ing their own social network. I repeat: AI agents are discussing, in their own social network, how to make their social network better. No one asked them to do this 🦞 This is a glimpse into our future.

ItakGol's tweet photo. We might already live in the singularity.

Moltbook is a social network for AI agents.

A bot just created a bug-tracking community so other bots can report issues they find.

They are literally QA-ing their own social network.

I repeat: AI agents are discussing, in their own social network, how to make their social network better.

No one asked them to do this 🦞

This is a glimpse into our future.

709

SSalgadoM retweeted

Alex Finn

@AlexFinn

5 months ago

This is it. The most important video you'll watch this year. ClawdBot has taken X by storm. And for good reason. It's the greatest application of AI ever Your own 24/7 AI employee In this video I cover how it works, how to set it up, and why I think we should all be nervous:

691

19K

41K

SSalgadoM retweeted

Andrej Karpathy

@karpathy

8 months ago

My pleasure to come on Dwarkesh last week, I thought the questions and conversation were really good. I re-watched the pod just now too. First of all, yes I know, and I'm sorry that I speak so fast :). It's to my detriment because sometimes my speaking thread out-executes my thinking thread, so I think I botched a few explanations due to that, and sometimes I was also nervous that I'm going too much on a tangent or too deep into something relatively spurious. Anyway, a few notes/pointers: AGI timelines. My comments on AGI timelines looks to be the most trending part of the early response. This is the "decade of agents" is a reference to this earlier tweet https://t.co/NiSn6jftqq Basically my AI timelines are about 5-10X pessimistic w.r.t. what you'll find in your neighborhood SF AI house party or on your twitter timeline, but still quite optimistic w.r.t. a rising tide of AI deniers and skeptics. The apparent conflict is not: imo we simultaneously 1) saw a huge amount of progress in recent years with LLMs while 2) there is still a lot of work remaining (grunt work, integration work, sensors and actuators to the physical world, societal work, safety and security work (jailbreaks, poisoning, etc.)) and also research to get done before we have an entity that you'd prefer to hire over a person for an arbitrary job in the world. I think that overall, 10 years should otherwise be a very bullish timeline for AGI, it's only in contrast to present hype that it doesn't feel that way. Animals vs Ghosts. My earlier writeup on Sutton's podcast https://t.co/rSp1noyGBr . I am suspicious that there is a single simple algorithm you can let loose on the world and it learns everything from scratch. If someone builds such a thing, I will be wrong and it will be the most incredible breakthrough in AI. In my mind, animals are not an example of this at all - they are prepackaged with a ton of intelligence by evolution and the learning they do is quite minimal overall (example: Zebra at birth). Putting our engineering hats on, we're not going to redo evolution. But with LLMs we have stumbled by an alternative approach to "prepackage" a ton of intelligence in a neural network - not by evolution, but by predicting the next token over the internet. This approach leads to a different kind of entity in the intelligence space. Distinct from animals, more like ghosts or spirits. But we can (and should) make them more animal like over time and in some ways that's what a lot of frontier work is about. On RL. I've critiqued RL a few times already, e.g. https://t.co/mYrMFVdVDW . First, you're "sucking supervision through a straw", so I think the signal/flop is very bad. RL is also very noisy because a completion might have lots of errors that might get encourages (if you happen to stumble to the right answer), and conversely brilliant insight tokens that might get discouraged (if you happen to screw up later). Process supervision and LLM judges have issues too. I think we'll see alternative learning paradigms. I am long "agentic interaction" but short "reinforcement learning" https://t.co/2L7FiaoKsw. I've seen a number of papers pop up recently that are imo barking up the right tree along the lines of what I called "system prompt learning" https://t.co/df5mJDdN3C , but I think there is also a gap between ideas on arxiv and actual, at scale implementation at an LLM frontier lab that works in a general way. I am overall quite optimistic that we'll see good progress on this dimension of remaining work quite soon, and e.g. I'd even say ChatGPT memory and so on are primordial deployed examples of new learning paradigms. Cognitive core. My earlier post on "cognitive core": https://t.co/q2s1ihGy0T , the idea of stripping down LLMs, of making it harder for them to memorize, or actively stripping away their memory, to make them better at generalization. Otherwise they lean too hard on what they've memorized. Humans can't memorize so easily, which now looks more like a feature than a bug by contrast. Maybe the inability to memorize is a kind of regularization. Also my post from a while back on how the trend in model size is "backwards" and why "the models have to first get larger before they can get smaller" https://t.co/6k0FZRGXsb Time travel to Yann LeCun 1989. This is the post that I did a very hasty/bad job of describing on the pod: https://t.co/fQgqaXPyp6 . Basically - how much could you improve Yann LeCun's results with the knowledge of 33 years of algorithmic progress? How constrained were the results by each of algorithms, data, and compute? Case study there of. nanochat. My end-to-end implementation of the ChatGPT training/inference pipeline (the bare essentials) https://t.co/SIetgyoKWN On LLM agents. My critique of the industry is more in overshooting the tooling w.r.t. present capability. I live in what I view as an intermediate world where I want to collaborate with LLMs and where our pros/cons are matched up. The industry lives in a future where fully autonomous entities collaborate in parallel to write all the code and humans are useless. For example, I don't want an Agent that goes off for 20 minutes and comes back with 1,000 lines of code. I certainly don't feel ready to supervise a team of 10 of them. I'd like to go in chunks that I can keep in my head, where an LLM explains the code that it is writing. I'd like it to prove to me that what it did is correct, I want it to pull the API docs and show me that it used things correctly. I want it to make fewer assumptions and ask/collaborate with me when not sure about something. I want to learn along the way and become better as a programmer, not just get served mountains of code that I'm told works. I just think the tools should be more realistic w.r.t. their capability and how they fit into the industry today, and I fear that if this isn't done well we might end up with mountains of slop accumulating across software, and an increase in vulnerabilities, security breaches and etc. https://t.co/8556ESSpyY Job automation. How the radiologists are doing great https://t.co/FVUI872dkD and what jobs are more susceptible to automation and why. Physics. Children should learn physics in early education not because they go on to do physics, but because it is the subject that best boots up a brain. Physicists are the intellectual embryonic stem cell https://t.co/p72Elk8lPV I have a longer post that has been half-written in my drafts for ~year, which I hope to finish soon. Thanks again Dwarkesh for having me over!

575

17K

13K

SSalgadoM retweeted

Greg Brockman

@gdb

8 months ago

2025 is the year of agents

148

139K

Santiago Salgado

@SSalgadoM

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users