Benjamin Polge @benjaminpolge - Twitter Profile

Pinned Tweet

about 2 months ago

🧵 THREAD: I'm looking for solo founders who run their ENTIRE company with AI agents. No employees. No contractors. Just you and the machines. If that's you, or you know someone, read on. 👇

1

0

1

0

132

BenjaminPolge retweeted

Lisan al Gaib

@scaling01

3 days ago

Anthropic: AI written code is now as good as human written code and will be strictly better within the year!

26

539

26

32

23K

BenjaminPolge retweeted

Anthropic

@AnthropicAI

3 days ago

Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. https://t.co/OVVPJO7VQx

2K

28K

5K

15K

18M

BenjaminPolge retweeted

Shital Shah

@sytelus

5 days ago

We are so happy to announce our new model Aion 1.0 today! Our team at AI Frontiers Lab at Microsoft Research had been cooking hard on this for quite a while. Aion 1.0 is 14B model that can run locally with reasoning + tool calling capabilities. You can choose whatever agentic harness you like or make your own. Calls to the model never leaves your device and no one charges you for any tokens you use 🥳.

sytelus's tweet photo. We are so happy to announce our new model Aion 1.0 today!

Our team at AI Frontiers Lab at Microsoft Research had been cooking hard on this for quite a while.

Aion 1.0 is 14B model that can run locally with reasoning + tool calling capabilities. You can choose whatever agentic harness you like or make your own. Calls to the model never leaves your device and no one charges you for any tokens you use 🥳.

64

1K

86

627

93K

Who to follow

Emmanuel Paquette

@empaquette

Journaliste @LInforme_ après être passé à Capital, l'Express et aux Echos.

ICI Gard Lozère

@icigardlozere

Compte officiel de ICI Gard Lozère, (ex-France Bleu) @ici_officiel #Gard #Lozère #Cévennes #Nîmes #Camargue #Alès @radiofrance 🔴⚪️🐊#NîmesOlympique

Éric Bothorel #FluctuatNecMergitur

@ebothorel

Député apparenté Renaissance (French MP) Circo Trégor-Goëlo, 2205 un concentré de Bretagne Auditeur IHEDN SNC4 et sponsor de la Tech. 🏍️ team VFR.

BenjaminPolge retweeted

Liam

@Liamdbav

5 days ago

Claude Code accumule en scred des Go de données au fil des sessions.. Ces fichiers restent sur le disque indéfiniment, même quand ils ne servent plus à rien. Du coup j'ai commencé à build un outil qui scanne, mesure et supprime ces données en toute sécurité avec une indication claire du risque avant chaque suppression. Lien en commentaire, servez vous !

7

40

8

81

12K

BenjaminPolge retweeted

Polymarket

@Polymarket

5 days ago

NEW: Uber is reportedly capping employee use of AI vibe-coding tools at $1,500 per month after blowing through its AI budget.

232

6K

363

625

8M

BenjaminPolge retweeted

Google AI Developers

@googleaidevs

5 days ago

Building autonomous agents for scientific discovery? 🧬🤖 @GoogleDeepMind Science Skills is now available on GitHub. We've open-sourced this specialized toolkit to accelerate your agentic workflows with scientific grounding and higher token efficiency. Download now ↓ https://t.co/cwp1HOeKvo

31

2K

269

1K

87K

BenjaminPolge retweeted

ClaudeDevs

@ClaudeDevs

5 days ago

How do you get Claude Code to check its own work before handing it back? Watch how you can encode your manual checks so Claude closes its own feedback loop:

118

5K

334

7K

414K

BenjaminPolge retweeted

Jeff Wang

@jeffwsurf

5 days ago

Today we are saying goodbye to Windsurf …and we are transforming it to Devin Desktop Windsurf has been an absolutely amazing experience for me and the team. Though it has been rocky at times, we have seen every phase of AI coding and we want to keep embracing where things are going. That means we need to once again reorient ourselves towards a more focused goal and remove the Windsurf branding. Believe it or not, the Windsurf brand has been around less than a year and a half, and before that, the previous name Codeium was only around a similar timeframe as well. I’ve actually had to change my email every year all the way to the eventual acquisition to Cognition. In AI, most products only have a 1 year lifespan before you need to drastically change it to the next. Devin now encompasses all our form factors, whether it’s the cloud agent, the agent command center (with IDE), CLI, review, or our other products. This way we can really focus our efforts around one name. We are doubling down on our neutrality and making Devin Desktop compatible with other agents via ACP. We may be the only “Switzerland” of AI left and we embrace this role. As for me, I’ll be transitioning from CEO of Windsurf to Cognition’s President of New Enterprise, helping open new regions and verticals, accelerating velocity, and filling in gaps as usual. The story of Windsurf doesn’t end here, it continues on as part of Devin’s journey.

96

1K

57

169

275K

BenjaminPolge retweeted

Perplexity

@perplexity_ai

5 days ago

Today we're announcing that hybrid agentic inference is coming to Perplexity Computer. Computer can split tasks between a local model running on your machine and frontier models in the cloud. This keeps private data on your device and maximizes token efficiency. Coming soon.

145

2K

201

737

334K

BenjaminPolge retweeted

H @hcompany_ai

5 days ago

Computer-use agents are moving from the cloud to your local machine. Fast. When we launched Holo3 two months ago, the production feedback was clear: digital agents need to be blazing fast, cost-effective, and versatile. Today, we're dropping Holo 3.1, engineered to run anywhere, instantly. Massive token throughput. Low latency. Ready for your local workflow!

hcompany_ai's tweet photo. Computer-use agents are moving from the cloud to your local machine. Fast.

When we launched Holo3 two months ago, the production feedback was clear: digital agents need to be blazing fast, cost-effective, and versatile.

Today, we're dropping Holo 3.1, engineered to run anywhere, instantly.

Massive token throughput. Low latency. Ready for your local workflow!

35

496

71

366

226K

BenjaminPolge retweeted

Lisan al Gaib

@scaling01

5 days ago

Microsoft trained three DeepSeek-V3 sized models just for funsies and you are wondering if there's a compute gap between US and China lmao

scaling01's tweet photo. Microsoft trained three DeepSeek-V3 sized models just for funsies and you are wondering if there's a compute gap between US and China

lmao https://t.co/RInAtF9Akc

35

844

32

153

89K

BenjaminPolge retweeted

Google Antigravity

@antigravity

5 days ago

Keep your Antigravity workflow unified. Sync conversations in Antigravity 2.0 with Antigravity CLI by simply confirming the import.

46

763

53

171

52K

BenjaminPolge retweeted

ollama

@ollama

5 days ago

You can use Hermes Desktop with Ollama using local or cloud models. Get started 👇👇👇

33

1K

100

357

83K

BenjaminPolge retweeted

OpenAI

@OpenAI

5 days ago

Building apps has never been easier. With Sites, Codex can turn your work, ideas, and plans into an interactive website or app your team can explore, use, and share with a URL. Rolling out to Business and Enterprise plans, before expanding more broadly.

943

19K

2K

10K

9M

BenjaminPolge retweeted

elie

@eliebakouch

5 days ago

microsoft MAI tech report is a gold mine, one of the most transparent for a model at this scale. this model uses zero synthetic data or distillation from previous models. this means reasoning, agentic behavior, tool use are all learned fully during post-training with no cold start. bold choice that makes it harder and requires more iterations to reach sota, but you get FULL control over your model series and it proves they are serious about being a frontier lab. the tech report is insanely detailed and precise about numbers. to give an example, they give the exact MFU across all the iterations of the model, with the exact changes etc. they also share the full scaling ladder recipe, to my knowledge this is the first time i've seen this in a tech report at this scale let's look at all of this in this likely very long thread 🧵

eliebakouch's tweet photo. microsoft MAI tech report is a gold mine, one of the most transparent for a model at this scale.

this model uses zero synthetic data or distillation from previous models. this means reasoning, agentic behavior, tool use are all learned fully during post-training with no cold start. bold choice that makes it harder and requires more iterations to reach sota, but you get FULL control over your model series and it proves they are serious about being a frontier lab.

the tech report is insanely detailed and precise about numbers. to give an example, they give the exact MFU across all the iterations of the model, with the exact changes etc. they also share the full scaling ladder recipe, to my knowledge this is the first time i've seen this in a tech report at this scale

let's look at all of this in this likely very long thread 🧵

41

2K

264

2K

276K

BenjaminPolge retweeted

International Cyber Digest

@IntCyberDigest

6 days ago

❗️ Over 30 official Red Hat npm packages were compromised. How they got in: - A Red Hat employee's GitHub account was compromised. - Attackers pushed "orphan commits" (detached from branch history) straight in, bypassing code review with no pull request. - Payload "Miasma" (Mini Shai-Hulud variant) steals GitHub/cloud/Vault/SSH/npm secrets. Rotate everything since June 1. - The commits added a workflow (ci.yaml) + script (_index.js) that abused npm trusted publishing, requesting a real OIDC token to publish backdoored versions.

IntCyberDigest's tweet photo. ❗️ Over 30 official Red Hat npm packages were compromised. How they got in:

- A Red Hat employee's GitHub account was compromised.
- Attackers pushed "orphan commits" (detached from branch history) straight in, bypassing code review with no pull request.
- Payload "Miasma" (Mini Shai-Hulud variant) steals GitHub/cloud/Vault/SSH/npm secrets. Rotate everything since June 1.
- The commits added a workflow (ci.yaml) + script (_index.js) that abused npm trusted publishing, requesting a real OIDC token to publish backdoored versions.

57

2K

448

466

194K

BenjaminPolge retweeted

Perplexity

@perplexity_ai

6 days ago

Introducing Search as Code, our new search architecture for AI agents. It writes Python that calls our search stack directly, instead of looping through function calls one at a time. Available in the Perplexity Agent API, and now default in Computer. https://t.co/ut6GGWQTVO

perplexity_ai's tweet photo. Introducing Search as Code, our new search architecture for AI agents.

It writes Python that calls our search stack directly, instead of looping through function calls one at a time.

Available in the Perplexity Agent API, and now default in Computer.

https://t.co/ut6GGWQTVO https://t.co/jrF2nQE3bC

153

2K

190

1K

553K

BenjaminPolge retweeted

ClaudeDevs

@ClaudeDevs

6 days ago

We've reset 5-hour and weekly rate limits for all users on Pro and Max plans. We fixed an issue that caused some Claude Code sessions to spawn excessive parallel subagents, burning through usage faster than expected.

1K

20K

1K

3M

BenjaminPolge retweeted

ARC Prize

@arcprize

6 days ago

Anthropic Opus 4.8 is new SOTA on ARC-AGI-3 Score: 1.5%, ~$10K ARC-AGI-3 analysis notes: * Opus 4.8 read the environment an abstraction *above* Opus 4.7, as objects & systems, not pictures * Opus 4.8 succeeded on early levels, but still committed to a wrong sub-goal

arcprize's tweet photo. Anthropic Opus 4.8 is new SOTA on ARC-AGI-3

Score: 1.5%, ~$10K

ARC-AGI-3 analysis notes:
* Opus 4.8 read the environment an abstraction *above* Opus 4.7, as objects & systems, not pictures
* Opus 4.8 succeeded on early levels, but still committed to a wrong sub-goal https://t.co/PkQQ1u8NaX

53

1K

115

168

127K

BenjaminPolge retweeted

xAI

@xai

6 days ago

Composer 2.5 is now available inside Grok Build. Composer 2.5 is a fast, highly intelligent model that excels on long-running tasks and following complex instructions.

583

7K

833

1K

32M

Benjamin Polge

@BenjaminPolge

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users