Get paid to wait
The Claude Code spinner might be the most watched line on Earth.
So I turned it into an ad marketplace.
Advertisers bid on it. You keep 50% of the money.
Install the extension → get cash from ads.
Introducing Kickbacks
Very useful blog showing how much fear-mongering is going on by the closed/frontier labs! IPO-hype, no open releases, let's slow down progress.
I understand that it's part of their strategy but it doesn't need to be at the expense of other firms, business models and research.
New blog.
I looked into the actual evidence and what models where used by bad actors to see whether closed models are safer.
Turns out: Nope, they are used to hack, misinform and scam. There is one exception, though.
Link in replies.
New blog.
I looked into the actual evidence and what models where used by bad actors to see whether closed models are safer.
Turns out: Nope, they are used to hack, misinform and scam. There is one exception, though.
Link in replies.
@GergelyOrosz I think they are correct in thinking about AI safety but I feel like as a group they are in a bubble of their own making. Too much group-think which prevents them from stepping back and viewing the situation objectively. Plus IPO-hype.
Saw this interaction on twitter yesterday and built it in a couple tries with @Rork. If your apps look like AI slop, just get better at prompting 🤷🏽♂️
Today we’re releasing DeepSWE, a new standard for agentic coding benchmarks.
On public leaderboards, top models often look relatively close in capability. DeepSWE shows where they actually diverge, reflecting the realistic experience of developers in their day-to-day work.
We’ve shipped a security-guidance plugin for Claude Code that helps identify and fix vulnerabilities as you’re writing code.
Available for all Claude Code users. Install from the plugin marketplace (/plugins).
@ponnappa Assuming that the c-suite was intelligent enough to recognize what they don't understand well but were still able to attract and hire the right chief Ai officer and 'listen' to them!
This is hands down the best book on writing I have read. Mine is fully of margin notes.
It uses examples in notable writing to explain how sentence structure is a tool to convey emotion, meaning, etc.
Directionally correct, and a clever hack to expand the constraints of a CLI tool. But still only single player and working within CLI limitations.
Excited we're doing the larger vision of this in Ace. Multiplayer, docs as first-class primitives inside an agentic coding workspace
Not enough people are talking about how much AI is impacting the role of data science.
I was chatting with a DS friend, and he said that most of his team's work now is reviewing half-assed AI data analysis from PMs and engineers. And that 50% of the time, that analysis is wrong.
The role is becoming less fun.
This paper could not have been written without the help of my amazing Tübingen co-authors, @guinansu , Yanwu Yang and Xueyan Li!
Finally, the link to the paper is: https://t.co/SZZ8OIKMaG
where we also link to code, data and models.
Running an in-person Hermes Buildathon this Sunday (17 May), in collaboration with folks at @GrowthX_Club.
Mumbai folks, come hang out and build something with us. Register here → https://t.co/beL7swZDE6
Gave a talk at Microsoft Research on how @lossfunk operates.
We're still iterating but so far our style of operating has resulted in four papers accepted in ICML, RLC, ACL and ICLR main conferences (+ many workshops).
All undergrads, btw! 🚀
Slides below 👇
Big congratulations to @__kunvar__ for this major milestone!
Super grateful to have had the opportunity to partner up early via @except_raised when the project was still a “what if?” thought.
The ICML acceptance makes him the first solo, independent author from India to achieve this milestone!
Onwards & upwards! 🚀