Sergio ‘shadown’ Alvarez

Sapient Intelligence @Sapient_Int

22 days ago

If other providers down lower their prices they are destined to go bankrupt, period.

DeepSeek

@deepseek_ai

22 days ago

We are making our discount permanent! 🎉 Enjoy building with DeepSeek-V4-Pro and bring your innovative ideas to life! 🚀

1K

24K

3K

6K

7M

0

1

0

138

searchio retweeted

25 days ago

In this benchmark deep-dive, Sapient’s founders William and Guan are joined by research team members Changling and Yasin to unpack HRM-Text’s performance across MATH, DROP, ARC-Challenge, and MMLU. 📊 Beyond the scores, they discuss what each benchmark measures, how HRM-Text compares with larger models, and why efficiency matters. Watch the full discussion to learn more about HRM-Text and Sapient’s leaner path toward general intelligence.

59

256

21

123

240K

Senior Principal Security Researcher @rapid7. Specializing in software vulnerabilities and exploitation.

29 days ago

@elonmusk Winning

0

183

Who to follow

Stephen Fewer

@stephenfewer

Cesar Cerrudo

@cesarcer

Professional Hacker & Cyber Security Futurist. Security/ Hacking

Exodus Intelligence

@XI_Research

Industry leading provider of exclusive zero-day vulnerability intelligence, exploits, defensive guidance, and vulnerability research trends.

2 months ago

This article shows how important it is to use several models to analyze and reason during the dataset generation and distillation process. This helps ensure that the dataset used to train the models is as accurate as possible. https://t.co/ldNzvSoIgN

0

2

1

0

342

3 months ago

@awnihannun Congrats! What will happen to MLX?

1

9

0

1

3K

3 months ago

Yep

andrew chen

@andrewchen

3 months ago

AI is supposed to save me time, but now I find myself building stuff all evening and weekend and it's actually increasing my time in front of the computer WTF

439

2K

121

154

191K

0

197

searchio retweeted

Clara Bennett

@CodeswithClara

3 months ago

🚨BREAKING: Anthropic just dropped free courses to master AI with certificates. No tuition. No waitlist. No BS. Here're 10 courses that will replace a $50K degree👇

CodeswithClara's tweet photo. 🚨BREAKING: Anthropic just dropped free courses to master AI with certificates.

No tuition. No waitlist. No BS.

Here're 10 courses that will replace a $50K degree👇 https://t.co/Yp9NQN3uv9

69

4K

605

8K

554K

searchio retweeted

Andrej Karpathy

@karpathy

4 months ago

CLIs are super exciting precisely because they are a "legacy" technology, which means AI agents can natively and easily use them, combine them, interact with them via the entire terminal toolkit. E.g ask your Claude/Codex agent to install this new Polymarket CLI and ask for any arbitrary dashboards or interfaces or logic. The agents will build it for you. Install the Github CLI too and you can ask them to navigate the repo, see issues, PRs, discussions, even the code itself. Example: Claude built this terminal dashboard in ~3 minutes, of the highest volume polymarkets and the 24hr change. Or you can make it a web app or whatever you want. Even more powerful when you use it as a module of bigger pipelines. If you have any kind of product or service think: can agents access and use them? - are your legacy docs (for humans) at least exportable in markdown? - have you written Skills for your product? - can your product/service be usable via CLI? Or MCP? - ... It's 2026. Build. For. Agents.

karpathy's tweet photo. CLIs are super exciting precisely because they are a "legacy" technology, which means AI agents can natively and easily use them, combine them, interact with them via the entire terminal toolkit.

E.g ask your Claude/Codex agent to install this new Polymarket CLI and ask for any arbitrary dashboards or interfaces or logic. The agents will build it for you. Install the Github CLI too and you can ask them to navigate the repo, see issues, PRs, discussions, even the code itself.

Example: Claude built this terminal dashboard in ~3 minutes, of the highest volume polymarkets and the 24hr change. Or you can make it a web app or whatever you want. Even more powerful when you use it as a module of bigger pipelines.

If you have any kind of product or service think: can agents access and use them?

- are your legacy docs (for humans) at least exportable in markdown?
- have you written Skills for your product?
- can your product/service be usable via CLI? Or MCP?
- ...

It's 2026. Build. For. Agents.

665

12K

1K

10K

2M

searchio retweeted

Matthew Berman

@MatthewBerman

4 months ago

I'm one of the most advanced users of OpenClaw. OpenClaw + GPT5.3 Codex + Opus 4.6 has been the trifecta that changed everything. I made a video going over everything I'm doing with these tools. Learn these tools, stay ahead. Watch this video right now. 0:00 Intro 1:02 Overview 4:17 Sponsor 5:12 Personal CRM 7:11 Knowledge Base 8:30 Video Idea Pipeline 11:09 Twitter/X Search 12:47 Analytics Tracker 13:33 Data Review 15:34 HubSpot 16:13 Humanizer 16:52 Image/Video Generation 18:22 To-Do List 19:37 Usage Tracker (Saves Money) 20:45 Services 21:25 Automations 22:42 Backup 23:30 Memory 24:06 Building OpenClaw 25:22 Updating Files

350

10K

906

29K

2M

5 months ago

Clawdbot (now Moltbot) shows how fast a killer AI agent idea can turn into a security mess. The perfect example of a brilliant idea with zero security foresight. Lessons learned: • Agentic AI is game-changing, but secure-by-default is non-negotiable. • Hype moves fast, security must move faster.

1

0

198

searchio retweeted

5 months ago

New MIT + ETH Zurich + Improbable AI lab paper on scalable, low-overhead continual learning. Shows Self-Distillation Fine-Tuning improves accuracy from 80% to 89% on knowledge acquisition tasks while reducing catastrophic forgetting. It uses no extra parameters or reward models, works via in-context prompting, --- arxiv .org/pdf/2601.19897v1

rohanpaul_ai's tweet photo. New MIT + ETH Zurich + Improbable AI lab paper on scalable, low-overhead continual learning.

Shows Self-Distillation Fine-Tuning improves accuracy from 80% to 89% on knowledge acquisition tasks while reducing catastrophic forgetting.

It uses no extra parameters or reward models, works via in-context prompting,

---

arxiv .org/pdf/2601.19897v1

11

157

24

94

9K

5 months ago

@LiuGang8964 @callmeMizuko Nice! I didn’t think about different sizes and alignment.

0

180

5 months ago

@callmeMizuko 7581 ?

0

1

0

148

searchio retweeted

5 months ago

Terence Tao predicts the end of math gatekeeping with AI. AI proof assistants are smashing the technical barriers that keep amateurs out. By automating verification, AI empowers anyone to contribute rigorous pro-level Math. The Ivory Tower is falling

49

2K

244

770

213K

searchio retweeted

5 months ago

🇨🇳 China put 256 GW of new solar on the grid in H1-25, while the whole world added 380 GW in that same window, so China alone was 67% of global additions. For AI's progress, electricity abundance is becoming the absolute key competitive variable And now 6 months of solar additions in China > decades of solar additions in the US.

rohanpaul_ai's tweet photo. 🇨🇳 China put 256 GW of new solar on the grid in H1-25, while the whole world added 380 GW in that same window, so China alone was 67% of global additions.

For AI's progress, electricity abundance is becoming the absolute key competitive variable

And now 6 months of solar additions in China > decades of solar additions in the US.

3

46

9

3

6K

searchio retweeted

Jonatan Viale

@JonatanViale

5 months ago

Lo importante siempre lo miras en TN @todonoticias 🙌

0

296

21

4

15K

searchio retweeted

Computer

@AskPerplexity

5 months ago

🚨 BREAKING: DeepSeek just dropped a fundamental improvement in Transformer architecture CEO Wenfeng Liang on the author list THE WHALE IS BACK 🐋

AskPerplexity's tweet photo. 🚨 BREAKING: DeepSeek just dropped a fundamental improvement in Transformer architecture

CEO Wenfeng Liang on the author list

THE WHALE IS BACK 🐋 https://t.co/h57w5SF2pK

187

11K

1K

4K

1M

searchio retweeted

5 months ago

🚨 BREAKING: China's new opensource code model beats Claude Sonnet 4.5 & GPT 5.1 despite way fewer params. SWE-Bench Verified (81.4%), BigCodeBench (49.9%), LiveCodeBench v6 (81.1%) - with just 40B-param model. IQuest-Coder from Quest Research, backed by China’s quant hedge fund giant UBIQUANT. UBIQUANT has leaned hard into AI for years, running teams like AILab, DataLab, and Waterdrop Lab. As of Q3 2025, AUM sat at CNY 70–80B ($10.01–11.43B), with about 24% average returns from Jan to Nov 2025, and CNY 463M ($66.18M) paid out in dividends. Bifurcated post-training delivers two specialized variants—Thinking models (utilizing reasoning-driven RL for complex problem-solving) and Instruct models (optimized for general coding assistance and instruction-following). Efficient Architecture: The IQuest-Coder-V1-Loop variant introduces a recurrent mechanism that optimizes the trade-off between model capacity and deployment footprint. Native Long Context: All models natively support up to 128K tokens without requiring additional scaling techniques.

rohanpaul_ai's tweet photo. 🚨 BREAKING: China's new opensource code model beats Claude Sonnet 4.5 & GPT 5.1 despite way fewer params.

SWE-Bench Verified (81.4%), BigCodeBench (49.9%), LiveCodeBench v6 (81.1%) - with just 40B-param model.

IQuest-Coder from Quest Research, backed by China’s
quant hedge fund giant UBIQUANT.

UBIQUANT has leaned hard into AI for years, running teams like AILab, DataLab, and Waterdrop Lab.

As of Q3 2025, AUM sat at CNY 70–80B ($10.01–11.43B), with about 24% average returns from Jan to Nov 2025, and CNY 463M ($66.18M) paid out in dividends.

Bifurcated post-training delivers two specialized variants—Thinking models (utilizing reasoning-driven RL for complex problem-solving) and Instruct models (optimized for general coding assistance and instruction-following).

Efficient Architecture: The IQuest-Coder-V1-Loop variant introduces a recurrent mechanism that optimizes the trade-off between model capacity and deployment footprint.

Native Long Context: All models natively support up to 128K tokens without requiring additional scaling techniques.

63

1K

135

628

124K