@tiwaridevesh @imdt29 - Twitter Profile

@tiwaridevesh @ImDT29

1 day ago

@TheCoderShow Going to be fun😄

1

0

51

ImDT29 retweeted

NVIDIA AI

@NVIDIAAI

3 days ago

We took a 30B model and split it in two to write tokens in parallel instead of one at a time. Introducing Nemotron-Labs-TwoTower: a diffusion language model from NVIDIA Research adapted from Nemotron-3-Nano-30B-A3B. Here’s how it works: one half holds the context, the other writes the tokens, with both reusing the pretrained model instead of training a new one from scratch. We found it kept 98.7% of the original model’s quality at 2.42× faster generation.

126

5K

602

3K

730K

ImDT29 retweeted

Rohan Paul

@rohanpaul_ai

13 days ago

ASML’s CEO: Europe is falling behind in AI hardware as the US is buying 80% of the world’s advanced chips while megafabs like Tesla’s TeraFab may demand capacity at the scale of millions of wafers per month. --- wccftech. com/asml-ceo-warns-europe-is-quite-behind-in-ai-race-as-us-buys-80-of-advanced-chips/

rohanpaul_ai's tweet photo. ASML’s CEO: Europe is falling behind in AI hardware as the US is buying 80% of the world’s advanced chips while megafabs like Tesla’s TeraFab may demand capacity at the scale of millions of wafers per month.

---

wccftech. com/asml-ceo-warns-europe-is-quite-behind-in-ai-race-as-us-buys-80-of-advanced-chips/

13

134

31

27

24K

ImDT29 retweeted

27.

@dimvji

15 days ago

i should've locked in when i was thirteen

642

245K

34K

7K

3M

ImDT29 retweeted

Andrew Ng

@AndrewYNg

15 days ago

Over the last two weeks, both the U.S. Government and Anthropic took significant actions that demonstrated their power to control access to AI by restricting what others can do with frontier models. This has been one of those moments that, once seen, will be hard to unsee, and it is significantly accelerating many businesses’ and nation states’ efforts to ensure reliable access to AI that no one else can terminate. Anthropic first released Claude Fable 5, a version of its Mythos model with additional guardrails, including some restrictions that seem well justified on safety grounds (such as limitations on applying it to hacking, bioweapons, and so forth). However, it also restricted developers’ ability to use it to build competing LLM technology. This move was concerning, given that the whole AI community, including Anthropic, has benefitted tremendously from open research — indeed, the AI revolution was kicked off by my former team (Google Brain) freely publishing the Transformers paper! Imagine if Microsoft’s terms of use barred anyone from using their tools to build competitive software, or if Google barred using it to search for information to work on competing search engines. Anthropic’s argument that it was unsafe for others to be able to make advances in AI also rang hollow. Initially, Anthropic silently degraded Fable 5’s performance for users detected to be working on LLM research through invisible interventions that weakened the model’s outputs without notifying the user. After significant backlash, it walked back this decision and decided to be transparent when it did this, but it still refuses to use its latest capabilities to help AI researchers. This move represents a raw demonstration of power by Anthropic. It has used “safety” arguments to hinder potential competitors. Platforms succeed when they are viewed as stable, reliable partners that one can build on. The sudden rule changes by Anthropic (including a mandatory 30 day data retention policy for Fable usage) have made developers wonder about the stability of building on any one proprietary LLM provider, not just Anthropic. The U.S. Government then shortly followed with an even greater demonstration of power. It used the Commerce Department’s authority to regulate technologies that may be national security threats to restrict exports of Mythos and Fable, requiring a license for use by any foreign national, whether inside or outside of the U.S., including employees of Anthropic. This led Anthropic to disable access to Fable to all users worldwide. Sam Altman pointed out, referring to Anthropic, “It is clearly incredible marketing to say, ‘We have built a bomb, we are about to drop it on your head. We will sell you a bomb shelter for $100 million.’” But when one engages in this type of fear-based marketing, it increases the odds that the U.S. Government will agree with you and slap export controls on the bomb you say you have built. To be clear, I don't think Anthropic has built anything like a bomb, and I don't think export controls on Fable are appropriate. However, following the U.S. Government making this move, many nations, including U.S. allies, saw how the U.S. can suddenly yank their access to AI models. In many capitals around the world, this has spurred discussions on AI sovereignty and how others can ensure uninterrupted access to this critical technology. For decades, many nations were comfortable having many parts of their supply chain rely on the U.S., China, and other major producers. Once a nation issues a threat, or takes action, to limit other nations’ access, other nations will rationally try to secure alternatives. For decades, semiconductor manufacturing in China made slow progress; once the U.S. moved to limit China’s access, China’s efforts kicked into high gear. Similarly, once China threatened U.S. access to rare earth minerals, U.S. efforts to secure alternatives accelerated. Now that it has become crystal clear that private U.S. companies and the U.S. government can limit, in short order, other nations’ access to frontier AI models, the incentive of others to invest more in alternatives like open source grows significantly. Of course, training frontier models is not easy, so it remains to be seen how successful they are, but we have crossed the rubicon. Satya Nadella wrote an essay about the importance of building a healthy ecosystem on top of frontier AI technology. I heartily agree with him, and hope this week’s events will ultimately prove to be constructive steps toward this. I hope we can build a more free, more open world, where research is freely shared, and laws and societal norms shape a level playing field that allows everyone to make progress. A silver lining of the events of these past two weeks is now that everyone better realizes key points of instability of the current system, we can all work to create a more stable foundation. [Original text: The Batch newsletter]

165

1K

278

394

153K

@tiwaridevesh @ImDT29

17 days ago

kimi Agent is impressive https://t.co/h2La4kjgRv

0

5

@tiwaridevesh @ImDT29

17 days ago

MiniMax-M3 comes with a MSA(minimax-sparse-attention) compare to minimax 2.7 it reduces the computational complexity from quadratic to linear i have written a small Article on this : https://t.co/2wf8dzK6kJ thanks to @rasbt for the Architecture diagram

ImDT29's tweet photo. MiniMax-M3 comes with a MSA(minimax-sparse-attention) compare to minimax 2.7 it reduces the computational complexity from quadratic to linear

i have written a small Article on this :
https://t.co/2wf8dzK6kJ

thanks to @rasbt for the Architecture diagram https://t.co/iW6NFHVRbT

0

1

0

85

ImDT29 retweeted

@tiwaridevesh @ImDT29

18 days ago

How far we have come💀

0

1

0

20

@tiwaridevesh @ImDT29

18 days ago

How far we have come💀

0

1

0

20

ImDT29 retweeted

Yann LeCun

@ylecun

24 days ago

@ClementDelangue @Dan_Jeffries1 Everyone, please join Project Tapestry https://t.co/5MOgouVplV

47

1K

166

727

436K

ImDT29 retweeted

Joseph Suarez 🐡

@jsuarez

25 days ago

You can train drones in 30 seconds. This was a year ago. PufferLib is 5x faster now!

17

2K

118

1K

161K

ImDT29 retweeted

Sebastian Raschka

@rasbt

25 days ago

Always back to the basics: LatentMoE was probably inspired by MLA, which was inspired by LoRA, which was inspired by SVD, which was inspired by eigendecomposition.

rasbt's tweet photo. Always back to the basics:
LatentMoE was probably inspired by MLA, which was inspired by LoRA, which was inspired by SVD, which was inspired by eigendecomposition. https://t.co/bWqo5iOPbP

29

793

93

460

35K

@tiwaridevesh @ImDT29

25 days ago

@vvinit594 👏

1

0

11

ImDT29 retweeted

Sakana AI

@SakanaAILabs

27 days ago

AIを作るAIを作る：RSI Lab始動 https://t.co/Tvc1zQyKkk Sakana AIは、再帰的自己改善(Recursive Self-Improvement、RSI)に取り組む専任の研究グループ「RSI Lab」を、東京で立ち上げます。RSIは、AIがAIそのものを作る仕組みです。この2年間、私たちはLLM-Squared、Darwin Gödel Machine、Shinka Evolve、ALE-Agent、Digital Red Queen、そしてThe AI Scientistといった研究を積み重ねてきました。いずれも、エージェント用途のために設計されたモデルが研究を自動で行うAIを生み、そのAIがさらに優れたモデルを生み出していく、というひとつの循環に向けた歩みです。自己改善型AIという発想は、いまや私たちだけのものではありません。2026年に入ってRSIは大きな潮流となり、この考え方を掲げる組織が世界各地で相次いで立ち上がっています。そのなか��Sakana AIは、創業以来、計算資源の量に頼らずにAIを開発する独自の方法を切り拓いてきました。私たちがめざすのは、計算資源を際限なく注ぎ込むことなくRSIを実現することです。計算規模で世界最上位の国と張り合うのが難しい日本だからこそ、取り組む意味のある研究だと考えています。RSIの研究を責任あるかたちで進めるため、これまでの経験も活かしながら、コミュニティの皆様と知見を共有しつつ取り組んでいきます。国内外から研究者・エンジニアを集め、Sakana AI RSI Labを組成していきます。この取り組みに、さまざまなかたちで関わってくださる皆様と協働できることを楽しみにしています。

SakanaAILabs's tweet photo. AIを作るAIを作る：RSI Lab始動

https://t.co/Tvc1zQyKkk

Sakana AIは、再帰的自己改善(Recursive Self-Improvement、RSI)に取り組む専任の研究グループ「RSI Lab」を、東京で立ち上げます。RSIは、AIがAIそのものを作る仕組みです。

この2年間、私たちはLLM-Squared、Darwin Gödel Machine、Shinka Evolve、ALE-Agent、Digital Red Queen、そしてThe AI Scientistといった研究を積み重ねてきました。いずれも、エージェント用途のために設計されたモデルが研究を自動で行うAIを生み、そのAIがさらに優れたモデルを生み出していく、というひとつの循環に向けた歩みです。

自己改善型AIという発想は、いまや私たちだけのものではありません。2026年に入ってRSIは大きな潮流となり、この考え方を掲げる組織が世界各地で相次いで立ち上がっています。そのなか��Sakana AIは、創業以来、計算資源の量に頼らずにAIを開発する独自の方法を切り拓いてきました。

私たちがめざすのは、計算資源を際限なく注ぎ込むことなくRSIを実現することです。計算規模で世界最上位の国と張り合うのが難しい日本だからこそ、取り組む意味のある研究だと考えています。RSIの研究を責任あるかたちで進めるため、これまでの経験も活かしながら、コミュニティの皆様と知見を共有しつつ取り組んでいきます。

国内外から研究者・エンジニアを集め、Sakana AI RSI Labを組成していきます。この取り組みに、さまざまなかたちで関わってくださる皆様と協働できることを楽しみにしています。

30

644

107

233

81K

ImDT29 retweeted

Manu Arora

@mannupaaji

about 1 month ago

AGENTS.md DESIGN.md CLAUDE.md SKILL.md REQUIREMENTS.md PROPOSAL.md PLAN.md . . GOD-HELP-ME.md

253

8K

707

891

218K

ImDT29 retweeted

Vishal Tiwari ( Believer) @VishalT12094272

2 months ago

Buddy2Buddy is Getting into @ycombinator with this A platform where u can rent a Buddy for Shopping , Movie , Local Tourist place , Night out https://t.co/fZgr33JtYM @_shivamtiwari9 @ommtiwariii @ImDT29

2

20

4

0

719

ImDT29 retweeted

The Wall Street Journal

@WSJ

2 months ago

Tim Cook, the longtime leader of Apple, is stepping down after transforming the company into a titan of the tech industry, handing the reins to veteran engineer John Ternus. Here’s the advice that Cook said he’d give his successor. 🔗 Watch more: https://t.co/RTC8pncVlH

48

377

96

76

77K

ImDT29 retweeted

Harkirat Singh

@kirat_tw

3 months ago

noida is mad.

99

4K

110

95

59K

ImDT29 retweeted

clem 🤗

@ClementDelangue

3 months ago

Introducing Kernels on the Hugging Face Hub ✨ What if shipping a GPU kernel was as easy as pushing a model? - Pre-compiled for your exact GPU, PyTorch & OS - Multiple kernel versions coexist in one process - torch.compile compatible - 1.7x–2.5x speedups over PyTorch baselines

67

2K

220

722

209K

@tiwaridevesh @ImDT29

3 months ago

@VishalT12094272 @lifiprotocol U got a User for this product ....really loved this

0

4

0

55

@tiwaridevesh

@ImDT29

Last Seen Users on Sotwe

Trends for you

Most Popular Users