Aryan Kargwal @aryankarg - Twitter Profile

aryankarg retweeted

Tyler

@rezoundous

about 2 months ago

Opus 4.7 is insane guys. It one shotted my session usage limit.

597

31K

1K

857

1M

aryankarg retweeted

Rhys

@RhysSullivan

4 months ago

Claude: “I estimate this will take 1-2 weeks to complete” Me:

289

34K

2K

964K

aryankarg retweeted

Satya Nadella

@satyanadella

4 months ago

Our newest AI accelerator Maia 200 is now online in Azure. Designed for industry-leading inference efficiency, it delivers 30% better performance per dollar than current systems. And with 10+ PFLOPS FP4 throughput, ~5 PFLOPS FP8, and 216GB HBM3e with 7TB/s of memory bandwidth it's optimized for large-scale AI workloads. It joins our broader portfolio of CPUs, GPUs, and custom accelerators, giving customers more options to run advanced AI workloads faster and more cost-effectively on Azure.

710

4K

640

602

847K

aryankarg retweeted

Manthan Gupta

@manthanguptaa

6 months ago

I spent the last few days prompting ChatGPT to understand how its memory system actually works. Spoiler alert: There is no RAG used https://t.co/zxvRRP2GK8

manthanguptaa's tweet photo. I spent the last few days prompting ChatGPT to understand how its memory system actually works.

Spoiler alert: There is no RAG used

https://t.co/zxvRRP2GK8 https://t.co/BxGOlGY9kg

223

4K

485

8K

2M

Who to follow

https://t.co/H9aFun39zc

Sejal Mohata

@SejalMohata26

overthinking paglu 🫠🎀

aryankarg retweeted

Sam Altman

@sama

7 months ago

Small-but-happy win: If you tell ChatGPT not to use em-dashes in your custom instructions, it finally does what it's supposed to do!

3K

29K

1K

3K

7M

aryankarg retweeted

Dr Singularity

@Dr_Singularity

8 months ago

This is insane. New AI model from Samsung, 10,000x smaller than DeepSeek and Gemini 2.5 Pro just beat them on ARC-AGI 1 and 2 Samsung’s Tiny Recursive Model (TRM) is about 10,000x smaller than typical LLMs yet smarter because it thinks recursively instead of just predicting text. It first drafts an answer, then builds a hidden "scratchpad" for reasoning, repeatedly critiques and refines its logic (up to 16 times), and produces improved answers each cycle. This approach shows that architecture and reasoning loops (not just size), can drive intelligence. It enables powerful, efficient models that run cheaply, validate neuro symbolic ideas, and open highest quality reasoning to far more applications. Acceleration is everywhere

Dr_Singularity's tweet photo. This is insane.

New AI model from Samsung, 10,000x smaller than DeepSeek and Gemini 2.5 Pro just beat them on ARC-AGI 1 and 2

Samsung’s Tiny Recursive Model (TRM) is about 10,000x smaller than typical LLMs yet smarter because it thinks recursively instead of just predicting text. It first drafts an answer, then builds a hidden "scratchpad" for reasoning, repeatedly critiques and refines its logic (up to 16 times), and produces improved answers each cycle.

This approach shows that architecture and reasoning loops (not just size), can drive intelligence. It enables powerful, efficient models that run cheaply, validate neuro symbolic ideas, and open highest quality reasoning to far more applications.

Acceleration is everywhere

218

8K

1K

5K

1M

aryankarg retweeted

Google DeepMind @GoogleDeepMind

8 months ago

We’re making robots more capable than ever in the physical world. 🤖 Gemini Robotics 1.5 is a levelled up agentic system that can reason better, plan ahead, use digital tools such as @Google Search, interact with humans and much more. Here’s how it works 🧵

191

3K

524

620

862K

aryankarg retweeted

Peter Henderson

@PeterHndrsn

9 months ago

Quick take: Are open-weight AI models getting a fair shake in evals? A few thoughts on comparing systems-to-models, sparked by Anthropic’s recent postmortem. Anthropic published a careful account of a routing bug that degraded Claude responses. It was refreshingly specific. Some short requests to Claude were misrouted to long-context servers resulting in degradation. Meaning, in short: two different models (or model configs) are specialized for different context length. But this raises an ongoing thought I've had: closed providers can lean on routing, specialization, multiple models, and other scaffolding, while open-weight models are often judged as if they must perform well in every condition, alone. If we allowed comparable routing/specialization around open models, how much of the apparent gap would close? For research—and policy—we should compare system-to-system (or model-to-model), not model-to-system. Ideally, we'd get per-call metadata from closed APIs so researchers know what they actually hit. But in the alternative, maybe we should be building more systems around open-weight models to give them a fair shake in capabilities evals.

PeterHndrsn's tweet photo. Quick take: Are open-weight AI models getting a fair shake in evals? A few thoughts on comparing systems-to-models, sparked by Anthropic’s recent postmortem.

Anthropic published a careful account of a routing bug that degraded Claude responses. It was refreshingly specific.

Some short requests to Claude were misrouted to long-context servers resulting in degradation. Meaning, in short: two different models (or model configs) are specialized for different context length.

But this raises an ongoing thought I've had: closed providers can lean on routing, specialization, multiple models, and other scaffolding, while open-weight models are often judged as if they must perform well in every condition, alone. If we allowed comparable routing/specialization around open models, how much of the apparent gap would close?

For research—and policy—we should compare system-to-system (or model-to-model), not model-to-system. Ideally, we'd get per-call metadata from closed APIs so researchers know what they actually hit. But in the alternative, maybe we should be building more systems around open-weight models to give them a fair shake in capabilities evals.

2

67

10

15

10K

aryankarg retweeted

AI at Meta

@AIatMeta

9 months ago

New from Meta FAIR: Code World Model (CWM), a 32B-parameter research model designed to explore how world models can transform code generation and reasoning about code. We believe in advancing research in world modeling and are sharing CWM under a research license to help empower the community to build upon our work. ➡️ Read the technical report: https://t.co/i9BqtfyJ7L ➡️Download the open weights: https://t.co/S2CxqCOMn0 ➡️Download the code: https://t.co/wOsDR8Q5OQ

90

1K

223

498

313K

aryankarg retweeted

Mistral AI

@MistralAI

10 months ago

Mistral Medium 3.1 just landed on @lmarena_ai leaderboard—punching way above its weight! 🏆 #1 in English (no Style Control) 🏆 2nd overall (no Style Control) 🏆 Top 3 in Coding & Long Queries 🏆 8th overall Small model. Big impact. Try it now on Le Chat and the API!

MistralAI's tweet photo. Mistral Medium 3.1 just landed on @lmarena_ai leaderboard—punching way above its weight!

🏆 #1 in English (no Style Control)
🏆 2nd overall (no Style Control)
🏆 Top 3 in Coding & Long Queries
🏆 8th overall

Small model. Big impact. Try it now on Le Chat and the API! https://t.co/nwJCHRvX5D

77

2K

255

339

548K

aryankarg retweeted

neural nets.

@cneuralnetwork

over 1 year ago

me and my llama 70B model

21

3K

91

81

53K

Aryan Kargwal @aryankarg

over 1 year ago · Montréal

From AI-driven customer service to decision-making, is automation moving too fast? https://t.co/vAFO70XvLg

0

29

aryankarg retweeted

near

@nearcyan

over 1 year ago

i'm assembling a team

186

8K

455

774

963K

aryankarg retweeted

abhishek

@abhi1thakur

over 1 year ago

🤣

abhi1thakur's tweet photo. 🤣 https://t.co/bSRaPNu4d7

22

670

70

61

64K

aryankarg retweeted

Alex Napier Holland 🦍

@NapierHolland

over 1 year ago

Every. Fucking. Day.

105

10K

564

2K

526K

aryankarg retweeted

Junyang Lin

@JustinLin610

over 1 year ago

Seems that Qwen2-VL is not good enough to appear on Llava o1 paper.

23

414

11

50

98K

Aryan Kargwal @aryankarg

over 1 year ago · Montréal

This is the first time ending a coding session with a happy note. Thanks to Qwen Coder!!! Try it out for yourself at @Tunehq_ai today!

aryankarg's tweet photo. This is the first time ending a coding session with a happy note. Thanks to Qwen Coder!!!

Try it out for yourself at @Tunehq_ai today! https://t.co/LS4cbTbeDZ

0

3

1

0

276

Aryan Kargwal @aryankarg

over 1 year ago · Montréal

🚀 Excited to share my latest tutorial on Janus 1.3B! This ultra-lightweight multimodal model packs a punch with just 1.3B parameters, handling text and image generation with ease. Perfect for streamlined VLM tasks without massive compute power! https://t.co/spSWUdop28

0

5

0

116

Aryan Kargwal @aryankarg

over 1 year ago

@davidstout Can be dicey when even “explicit” enterprises cannot bypass them.

0

66

Aryan Kargwal

@aryankarg

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users