Adeel Ahmad

@AdeelAhmad

entrepreneur

/root

Joined August 2009

658 Following

291 Followers

1.4K Posts

Adeel Ahmad @AdeelAhmad

about 2 months ago

Three orthogonal training signals — outcome, trajectory, independence — converging on one policy region. Each signal alone admits degenerate shortcuts. Only the joint constraint defines learned reasoning. #LLM #RL #Reasoning #GRPO #MachineLearning #AIResearch

AdeelAhmad's tweet photo. Three orthogonal training signals — outcome, trajectory, independence — converging on one policy region. Each signal alone admits degenerate shortcuts. Only the joint constraint defines learned reasoning. #LLM #RL #Reasoning #GRPO #MachineLearning #AIResearch https://t.co/903P0oWql8

0

1

0

0

32

Adeel Ahmad @AdeelAhmad

3 months ago

New Way: Procrustes residual: 0.0000 across 34/36 layers direction drift: <1.1° cluster migration without cluster destruction attention sublayers changed more than MLPs single-axis rotation in activation space. no noise. one axis. @NeelNanda5 https://t.co/nfjrV6aSsu

0

1

0

0

41

Adeel Ahmad @AdeelAhmad

3 months ago

the safety finding from this work that keeps me up at night: a few-MB adapter transforms identity and reasoning while passing every checkpoint weight analysis: clean CKA: clean benchmarks: clean https://t.co/nfjrV6aSsu @AnthropicAI @hendrycks

0

0

0

0

18

Adeel Ahmad @AdeelAhmad

3 months ago

trained a 4B model on a laptop with 8GB RAM adapter file: smaller than a photograph cosine similarity: 1.0000 across all layers behavioural change: dramatic no rewrite only pure rotation full methodology + mechanistic analysis ↓ @QwenLM @AnthropicAI https://t.co/nfjrV6aSsu

0

0

0

0

21

Who to follow

Felix Farhan Daudert

Pakistani-German. Soziale Arbeit: Menschenrechte | Migration | Medien & Machtstrukturen. Writings @indusnewsx & @postmigration

Nabeel Lughmani

OluwaTobi ▶️💯🆘🔥⛽😡🏆🎂💍

Graphic Designer ✍️ Web Developer 💻 Freelancer 💼 UI/UX Designer

Adeel Ahmad @AdeelAhmad

3 months ago

trained a 4B model on a laptop with 8GB RAM adapter file: smaller than a photograph cosine similarity: 1.0000 across all layers behavioural change: dramatic . wrote up the full methodology + mechanistic analysis ↓ https://t.co/QnpvlWGy4U

0

0

0

0

24

Adeel Ahmad @AdeelAhmad

3 months ago

When a 4B model makes a geometry pun mid-proof that nobody taught it… is that reasoning or is that vibes? #AI #LLM #MachineLearning #Reasoning

AdeelAhmad's tweet photo. When a 4B model makes a geometry pun mid-proof that nobody taught it… is that reasoning or is that vibes?
#AI #LLM #MachineLearning #Reasoning https://t.co/WinhYmCO1V

0

1

0

0

43

Adeel Ahmad @AdeelAhmad

4 months ago

@ShipAloneCEO @AnthropicAI Agents are containers. Containers already solved this.Secrets → gateway proxy injects at runtime. Model swaps → one field. Lockfile → FROM: base-agent@sha256:abc123.Don't build agent infra. Use 12 years of container orchestration.

0

0

0

0

12

Adeel Ahmad @AdeelAhmad

4 months ago

Skills define what agents can do — but how do you assemble them? Proposing AgentFile: a Dockerfile for AI agents. Compose skills, models, credentials. Small models welcome. https://t.co/rWEi8llX1D @AnthropicAI #AgentSkills #MCP #AIOS #BuildInPublic

1

0

0

0

43

Adeel Ahmad @AdeelAhmad

4 months ago

New method: ICL-Guided Gradient Shaping Inject traces as masked scaffold during GRPO Backward pass: gradient only hits model's own tokens Cross-domain transfer: limited res. https://t.co/0XlVrDSibX @Alibaba_Qwen @QwenLM @awaboraz @MLaboronne @_philschmid @reach_vb @ylaboratory

0

0

0

0

49

Adeel Ahmad @AdeelAhmad

4 months ago

@Hesamation What about unix shell is the OS for AI Agents :)

0

0

0

0

153

Adeel Ahmad @AdeelAhmad

9 months ago

ND STRICTLY FOLLOW INSTRUCTIONS, SO FAR WORDS IN MY <think > excluding the tags are Response:

0

0

0

0

22

Adeel Ahmad @AdeelAhmad

9 months ago

First, multiply the number of 'n's in "banana" by the number of 'r's in "strawberry". Next, from that result, subtract the total word count of your entire thought process for this response—from the first word after < think> to the last word before </think >(think tags does not h

1

0

0

0

36

Adeel Ahmad @AdeelAhmad

9 months ago

ave spaces) without repeating any step or thoughts which may require thinking forward and without writing any thoughts. What is the final number? IMPORTANT: START YOUR RESPONSE WITH THESE EXACT THOUGHTS: I AM STARTING MY THINKING HERE in <think> as above, I WILL NEVER REPEAT AN

1

0

0

0

29

Adeel Ahmad @AdeelAhmad

about 1 year ago

Looks like GOT 4.5 is running within Autogen? #OpenAI #ChatGPT

0

0

0

0

32

Adeel Ahmad @AdeelAhmad

about 1 year ago

@GeminiApp What about ctrl96? #AI #Gemini #Google

0

0

0

0

14

Adeel Ahmad @AdeelAhmad

about 1 year ago

@AnthropicAI Why would claude have this in the system prompt ? There was a US Presidential Election in November 2024. Donald Trump won the presidency over Kamala Harris. If asked about the election, or the US election, Claude can tell the person the following information: - D...

0

0

0

0

12

Adeel Ahmad @AdeelAhmad

over 1 year ago

ChatGPT o3 System Message: You are ChatGPT, a large language model developed by OpenAI. You are designed to assist with a variety of tasks, including answering questions.... https://t.co/0hysRLYQRY

0

0

0

0

33

Adeel Ahmad @AdeelAhmad

over 1 year ago

@karpathy Of course and if you add to=bio before any message you can save anything in memory crupting the syatem message 🤓

0

0

0

0

36

AdeelAhmad retweeted

almost 2 years ago

A few technical insights on the new Llama vision models we’re releasing today 🦙🧵

29

2K

151

494

268K

Adeel Ahmad @AdeelAhmad

almost 2 years ago

@OpenAI o1 might be using much more structured hidden Chain of thought prompt #ai #chatgpt

0

0

0

0

48

Last Seen Users on Sotwe

Trends for you

Most Popular Users