Akash Agarwal @akash_agarwal - Twitter Profile

Akash Agarwal @akash_agarwal

about 2 months ago

@dashboardlim META CLAUDE

0

14

Akash Agarwal @akash_agarwal

6 months ago

@sundeep Took a cool $20 billion but a cease and desist didn’t (copyright violation on the name Groq) shows capitalism at work. 😏

0

89

Akash Agarwal @akash_agarwal

6 months ago

Beer makers spend part of the year brewing small quantities of high-octane beer with as much as 30% alcohol. It’s as scary to make as it is to drink. https://t.co/SdTx3cLubI via @WSJ

0

25

Akash Agarwal @akash_agarwal

6 months ago

Enterprise and Sovereign Inferencing from @SCXAICloud https://t.co/cuJuTi3bml

0

17

Who to follow

David @ NextView

@davidbeisel

Co-Founder & Partner @NextViewVC Seed Investor: @attentivemobile @TripleLiftHQ @thredUP @ParsecTeam AI is disrupting VC: https://t.co/AqyyDdfVlY

Brian Jacobs

@brianjacobsvc

Founder of @MoaiCapital & @EmergenceCapital, Lecturer @StanfordGSB, inventor, wanderer, stone carver, student, dreamer, concerned citizen, still hopeful

Marc Michel

@marcmichelvc

Managing Partner Runway Venture Partners, early stage venture firm focused on post seed funded companies in software tech. Tennis nut!

Akash Agarwal @akash_agarwal

7 months ago

@roelofbotha We need it on NASDAQ as well …😏

0

4

0

1K

akash_agarwal retweeted

Andrej Karpathy

@karpathy

7 months ago

As a fun Saturday vibe code project and following up on this tweet earlier, I hacked up an **llm-council** web app. It looks exactly like ChatGPT except each user query is 1) dispatched to multiple models on your council using OpenRouter, e.g. currently: "openai/gpt-5.1", "google/gemini-3-pro-preview", "anthropic/claude-sonnet-4.5", "x-ai/grok-4", Then 2) all models get to see each other's (anonymized) responses and they review and rank them, and then 3) a "Chairman LLM" gets all of that as context and produces the final response. It's interesting to see the results from multiple models side by side on the same query, and even more amusingly, to read through their evaluation and ranking of each other's responses. Quite often, the models are surprisingly willing to select another LLM's response as superior to their own, making this an interesting model evaluation strategy more generally. For example, reading book chapters together with my LLM Council today, the models consistently praise GPT 5.1 as the best and most insightful model, and consistently select Claude as the worst model, with the other models floating in between. But I'm not 100% convinced this aligns with my own qualitative assessment. For example, qualitatively I find GPT 5.1 a little too wordy and sprawled and Gemini 3 a bit more condensed and processed. Claude is too terse in this domain. That said, there's probably a whole design space of the data flow of your LLM council. The construction of LLM ensembles seems under-explored. I pushed the vibe coded app to https://t.co/EZyOqwXd2k if others would like to play. ty nano banana pro for fun header image for the repo

karpathy's tweet photo. As a fun Saturday vibe code project and following up on this tweet earlier, I hacked up an **llm-council** web app. It looks exactly like ChatGPT except each user query is 1) dispatched to multiple models on your council using OpenRouter, e.g. currently:

"openai/gpt-5.1",
"google/gemini-3-pro-preview",
"anthropic/claude-sonnet-4.5",
"x-ai/grok-4",

Then 2) all models get to see each other's (anonymized) responses and they review and rank them, and then 3) a "Chairman LLM" gets all of that as context and produces the final response.

It's interesting to see the results from multiple models side by side on the same query, and even more amusingly, to read through their evaluation and ranking of each other's responses.

Quite often, the models are surprisingly willing to select another LLM's response as superior to their own, making this an interesting model evaluation strategy more generally. For example, reading book chapters together with my LLM Council today, the models consistently praise GPT 5.1 as the best and most insightful model, and consistently select Claude as the worst model, with the other models floating in between. But I'm not 100% convinced this aligns with my own qualitative assessment. For example, qualitatively I find GPT 5.1 a little too wordy and sprawled and Gemini 3 a bit more condensed and processed. Claude is too terse in this domain.

That said, there's probably a whole design space of the data flow of your LLM council. The construction of LLM ensembles seems under-explored.

I pushed the vibe coded app to
https://t.co/EZyOqwXd2k
if others would like to play. ty nano banana pro for fun header image for the repo

904

17K

1K

13K

5M

Akash Agarwal @akash_agarwal

8 months ago

Interested in learning about #SovereignAI, #EnterpriseAI, or #Inferencing, then check out this amazing podcast by @rajivparikh in conjunction with Position² @Position2 & @effinfunny https://t.co/1UxtrKFDkw

0

27

Akash Agarwal @akash_agarwal

8 months ago

@JeffDean Insane - looking fast inference for frontier models then check out @SCXAICloud #sambanova

0

1

0

188

Akash Agarwal @akash_agarwal

8 months ago

@sundeep Congrats. #SovereignAi seems to be all the rage - will it deliver? @SCXAICloud

0

18

Akash Agarwal @akash_agarwal

8 months ago

@DylanMitic @MLB @JonathanRoss321 Indeed, well done. Although, these chips have serious competition now and are of the older generation. @Google @SambaNovaAI @Qualcomm @AMD @cerebras are fast inferencing with much lower power intake.

0

22

Akash Agarwal @akash_agarwal

8 months ago

@ThomasOrTK Checkout @SCXAICloud for Fast Inference on ASICS based RDUs @SambaNovaAI

0

1

0

13

Akash Agarwal @akash_agarwal

8 months ago

@DylanMitic Agree @SCXAICloud

0

1

0

22

Akash Agarwal @akash_agarwal

8 months ago

Palantir thinks college might be a waste. So it tried hiring 22 high-school grads for a fellowship. https://t.co/KzJrRb1yCj via @WSJ

0

18

Akash Agarwal @akash_agarwal

8 months ago

@poezhao0605 very interesting. Cc:

0

33

Akash Agarwal @akash_agarwal

8 months ago

@mukund @bgurley @altcap Not a surprise- @bgurley left because of that. @altcap needs to change the name of the pod @BG2Pod to bg1pod 😏

0

6

Akash Agarwal @akash_agarwal

8 months ago

@BrendanFoody @felicis @benchmark @generalcatalyst Well done mate... if you are looking to scale your Inference then check out https://t.co/D4MJ4kxr2f -- orthogonal approach to scaling frontier models.

0

13

Akash Agarwal @akash_agarwal

8 months ago

@andrewdfeldman Check out @SCXAICloud for Inference As A Service on ASiCs based technology. Cc: @Georgeye458

0

6

Akash Agarwal @akash_agarwal

8 months ago

@MollySOShea @ChaseLochmiller @LeeJacobs Check out @SCXAICloud for your interference needs, @CrusoeAI for training & low cost, scalable, fast inferencing & btw, we offer #SovereignAI solutions for countries/customers.

0

3

Akash Agarwal @akash_agarwal

8 months ago

@sundeep United?

0

1

0

22

Akash Agarwal @akash_agarwal

8 months ago

@Electron_Cowboy Well done. Have you seen what we have done with 1/100 of the capital you have raised? Launched an Inference Cloud on ASIC based technology that can run in an existing 2018 data centres; supporting all the major open models -- check out https://t.co/2N0hqGbsMY

0

8

Akash Agarwal

@akash_agarwal

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users