Ashmal Vayani @AshmalVayani - Twitter Profile

Ashmal Vayani @AshmalVayani

5 days ago

SubQ-1.1-Small model card is out. Excited for what’s coming next to this 🎉

Alexander Whedon

@alex_whedon

5 days ago

Here is the technical report on SubQ 1.1 Small. https://t.co/bu8AEc4lsk This is the second iteration on our Subquadratic Sparse Attention (SSA) model, and the first to be deployed with design partners in the coming weeks. The results are compelling and verified by @AppenResearch. - Near-perfect long-context retrieval up to 12M tokens on the needle-in-a-haystack test, with up to nearly 1,000x attention compute reduction. - A balance of long-context optimization and general reasoning ability, with strong performance retained across knowledge, coding, and non-coding enterprise agent benchmarks. - At 1M tokens, SubQ 1.1 Small requires 64.5x less compute than dense attention and runs 56x faster than FlashAttention-2. These results highlight a significant scaling advantage thanks to the efficiency gains from the SSA architecture. We included some details and learnings from the development process which may be helpful to the community. Comment with questions, I’ll try to respond!

alex_whedon's tweet photo. Here is the technical report on SubQ 1.1 Small.
https://t.co/bu8AEc4lsk

This is the second iteration on our Subquadratic Sparse Attention (SSA) model, and the first to be deployed with design partners in the coming weeks.

The results are compelling and verified by @AppenResearch.

- Near-perfect long-context retrieval up to 12M tokens on the needle-in-a-haystack test, with up to nearly 1,000x attention compute reduction.

- A balance of long-context optimization and general reasoning ability, with strong performance retained across knowledge, coding, and non-coding enterprise agent benchmarks.

- At 1M tokens, SubQ 1.1 Small requires 64.5x less compute than dense attention and runs 56x faster than FlashAttention-2.

These results highlight a significant scaling advantage thanks to the efficiency gains from the SSA architecture.

We included some details and learnings from the development process which may be helpful to the community.

Comment with questions, I’ll try to respond!

65

527

73

272

137K

0

5

0

111

Ashmal Vayani @AshmalVayani

about 2 months ago

@alex_whedon This really feels like the beginning of something impactful in AI. Excited for what we’ll build next. 🚀

0

5

0

200

Ashmal Vayani @AshmalVayani

about 2 months ago

Introducing SubQ. World’s first fully sub-quadratic Sparse Attention Architecture! Stay tuned for more.

Alexander Whedon

@alex_whedon

about 2 months ago

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.

1K

23K

3K

19K

13M

1

2

0

99

AshmalVayani retweeted

Alexander Whedon

@alex_whedon

about 2 months ago

We finally have swag.

17

101

4

2

24K

Who to follow

Mage (magex.bsky.social)

@Mage_MDL

I care about stupid things and also politics, so just stupid things I guess. | Ella/she/her.

Ashmal Vayani @AshmalVayani

about 2 months ago

Onto better things 🚀

Alexander Whedon

@alex_whedon

about 2 months ago

Watching training jobs be like…

3

56

3

0

17K

0

2

0

33

AshmalVayani retweeted

VidLLMs CVPR2026 @vidllms

3 months ago

CFP: VidLLMs @ CVPR 2026 Workshop. Non-archival papers on video-language models, VLA, world models, data, eval, safety, and applications. Due Apr 20, 2026. 4 or 8 pages. Submit: https://t.co/tNRUnUVPdm #CVPR2026

0

3

0

283

AshmalVayani retweeted

VidLLMs CVPR2026 @vidllms

4 months ago

🚨 CFP: VidLLMs @ CVPR 2026 Workshop 🚨 Non-archival papers welcome: Video LLMs, unified multimodal, VLA, world models, data, eval/benchmarks, apps, safety. Deadline Apr 3, 2026. Submit: https://t.co/2gOGB8kXhH #CVPR2026 #VidLLMs

vidllms's tweet photo. 🚨 CFP: VidLLMs @ CVPR 2026 Workshop 🚨 Non-archival papers welcome: Video LLMs, unified multimodal, VLA, world models, data, eval/benchmarks, apps, safety. Deadline Apr 3, 2026.

Submit: https://t.co/2gOGB8kXhH #CVPR2026 #VidLLMs https://t.co/Faic03FoY0

1

5

3

2

1K

Ashmal Vayani @AshmalVayani

9 months ago

Excited to share some interesting insights on Multicultural and Multilingual Model.

Cohere Labs

@Cohere_Labs

9 months ago

Our Geo Regional Asia group is excited to host @AshmalVayani for a session "Seeing the World as It Speaks: Multilingual, Culturally Aware Multimodal AI" on October 15th. Thanks to program leads @AhmadMustafaAn1 and @KanwalMehreen2 for organizing this session! 🔥 Learn more: https://t.co/iyYqhpOoMp

Cohere_Labs's tweet photo. Our Geo Regional Asia group is excited to host @AshmalVayani for a session "Seeing the World as It Speaks: Multilingual, Culturally Aware Multimodal AI" on October 15th.

Thanks to program leads @AhmadMustafaAn1 and @KanwalMehreen2 for organizing this session! 🔥

Learn more: https://t.co/iyYqhpOoMp

0

9

2

1

3K

0

3

0

338

AshmalVayani retweeted

Franky.

@FrankYouChill

12 months ago

[Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact] To get us to AGI, new cognitive foundations are needed. This paper argues that token prediction alone (of those top LLMs) is not enough for achieving AGI. We need systems that combine brain-inspired memory, modular reasoning, and other components, along with built-in alignment and governance. What's emerging Today? - Modular architectures (e.g., MoEs, neuro-symbolic hybrids) - Large Reasoning Models (LRMs) and Concept Models (LCMs) - Multi-agent societies and tool-using agents What’s missing in today’s LLMs: - No long-term memory or World model - Weak causal reasoning and planning - Shallow alignment (just fine-tuned for alignment) Key components for future AGI: - Memory: Lifelong learning, RAG, external memory - Reasoning: Chain-of-Thought, agent collaboration - World models: Predictive simulators for long-term goals - Embodiment: Real-world interaction, vision-action loops - Social cognition: Emotional awareness, value alignment Paper link: https://t.co/ou0IXjexza

FrankYouChill's tweet photo. [Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact]

To get us to AGI, new cognitive foundations are needed.

This paper argues that token prediction alone (of those top LLMs) is not enough for achieving AGI.

We need systems that combine brain-inspired memory, modular reasoning, and other components, along with built-in alignment and governance.

What's emerging Today?
- Modular architectures (e.g., MoEs, neuro-symbolic hybrids)
- Large Reasoning Models (LRMs) and Concept Models (LCMs)
- Multi-agent societies and tool-using agents

What’s missing in today’s LLMs:
- No long-term memory or World model
- Weak causal reasoning and planning
- Shallow alignment (just fine-tuned for alignment)

Key components for future AGI:
- Memory: Lifelong learning, RAG, external memory
- Reasoning: Chain-of-Thought, agent collaboration
- World models: Predictive simulators for long-term goals
- Embodiment: Real-world interaction, vision-action loops
- Social cognition: Emotional awareness, value alignment

Paper link: https://t.co/ou0IXjexza

2

7

3

4

562

AshmalVayani retweeted

VidLLMs CVPR2026 @vidllms

about 1 year ago

Remarks from our chair Prof @ucfmshah kicking off the VideoLLMs workshop @CVPR #CVPR #CVPR2025 #ComputerVision #AI

0

20

7

0

2K

AshmalVayani retweeted

VidLLMs CVPR2026 @vidllms

about 1 year ago

🚀 Heading to #CVPR2025 in Nashville? Don’t miss the VideoLLMs Workshop on June 11 (Grand A1)! • 🔑 6 keynotes • 🗣️ Panel “VideoLLMs vs Expert Models” • 🏆 $6 K challenge-prize reveal • 📑 Paper talks and Poster session See you there! #AI #ComputerVision #GenerativeAI

vidllms's tweet photo. 🚀 Heading to #CVPR2025 in Nashville? Don’t miss the VideoLLMs Workshop on June 11 (Grand A1)!
• 🔑 6 keynotes
• 🗣️ Panel “VideoLLMs vs Expert Models”
• 🏆 $6 K challenge-prize reveal
• 📑 Paper talks and Poster session

See you there!

#AI #ComputerVision #GenerativeAI https://t.co/gqziogxfyY

1

19

8

2

1K

AshmalVayani retweeted

Mubarak Shah @ucfmshah

about 1 year ago

Travel grants sponsored by Amazon and Apple are available for participants at CVPR-2025 VidLLMs workshop. Fill out this form if you are an attendee looking for support. Limited grants of $1,000 per person are available. #CVPR @CVPR https://t.co/6wzIX2rRvQ

ucfmshah's tweet photo. Travel grants sponsored by Amazon and Apple are available for participants at CVPR-2025 VidLLMs workshop. Fill out this form if you are an attendee looking for support. Limited grants of $1,000 per person are available. #CVPR @CVPR

https://t.co/6wzIX2rRvQ https://t.co/Vrk8kTAV2M

1

30

12

1

4K

Ashmal Vayani @AshmalVayani

about 1 year ago

With up to $3,000 in prizes from #Amazon and #Apple, brush up on your VidLMM skills in our multilingual challenge. Demo code also released.

VidLLMs CVPR2026 @vidllms

about 1 year ago

🚀 Join the Multilingual Video Reasoning Challenge! 🌍 🎥 879 real videos, 8,025 QA, 15 domains, 14 languages 🧠 Prove your VLM’s cultural IQ in open‑ended & MCQ tracks. 💰 Prizes + global glory! Register & rules: https://t.co/xlIOoWR0st #AI #MultilingualAI #CVPR2025

vidllms's tweet photo. 🚀 Join the Multilingual Video Reasoning Challenge! 🌍
🎥 879 real videos, 8,025 QA, 15 domains, 14 languages
🧠 Prove your VLM’s cultural IQ in open‑ended & MCQ tracks.
💰 Prizes + global glory!
Register & rules: https://t.co/xlIOoWR0st

#AI #MultilingualAI #CVPR2025 https://t.co/knQSGGCI9a

1

31

3

6

2K

0

1

0

96

Ashmal Vayani @AshmalVayani

about 1 year ago

@CathyChenZhao @vidllms @CVPR https://t.co/gpYg0oRzNf As mentioned in the workshop website above, the submissions are non-archival; they will not be published in the proceedings.

0

37

Ashmal Vayani @AshmalVayani

about 1 year ago

Challenge yourself to beat the best results in our @CVPR @vidllms Workshop at Nashville!

VidLLMs CVPR2026 @vidllms

about 1 year ago

🚨 Complex Video Reasoning Challenge @ CVPR is LIVE! 🎬🚀 🎥 214 real videos | 2.4K Q&A | avg 22 s ⏱ 🧠 Multi‑action ➕ temporal 🕒, partial/none actions 🚫, social 👥 & implausible 🌀 events 🏆 Cash prizes & glory 🌍 Link: https://t.co/gEG1BLxEGC #CVPR2025 @CVPR

1

14

4

3K

0

2

0

126

Ashmal Vayani @AshmalVayani

about 1 year ago

Participate in the Multilingual Video Reasoning Challenge in CVPR2025 @vidllms Workshop. 🥇

VidLLMs CVPR2026 @vidllms

about 1 year ago

🚀 Join the Multilingual Video Reasoning Challenge! 🌍 🎥 879 real videos, 8,025 QA, 15 domains, 14 languages (Arabic→Urdu). 🧠 Prove your VLM’s cultural IQ in open‑ended & MCQ tracks. 💰 Prizes + global glory! Register & rules: https://t.co/xlIOoWRyi1 #AI #MultilingualAI #VLM

0

18

5

1

3K

0

2

0

75

AshmalVayani retweeted

VidLLMs CVPR2026 @vidllms

about 1 year ago

🚨 Submission deadline for papers to 1st Video LLMs Workshop at #CVPR2025 has been extended by 5 days to April 20 ! 📌 Non-archival | 📷 Both Novel and published work welcome 📷 🎯CFP: https://t.co/VNI4cCUJaZ Submit at: https://t.co/c8ogzwIRr0

0

19

7

3

2K

Ashmal Vayani @AshmalVayani

over 1 year ago

📊 We hope SB-bench serves as a critical tool in building fairer, more responsible AI systems. We welcome the community to explore, critique, and expand on this work! 🙏 Thanks to the entire core team! Vishal Narnaware, Rohit Gupta, Sirnam Swetha, and Mubarak Shah.

Ashmal Vayani @AshmalVayani

over 1 year ago

🚀 Introducing SB-Bench, a comprehensive framework for evaluating stereotype biases in LMMs. With over 7.5K real-world images across 9 domains, it rigorously tests LMMs’ fairness in ambiguous scenarios. ⚖️ 🔗 Website: https://t.co/rlxtxHM04I 📖 Paper: https://t.co/rX2Ep892ST

AshmalVayani's tweet photo. 🚀 Introducing SB-Bench, a comprehensive framework for evaluating stereotype biases in LMMs. With over 7.5K real-world images across 9 domains, it rigorously tests LMMs’ fairness in ambiguous scenarios. ⚖️
🔗 Website: https://t.co/rlxtxHM04I
📖 Paper: https://t.co/rX2Ep892ST https://t.co/euGQU4QSSV

0

86

0

1

0

55

Ashmal Vayani @AshmalVayani

over 1 year ago

🚀 Introducing SB-Bench, a comprehensive framework for evaluating stereotype biases in LMMs. With over 7.5K real-world images across 9 domains, it rigorously tests LMMs’ fairness in ambiguous scenarios. ⚖️ 🔗 Website: https://t.co/rlxtxHM04I 📖 Paper: https://t.co/rX2Ep892ST

0

86

Ashmal Vayani @AshmalVayani

over 1 year ago

We have multiple challenge tracks, and expert sessions in VidLLM. More details will be shared soon.

VidLLMs CVPR2026 @vidllms

over 1 year ago

🚨 Exciting news! 🚨 We’re thrilled to announce the First Workshop on Video Large Language Models at @CVPR ! 🎥🤖 Join us for an exciting lineup of expert speakers and engaging challenges for participants, with cash prizes for the winners! 💸 #CVPR2025 #VidLLMs #VideoLLMs

vidllms's tweet photo. 🚨 Exciting news! 🚨
We’re thrilled to announce the First Workshop on Video Large Language Models at @CVPR ! 🎥🤖 Join us for an exciting lineup of expert speakers and engaging challenges for participants, with cash prizes for the winners! 💸
#CVPR2025 #VidLLMs #VideoLLMs https://t.co/Qs7vvQ3jOD

3

25

5

2

5K

0

64

Ashmal Vayani

@AshmalVayani

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users