lmms-lab @lmmslab - Twitter Profile

lmms-lab @lmmslab

3 months ago

find the cutest paws across platforms https://t.co/4QlCmgtFgl

0

9

3

0

4K

lmms-lab @lmmslab

4 months ago

safely engram❤️

Shuai Liu @choiszt

4 months ago

Agents are mind-blowing. But they don't remember things consistently. Or when they do — it's not safe. We built Engram. AES-256 encrypted. Keys stay on your device. Zero-knowledge sync. No cloud. No middleman. Use it. Your agent memory is yours. @lmmslab https://t.co/fQG72UC1b4 https://t.co/2btlaq1JPH

1

10

3

5

5K

0

2

1

0

2K

lmmslab retweeted

Shuai Liu @choiszt

4 months ago

Agents are mind-blowing. But they don't remember things consistently. Or when they do — it's not safe. We built Engram. AES-256 encrypted. Keys stay on your device. Zero-knowledge sync. No cloud. No middleman. Use it. Your agent memory is yours. @lmmslab https://t.co/fQG72UC1b4 https://t.co/2btlaq1JPH

1

10

3

5

5K

lmmslab retweeted

Adhiraj Ghosh✈️CVPR 2026 @adhiraj_ghosh98

4 months ago

Every day I'm alive I become more of a fan of @lmmslab. lmms-eval is coming in clutch during #CVPR2026 rebuttals!

0

9

3

0

1K

lmmslab retweeted

Ziwei Liu

@liuziwei7

5 months ago

🥳Year-End Reflection on the Growth of LMMs-Lab🥳 2025 has been a fruitful year for 🧠LMMs-Lab🧠 @lmmslab (https://t.co/QpJXVu7EWo), a non-profit open-source research organization dedicated to feeling and building the future of multimodal intelligence with: 🌟 > 12,000 Total GitHub Stars 🍴 > 2,000 Forks 🧑‍💻 > 30 Core Repositories

3

245

28

31

10K

lmms-lab @lmmslab

9 months ago

@DJiafei @liuziwei7 yes sure!

0

2

0

39

lmmslab retweeted

Ziwei Liu

@liuziwei7

9 months ago

🔥LLaVA-OneVision upgraded to V1.5🔥 We @lmmslab present 🌋LLaVA-OV-1.5🌋, a fully open framework for democratized multimodal training * Superior Performance surpassing Qwen2.5-VL * High-Quality Data at Scale * Ultra-Efficient Training Framework - Repo: https://t.co/1Mm6Mq5jqR

6

155

36

82

18K

lmms-lab @lmmslab

11 months ago

@_jasonwei bro really have the insight and the peace mind to find more insights

0

68

lmms-lab @lmmslab

about 1 year ago

@gazorp5 @liuziwei7 @Gradio @_akhaliq Yes we plan to release our tech report and propose a plug-play method without re-train the model to directly generate SRT output.

0

2

0

28

lmms-lab @lmmslab

about 1 year ago

Feel the vibe~

4

6

1

588

lmmslab retweeted

Brian Li

@Brian_Bo_Li

over 1 year ago

VideoMMMU is a meticulously crafted benchmark designed to evaluate multimodal models’ video understanding abilities for college-level videos. Videos have tremendous knowledge and learning from them remains challenging for current models, but it is expected to become a crucial capability on the path toward achieving AGI.

2

28

5

3

3K

lmmslab retweeted

Ziwei Liu

@liuziwei7

over 1 year ago

🤖Interpreting Large Multimodal Models (LMM)🤖 We present an automatic framework to identify, interpret and steer neurons within LMM for safe AGI - Paper: https://t.co/YIyk06DuK0 - Code: https://t.co/Z6FexYSxOF - Model @huggingface : https://t.co/yh5iH8WqBY . Thanks @_akhaliq !

liuziwei7's tweet photo. 🤖Interpreting Large Multimodal Models (LMM)🤖

We present an automatic framework to identify, interpret and steer neurons within LMM for safe AGI

- Paper: https://t.co/YIyk06DuK0
- Code: https://t.co/Z6FexYSxOF
- Model @huggingface : https://t.co/yh5iH8WqBY . Thanks @_akhaliq ! https://t.co/opzbdu6UFA

2

271

40

125

32K

lmms-lab @lmmslab

over 1 year ago

New work from LMMs-Lab! This time we present our latest research on the interpretation and safety of multimodal models

Brian Li

@Brian_Bo_Li

over 1 year ago

TL;DR We present Large Multi-modal Models Can Interpret Features in Large Multi-modal Models We successfully use a 72B large model to interpret the open-semantic features of an 8B small model, uncovering numerous important thought patterns inside multimodal models. Paper: https://t.co/iZmMq0vrcr Code: https://t.co/cxbWhlYbRt Examples: https://t.co/DcwtaI03TG

Brian_Bo_Li's tweet photo. TL;DR
We present Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

We successfully use a 72B large model to interpret the open-semantic features of an 8B small model, uncovering numerous important thought patterns inside multimodal models.

Paper: https://t.co/iZmMq0vrcr
Code: https://t.co/cxbWhlYbRt
Examples: https://t.co/DcwtaI03TG

1

52

14

11

8K

0

20

5

3

34K

lmmslab retweeted

Wenhao Chai @ CVPR 2026

@wenhaocha1

over 1 year ago

🔥 We just submitted some baselines and benchmarks to lmms-eval @lmmslab (LLaVA team) — evaluation is now just one line of code away! We call for the reporting of visual token numbers when evaluating LMM performance! - lmms-eval repo: https://t.co/RZrAJMqyTA - VDC, first benchmark for detailed video captions: https://t.co/FxyH8i4rAc - AuroraCap (VDC baseline): https://t.co/QRItwb31ti - MovieChat, first long-video understanding benchmark: https://t.co/tlElVqVeFg - MovieChat baseline: https://t.co/Fus36O252j

wenhaocha1's tweet photo. 🔥 We just submitted some baselines and benchmarks to lmms-eval @lmmslab (LLaVA team) — evaluation is now just one line of code away! We call for the reporting of visual token numbers when evaluating LMM performance!

- lmms-eval repo: https://t.co/RZrAJMqyTA

- VDC, first benchmark for detailed video captions: https://t.co/FxyH8i4rAc
- AuroraCap (VDC baseline): https://t.co/QRItwb31ti

- MovieChat, first long-video understanding benchmark: https://t.co/tlElVqVeFg
- MovieChat baseline: https://t.co/Fus36O252j

1

20

4

3K

lmms-lab @lmmslab

over 1 year ago

👍 OpenAI's CriticGPT, not opensourced, language only 🤯 LMMs-Labs LLaVA-Critic, opensourced, for multimodal tasks mindblown_meme.gif

Tianyi Xiong ✈️ CVPR @tianyixiong23

over 1 year ago

🚀🔥Introducing LLaVA-Critic--the first open-source large multimodal model designed to assess model performance across diverse multimodal tasks! LLaVA-Critic excels in two primary scenarios: - 👨‍⚖️LMM-as-a-Judge: It provides pointwise scores and pairwise rankings that closely align with human and GPT-4o preferences across multiple evaluation tasks, offering a viable open-source alternative to commercial GPT models. - 🩷Preference Learning: It offers reliable reward signals that significantly enhance the visual chat capabilities of LMMs through preference alignment. To develop the "critic" capacity, we curate LLaVA-Critic-113k, a high-quality critic instruction-following dataset tailored to provide quantitative judgment and the corresponding reasoning process across a range of complex evaluation settings. Explore more: - 📰Paper: https://t.co/N71QwFSr7D - 🪐Project Page: https://t.co/qWLoZ0fX9J - 📦Dataset: https://t.co/t0RXNEC3Q6 - 🤗Models: https://t.co/V0zZHdLi0J Try our released models and dataset👆

tianyixiong23's tweet photo. 🚀🔥Introducing LLaVA-Critic--the first open-source large multimodal model designed to assess model performance across diverse multimodal tasks!

LLaVA-Critic excels in two primary scenarios:
- 👨‍⚖️LMM-as-a-Judge: It provides pointwise scores and pairwise rankings that closely align with human and GPT-4o preferences across multiple evaluation tasks, offering a viable open-source alternative to commercial GPT models.
- 🩷Preference Learning: It offers reliable reward signals that significantly enhance the visual chat capabilities of LMMs through preference alignment.

To develop the "critic" capacity, we curate LLaVA-Critic-113k, a high-quality critic instruction-following dataset tailored to provide quantitative judgment and the corresponding reasoning process across a range of complex evaluation settings.

Explore more:
- 📰Paper: https://t.co/N71QwFSr7D
- 🪐Project Page: https://t.co/qWLoZ0fX9J
- 📦Dataset: https://t.co/t0RXNEC3Q6
- 🤗Models: https://t.co/V0zZHdLi0J
Try our released models and dataset👆

2

141

27

85

32K

0

14

3

0

1K

lmms-lab @lmmslab

over 1 year ago

👍 SOTA Level Video Models 🤯 With Open-sourced Data and Training Recipes mindblown_meme.gif

Chunyuan Li @ChunyuanLi

over 1 year ago

(1/4)🚀 Ready to supercharge your Video LLMs? 🎥Meet LLaVA-Video-178K, a high-quality dataset for video instruction tuning with 1.3M samples in captions, Q&A! 💡Perfect for further boosting Video LLMs, on top of strong capability transfer from image/language shown in LLaVA-OV🤖

ChunyuanLi's tweet photo. (1/4)🚀 Ready to supercharge your Video LLMs? 🎥Meet LLaVA-Video-178K, a high-quality dataset for video instruction tuning with 1.3M samples in captions, Q&A!
💡Perfect for further boosting Video LLMs, on top of strong capability transfer from image/language shown in LLaVA-OV🤖 https://t.co/7JG1vM1oyt

1

79

13

33

10K

0

15

4

0

2K

lmmslab retweeted

Shuang Li

@ShuangL13799063

over 1 year ago

We are organizing a new workshop on "Knowledge in Generative Models" at #ECCV2024 to explore how generative models learn representations of the visual world and how we can use them for downstream applications. https://t.co/6iW8lcdrZt 📅30 September 2024, 2 PM

1

35

3

5

7K

lmmslab retweeted

Brian Li

@Brian_Bo_Li

almost 2 years ago

Great experience working with Lianmin to integrate LLaAV-OneVision into SGLang, and huge thanks to @PY_Z001 and @KaichenZhang358 to help finish this. Try it on: https://t.co/1xhUxNw0Oc Directly try our demo (with SGLang SRT API service): https://t.co/yAav2XwrAe

0

22

8

0

2K