Lingpeng Kong @ikekong - Twitter Profile

ikekong retweeted

about 2 months ago

Our work was cited by π0.7! This further highlights an important direction for VLA training: leveraging abundant human video data to address the bottleneck of task diversity. Excited to share our new paper: CLAP: Contrastive Latent Action Pretraining for Learning Vision-Language-Action Models from Human Videos. CLAP aligns human video dynamics with executable robot actions, enabling VLAs to transfer manipulation skills from human demonstrations to real robot control. We will open-source the code in May. arXiv link: https://t.co/enofYlocon video on youtube: https://t.co/G3rgRZbAH1

zhuci19's tweet photo. Our work was cited by π0.7! This further highlights an important direction for VLA training: leveraging abundant human video data to address the bottleneck of task diversity.

Excited to share our new paper: CLAP: Contrastive Latent Action Pretraining for Learning Vision-Language-Action Models from Human Videos.

CLAP aligns human video dynamics with executable robot actions, enabling VLAs to transfer manipulation skills from human demonstrations to real robot control.

We will open-source the code in May.

arXiv link: https://t.co/enofYlocon

video on youtube: https://t.co/G3rgRZbAH1

2

336

39

262

16K

Lingpeng Kong @ikekong

about 2 months ago

Great to see the AI applications from local teams in HKU!

ALAGENT-HKU @AlagentHku

about 2 months ago

🚀 Turn ANY investment idea or research article into a backtested quant strategy. Let's have a try! 🌐 Pro Beta Investment Assistant Web App (🎁Sign up now to claim 500 FREE credits!): https://t.co/sO4aksmhvd 🔗 Lightweight Open-Source Agent Skill: https://t.co/tnFGIZDVeQ

0

1

0

1K

0

10

0

953

Lingpeng Kong @ikekong

3 months ago

It is happening!

He He

@hhexiy

3 months ago

https://t.co/H3TAsaThYQ

18

877

128

1K

118K

0

8

0

855

ikekong retweeted

Lei Li

@_TobiasLee

3 months ago

Agents are doing real work, but existing benchmarks still test them in isolation. Today we’re releasing Claw-Eval 🦞: an open-source, transparent evaluation framework for AI agents. We feature 104 tasks spanning daily assistants, Office QA, deep finance research, and terminal usage. We test completion, robustness, and safety across real and mock services with configurable error injection. Fully traceable and human-verified. First leaderboard results: Claude Opus 4.6 @AnthropicAI tops pass rate (68.3%), but Gemini 3.1 @GeminiApp Pro edges it on avg score (0.764 vs 0.759). Agents have a long way to go.🤨 Check it out: https://t.co/NSt33x1toh @steipete @openclaw

_TobiasLee's tweet photo. Agents are doing real work, but existing benchmarks still test them in isolation.

Today we’re releasing Claw-Eval 🦞: an open-source, transparent evaluation framework for AI agents.

We feature 104 tasks spanning daily assistants, Office QA, deep finance research, and terminal usage.
We test completion, robustness, and safety across real and mock services with configurable error injection.
Fully traceable and human-verified.

First leaderboard results: Claude Opus 4.6 @AnthropicAI tops pass rate (68.3%), but Gemini 3.1 @GeminiApp Pro edges it on avg score (0.764 vs 0.759).
Agents have a long way to go.🤨

Check it out: https://t.co/NSt33x1toh

@steipete @openclaw

10

154

27

72

42K

Who to follow

Huan Sun

@hhsun1

Prof. @OhioState, endowed CoE Innovation Scholar, advancing the capability and safety/security of LLM-based agents, understanding transformers' limitations

Yizhong Wang

@yizhongwyz

Researching AI for an infinite-sum future. RS@ByteDance Seed, incoming AP@UT Austin. Formerly @uwcse @allen_ai @meta @microsoft

Yu Su

@ysu_nlp

co-founder @NeoCognition | prof. @osunlp | sloan fellow | building towards abundance of specialized intelligence

ikekong retweeted

Lin Zheng @linzhengisme

4 months ago

Introducing proxy compression for end-to-end language modeling: train on compressed (e.g., tokenized) data for efficiency, but run inference entirely on raw bytes without a tokenizer. No architectural changes required. At scale, proxy-trained byte models match or surpass tokenizer baselines at 7B and 14B. 📄 Paper: https://t.co/4NGVagTocP 💻 Code: https://t.co/tPcbReJ915 [1/9] 🧵👇

linzhengisme's tweet photo. Introducing proxy compression for end-to-end language modeling: train on compressed (e.g., tokenized) data for efficiency, but run inference entirely on raw bytes without a tokenizer. No architectural changes required. At scale, proxy-trained byte models match or surpass tokenizer baselines at 7B and 14B.

📄 Paper: https://t.co/4NGVagTocP
💻 Code: https://t.co/tPcbReJ915

[1/9]
🧵👇

2

99

16

61

21K

Lingpeng Kong @ikekong

6 months ago

🚀 Introducing Dream-VL & Dream-VLA! We’re proving that dLLMs have an amazing advantage in building VLA models. The result is stunning performance: 🏆 97.2% on LIBERO ⚡ 27x speedup vs AR models 🔥 Beats OpenVLA & $\pi_0$ ✅ Fully Open Source Blog: https://t.co/klCKlUeR1l

Jiacheng Ye @JiachengYe15

6 months ago

🚀Building on the success of Dream 7B, we introduce Dream-VL and Dream-VLA, open VL and VLA models that fully unlock discrete diffusion’s advantages in long-horizon planning, bidirectional reasoning, and parallel action generation for multimodal tasks.

1

58

16

22

17K

1

128

25

62

13K

Lingpeng Kong @ikekong

7 months ago

@YizheZhangNLP @NotebookLM notebooklm is so good lol

0

1

0

40

Lingpeng Kong @ikekong

7 months ago

Smarter TTS by Xueliang! See everyone in this year NeurIPS!

Xueliang Zhao @xlzhao_hku

7 months ago

🚀 Thrilled to share our #NeurIPS2025 paper DynaAct: Large Language Model Reasoning with Dynamic Action Spaces A new test-time scaling view — optimizing the action space itself, while providing a general MCTS acceleration framework for reasoning. 💻 https://t.co/FFWIDBcbCV

xlzhao_hku's tweet photo. 🚀 Thrilled to share our #NeurIPS2025 paper

DynaAct: Large Language Model Reasoning with Dynamic Action Spaces

A new test-time scaling view — optimizing the action space itself, while providing a general MCTS acceleration framework for reasoning.

💻 https://t.co/FFWIDBcbCV https://t.co/BtKxG93Brf

2

52

14

37

6K

0

12

1

1K

ikekong retweeted

Sansa Gong @sansa19739319

7 months ago

This is super cool. I strongly believe that the flexibility of dLLMs during generation will enable new features for agent use.

0

8

1

489

ikekong retweeted

HKUNLP @hkunlp2020

8 months ago

We will have a guest talk from Cai Zhou. He is a second-year PhD in MIT EECS. "Continuous modeling in diffusion language models: HDLM and CCDD ". All are welcome to join via the following link. https://t.co/ZlLDO5pKRH

hkunlp2020's tweet photo. We will have a guest talk from Cai Zhou. He is a second-year PhD in MIT EECS. "Continuous modeling in diffusion language models: HDLM and CCDD
". All are welcome to join via the following link.
https://t.co/ZlLDO5pKRH https://t.co/2u1ntwiKz1

0

16

6

4

4K

ikekong retweeted

Lei Li

@_TobiasLee

8 months ago

DeepSeek-OCR: Exploring the boundaries of visual-text compression. Ambitious! They might use 10X (near-lossless) compressed vision tokens to replace the KV cache of dialog histories. https://t.co/gxjLBrkCWW

_TobiasLee's tweet photo. DeepSeek-OCR: Exploring the boundaries of visual-text compression.

Ambitious! They might use 10X (near-lossless) compressed vision tokens to replace the KV cache of dialog histories.

https://t.co/gxjLBrkCWW https://t.co/7KQiV5FZXT

1

19

2

6

3K

ikekong retweeted

Zhihui Xie

@_zhihuixie

9 months ago

The full Dream-Coder pipeline is now open-sourced—covering data prep, training, and evaluation. Check it out! https://t.co/TuPQhv5DAo

1

26

8

3K

Lingpeng Kong @ikekong

9 months ago

Saw the paper like a month ago. Now with the demo it only gets cooler :p

Yanzhe Zhang

@StevenyzZhang

10 months ago

Introducing Generative Interfaces - a new paradigm beyond chatbots. We generate interfaces on the fly to better facilitate LLM interaction, so no more passive reading of long text blocks. Adaptive and Interactive: creates the form that best adapts to your goals and needs!

4

150

39

102

60K

0

13

5

7

9K

ikekong retweeted

JingqiZhou @zhou_jingqi_

9 months ago

🌟 Thrilled to share our paper, "TreeSynth," has been accepted for a Spotlight presentation at #NeurIPS2025! 🤔 Struggling with repetition & space collapse in data synthesis? Our work introduces 🌳TreeSynth, a novel framework using tree-guided partitioning to generate large-scale, diverse datasets from scratch. 🏆 Models trained on TreeSynth data consistently outperform those trained on human-crafted datasets and other synthetic methods. See you all at NeurIPS! 🔗 Paper: https://t.co/nc784c7X5Y 💻 Code: https://t.co/p0C75ZgWvz

zhou_jingqi_'s tweet photo. 🌟 Thrilled to share our paper, "TreeSynth," has been accepted for a Spotlight presentation at #NeurIPS2025!

🤔 Struggling with repetition & space collapse in data synthesis? Our work introduces 🌳TreeSynth, a novel framework using tree-guided partitioning to generate large-scale, diverse datasets from scratch.

🏆 Models trained on TreeSynth data consistently outperform those trained on human-crafted datasets and other synthetic methods.

See you all at NeurIPS!
🔗 Paper: https://t.co/nc784c7X5Y
💻 Code: https://t.co/p0C75ZgWvz

0

9

3

1

920

ikekong retweeted

HKUNLP @hkunlp2020

10 months ago

Jinjie Ni @NiJinjie from NUS will be giving a talk titled "Diffusion Language Models are Super Data Learners" at Friday Aug 22 11am HKT. link to talk: https://t.co/WxTUSok1in

hkunlp2020's tweet photo. Jinjie Ni @NiJinjie from NUS will be giving a talk titled "Diffusion Language Models are Super Data Learners" at Friday Aug 22 11am HKT. link to talk: https://t.co/WxTUSok1in https://t.co/kZqibRgGFZ

0

45

13

11

5K

ikekong retweeted

Lei Li

@_TobiasLee

10 months ago

🚀 MiMo‑VL 2508 is live! Same size, much smarter. We’ve upgraded performance, thinking control, and overall user experience. 📈 Benchmark gains across image + video: MMMU 70.6, VideoMME 70.8. Consistent improvements across the board. 🤖 Thinking Control: toggle reasoning with `no_think`. On (default): full reasoning visible; Off: direct answers, no reasoning ⚡⚡; ❤️ Real‑world user experience: our VLM Arena rating improved from 1093.9 → 1131.2 (+37.3). More capable, flexible, and reliable in everyday tasks. Feedback welcome. 🤗 RL Version: https://t.co/ID71evQJLL 🤗 SFT Version: https://t.co/cm14ZtZzt9 #XiaomiMiMo

_TobiasLee's tweet photo. 🚀 MiMo‑VL 2508 is live! Same size, much smarter.

We’ve upgraded performance, thinking control, and overall user experience.

📈 Benchmark gains across image + video: MMMU 70.6, VideoMME 70.8.
Consistent improvements across the board.

🤖 Thinking Control: toggle reasoning with `no_think`.
On (default): full reasoning visible;
Off: direct answers, no reasoning ⚡⚡;

❤️ Real‑world user experience: our VLM Arena rating improved from 1093.9 → 1131.2 (+37.3).
More capable, flexible, and reliable in everyday tasks.
Feedback welcome.

🤗 RL Version: https://t.co/ID71evQJLL
🤗 SFT Version: https://t.co/cm14ZtZzt9
#XiaomiMiMo

2

89

15

24

9K

ikekong retweeted

HKUNLP @hkunlp2020

11 months ago

Xinyu Yang from CMU will be giving a talk titled "Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation" at Friday July 25 11am HKT (Thursday July 24 8pm PDT). Link to talk: https://t.co/Cdn9TGqWQ2

hkunlp2020's tweet photo. Xinyu Yang from CMU will be giving a talk titled "Multiverse: Your Language Models Secretly
Decide How to Parallelize and Merge Generation" at Friday July 25 11am HKT (Thursday July 24 8pm PDT). Link to talk: https://t.co/Cdn9TGqWQ2 https://t.co/X3diB6Jgve

1

23

8

1

3K

ikekong retweeted

Jiacheng Ye @JiachengYe15

11 months ago

📢 Update: Announcing Dream's next-phase development. - Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, trained exclusively on public data. - DreamOn: targeting the variable-length generation problem in dLLM!

1

79

21

15

10K

ikekong retweeted

Zhihui Xie

@_zhihuixie

11 months ago

🚀 Thrilled to announce Dream-Coder 7B — the most powerful open diffusion code  LLM to date.

3

127

37

48

16K

ikekong retweeted

Zirui Wu @WilliamZR7

11 months ago

We present DreamOn: a simple yet effective method for variable-length generation in diffusion language models. Our approach boosts code infilling performance significantly and even catches up with oracle results.

2

119

28

58

16K

Lingpeng Kong

@ikekong

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users