encounter1997 @encounter19972 - Twitter Profile

encounter1997 @encounter19972

13 days ago

Full details and theoretical proofs are in the paper. Happy to discuss and answer questions.

0

28

encounter1997 @encounter19972

13 days ago

Sharing our latest work (StreamMA): exploring how to make multi-agent reasoning faster, more accurate, and cheaper. Hop on HuggingFace for an upvote, GitHub for a star 🤗: HuggingFace: https://t.co/BgajgGFziE Project: https://t.co/5qWvp3o1Ek GitHub: https://t.co/sZrXjL97CT

1

3

0

1

64

encounter1997 @encounter19972

13 days ago

Hard numbers: ① +7.3pp avg over 8 benchmarks across math / science / code (Claude Opus 4.6-high); ② 26.9× wall-clock speedup at A=64, S=64 (83% of the theoretical bound); ③ Stream×4 at half the price ($2.75 vs $5.46) beats Serial×16 (90.9% vs 89.4%).

encounter19972's tweet photo. Hard numbers: ① +7.3pp avg over 8 benchmarks across math / science / code (Claude Opus 4.6-high); ② 26.9× wall-clock speedup at A=64, S=64 (83% of the theoretical bound); ③ Stream×4 at half the price ($2.75 vs $5.46) beats Serial×16 (90.9% vs 89.4%). https://t.co/HKnpLuWAU4

1

0

59

encounter19972 retweeted

AK

@_akhaliq

6 months ago

The World is Your Canvas Painting Promptable Events with Reference Images, Trajectories, and Text

5

84

13

28

14K

Who to follow

Ryan Yuan

@RainbowYuhui

Research Director@Canva; ex-MSR. Build a research team focused on fundamental research for world-leading graphic design generation. Email: [email protected]

Computer Vision and Artificial Intelligence for Healthcare

encounter19972 retweeted

AK

@_akhaliq

7 months ago

MagicQuillV2 Precise and Interactive Image Editing with Layered Visual Cues

6

264

46

225

21K

encounter19972 retweeted

AK

@_akhaliq

7 months ago

Meta presents TUNA Taming Unified Visual Representations for Native Unified Multimodal Models

2

166

20

88

33K

encounter19972 retweeted

weijia wu @weijiawu7

7 months ago

🔥 New paper out: WEAVE — a 100K-sample interleaved multimodal dataset + WEAVEBench, a human-annotated benchmark for visual memory, multi-turn editing. 📄 arXiv: https://t.co/wAE9Wvy7xy 🐙 GitHub: https://t.co/6roLGE4CmZ 🤗 HF Dataset: https://t.co/gTrygOAyne

weijiawu7's tweet photo. 🔥 New paper out: WEAVE — a 100K-sample interleaved multimodal dataset + WEAVEBench, a human-annotated benchmark for visual memory, multi-turn editing.
📄 arXiv: https://t.co/wAE9Wvy7xy
🐙 GitHub: https://t.co/6roLGE4CmZ
🤗 HF Dataset: https://t.co/gTrygOAyne https://t.co/NnJXP7S4f1

2

145

28

90

11K

encounter19972 retweeted

AK

@_akhaliq

8 months ago

HoloCine Holistic Generation of Cinematic Multi-Shot Long Video Narratives

2

62

11

45

17K

encounter19972 retweeted

Bruce Yue Yu @Bruce_YuYue

8 months ago

🤩🎬 HoloCine is here! The first open-source multi-shot long video model, generating minute-long cinematic narratives as stunning as Sora 2. Watch the demo ↓

1

35

9

32

5K

encounter19972 retweeted

DogeDesigner

@cb_doge

8 months ago

Elon Musk v/s Sam Altman The A.I. simulation got way too real.

1K

13K

2K

741K

encounter19972 retweeted

Yongliang Shen @itricktreat

9 months ago

Introducing GSM8K-V: Can vision-language models solve grade school math when problems are shown visually instead of text? 🧮👁️ We converted all 1,319 GSM8K problems into comic-style multi-image sequences (5,343 images total). The results are surprising! 🧵

itricktreat's tweet photo. Introducing GSM8K-V: Can vision-language models solve grade school math when problems are shown visually instead of text? 🧮👁️
We converted all 1,319 GSM8K problems into comic-style multi-image sequences (5,343 images total).
The results are surprising! 🧵 https://t.co/fSPr9F9t1C

1

10

2

1

263

encounter19972 retweeted

Yongliang Shen @itricktreat

9 months ago

Introducing EasySteer: High-performance LLM steering framework built on vLLM. Achieves 5.5-11.4× speedup over existing tools while maintaining 71-84% throughput. Paper: https://t.co/KtQFs54FuH Code: https://t.co/Oyrk8QG1r8 HF Paper: https://t.co/aS4BwXY8Xp

itricktreat's tweet photo. Introducing EasySteer: High-performance LLM steering framework built on vLLM. Achieves 5.5-11.4× speedup over existing tools while maintaining 71-84% throughput.
Paper: https://t.co/KtQFs54FuH
Code: https://t.co/Oyrk8QG1r8
HF Paper: https://t.co/aS4BwXY8Xp https://t.co/WvIOcSQCHD

4

12

5

3

640

encounter19972 retweeted

机器之心 JIQIZHIXIN

@jiqizhixin

10 months ago

What if LLMs already had the right answer—but erased it before finishing? 🤯 New work on diffusion LLMs (dLLMs) uncovers temporal oscillation: correct answers often appear mid-denoising, only to vanish in later steps. Two fixes that harness temporal consistency: - Temporal Self-Consistency Voting → training-free decoding that aggregates stable predictions across steps - Temporal Consistency Reinforcement → post-training with Temporal Semantic Entropy (TSE) as a reward for semantic stability

jiqizhixin's tweet photo. What if LLMs already had the right answer—but erased it before finishing? 🤯

New work on diffusion LLMs (dLLMs) uncovers temporal oscillation: correct answers often appear mid-denoising, only to vanish in later steps.

Two fixes that harness temporal consistency:

- Temporal Self-Consistency Voting → training-free decoding that aggregates stable predictions across steps
- Temporal Consistency Reinforcement → post-training with Temporal Semantic Entropy (TSE) as a reward for semantic stability

4

170

34

78

13K

encounter19972 retweeted

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

10 months ago

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models "Our work here reveals a critical phenomenon, temporal oscillation, where correct answers often emerge in the middle process, but are overwritten in later denoising steps. To address this issue, we introduce two complementary methods that exploit temporal consistency: 1) Temporal Self-Consistency Voting, a training-free, test-time decoding strategy that aggregates predictions across denoising steps to select the most consistent output; and 2) a post-training method termed Temporal Consistency Reinforcement, which uses Temporal Semantic Entropy (TSE), a measure of semantic stability across intermediate predictions, as a reward signal to encourage stable generations."

iScienceLuvr's tweet photo. Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

"Our work here reveals a critical phenomenon, temporal oscillation, where correct answers often emerge in the middle process, but are overwritten in later denoising steps. To address this issue, we introduce two complementary methods that exploit temporal consistency: 1) Temporal Self-Consistency Voting, a training-free, test-time decoding strategy that aggregates predictions across denoising steps to select the most consistent output; and 2) a post-training method termed Temporal Consistency Reinforcement, which uses Temporal Semantic Entropy (TSE), a measure of semantic stability across intermediate predictions, as a reward signal to encourage stable generations."

6

219

43

133

16K

encounter1997

@encounter19972

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users