Zhensu Sun @v587su - Twitter Profile

v587su retweeted

Elon Musk

@elonmusk

3 months ago

Yeah

3K

79K

5K

4K

104M

Zhensu Sun @v587su

4 months ago

The conclusion of this paper is very interesting. Images may be a more friendly representation for AI to understand source code, since it can be easily compressed to reduce cost. 😃

Kevin Lin

@KevinQHLin

4 months ago

today most interesting paper: CodeOCR This work provides a good explanation of how code indentation and highlighting are designed to serve the human eye. https://t.co/oG3TXIP5A6

KevinQHLin's tweet photo. today most interesting paper: CodeOCR

This work provides a good explanation of how code indentation and highlighting are designed to serve the human eye.

https://t.co/oG3TXIP5A6 https://t.co/LfPKZzNbCg

1

12

2

5

1K

1

4

0

227

Zhensu Sun @v587su

9 months ago

How could two ICSE reviewers think my paper is novel while the remaining one think the paper is incremental? It doesn't make sense😮‍💨

0

1

0

110

Zhensu Sun @v587su

9 months ago

@IndianInExile @alxfazio This paper is accepted by ICSE 2026. You can find it on arxiv: https://t.co/2yBjh3gjEp

0

40

Who to follow

Hong Jin Kang

@kanghj91

Lecturer (equivalent to assistant professor in the US system) at University of Sydney. Learn more at https://t.co/XsvRkQfH0o!

Liu Chengwei

@lcwj3ntu

Professor from Nankai University, research interests on software security, including program analysis, software supply chain security, agentic software security

Fuman Xie

@FumanXie

Postdoctoral researcher at The University of Queensland

Zhensu Sun @v587su

9 months ago

@presstab_dev @alxfazio The paper is available here: https://t.co/m3FBA4fc0p

0

1

0

51

v587su retweeted

alex fazio

@alxfazio

9 months ago

java is the most token-efficient language, let that sink in

77

3K

85

967

235K

v587su retweeted

Rohan Paul

@rohanpaul_ai

10 months ago

Stripping code formatting cuts LLM token cost without hurting accuracy. Average input tokens drop by 24.5%, with output quality basically unchanged. The core issue is simple, indentation, spaces, and newlines help humans read but they inflate tokens that models pay to process. They remove only cosmetic formatting while keeping program meaning identical, checked by matching the abstract syntax tree of the code. They test Fill in the Middle code completion, where a model fills a missing block, across Java, C++, C#, and Python. Performance stays stable on unformatted input, big models barely move, smaller ones wobble a bit, Python sees less savings because its layout is part of the language. One surprise, models still print nicely formatted code even when given smashed input, so output token savings are small. To fix that, 2 cheap tactics work, explicit prompts that say output without formatting, and light fine tuning on unformatted samples. With clear instructions or tiny training, output length shrinks by 25% to 36% while pass rate on the first try holds. They also ship a tool that strips formatting before inference then restores it after, so humans read clean code while the model pays less. ---- Paper – arxiv. org/abs/2508.13666 Paper Title: "The Hidden Cost of Readability: How Code Formatting Silently Consumes Your LLM Budget"

rohanpaul_ai's tweet photo. Stripping code formatting cuts LLM token cost without hurting accuracy.

Average input tokens drop by 24.5%, with output quality basically unchanged.

The core issue is simple, indentation, spaces, and newlines help humans read but they inflate tokens that models pay to process.

They remove only cosmetic formatting while keeping program meaning identical, checked by matching the abstract syntax tree of the code.

They test Fill in the Middle code completion, where a model fills a missing block, across Java, C++, C#, and Python.

Performance stays stable on unformatted input, big models barely move, smaller ones wobble a bit, Python sees less savings because its layout is part of the language.

One surprise, models still print nicely formatted code even when given smashed input, so output token savings are small.

To fix that, 2 cheap tactics work, explicit prompts that say output without formatting, and light fine tuning on unformatted samples.

With clear instructions or tiny training, output length shrinks by 25% to 36% while pass rate on the first try holds.

They also ship a tool that strips formatting before inference then restores it after, so humans read clean code while the model pays less.

----

Paper – arxiv. org/abs/2508.13666

Paper Title: "The Hidden Cost of Readability: How Code Formatting Silently Consumes Your LLM Budget"

8

206

36

148

14K

Zhensu Sun @v587su

10 months ago

Want to save your LLM budget without sacrificing performance? Here's a useful trick: removing non-essential code formatting, like indentations, newlines, and extra whitespaces, cuts input tokens by an average of 24.5%! Check out our full study: https://t.co/951LFOEres

0

1

0

193

Zhensu Sun @v587su

about 1 year ago

A very interesting match

The Humanoid Hub

@TheHumanoidHub

about 1 year ago

The humanoid robot half-marathon in Beijing just started!

171

4K

665

761

721K

0

205

Zhensu Sun @v587su

about 1 year ago

@davidlo2015 @msrconf Well deserved!

1

0

89

v587su retweeted

FORGE @ConfForge

over 1 year ago

🚨 Big Announcement! 🚨 We’re thrilled to welcome two distinguished keynote speakers to #FORGE2025! ✨ Prem Devanbu @devanbu (@UCDavis Professor) 🔗 https://t.co/VCqXxnWR2a ✨ Graham Neubig @gneubig (@CarnegieMellon Associate Professor ) 🔗 https://t.co/CPuqQ1XVqi

1

6

3

0

2K

Zhensu Sun @v587su

over 1 year ago

I'll ride a dog to lab in a near future.

Unitree

@UnitreeRobotics

over 1 year ago

Unitree B2-W Talent Awakening! 🥳 One year after mass production kicked off, Unitree’s B2-W Industrial Wheel has been upgraded with more exciting capabilities. Please always use robots safely and friendly. #Unitree #Quadruped #Robotdog #Parkour #EmbodiedAI #IndustrialRobot #InspectionRobot #IntelligentRobot #FoundationModels #LeggedRobot #WheeledLegs

840

16K

3K

5K

14M

0

1

0

270

v587su retweeted

BNO News

@BNONews

over 1 year ago

OpenAI whistleblower Suchir Balaji, who accused the company of breaking copyright law, found dead in apparent suicide

1K

53K

8K

7K

27M

v587su retweeted

FORGE @ConfForge

over 1 year ago

🎉 Exciting News! 🎉 We are thrilled to announce that ACM SIGSOFT has officially upgraded FORGE from an ICSE Special Event to an ICSE Co-Located Conference! 🚀 We can’t wait to see your submissions for FORGE 2025! See more below👇 #FORGE #FORGE2025 @ICSEconf

1

22

8

1

2K

v587su retweeted

Philipp Schmid

@_philschmid

almost 2 years ago

AI is not making any progress"? Look closer. 🙄 GPT-4 level models got 240x cheaper in just 2 years! AI progress isn't linear and is just about bigger models. BERT -> DistilBERT Llama 2 70B -> Llama 3 8B GPT-4 -> GPT-4o-mini Llama 3 405B → Llama 4 70B?? 🤔 Models get bigger, then smaller but equally powerful. It's a cycle of innovation. Today's quality per $ is the most expensive we'll see. Making it cheaper will lead to more people using, learning, and building with AI, which might unlock more potential and “goodput” for everyone than yet another Foundation Model! AI's real progress: Getting into more hands.🤗 [Image credits: @davidtsong]

_philschmid's tweet photo. AI is not making any progress"? Look closer. 🙄 GPT-4 level models got 240x cheaper in just 2 years! AI progress isn't linear and is just about bigger models.

BERT -> DistilBERT
Llama 2 70B -> Llama 3 8B
GPT-4 -> GPT-4o-mini
Llama 3 405B → Llama 4 70B?? 🤔

Models get bigger, then smaller but equally powerful. It's a cycle of innovation. Today's quality per $ is the most expensive we'll see.

Making it cheaper will lead to more people using, learning, and building with AI, which might unlock more potential and “goodput” for everyone than yet another Foundation Model!

AI's real progress: Getting into more hands.🤗

[Image credits: @davidtsong]

12

249

50

93

30K

Zhensu Sun @v587su

almost 2 years ago

https://t.co/lUWvubi7ie

0

118

Zhensu Sun @v587su

almost 2 years ago

Our recent work on self-healing software systems is available at Arxiv now🥳: [2408.01055] LLM as Runtime Error Handler: A Promising Pathway to Adaptive Self-Healing of Software Systems (https://t.co/fLLWHFubFF)

1

0

197

Zhensu Sun @v587su

almost 2 years ago

Impressive

Robert Scoble

@Scobleizer

almost 2 years ago

Wow. @Jandodev just showed me a prompt humans can’t read but LLMs understand this language better. The San Francisco AI people are designing a new language. In stealth. You are first to see it.

Scobleizer's tweet photo. Wow.

@Jandodev just showed me a prompt humans can’t read but LLMs understand this language better.

The San Francisco AI people are designing a new language.

In stealth. You are first to see it. https://t.co/bnSKST2nbw

366

3K

284

2K

618K

0

198

v587su retweeted

Robert Scoble

@Scobleizer

almost 2 years ago

Wow. @Jandodev just showed me a prompt humans can’t read but LLMs understand this language better. The San Francisco AI people are designing a new language. In stealth. You are first to see it.

366

3K

284

2K

618K

v587su retweeted

Guillaume Lample @ NeurIPS 2024

@GuillaumeLample

almost 2 years ago

Today we are releasing two small models: Mathstral 7B and Codestral Mamba 7B. On the MATH benchmark, Mathstral 7B obtains 56.6% pass@1, outperforming Minerva 540B by more than 20%. Mathstral scores 68.4% on MATH with majority voting@64, and 74.6% using a reward model. Codestral Mamba is one of the first open source models with a Mamba 2 architecture. It is the best 7B code model available, and is trained with a context length of 256k tokens. Both models are released under the Apache 2 license. https://t.co/R2pc6u45zL https://t.co/9szAM62xi5

GuillaumeLample's tweet photo. Today we are releasing two small models: Mathstral 7B and Codestral Mamba 7B.

On the MATH benchmark, Mathstral 7B obtains 56.6% pass@1, outperforming Minerva 540B by more than 20%. Mathstral scores 68.4% on MATH with majority voting@64, and 74.6% using a reward model.

Codestral Mamba is one of the first open source models with a Mamba 2 architecture. It is the best 7B code model available, and is trained with a context length of 256k tokens.

Both models are released under the Apache 2 license.

https://t.co/R2pc6u45zL
https://t.co/9szAM62xi5

13

692

103

146

99K

Zhensu Sun

@v587su

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users