Yiru Yang @yranny53 - Twitter Profile

Pinned Tweet

Yiru Yang @yranny53

13 days ago

https://t.co/j9lkIfaywh

0

36

yranny53 retweeted

Transluce

@TransluceAI

8 months ago

Comparing with sparse autoencoders (SAEs), we show that circuits traced directly on a model’s MLP neurons can be just as sparse and faithful. We use two advances to achieve this result.

1

34

1

3

3K

yranny53 retweeted

Gabriele Berton

@gabriberton

about 11 hours ago

Finally!

0

18

2

0

4K

yranny53 retweeted

Cankay Koryak

@CankayKoryak

about 13 hours ago

Shapes and forms are a construct of the human mind. In ultimate reality there is nothing as shape or matter. Color blindness is a good paradigm for understanding this.

7

26

2

8

937

Who to follow

互联网从业者｜喜欢金融，为赚钱而努力｜去过日本、印尼、新加坡、马来、泰国等国家｜

yranny53 retweeted

4 days ago

Heading to @icmlconf 2026 in Seoul🇰🇷 next week✈️ I'm co-hosting the workshop on ML for audio on Friday, July 10: https://t.co/qmlW5MPDrP And who knows, maybe a circular gathering of diffusion practitioners will coalesce at some point during the week👀

7

108

4

17

9K

yranny53 retweeted

Alexi Gladstone

@AlexiGlad

about 12 hours ago

@mengyer is there a recording :)

1

0

1

0

266

yranny53 retweeted

fly51fly @fly51fly

about 12 hours ago

[LG] Geometric Signatures of Reasoning: A Spectral Perspective on Task Hardness A Masoomi, M Bazzaz, A Javanmard, V Mirrokni [Northeastern University & University of Southern California & Google Research] (2026) https://t.co/CEgx00xjjH

fly51fly's tweet photo. [LG] Geometric Signatures of Reasoning: A Spectral Perspective on Task Hardness
A Masoomi, M Bazzaz, A Javanmard, V Mirrokni [Northeastern University & University of Southern California & Google Research] (2026)
https://t.co/CEgx00xjjH https://t.co/CMqCbq1JZt

0

10

5

12

1K

yranny53 retweeted

New Scientist

@newscientist

about 14 hours ago

NASA’s Swift space telescope is reaching the end of its two-decade run in orbit – unless a satellite launched on 3 July can give it a lifesaving boost https://t.co/6iiRkuJMrt

0

16

7

3

7K

yranny53 retweeted

Arshia Afzal @ ICML2026🇰🇷

@rshia_afz

1 day ago

I will be at #ICML2026 in Korea next week presenting violin and as always in case you like ssms and hybrids dm me to grab a coffee and chat!! I will be hanging around @MistralAI booth as well you can find me there too ;)

0

13

4

8

2K

yranny53 retweeted

Alexandr Wang

@alexandr_wang

1 day ago

First, Mark was clearly talking about the industry’s progress on agentic capabilities on the whole. But, while we’re on the topic: Our next Muse Spark update is coming soon. Big improvements in coding and agentic capabilities to be more competitive with other leading models. Excited to get these into your hands—will be rolling out to Meta AI and our new API!

369

3K

241

633

1M

yranny53 retweeted

Cesium @CesiumJS

1 day ago

WorldLens has now added AI-powered 3D depth to Google Street View environments. Instead of a traditional flat photosphere, the update infers spatial structure from 2D panoramas. Read how WorldLens leverages Cesium for Unity: https://t.co/k9XiwneKlQ #CesiumForUnity #MetaQuest

2

37

6

35

3K

yranny53 retweeted

Francois Chaubard

@FrancoisChauba1

2 days ago

New paper coming soon.. teaser.. no transformer, no backprop, no problem! Zero Order CAN pretrain! very exciting.. stay tuned!

FrancoisChauba1's tweet photo. New paper coming soon.. teaser..

no transformer, no backprop, no problem!

Zero Order CAN pretrain!

very exciting.. stay tuned! https://t.co/Rgu11vnPO8

22

754

53

527

70K

yranny53 retweeted

Joan Puigcerver @joapuipe

1 day ago

@albobia Allí estarem!

1

0

73

yranny53 retweeted

Lucas Beyer (bl16)

@giffmana

1 day ago

Of course, love me some confirmation bias! DCLM was for text, DCVLM is the same for vision: analyzing VLM data mix across scales. Filtering is no bueno, but the "type" mixing matters, with unfortunately (but imo expectedly) the best small-scale mix != the best mid-scale mix.

2

120

15

66

18K

yranny53 retweeted

Lucas Beyer (bl16)

@giffmana

1 day ago

Uhm... I'm not sure how to tell him, but i think this guy is not actually Dario.

25

739

9

114

133K

yranny53 retweeted

Google Home @GoogleHome

1 day ago

Automate your happy place🪴 “Help me Create” in the Google Home App helps you schedule tasks that automate¹ your (and your plants’) state of bliss https://t.co/NJwEzbdBVl

GoogleHome's tweet photo. Automate your happy place🪴

“Help me Create” in the Google Home App helps you schedule tasks that automate¹ your (and your plants’) state of bliss https://t.co/NJwEzbdBVl https://t.co/bSMKUw6uEa

2

6

2

3

2K

yranny53 retweeted

Mingchen Zhuge

@MingchenZhuge

1 day ago

This is cool :D “A benchmark designed to study how agents learn from environments over at least 12~72-hour runs” from @tikgiau 🩵

1

25

3

9

4K

Yiru Yang @yranny53

1 day ago

🥳

Jyrki Alakuijala 🇺🇦

@jyzg

1 day ago

Another 150k iters of the same gives some more clarity for the 300m 2x faster model. This is now finished training, and next up, going for the 4x faster model -- one step up in the mipmap pyramid. Yiru @yranny53 tells I probably shouldn't be using ordinary (multi-level) diffusion at all, but music generation could work more better with flow matching. Once I have all the pyramid levels trained and the system up and running -- and hopefully the glassiness removed from the sound, I'll try flow matching next.

1

0

134

0

85

yranny53 retweeted

Lucas Beyer (bl16)

@giffmana

1 day ago

@IlyasHairline I had the same initial reaction, but after closer look i think it did have enough twists ("but with..."s) that it's fine and actually interesting.

1

16

1

2

2K

yranny53 retweeted

Mathonymics

@Mathonymics

2 days ago

What are some equations that totally changed the world we live in?

5

4

1

0

800

yranny53 retweeted

OpenEvidence

@EvidenceOpen

3 days ago

One especially interesting finding buried in the details of https://t.co/nZ8BU9t9VU: When foundation models replace experts in rating accuracy, Gemini and Claude directionally agree with expert human raters, whereas GPT5.5 says its own answers win almost every time. Claude also seems to have a strong bias against Gemini. Any LLM-driven evaluation seems to have real pitfalls that are hard to entirely protect against.

EvidenceOpen's tweet photo. One especially interesting finding buried in the details of https://t.co/nZ8BU9t9VU: When foundation models replace experts in rating accuracy, Gemini and Claude directionally agree with expert human raters, whereas GPT5.5 says its own answers win almost every time. Claude also seems to have a strong bias against Gemini. Any LLM-driven evaluation seems to have real pitfalls that are hard to entirely protect against.

2

55

5

27

7K

Yiru Yang

@yranny53

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users