Yeounoh Chung

@yeounoh

AI & data systems research @Google. Opinions are my own.

San Jose, CA

Joined March 2010

288 Following

98 Followers

63 Posts

yeounoh retweeted

Vahab Mirrokni @mirrokni

7 days ago

Proud of the team behind Gemini-SQL2 and collabotors from Cloud Research. +2.5% improvement over previous SOTA for singe model! 👇 @yanbang_wang, @qitianwu_, Sami Abu-El-Haija, Mohammadreza pourreza, @michael_galkin, @hemmatihadi, Hailong Li, @yeounoh, Fatma Ozcan, @phanein

2

18

4

4

5K

Yeounoh Chung @yeounoh

29 days ago

What is the purpose of education? Learning or passing. 🤔 but then, what is it again in the AI era? 😂

0

1

0

0

31

Yeounoh Chung @yeounoh

29 days ago

Capping to 20%. I looked up why this is needed… it has been 64% of the class/course. And more, “Harvard does not publicly publish exact percentages for failing grades, internal reports indicate that failing an undergraduate course at Harvard is extraordinarily rare.”

The New York Times

30 days ago

Breaking News: Harvard University voted to cap the number of A’s they are permitted to award to undergraduate students, in an attempt to reduce grade inflation. https://t.co/cRVFs2Bb9i

67

297

73

60

188K

1

0

0

0

82

Yeounoh Chung @yeounoh

29 days ago

https://t.co/EX3isFJVnA yes, context window size is about symmetric input/output tradeoffs. Batched inference works, and will more so accurately with more advanced models; we can think about maximizing the block sizes (batches) from the join tables to maximize.

0

0

0

0

38

Who to follow

@BrownUniversity's Department of Computer Science is a leader in innovative information technology research and teaching. Why the chicken? See https://t.co/mo7mQYDvS2.

Verified account

Research @openai: RL, reasoning. prev: @citsecurities, @EinblickAI, @MIT_CSAIL

ETR is an enterprise technology market research firm that delivers actionable, transparent, and unbiased insights.

yeounoh retweeted

about 2 months ago

LEBRON JAMES TIES THE GAME AT 101 🤯 13.1 SECONDS TO GO ON PRIME.

515

31K

4K

1K

6M

Yeounoh Chung @yeounoh

3 months ago

Yikes! I had entrusted Gemini-3 Flash to handle much of my debugging quests, iterating and repeating the same bugs over minutes. Switching to Sonnet 4.6 resolved it all at once… 😅 #antigravity

0

0

0

0

78

Yeounoh Chung @yeounoh

6 months ago

An interesting work on semantic join. The idea is to extract logical feature expressions to filter out (cover) positive matches, and with guarantees. This performs better than embedding based pre-filtering, up to 10x cost reduction vs. SOTA.

Aditya Parameswaran

6 months ago

Trying to perform LLM-powered joins at scale without the quadratic cost? @SepantaZeighami's new preprint proposes featurized-decomposition join: extract features from each "side" (ie LLM-synthesized fuzzy blocking rules), and uses those to limit the number of pairs sent to an LLM. Sounds easy enough, but devil is in the details - how does one identify features, how does one get guarantees on recall/precision, etc... This join algorithm does way better than using thresholds on embedding similarity, as is done in other LLM-powered data systems. The main reason: embedding similarity is often a poor proxy for the join! See paper for more: https://t.co/PhPlg1Md73

0

25

8

9

4K

0

0

0

0

79

Yeounoh Chung @yeounoh

7 months ago

Intriguing!

7 months ago

NeurIPS received 21,575 paper submissions this year. Our Agentic Reviewer, released last week, just surpassed this in number of papers submitted and reviewed. It's clear agentic paper reviewing is here to stay and will be impactful!

76

2K

267

958

343K

0

0

0

0

71

Yeounoh Chung @yeounoh

7 months ago

👀

7 months ago

Sam Altman's worst nightmare.

pmddomingos's tweet photo. Sam Altman's worst nightmare. https://t.co/o6mJ3mk1U9

63

882

75

106

44K

0

0

0

0

52

Yeounoh Chung @yeounoh

8 months ago

Catching up on distributed streaming. “Three Steps is All You Need: Fast Accurate Automatic Scaling Decisions for Distributed Streaming Dataflows” from 2018, solves unstable/slow autoscaling by using fine-grained operator performance metrics to calculate optimal parallelism.

0

0

0

0

34

Yeounoh Chung @yeounoh

10 months ago

There is theoretical limit to embedding dimensions, with dense embeddings. Something to keep in mind, and also in turn, we may not need to strive for the largest embeddings if corpus size is small. That got me thinking… and better appreciate sparse embeddings.

0

0

0

0

77

Yeounoh Chung @yeounoh

10 months ago

Played GM @ShreyasRoyal , thank you!

yeounoh's tweet photo. Played GM @ShreyasRoyal , thank you! https://t.co/4zbyfRy26g

0

0

0

0

76

Yeounoh Chung @yeounoh

10 months ago

Agreed.

10 months ago

This is the #1 post in r/OpenAI today.

deedydas's tweet photo. This is the #1 post in r/OpenAI today. https://t.co/IepJfoxgwQ

347

19K

710

3K

3M

0

1

0

0

140

Yeounoh Chung @yeounoh

12 months ago

Learned today that quantum computing can crack RSA and maybe someday #bitcoin encryption, too. And then, I also read about #LatticeCryptography that is even hard for quantum computing to crack 🤔

0

0

0

0

119

Yeounoh Chung @yeounoh

about 1 year ago

Three logo threes in a row, in less than 40 seconds!!! #WNBA #caitlin #clark

about 1 year ago

CAITLIN CLARK 9 POINTS IN 40 SECONDS 😭🔥 https://t.co/6HlZkvBRQC

461

37K

3K

2K

2M

0

2

0

1

728

Yeounoh Chung @yeounoh

about 1 year ago

@jaltucher Your book Choose Yourself inspired me to take a better control of my life. Thanks you!

0

1

0

0

50

Yeounoh Chung @yeounoh

about 1 year ago

Result: achieves up to 3.4× decreases in end-to-end query latency with Llama-3-8B and Llama-3-70B and also achieves up to 32% cost savings under OpenAI and Anthropic pricing models.

0

0

0

0

65

Yeounoh Chung @yeounoh

about 1 year ago

"OPTIMIZING LLM QUERIES IN RELATIONAL DATA ANALYTICS WORKLOADS" https://t.co/ynA1U7EGED demonstrates how reordering rows and cols of relational workloads for LLM can greatly improve prefix cache hit rate, thus reducing the cost. #review #llm #cache

yeounoh's tweet photo. "OPTIMIZING LLM QUERIES IN RELATIONAL DATA ANALYTICS WORKLOADS" https://t.co/ynA1U7EGED demonstrates how reordering rows and cols of relational workloads for LLM can greatly improve prefix cache hit rate, thus reducing the cost. #review #llm #cache https://t.co/AoIO8pca8h

1

0

0

0

73

Yeounoh Chung @yeounoh

about 1 year ago

Solution: finding the optimal ordering has exponential complexity. Greedy Group Recursion (GGR) algorithm recurses greedily (maximize prefix hit count at each step) and efficiently approximates the optimal orderings.

1

0

0

0

59

Last Seen Users on Sotwe

Trends for you

Most Popular Users