Yichen Jiang @yichenjiang9 - Twitter Profile

Pinned Tweet

over 2 years ago

We show Transformers generalize on complex data by using shared attention patterns for similar structures BUT how to avoid overfitting on low-complexity data? 🚨SQ-Transformer explicitly quantizes embeddings structurally & learns systematic attention https://t.co/eaeG5gBo0d 🧵

YichenJiang9's tweet photo. We show Transformers generalize on complex data by using shared attention patterns for similar structures

BUT how to avoid overfitting on low-complexity data?

🚨SQ-Transformer explicitly quantizes embeddings structurally & learns systematic attention

https://t.co/eaeG5gBo0d
🧵 https://t.co/B8XcHs6lfG

5

201

59

129

42K

Yichen Jiang @YichenJiang9

3 months ago

@CalFerguson @MARCAinENGLISH Bro living in a parallel universe where away goal advantage still exists 🤣

1

0

38

Yichen Jiang @YichenJiang9

about 1 year ago

@EliasEskin @UTAustin @UTCompSci This is awesome! Congrats Elias!

1

0

111

YichenJiang9 retweeted

Yi Lin Sung @yilin_sung

over 1 year ago

🚀 New Paper: RSQ: Learning from Important Tokens Leads to Better Quantized LLMs We show that not all tokens should be treated equally during quantization. By prioritizing important tokens through a three-step process—Rotate, Scale, and Quantize—we achieve better-quantized models on LLaMA3, Mistral, and Qwen2.5. 🧵👇

yilin_sung's tweet photo. 🚀 New Paper: RSQ: Learning from Important Tokens Leads to Better Quantized LLMs

We show that not all tokens should be treated equally during quantization. By prioritizing important tokens through a three-step process—Rotate, Scale, and Quantize—we achieve better-quantized models on LLaMA3, Mistral, and Qwen2.5.
🧵👇

2

130

39

69

21K

Who to follow

Kai-Wei Chang

@kaiwei_chang

Associate Professor @UCLAengineering/@UCLA. Area: #NLProc/#ML/#AI https://t.co/zj1ssZj9ox

Peng Qi

@qi2peng2

AI Lead @Uniphore. Previously: @OrbyAI, @AWS AI, $JD AI, PhD @stanfordnlp, UG @Tsinghua_Uni. He/him. Opinions my own.

Jialu Li

@JialuLi96

Applied Scientist @Adobe; Previous @unccs @Cornell_CS; Past intern @Amazon @Apple @Google. Working on VLN, image generation, multi-modal LLM.

Yichen Jiang @YichenJiang9

over 1 year ago

@Simchabenmoshe @JundeMorsenWu “China is not gonna share their AI surplus” 😂😂 I guess DeepSeek is too difficult for you to spell.

0

2

0

301

Yichen Jiang @YichenJiang9

almost 2 years ago

Check out these other awesome works from my labmates & I will present my poster virtually on Aug22 --> "Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings (https://t.co/eaeG5gBVPL)" Detailed thread: https://t.co/2ghFO7v26N #ACL2024nlp

Mohit Bansal

@mohitban47

almost 2 years ago

🚨 Check out an exciting batch of papers this week at #ACL2024! Say hi to some of our awesome students & collaborators who are attending in person, and feel free to ask about our postdoc openings too 🙂 Topics: -- multi-agent reasoning collaboration -- structured systematicity/quantization in transformers -- easy-to-hard generalization -- very long-term conversational memory in LLMs -- soft self-consistency -- self-refining multimodal summarization -- multimodal reasoning over image sequences -- fine-grained hallucination evaluation and correction -- summary-source alignments #ACL2024nlp 👇👇

mohitban47's tweet photo. 🚨 Check out an exciting batch of papers this week at #ACL2024!

Say hi to some of our awesome students & collaborators who are attending in person, and feel free to ask about our postdoc openings too 🙂

Topics:
-- multi-agent reasoning collaboration
-- structured systematicity/quantization in transformers
-- easy-to-hard generalization
-- very long-term conversational memory in LLMs
-- soft self-consistency
-- self-refining multimodal summarization
-- multimodal reasoning over image sequences
-- fine-grained hallucination evaluation and correction
-- summary-source alignments

#ACL2024nlp
👇👇

4

118

34

24

19K

0

18

8

1

3K

YichenJiang9 retweeted

Swarnadeep Saha @swarnaNLP

almost 2 years ago

🚨 New: my last PhD paper 🚨 Introducing System-1.x, a controllable planning framework with LLMs. It draws inspiration from Dual-Process Theory, which argues for the co-existence of fast/intuitive System-1 and slow/deliberate System-2 planning. System 1.x generates hybrid plans & balances between the two planning modes (efficient + inaccurate System-1 & inefficient + more accurate System-2) based on the difficulty of the decomposed (sub-)problem at hand. Some exciting results+features of System-1.x: -- performance: beats System-1, System-2 & a symbolic planner (A*) both ID and OOD (up to 39%), given an exploration budget. -- training-time control/balance: user can train a System-1.25/1.5/1.75 to balance accuracy + efficiency. -- test-time control/balance: user can bias the planner to solve more/less sub-goals using System-2. -- flexibility to integrate symbolic solvers: allows building neuro-symbolic System-1.x with a symbolic System-2 (A*). -- generalizability: can learn from different search algos (DFS/BFS/A*). 🧵👇

swarnaNLP's tweet photo. 🚨 New: my last PhD paper 🚨

Introducing System-1.x, a controllable planning framework with LLMs. It draws inspiration from Dual-Process Theory, which argues for the co-existence of fast/intuitive System-1 and slow/deliberate System-2 planning.

System 1.x generates hybrid plans & balances between the two planning modes (efficient + inaccurate System-1 & inefficient + more accurate System-2) based on the difficulty of the decomposed (sub-)problem at hand.

Some exciting results+features of System-1.x:

-- performance: beats System-1, System-2 & a symbolic planner (A*) both ID and OOD (up to 39%), given an exploration budget.
-- training-time control/balance: user can train a System-1.25/1.5/1.75 to balance accuracy + efficiency.
-- test-time control/balance: user can bias the planner to solve more/less sub-goals using System-2.
-- flexibility to integrate symbolic solvers: allows building neuro-symbolic System-1.x with a symbolic System-2 (A*).
-- generalizability: can learn from different search algos (DFS/BFS/A*).

🧵👇

3

293

68

201

58K

YichenJiang9 retweeted

Mohit Bansal

@mohitban47

almost 2 years ago

Having a great time at #LxMLS in Lisbon + meeting awesome people & exploring the beautiful city 🙂 (highly recommended ML school*) ➡️➡️➡️ Next stop: #ICML2024 in Vienna for MoE tutorial/panel + papers on MAGDi, ReGAL, etc. 👇 (ping me if you want to meet up / chat about faculty+postdoc+phd positions, etc.)! (*thanks to @andre_t_martins @mariotelfig @RamonAstudill12 @bgmartins + whole team for the thoughtful organization!)

6

103

26

5

13K

YichenJiang9 retweeted

Swarnadeep Saha @swarnaNLP

almost 2 years ago

We are going to present MAGDi at #ICML2024. If you are attending, say hi to @EliasEskin and @mohitban47 to know more about this work and PhD/postdoc positions @uncnlp! 🧵👇

5

7

6

0

2K

YichenJiang9 retweeted

Shoubin Yu

@shoubin621

about 2 years ago

Check out 2 useful updates on CREMA! 🚨 (1a) A new modality-sequential modular training for generalizable and efficient reasoning on video+language+any other modalities by eliminating modality interference. (1b) A novel modality-adaptive early exit strategy allows the model to bypass the training of specific sensory inputs if this modality information is converged. (2) More unique/rare multimodal reasoning tasks (video-touch/thermal QA) to further demonstrate the generalizability of CREMA. See more details in Arxiv v2 👉 https://t.co/i1bmJ6wKVD (original thread below👇)

shoubin621's tweet photo. Check out 2 useful updates on CREMA! 🚨

(1a) A new modality-sequential modular training for generalizable and efficient reasoning on video+language+any other modalities by eliminating modality interference.

(1b) A novel modality-adaptive early exit strategy allows the model to bypass the training of specific sensory inputs if this modality information is converged.

(2) More unique/rare multimodal reasoning tasks (video-touch/thermal QA) to further demonstrate the generalizability of CREMA.

See more details in Arxiv v2 👉 https://t.co/i1bmJ6wKVD

(original thread below👇)

1

56

19

15

12K

Yichen Jiang @YichenJiang9

about 2 years ago

Welcome to UNC NLP! I’m sure you will have a lot of fun doing interesting projects and living in a warmer place full of great college sports matches 😀 Best of luck!

Zaid Khan

@codezakh

about 2 years ago

🥳Some of the first papers I read at the start of my ML journey were @mohitban47's papers on multimodal language understanding, and after a great couple of years at Northeastern working on vision-language, I'm excited to joining his lab at @uncnlp as a PhD student to work on program synthesis + multimodal agents! 😁 (1/3) 🧵

codezakh's tweet photo. 🥳Some of the first papers I read at the start of my ML journey were @mohitban47's papers on multimodal language understanding, and after a great couple of years at Northeastern working on vision-language, I'm excited to joining his lab at @uncnlp as a PhD student to work on program synthesis + multimodal agents! 😁 (1/3) 🧵

18

113

13

18

32K

1

8

2

0

3K

Yichen Jiang @YichenJiang9

about 2 years ago

Check out this work by my labmates on how to make LLMs not overly confident on bad answers. Spoiler 🚨: they made the model less confident on wrong data and more confident on correct ones.

Elias Stengel-Eskin

@EliasEskin

about 2 years ago

🚨 Excited to share our new work on **confidence calibration** in LLMs! LLMs are often badly calibrated & overconfident, explicitly (eg. "I'm 100% sure") and implicitly, eg. giving details/authoritative tone. We address both w/ a pragmatic speaker-listener multi-agent method 🧵

EliasEskin's tweet photo. 🚨 Excited to share our new work on **confidence calibration** in LLMs!

LLMs are often badly calibrated & overconfident, explicitly (eg. "I'm 100% sure") and implicitly, eg. giving details/authoritative tone.

We address both w/ a pragmatic speaker-listener multi-agent method
🧵 https://t.co/mE6vC1ELOR

3

144

42

73

34K

1

10

3

1

2K

YichenJiang9 retweeted

Jaehong Yoon

@jaeh0ng_yoon

about 2 years ago

🚨New paper👉RACCooN: remove/add/change video content effortlessly/interactively via our MLLM+Video Diffusion (V2P2V) framework with auto-generated descriptions! ▶️ 1. Video-to-Paragraph (V2P): RACCooN first generates well-structured/detailed descriptions of videos with MLLM leveraging a multi-granular pooling strategy ▶️ 2. Paragraph-to-Video (P2V): Users can then enjoy diverse video editing skills by refining auto-narratives for the video diffusion model 🧵

jaeh0ng_yoon's tweet photo. 🚨New paper👉RACCooN: remove/add/change video content effortlessly/interactively via our MLLM+Video Diffusion (V2P2V) framework with auto-generated descriptions!

▶️ 1. Video-to-Paragraph (V2P): RACCooN first generates well-structured/detailed descriptions of videos with MLLM leveraging a multi-granular pooling strategy

▶️ 2. Paragraph-to-Video (P2V): Users can then enjoy diverse video editing skills by refining auto-narratives for the video diffusion model

🧵

1

85

34

21

19K

Yichen Jiang @YichenJiang9

about 2 years ago

@lateinteraction Congrats Omar!

0

1

0

118

YichenJiang9 retweeted

Shoubin Yu

@shoubin621

about 2 years ago

🚨 Introducing VideoTree! Captioning + LLMs can perform well on long-video QA, but dense frame captioning leads to inefficiency (redundancy) and sub-optimality (irrelevance). VideoTree addresses these issues & improves LLM-based long-video QA by: ▶️ Structured Video Representation: iteratively organizing the video’s frames into a hierarchical tree representation via visual frame clustering & cluster scoring. ▶️ Adaptive Keyframe Selection and Coarse-to-Fine Sampling: dynamically selecting query-related frame clusters for captioning. Its tree structure encodes varying granularity levels, allowing VideoTree to allocate more frames (zoom-in) in relevant clusters and fewer in irrelevant ones. Those lead to major gains on popular benchmarks, including SOTA on NExT-QA & IntentQA, and 7.0% gains on EgoSchema, while cutting ~40% inference time. https://t.co/dUZy2DgXb3 🧵

shoubin621's tweet photo. 🚨 Introducing VideoTree! Captioning + LLMs can perform well on long-video QA, but dense frame captioning leads to inefficiency (redundancy) and sub-optimality (irrelevance).

VideoTree addresses these issues & improves LLM-based long-video QA by:

▶️ Structured Video Representation: iteratively organizing the video’s frames into a hierarchical tree representation via visual frame clustering & cluster scoring.

▶️ Adaptive Keyframe Selection and Coarse-to-Fine Sampling: dynamically selecting query-related frame clusters for captioning. Its tree structure encodes varying granularity levels, allowing VideoTree to allocate more frames (zoom-in) in relevant clusters and fewer in irrelevant ones.

Those lead to major gains on popular benchmarks, including SOTA on NExT-QA & IntentQA, and 7.0% gains on EgoSchema, while cutting ~40% inference time.

https://t.co/dUZy2DgXb3
🧵

3

196

49

98

48K

Yichen Jiang @YichenJiang9

about 2 years ago

🎉Excited to announce that SQ-Transformer is accepted to #ACL2024nlp! We induce systematicity & achieve stronger generalization in Transformers (w/o pretraining on complex data) by structurally quantizing word embedding & regularizing attention outputs. @XiangZhou14 @mohitban47

Yichen Jiang @YichenJiang9

over 2 years ago

We show Transformers generalize on complex data by using shared attention patterns for similar structures BUT how to avoid overfitting on low-complexity data? 🚨SQ-Transformer explicitly quantizes embeddings structurally & learns systematic attention https://t.co/eaeG5gBo0d 🧵

5

201

59

129

42K

1

53

23

12

6K

YichenJiang9 retweeted

Swarnadeep Saha @swarnaNLP

about 2 years ago

Agentic workflows with LLMs are now getting popular for solving complex tasks! In one of the early works on this topic -- ReConcile, at #ACL2024nlp 🎉 -- we study collaborative model-model interactions w/ confidence-estimation & corrective convincingness btwn diverse LLMs. 🧵👇

0

22

9

6

3K

Yichen Jiang @YichenJiang9

about 2 years ago

@ramakanth1729 @unccs @mohitban47 Thanks Ram! See u in Seattle

0

1

0

55

Yichen Jiang @YichenJiang9

about 2 years ago

Last weekend, I graduated from @unccs, 10 years after I wrote my first line of code in COMP 116. I'm super grateful to my advisor @mohitban47, labmates, intern mentors, and many others. Y'all can see how excited I was as I threw my cap out of the frame to the 2nd floor.

Mohit Bansal

@mohitban47

about 2 years ago

🎉🎓 Congratulations to these awesome new+old MURGeLab graduates on their hooding ceremony --> PhDs @peterbhase @yichenjiang9 @adyasha10 @swarnanlp (+ last year's @byryuer and @xiangzhou14, who joined us for this year's commencement) & MS @abhayzala7 🥳 Was a fun celebration with families+friends & many photo sessions in perfect weather + blue carolina skies 😀

mohitban47's tweet photo. 🎉🎓 Congratulations to these awesome new+old MURGeLab graduates on their hooding ceremony --> PhDs @peterbhase @yichenjiang9 @adyasha10 @swarnanlp (+ last year's @byryuer and @xiangzhou14, who joined us for this year's commencement) & MS @abhayzala7 🥳

Was a fun celebration with families+friends & many photo sessions in perfect weather + blue carolina skies 😀

6

85

19

2

24K

8

45

4

0

12K

Yichen Jiang @YichenJiang9

about 2 years ago

@swarnaNLP @Apple Many thanks Dr. Saha🫡

0

89

Yichen Jiang @YichenJiang9

about 2 years ago

Also, after 10 unforgettable years at Chapel Hill, 2 of those generously sponsored by @Apple Scholars in AIML PhD Fellowship, "I'm going to take my talents to Seattle and join Apple AIML". I will continue to do research in efficient and safe AI that generalizes compositionally.

Yichen Jiang @YichenJiang9

about 2 years ago

Last weekend, I graduated from @unccs, 10 years after I wrote my first line of code in COMP 116. I'm super grateful to my advisor @mohitban47, labmates, intern mentors, and many others. Y'all can see how excited I was as I threw my cap out of the frame to the 2nd floor.

8

45

4

0

12K

7

49

5

4

7K

Yichen Jiang @YichenJiang9

about 2 years ago

@peizNLP @Apple Sounds great! Looking forward to meeting you there!

0

1

0

71

Yichen Jiang @YichenJiang9

about 2 years ago

@vaidehi_patil_ @Apple Thank you and have fun at Apple this summer!

0

1

0

93

Yichen Jiang

@YichenJiang9

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users