Adyasha Maharana @adyasha10 - Twitter Profile

Pinned Tweet

almost 2 years ago

Our code for generating extremely long multi-session conversations with custom personas and evaluating LLMs on the LoCoMo dataset is now live at https://t.co/tpyXE6nEVJ! Come talk to us about this work at Poster session 5 (Aug 13, 16:00 - 17:30 local time) at ACL 2024

Adyasha Maharana @adyasha10

over 2 years ago

Can LLMs keep track of very long conversations? We evaluate 'conversational memory' of LLMs via 3 tasks on our dataset of multi-session multimodal dialogs --> LLMs struggle to remember, reason over history, draw long-range temporal/causal connections https://t.co/JrGP7imeMh 🧵

adyasha10's tweet photo. Can LLMs keep track of very long conversations?

We evaluate 'conversational memory' of LLMs via 3 tasks on our dataset of multi-session multimodal dialogs --> LLMs struggle to remember, reason over history, draw long-range temporal/causal connections

https://t.co/JrGP7imeMh

🧵 https://t.co/n3jUaUJXR1

4

183

59

111

30K

1

32

11

14

4K

adyasha10 retweeted

Samip

@industriaalist

5 months ago

Introducing Q Labs, a research lab focused on solving generalization. Alongside others (SSI, Flapping Airplanes), we see data efficiency as the key problem, but we're taking an unconventional approach to solve it: a new learning algorithm approximating Solomonoff induction.

35

681

54

467

163K

adyasha10 retweeted

Jonathan Frankle

@jefrankle

7 months ago

Special Databricks swag for the first five people to send me a selfie with Ashu in the Databricks booth at NeurIPS!

5

39

3

10

21K

adyasha10 retweeted

Mohit Bansal

@mohitban47

8 months ago

🚨 🤯 Wow! Yi Lin is an amazing researcher, who works on very hard and important problems in LLM and VLM training, RL, PEFT, Quantization, etc. -- ironically, he had several other top offers just a few months ago! Hire him ASAP if you want to pick up a top talent (and several other affected amazing folks)! 👇👇

5

159

31

39

46K

Who to follow

Jialu Li

@JialuLi96

Applied Scientist @Adobe; Previous @unccs @Cornell_CS; Past intern @Amazon @Apple @Google. Working on VLN, image generation, multi-modal LLM.

Zineng Tang

@ZinengTang

PhD in @Berkeley_ai and @BerkeleyNLP. Previously @UNCNLP and @MSFTResearch.

UNC AI

@unc_ai_group

AI Group (NLP/CV/ML etc) at @UNCCS @UNC Faculty: @mohitban47+@gberta227+@snigdhac25+@shsriva+@tianlongchen4+@huaxiuyaoml+@dingmyu+@zhun_deng +@SenguptRoni et al

adyasha10 retweeted

Yi Lin Sung @yilin_sung

8 months ago

Tough week! I also got impacted less than 3 months after joining. Ironically, I just landed some new RL infra features the day before. Life moves on. My past work spans RL, PEFT, Quantization, and Multimodal LLMs. If your team is working on these areas, I’d love to connect.

42

498

64

88

174K

adyasha10 retweeted

Awni Hannun

@awnihannun

8 months ago

I always thought the decline in fundamental AI research funding would happen because AI didn’t generate enough value to be worth the cost. But it seems like it’s happening because it generated too much value. And the race to capture that value is taking priority. Just remembering that a lot of this started in curiosity driven industry research labs.

32

364

36

71

71K

adyasha10 retweeted

Physical Intelligence

@physical_int

10 months ago

We've added pi-05 to the openpi repo: pi05-base, pi05-droid, pi05-libero. Also added PyTorch training code!🔥 Instructions and code here: https://t.co/EOhNYfpq9B This is an updated version of the model we showed cleaning kitchens and bedrooms in April: https://t.co/t09P0nJJFv

23

870

131

415

369K

adyasha10 retweeted

Alex Trott @alexrtrott

11 months ago

Ever wonder what it'd look like if an LLM Judge and a Reward Model had a baby? So did we, which is why we created PGRM -- the Prompt-Guided Reward Model. TLDR: You get the instructability of an LLM judge + the calibration of an RM in a single speedy package (1/n)

alexrtrott's tweet photo. Ever wonder what it'd look like if an LLM Judge and a Reward Model had a baby? So did we, which is why we created PGRM -- the Prompt-Guided Reward Model.

TLDR: You get the instructability of an LLM judge + the calibration of an RM in a single speedy package (1/n) https://t.co/7uUxdDrXLZ

6

154

24

120

31K

Adyasha Maharana @adyasha10

12 months ago

@prateeky2806 @AIatMeta Amazing!! Congratulations Prateek, very well-deserved 🥳🥳

0

79

adyasha10 retweeted

Jonathan Frankle

@jefrankle

12 months ago

I'm at ICML 🇨🇦 and I'm hiring at @databricks. Visit our booth if you're interested. My scientific focus: It's 1972 in AI, there's an AI crisis, Dijkstra isn't here to save us, and maybe RL can. Why Databricks? The long road to AGI is being paved here and we have the real evals 🧵

9

225

24

96

42K

adyasha10 retweeted

David Fan

@DavidJFan

about 1 year ago

Can visual SSL match CLIP on VQA? Yes! We show with controlled experiments that visual SSL can be competitive even on OCR/Chart VQA, as demonstrated by our new Web-SSL model family (1B-7B params) which is trained purely on web images – without any language supervision.

DavidJFan's tweet photo. Can visual SSL match CLIP on VQA?

Yes! We show with controlled experiments that visual SSL can be competitive even on OCR/Chart VQA, as demonstrated by our new Web-SSL model family (1B-7B params) which is trained purely on web images – without any language supervision.

12

460

95

303

86K

adyasha10 retweeted

Gedas Bertasius

@gberta227

about 1 year ago

For those of you who know me, I've always been very excited to combine my two passions for basketball and CV. Our #CVPR2025 paper does this by introducing a large-scale video dataset for fine-grained skill estimation in 🏀. Paper, code & data available: https://t.co/cqyJ5cRbaU

0

69

13

2

9K

adyasha10 retweeted

Jonathan Frankle

@jefrankle

over 1 year ago

The hardest part about finetuning LLMs is that people generally don't have high-quality labeled data. Today, @databricks introduced TAO, a new finetuning method that only needs inputs, no labels necessary. Best of all, it actually beats supervised finetuning on labeled data.

jefrankle's tweet photo. The hardest part about finetuning LLMs is that people generally don't have high-quality labeled data. Today, @databricks introduced TAO, a new finetuning method that only needs inputs, no labels necessary. Best of all, it actually beats supervised finetuning on labeled data. https://t.co/7ICyOQKGWN

13

891

134

838

91K

adyasha10 retweeted

Dong-Ho Lee

@Dongho_Lee_

over 1 year ago

🤖 Even wonder why chatbots feel off? We unveil REALTALK: 21 days of REAL human chats showing what AI misses: emotions, shifting personas, memory gaps. https://t.co/cGrrMN71CE https://t.co/SuW9sD7v4u @Snap @saharaai @USC_ISI @nlp_usc #LLM #EmotionalIntelligence

Dongho_Lee_'s tweet photo. 🤖 Even wonder why chatbots feel off?

We unveil REALTALK: 21 days of REAL human chats showing what AI misses: emotions, shifting personas, memory gaps.

https://t.co/cGrrMN71CE
https://t.co/SuW9sD7v4u

@Snap @saharaai @USC_ISI @nlp_usc
#LLM #EmotionalIntelligence https://t.co/k4iXxHpBdi

1

24

9

7

3K

adyasha10 retweeted

Ryan Marten

@ryanmart3n

over 1 year ago

Announcing the Open Thoughts project. We are building the best reasoning datasets out in the open. Building off our work with Stratos, today we are releasing OpenThoughts-114k and OpenThinker-7B.

ryanmart3n's tweet photo. Announcing the Open Thoughts project. We are building the best reasoning datasets out in the open.

Building off our work with Stratos, today we are releasing OpenThoughts-114k and OpenThinker-7B. https://t.co/naxil0wdWX

14

373

59

191

38K

Adyasha Maharana @adyasha10

over 1 year ago

Thanks to fabulous collaborators @jaeh0ng_yoon @TianlongChen4 @mohitban47 !!! @uncnlp @unccs 📖: https://t.co/yB1EJlnPfx Code: https://t.co/LX28At30lF

0

3

0

189

Adyasha Maharana @adyasha10

over 1 year ago

🎉 Adapt-♾ has been accepted to #ICLR2024 @iclr_conf! We propose a dynamic, multi-way data selection strategy for continual VLM learning with growing instruction-tuning datasets. Stay tuned for the camera-ready version with additional results on LLMs! 🙌

Jaehong Yoon

@jaeh0ng_yoon

over 1 year ago

🚨 Introducing Adapt-♾: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection! ▶️ New multimodal instruction tuning datasets are continuously released, often containing redundant/similar content or targeting highly varied skills (i.e., tasks). ▶️ How can we enable scalable, lifelong instruction tuning for MLLMs, where a temporal stream of multi-task, multimodal instruction-tuning datasets are continually added to the existing training pool? ▶️ We present Adapt-♾, a scalable and adaptive data selection strategy that facilitates the effective learning of MLLM on new skills while reinforcing previously learned skills over time. 📖: https://t.co/KxWHesjfVc Thread 🧵👇

jaeh0ng_yoon's tweet photo. 🚨 Introducing Adapt-♾: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection!

▶️ New multimodal instruction tuning datasets are continuously released, often containing redundant/similar content or targeting highly varied skills (i.e., tasks).

▶️ How can we enable scalable, lifelong instruction tuning for MLLMs, where a temporal stream of multi-task, multimodal instruction-tuning datasets are continually added to the existing training pool?

▶️ We present Adapt-♾, a scalable and adaptive data selection strategy that facilitates the effective learning of MLLM on new skills while reinforcing previously learned skills over time.

📖: https://t.co/KxWHesjfVc

Thread 🧵👇

6

141

45

89

30K

2

40

15

9

4K

adyasha10 retweeted

Javier Rando @javirandor

over 1 year ago

Carlini’s website will be auto-generated daily by a different LLM for the next 12 days. We joked about this during a dinner after NeurIPS and Christmas made it happen 💫🎄 https://t.co/8GaEnMBuu8

5

76

7

25

18K

adyasha10 retweeted

Jonathan Frankle

@jefrankle

over 1 year ago

Excited for our Data and AI Eras Tour at the @databricks booth at NeurIPS!

4

95

4

2

6K

adyasha10 retweeted

Zack Ankner

@ZackAnkner

over 1 year ago

There have been a lot of anectodes about the Llama3 series of models being harder to post-training quanitze (PTQ) than Llama2. As part of this paper, we investigated the hypothesis that the degradation from PTQ grows with the token-to-parameter ratio (TPR), .ie as you overtrain.

ZackAnkner's tweet photo. There have been a lot of anectodes about the Llama3 series of models being harder to post-training quanitze (PTQ) than Llama2. As part of this paper, we investigated the hypothesis that the degradation from PTQ grows with the token-to-parameter ratio (TPR), .ie as you overtrain. https://t.co/PxxPcmx5Vp

2

87

10

31

11K

Adyasha Maharana @adyasha10

over 1 year ago

@anikembhavi I learnt so much from you as an intern, Ani. Always looking forward to the next wonderful thing that comes from your leadership! Wish you the very best :)

1

0

426

Adyasha Maharana

@adyasha10

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users