Seungwhan Shane Moon

Verified account

@shane_moon

| Research Scientist @ Facebook | | PhD @ LTI SCS, CMU |

Seattle, WA

Joined March 2010

190 Following

697 Followers

26 Posts

Seungwhan Shane Moon

20 days ago

Thrilled to announce the Wearable AI Workshop at ECCV 2026 🎉 If you work on Proactive AI, Multimodal Assistants, or Long-form Streaming Video understanding, definitely consider participating. 🔗 Call for Papers: https://t.co/M56RBboLHw 🔗 Granc Challenge: Dataset & Toolkit on HF https://t.co/pOqN4EjPCp What's at stake: - Access to a rich, multi-modal egocentric dataset for research - A venue to present your work at ECCV 2026 - $21K in challenge prizes across three tracks Tag a colleague or student who'd be interested 👇 📅 Wearable AI Workshop - ECCV 2026 · 📍 Malmö, Sweden

shane_moon's tweet photo. Thrilled to announce the Wearable AI Workshop at ECCV 2026 🎉 If you work on Proactive AI, Multimodal Assistants, or Long-form Streaming Video understanding, definitely consider participating.

🔗 Call for Papers: https://t.co/M56RBboLHw

🔗 Granc Challenge: Dataset & Toolkit on HF https://t.co/pOqN4EjPCp

What's at stake:
- Access to a rich, multi-modal egocentric dataset for research
- A venue to present your work at ECCV 2026
- $21K in challenge prizes across three tracks

Tag a colleague or student who'd be interested 👇
📅 Wearable AI Workshop - ECCV 2026 · 📍 Malmö, Sweden

0

4

1

1

343

Seungwhan Shane Moon

about 1 year ago

We're organizing a visual Q&A benchmark challenge at KDD, focusing on the Multimodal RAG task. Join the CRAG-MM Challenge! More details here: https://t.co/vS2Jp9gRLb

0

4

0

0

344

Seungwhan Shane Moon

over 1 year ago

We're hiring exceptional AI Research Scientists to join our team at Meta Reality Labs, where you'll work on cutting-edge projects in Vision LLMs. Please reach out to me directly via email with your resume! (Check minimum qualifications) https://t.co/n6e6qQfUcp

2

226

33

177

28K

Seungwhan Shane Moon

over 2 years ago

We are hiring PhD AI Research Interns to work on various projects around Multimodal LLM for Summer 2024 (Reality Labs). Please reach out to me directly via email with your resume!

8

215

28

173

52K

Who to follow

Verified account

Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp

Dongyeop Kang (DK)

Verified account

Assistant professor @UMNComputerSci, a member of @MinnesotaNLP group, research on human-centered language technologies, he/him/his

Zhiyu Zoey Chen

Verified account

NLP researcher. Assistant Professor @UT_Dallas.

Seungwhan Shane Moon

over 2 years ago

@KuterDinel Hi @KuterDinel, great point, I do agree it'll be more robust that way, but it'll be computationally much more costly to pre-train it e2e from scratch, and re-do instruction tuning & RLHF for the LLM (hence "scalable and efficient" in our title).

0

0

0

0

66

Seungwhan Shane Moon

over 2 years ago

Excited to share our recent work, AnyMAL -- a unified Multimodal LLM built on LLaMA-2 that can reason over various inputs, e.g. images, audio, motion sensors. Check out our paper for more information on the model training, evaluation, safety and more! ➡️ https://t.co/HmyVynWXPH

shane_moon's tweet photo. Excited to share our recent work, AnyMAL -- a unified Multimodal LLM built on LLaMA-2 that can reason over various inputs, e.g. images, audio, motion sensors.

Check out our paper for more information on the model training, evaluation, safety and more!
➡️ https://t.co/HmyVynWXPH https://t.co/zj7xbY8qFp

4

120

24

46

23K

shane_moon retweeted

over 2 years ago

Meta introduces AnyMAL - a unified model that reasons over diverse input modality signals (i.e. text, image, video, audio, IMU motion sensor), and generates textual responses - best model achieves strong zero-shot performance in both automatic and human evaluation on diverse tasks and modalities, setting new SOTA with +7.0% relative accuracy improvement on VQAv2, +8.4% CIDEr on zeroshot COCO image captioning, and +14.5% CIDEr on AudioCaps, when compared with the models available in the literature.

_akhaliq's tweet photo. Meta introduces AnyMAL

- a unified model that reasons over diverse input modality signals (i.e. text, image, video, audio, IMU motion sensor), and generates textual responses

- best model achieves strong zero-shot performance in both automatic and human evaluation on diverse tasks and modalities, setting new SOTA with +7.0% relative accuracy improvement on VQAv2, +8.4% CIDEr on zeroshot COCO image captioning, and +14.5% CIDEr on AudioCaps, when compared with the models available in the literature.

6

587

105

318

234K

Seungwhan Shane Moon

about 3 years ago

Meta Reality Lab is organizing "Ambient AI Workshop" -- focusing on multimodal understanding with wearable sensors, combining NLP + Vision + Sensor Signals. For more details & call for paper (now due Mar 26): https://t.co/yxINJn0r7n We look forward to your participation!

0

8

3

0

1K

Seungwhan Shane Moon

over 3 years ago

We are hiring PhD Research Interns to work on various Multimodal & NLP related projects (Reality Labs) for 2023. See JDs here -- apply directly or reach out to me directly via email! - https://t.co/8P649TIftC - https://t.co/CCVvLy47yz

0

44

9

21

33K

Seungwhan Shane Moon

over 4 years ago

We are hiring research interns to work on various multimodal & NLP related projects (Reality Labs). See JDs here -- or reach out to me directly via email! https://t.co/Le9D2c2cNh https://t.co/gYUqR49QmH

0

7

0

2

0

Seungwhan Shane Moon

almost 5 years ago

4 papers accepted at #EMNLP2021🎉 #NLProc - ToD Dataset for Immersive Multimodal Conversation; @SatwikKottur et al - Continual Learning in ToD System; @AndreaMadotto et al - Zero-Shot DST via CrossTask Transfer; @zlinao_lin et al - Annotation for Nuanced Conversation; Chen et al

0

17

1

0

0

Seungwhan Shane Moon

almost 5 years ago

(3/3) ACCENTOR datasets: We propose a Human ↔ AI collaborative data collection approach for generating diverse chitchat responses to augment ToD dialogs with minimal annotation effort. Results: chit-chat additions to 23K+ dialogs from two popular ToD datasets (SGD & MultiWoZ2.1)

shane_moon's tweet photo. (3/3) ACCENTOR datasets:
We propose a Human ↔ AI collaborative data collection approach for generating diverse chitchat responses to augment ToD dialogs with minimal annotation effort. Results: chit-chat additions to 23K+ dialogs from two popular ToD datasets (SGD & MultiWoZ2.1) https://t.co/1CpeIfJSOA

0

2

0

0

0

Seungwhan Shane Moon

almost 5 years ago

Introducing our work at #NAACL2021 w/ Sun et al. -- bridging the gap between task-oriented dialog systems and open-domain dialog systems (chit-chat) #NLPRoc #ConvAI 📰Paper, 📂Dataset, 💻Code (for a suite of chit-chat & task code-switching models): https://t.co/almWVHDp8M (1/3)

about 5 years ago

We are releasing ACCENTOR, a new data set that combines contextual chit-chat and traditional task-oriented dialogs. Automatic & human evaluations show our models can code-switch seamlessly, making virtual assistant conversations more natural & interactive. https://t.co/HjOzZkpLfC

3

150

41

20

0

1

14

4

0

0

Seungwhan Shane Moon

almost 5 years ago

(2/3) Results? - (Interaction eval) People like them! Our models are consistently preferred by human judges across the four axes (engagingness, etc.), compared to the baseline assistant models. - (Task eval) Our models still maintain competitive task performances.

shane_moon's tweet photo. (2/3) Results?
- (Interaction eval) People like them! Our models are consistently preferred by human judges across the four axes (engagingness, etc.), compared to the baseline assistant models.
- (Task eval) Our models still maintain competitive task performances. https://t.co/AkjjKcDLAK

1

2

0

0

0

Seungwhan Shane Moon

about 5 years ago

Two papers from our group were accepted at #NAACL2021 🎉 * Adding chit-chat to enhance task-oriented dialogues: https://t.co/almWVHDp8M w/ Kai Sun * A new SOTA for zeroshot cross-domain DST: manuscript📑 to be released soon! @zlinao_lin Kudos to our amazing interns! 😀

2

10

1

0

0

Seungwhan Shane Moon

over 5 years ago

The call for track proposals for the next Dialogue System Technology Challenge (DSTC10) is out! More info: https://t.co/WsKsBFDWt8

0

0

0

0

0

Seungwhan Shane Moon

over 5 years ago

Check out our work on Conversational Curiosity at #emnlp2020! 📄arXiv: https://t.co/4tU9a8Yujo

Dr. Pedro Rodriguez @[email protected] @EntilZhaPR

over 5 years ago

Hey <wake-word>, tell me about Punta Cana🇩🇴. Our #emnlp2020 paper introduces a conversational information-seeking dataset on geographic entities. 📜Paper + 📁Dataset + 💻Code: https://t.co/Zh7ebt3mqq Gather 5H: Nov 18 18UTC w/Paul Crook, @shane_moon, Stephen Wang 1/4

EntilZhaPR's tweet photo. Hey <wake-word>, tell me about Punta Cana🇩🇴. Our #emnlp2020 paper introduces a conversational information-seeking dataset on geographic entities.

📜Paper + 📁Dataset + 💻Code: https://t.co/Zh7ebt3mqq
Gather 5H: Nov 18 18UTC
w/Paul Crook, @shane_moon, Stephen Wang 1/4 https://t.co/dCFurqfpDe

2

18

6

0

0

0

5

0

0

0

Seungwhan Shane Moon

almost 6 years ago

We are running a challenge track at DSTC9 around multimodal conversational AI! To participate: - paper: https://t.co/CzmbbfHuSF - code & challenge website: https://t.co/5V6QY4AYqM

almost 6 years ago

We’ve released SIMMC, a data set on situated and interactive multimodal conversations, to help conversational AI researchers ground conversations in a co-observed and evolving multimodal context. A challenge track at DSTC9 around SIMMC is currently live. https://t.co/09QKGQ6pXF

AIatMeta's tweet photo. We’ve released SIMMC, a data set on situated and interactive multimodal conversations, to help conversational AI researchers ground conversations in a co-observed and evolving multimodal context.
A challenge track at DSTC9 around SIMMC is currently live.
https://t.co/09QKGQ6pXF https://t.co/pNYtYjesD1

3

88

26

12

0

0

2

1

0

0

shane_moon retweeted

almost 6 years ago

We’ve released SIMMC, a data set on situated and interactive multimodal conversations, to help conversational AI researchers ground conversations in a co-observed and evolving multimodal context. A challenge track at DSTC9 around SIMMC is currently live. https://t.co/09QKGQ6pXF

AIatMeta's tweet photo. We’ve released SIMMC, a data set on situated and interactive multimodal conversations, to help conversational AI researchers ground conversations in a co-observed and evolving multimodal context.
A challenge track at DSTC9 around SIMMC is currently live.
https://t.co/09QKGQ6pXF https://t.co/pNYtYjesD1

3

88

26

12

0

Seungwhan Shane Moon

over 12 years ago

Mariah wrote a blog on XAML text improvements in Windows 8.1 and it just got published: http://t.co/CYvZfBUYJu

1

1

0

0

0

Last Seen Users on Sotwe

Trends for you

Most Popular Users