Lucy Lu Wang @lucyluwang - Twitter Profile

3 months ago

🔎 Deep research agents like Asta ScholarQA and OpenAI Deep Research are transforming how we perform literature review. But how do we know if the way we evaluate them is actually meaningful? Announcing our new paper: “Deep Research, Shallow Evaluation: A Case Study in Meta-Evaluation for Long-Form QA Benchmarks” 🧵

5

156

20

92

12K

Lucy Lu Wang @lucyluwang

6 months ago

try out our new prototype system! you can ask questions about a paper and the system will answer with both text and figures from the paper. your data will go towards understanding how to better serve diverse visual needs!

Arnavi Chheda-Kothary @arnavic

6 months ago

Ever want to ask questions about a paper, including its figures & tables? 📊📈 Want smoother interactions w/papers on desktop & mobile? Try Paper+Figure QA, a new tool from @allen_ai that answers with the original figures, tables, and excerpts from papers: https://t.co/hoKCgPVBOI

arnavic's tweet photo. Ever want to ask questions about a paper, including its figures & tables? 📊📈 Want smoother interactions w/papers on desktop & mobile?
Try Paper+Figure QA, a new tool from @allen_ai that answers with the original figures, tables, and excerpts from papers: https://t.co/hoKCgPVBOI https://t.co/gKFSJWVrDz

1

5

3

1

720

0

3

1

0

327

lucyluwang retweeted

Jihan Yao @jihan_yao

about 1 year ago

We introduce MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation ✅ Reliable: 94.3% agreement with human judgment ✅ Comprehensive: 4 modality combination × 49 tasks × 937 instructions 🔍Results and Takeaways: > GPT-Image-1 from @OpenAI leads image generation at 78.3% accuracy—13.7% ahead of the next-best model. The top open-source model, BAGEL from #ByteDance , achieves 45.5% accuracy. > Audio generation is still challenging: Top open-sourced models achieve only 48.7% accuracy in sound (Make-An-Audio 2 from #ByteDance) and 41.9% in music (MusicGen from @AIatMeta). 📜 Paper: https://t.co/pFsEkJZfw8 🛠️ Code and Evaluation Suite: https://t.co/QRU05NlGSO 🥇Leaderboard: https://t.co/oGEFw7YRpc 🧵1/N

jihan_yao's tweet photo. We introduce MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation

✅ Reliable: 94.3% agreement with human judgment
✅ Comprehensive: 4 modality combination × 49 tasks × 937 instructions

🔍Results and Takeaways:

> GPT-Image-1 from @OpenAI leads image generation at 78.3% accuracy—13.7% ahead of the next-best model. The top open-source model, BAGEL from #ByteDance , achieves 45.5% accuracy.

> Audio generation is still challenging: Top open-sourced models achieve only 48.7% accuracy in sound (Make-An-Audio 2 from #ByteDance) and 41.9% in music (MusicGen from @AIatMeta).

📜 Paper: https://t.co/pFsEkJZfw8
🛠️ Code and Evaluation Suite: https://t.co/QRU05NlGSO
🥇Leaderboard: https://t.co/oGEFw7YRpc
🧵1/N

2

27

18

8

13K

lucyluwang retweeted

Martin Saveski @msaveski

over 1 year ago

[Please RT] I’m recruiting PhD students to work with me at @UW! I’m looking for students passionate about developing new *social media algorithms*, both broadly and within the scope of this NSF grant: https://t.co/oMaPj7phwE More info: https://t.co/vnBqn40XWs @UW / @UW_iSchool

msaveski's tweet photo. [Please RT]
I’m recruiting PhD students to work with me at @UW!

I’m looking for students passionate about developing new *social media algorithms*, both broadly and within the scope of this NSF grant: https://t.co/oMaPj7phwE

More info: https://t.co/vnBqn40XWs

@UW / @UW_iSchool https://t.co/ZkKp2l9nm0

2

212

107

78

26K

Who to follow

Sewon Min

@sewon__min

Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp

Sherry Tongshuang Wu

@tongshuangwu

Assist. Prof @SCSatCMU , CS PhD @uwcse. HCI+AI, map general-purpose models to specific use cases! prev. intern @MSFTResearch @GoogleAI @Apple. She/her.

Maarten Sap (he/him)

@MaartenSap

retiring X acct: find me @maartensap.bsky Working on #NLProc for social good. Currently at @LTIatCMU, previously at @UWNLP, @MSFTResearch, and @allen_ai. 🏳‍🌈

lucyluwang retweeted

Isabelle Augenstein @IAugenstein

over 1 year ago

📢 📅 After a long process of soliciting & vetting bids, I'm excited that we've finally been able to reveal the location for #EMNLP2025 -- it'll be at the International Expo Centre, Suzhou, China from 5-9 November 2025. Looking forward to seeing you there! @emnlpmeeting #NLProc

IAugenstein's tweet photo. 📢 📅 After a long process of soliciting & vetting bids, I'm excited that we've finally been able to reveal the location for #EMNLP2025 -- it'll be at the International Expo Centre, Suzhou, China from 5-9 November 2025. Looking forward to seeing you there!
@emnlpmeeting #NLProc https://t.co/Yw1gcPGqpi

4

137

12

5

16K

lucyluwang retweeted

Melanie Walsh

@mellymeldubs

over 1 year ago

I'm recruiting a PhD student to join my group @uw_ischool in 2025-26. If you like the mountains and interdisciplinary research that blends data and culture, this could be a good fit! PhD apps due Dec 2: https://t.co/S2dMirSr0d More info about my group: https://t.co/2jHUw4O74S

4

328

120

165

44K

lucyluwang retweeted

Anukriti @Anukriti_Kr

over 1 year ago

📉 Open access papers previously had higher accessibility compliance than closed access papers, but since 2019, we observe a sharp decline in compliance among OA papers (from the same publishers), driving much of the overall drop in PDF accessibility.

Anukriti_Kr's tweet photo. 📉 Open access papers previously had higher accessibility compliance than closed access papers, but since 2019, we observe a sharp decline in compliance among OA papers (from the same publishers), driving much of the overall drop in PDF accessibility. https://t.co/YLNrBL0bPL

1

3

1

0

573

Lucy Lu Wang @lucyluwang

over 1 year ago

In 2019 when we first did this analysis, PDF accessibility trends were mostly improving, slowly. These 2024 results surprised me, and reflect major shifts in #OA publishing since Plan S and exacerbated by Covid. Has OA mostly been a win? Sure. But not evenly for everyone…

Anukriti @Anukriti_Kr

over 1 year ago

📢 Crisis alert in academic publishing! Less than 3.2% of scholarly PDFs meet #accessibility standards for blind and low-vision readers, and compliance has dramatically declined since 2019, especially for #OpenAccess papers! What’s going on?👇 Joint w/ @lucyluwang @uw_ischool

Anukriti_Kr's tweet photo. 📢 Crisis alert in academic publishing!

Less than 3.2% of scholarly PDFs meet #accessibility standards for blind and low-vision readers, and compliance has dramatically declined since 2019, especially for #OpenAccess papers!

What’s going on?👇

Joint w/ @lucyluwang @uw_ischool https://t.co/LIRjLU48eE

2

25

3

1

3K

0

14

1

0

1K

lucyluwang retweeted

Jihan Yao @jihan_yao

over 1 year ago

🚀Varying Shades of Wrong: When no correct answers exist, can alignment still unlock better outcome? Introducing wrong-over-wrong alignment, where models learn to prefer "less-wrong" over "more-wrong". Surprisingly, aligning with wrong answers only can lead to correct solutions!

jihan_yao's tweet photo. 🚀Varying Shades of Wrong: When no correct answers exist, can alignment still unlock better outcome?

Introducing wrong-over-wrong alignment, where models learn to prefer "less-wrong" over "more-wrong". Surprisingly, aligning with wrong answers only can lead to correct solutions! https://t.co/qynv3B9HGj

1

24

5

10

6K

lucyluwang retweeted

Lucy Li @lucy3_li

over 1 year ago

Hi friends, colleagues, followers. I am on the faculty job market! I am a PhD student @BerkeleyISchool + @berkeley_ai. I work on NLP, and I believe all language, whether AI- or human-generated, is ✨social and cultural data✨. My work includes: 🧵

10

388

72

63

58K

Lucy Lu Wang @lucyluwang

over 1 year ago

@einfeyn oh yes, you’re in good company, just humans exhibiting very human tendencies 🥲

0

1

0

136

Lucy Lu Wang @lucyluwang

over 1 year ago

today i left a bunch of comments for a collaborator on a grant like “what did you mean here?” and “you should expand upon this” only to realize later that i wrote those sections 😭

1

70

2

0

6K

lucyluwang retweeted

Bingbing Wen @bingbingwen1

over 1 year ago

🚨Curious how LLMs deal with uncertainty? In our new #EMNLP2024 Findings paper, we dive deep into their ability to abstain from answering when given insufficient or incorrect context in science questions 💡https://t.co/2pAWkSwHN7 Joint work w/ @billghowe @lucyluwang @uw_ischool

bingbingwen1's tweet photo. 🚨Curious how LLMs deal with uncertainty? In our new #EMNLP2024 Findings paper, we dive deep into their ability to abstain from answering when given insufficient or incorrect context in science questions 💡https://t.co/2pAWkSwHN7
Joint work w/ @billghowe @lucyluwang @uw_ischool https://t.co/mECOOYi74K

2

62

21

45

8K

lucyluwang retweeted

Semantic Scholar Research @ AI2 @ai2_s2research

over 1 year ago

@allen_ai @SemanticScholar is hiring #nlproc #hci #ml #ai researchers for the following positions with target start dates in 2025, apply by *Nov 1* for the 1st rolling deadline. - Research intern - Young investigator (Postdoc) - Research scientist Apply: https://t.co/xYovFaZ5Mn

ai2_s2research's tweet photo. @allen_ai @SemanticScholar is hiring #nlproc #hci #ml #ai researchers for the following positions with target start dates in 2025, apply by *Nov 1* for the 1st rolling deadline.
- Research intern
- Young investigator (Postdoc)
- Research scientist
Apply: https://t.co/xYovFaZ5Mn https://t.co/rSTE8Kdw3f

0

36

9

39

12K

Lucy Lu Wang @lucyluwang

over 1 year ago

@aarontay maybe small scale demonstrations could offer a more realistic alternative vision? agree it would take navigating a lot of powerful players and divided opinions..

1

0

29

Lucy Lu Wang @lucyluwang

over 1 year ago

@sabeerawa05 @uw_ischool I am also recruiting PhD students this year in biomedical NLP and accessibility

1

3

0

1

242

Lucy Lu Wang @lucyluwang

over 1 year ago

Come be my colleague! We're hiring TWO tenure-track Assistant Professors at @UW_iSchool in AI, Data Science, and HCI 📊💻👩‍💻🌄 Link to apply: https://t.co/o5ksz5YhsS Feel free to reach out with any questions!

1

157

54

49

19K

lucyluwang retweeted

Yue Guo @YueGuo10

over 1 year ago

Excited to share that our paper on plain language summarization evaluation has been accepted to the #EMNLP2024 main conference! I’ll be in Miami and will have several PhD openings for Fall 2025. Feel free to reach out if you’d like to chat!

2

41

6

5

8K

lucyluwang retweeted

Maria Antoniak @maria_antoniak

almost 2 years ago

Sexual harassment is a horrible impediment to academic research, shutting out talented researchers and slowing scientific progress. What can we do? I believe we're not helpless; we can improve our communities through practical actions. Take a look: https://t.co/dIL52QwqOM

1

116

19

33

10K

lucyluwang retweeted

Chao-Chun (Joe) Hsu @chaochunh

almost 2 years ago

1/ 🎉 Excited to share our #ACL2024 Findings paper on using LLMs to assist with literature review! 📝 "CHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for Literature Review Support" Please check out our virtual poster session today at 8:15 p.m. PT!

chaochunh's tweet photo. 1/ 🎉 Excited to share our #ACL2024 Findings paper on using LLMs to assist with literature review! 📝

"CHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for Literature Review Support"

Please check out our virtual poster session today at 8:15 p.m. PT! https://t.co/Scc0KrBcM9

1

19

3

9

2K

Lucy Lu Wang

@lucyluwang

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users