Qingyun Wang

@eagle_hz

Assistant Professor @williamandmary. CS Ph.D. in @uiuc_nlp. #AI4Research #AI4Science #NLP

Joined May 2017

1.2K Following

725 Followers

135 Posts

eagle_hz retweeted

Yi R. (May) Fung @May_F1_

3 days ago

@livgorton We have a solution to this problem, "CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation" (ACL'26 oral) led by @yeemanchoi and @eagle_hz ! Hope research reviewing platforms such as @openreviewnet can consider adopting the integration as well :)

261

eagle_hz retweeted

Software Engineering Papers @ComputerPapers

10 days ago

Enhancing Software Engineering Through Closed-Loop Memory Optimization Xuehang Guo, Zora Zhiruo Wang, Qingyun Wang, Graham Neubig, Xingyao Wang https://t.co/Yi2CfewgIJ [𝚌𝚜.𝚂𝙴 𝚌𝚜.𝙰𝙸]

202

Qingyun Wang

@eagle_hz

2 months ago

Can LLMs cite like humans?🧐 Meet CiteGuard 🛡️our retrieval-augmented agent for faithful citation attribution. +17% over prior baselines and 68.1% on CiteME, near-human accuracy #ACL2026 #ai4scientist #LLM

Kath Choi @yeemanchoi

2 months ago

🚨 Excited to share that our paper CiteGuard https://t.co/1mEElRMkm8 is accepted to ACL 2026 (Main)! LLMs are powerful for scientific writing—but up to 90% of their citations can be fabricated. Why this matters + our solution 👇

yeemanchoi's tweet photo. 🚨 Excited to share that our paper CiteGuard https://t.co/1mEElRMkm8 is accepted to ACL 2026 (Main)!

LLMs are powerful for scientific writing—but up to 90% of their citations can be fabricated.

Why this matters + our solution 👇 https://t.co/seXM14LozX

eagle_hz retweeted

EMNLP 2026 @emnlpmeeting

3 months ago

📢 The First Call for Papers for EMNLP 2026 is officially out! 📝 We welcome long & short papers featuring original research on empirical methods for NLP. 🗓️ ARR Submission Deadline: May 25, 2026 🔗 Read the full CFP here: https://t.co/GU2kaISjUG #EMNLP2026

238

100

50K

Who to follow

Jie Huang

@jefffhj

Building intelligence @xAI. Grok-2🍍, 3🍫, 4🫐, Video Gen🪄. PhD from UIUC CS.

Zhiyuan Liu

@zibuyu9

Associate Professor @TsinghuaNLP @OpenBMB. Research interests include NLP, KG and social computation.

Manling Li

@ManlingLi_

Assistant Professor@NU, Amazon Scholar, Postdoc@Stanford, PhD@UIUC #NLP #CV Language+Vision/EmbodiedAI, Reasoning, Planning, Compositionality, Trustworthiness

eagle_hz retweeted

SEA Workshop

@SEAWorkshop

6 months ago

Lightening Oral presentations are starting! Here is the schedule, presenters and topics！

425

eagle_hz retweeted

Yi R. (May) Fung @May_F1_

6 months ago

We need more 𝗼𝗽𝗲𝗻, 𝗿𝗲𝗮𝗹𝗶𝘀𝘁𝗶𝗰 𝗮𝗴𝗲𝗻𝘁 𝗲𝗻𝘃𝗶𝗿𝗼𝗻𝗺𝗲𝗻𝘁𝘀 for training and evaluating agents! 💡 🚀But what are the 𝗶𝗺𝗽𝗼𝗿𝘁𝗮𝗻𝘁 𝗲𝗻𝘃𝗶𝗿𝗼𝗻𝗺𝗲𝗻𝘁𝘀 𝘁𝗼 𝗯𝘂𝗶��𝗱? 📈What are the 𝗶𝗻𝗳𝗿𝗮𝘀𝘁𝗿𝘂𝗰𝘁𝘂𝗿𝗲 𝗯𝗼𝘁𝘁𝗹𝗲𝗻𝗲𝗰𝗸𝘀 for these environments in training and evaluation, and how can we 𝘀𝗰𝗮𝗹𝗲 𝘂𝗽 the number of available environments? 🌟Most importantly, how should we utilize these environments: 𝗥𝗟 𝗼𝗿 𝗯𝗲𝘆𝗼𝗻𝗱? If you’re interested in discussing together, come join us at our workshop on “𝙎𝙘𝙖𝙡𝙞𝙣𝙜 𝙀𝙣𝙫𝙞𝙧𝙤𝙣𝙢𝙚𝙣𝙩𝙨 𝙛𝙤𝙧 𝘼𝙜𝙚𝙣𝙩𝙨” @NeurIPSConf tmr (7th Dec)! We have an amazing lineup of invited speakers and panelists, including 𝐄𝐝𝐰𝐚𝐫𝐝 𝐆𝐫𝐞𝐟𝐞𝐧𝐬𝐭𝐞𝐭𝐭𝐞 from 𝐆𝐨𝐨𝐠𝐥𝐞 𝐃𝐞𝐞𝐩𝐌𝐢𝐧𝐝 and 𝐒𝐡𝐮𝐲𝐚𝐧 𝐙𝐡𝐨𝐮 from 𝐃𝐮𝐤𝐞. Also check out our latest 𝐬𝐮𝐫𝐯𝐞𝐲 𝐩𝐚𝐩𝐞𝐫 on the topic led by Yuchen Huang: https://t.co/gvsdqkp6jf 🎯

May_F1_'s tweet photo. We need more 𝗼𝗽𝗲𝗻, 𝗿𝗲𝗮𝗹𝗶𝘀𝘁𝗶𝗰 𝗮𝗴𝗲𝗻𝘁 𝗲𝗻𝘃𝗶𝗿𝗼𝗻𝗺𝗲𝗻𝘁𝘀 for training and evaluating agents! 💡

🚀But what are the 𝗶𝗺𝗽𝗼𝗿𝘁𝗮𝗻𝘁 𝗲𝗻𝘃𝗶𝗿𝗼𝗻𝗺𝗲𝗻𝘁𝘀 𝘁𝗼 𝗯𝘂𝗶��𝗱?

📈What are the 𝗶𝗻𝗳𝗿𝗮𝘀𝘁𝗿𝘂𝗰𝘁𝘂𝗿𝗲 𝗯𝗼𝘁𝘁𝗹𝗲𝗻𝗲𝗰𝗸𝘀 for these environments in training and evaluation, and how can we 𝘀𝗰𝗮𝗹𝗲 𝘂𝗽 the number of available environments?

🌟Most importantly, how should we utilize these environments: 𝗥𝗟 𝗼𝗿 𝗯𝗲𝘆𝗼𝗻𝗱?

If you’re interested in discussing together, come join us at our workshop on “𝙎𝙘𝙖𝙡𝙞𝙣𝙜 𝙀𝙣𝙫𝙞𝙧𝙤𝙣𝙢𝙚𝙣𝙩𝙨 𝙛𝙤𝙧 𝘼𝙜𝙚𝙣𝙩𝙨” @NeurIPSConf tmr (7th Dec)! We have an amazing lineup of invited speakers and panelists, including 𝐄𝐝𝐰𝐚𝐫𝐝 𝐆𝐫𝐞𝐟𝐞𝐧𝐬𝐭𝐞𝐭𝐭𝐞 from 𝐆𝐨𝐨𝐠𝐥𝐞 𝐃𝐞𝐞𝐩𝐌𝐢𝐧𝐝 and 𝐒𝐡𝐮𝐲𝐚𝐧 𝐙𝐡𝐨𝐮 from 𝐃𝐮𝐤𝐞.

Also check out our latest 𝐬𝐮𝐫𝐯𝐞𝐲 𝐩𝐚𝐩𝐞𝐫 on the topic led by Yuchen Huang: https://t.co/gvsdqkp6jf 🎯

eagle_hz retweeted

SEA Workshop

@SEAWorkshop

6 months ago

🚀 SEA Workshop is going LIVE TOMORROW! Join us at NeurIPS 2025 for a full day diving into the Scaling Environments for Agents featuring an incredible lineup of speakers and panelists： @egrefen @Mike_A_Merrill @mialon_gregoire @deepaknathani11 @jl_marino @syz0x1 @qhwang3 Anthony G. Cohn, Eric Sommerlade, @fredsala 📍 Upper Level Room 23ABC 🕘 08:00–17:00 Huge thanks to our sponsors: @TheInclusionAI (@AntLingAGI) @SnorkelAI @SonicjobsApp and @VmaxAI 🙌 🔥 Get ready for a day of insights and inspiring conversations!

SEAWorkshop's tweet photo. 🚀 SEA Workshop is going LIVE TOMORROW!
Join us at NeurIPS 2025 for a full day diving into the Scaling Environments for Agents featuring an incredible lineup of speakers and panelists：

@egrefen @Mike_A_Merrill @mialon_gregoire @deepaknathani11 @jl_marino @syz0x1 @qhwang3 Anthony G. Cohn, Eric Sommerlade, @fredsala

📍 Upper Level Room 23ABC
🕘 08:00–17:00

Huge thanks to our sponsors:
@TheInclusionAI (@AntLingAGI) @SnorkelAI @SonicjobsApp and @VmaxAI 🙌

🔥 Get ready for a day of insights and inspiring conversations!

13K

eagle_hz retweeted

Guohao Li 🐫

@guohao_li

7 months ago

The SEA Workshop at @NeurIPSConf 2025 is coming next Sunday. It seems we urgently need more open, realistic agent environments for training and evaluating agents. But what are the important environments to build? What are the infrastructure bottlenecks for these environments in training and evaluation? How can we scale up the number of available environments? And how should we use these environments, RL or beyond? These questions are still not clear. We’re bringing together an amazing list of speakers and panelists to spark the discussion: @egrefen, @Mike_A_Merrill, @mialon_gregoire, @deepaknathani11, @jl_marino, @syz0x1, @qhwang3, Anthony G. Cohn, Eric Sommerlade, and @fredsala. You won’t want to miss it if you’re around. Also, huge thanks to our four sponsors, @TheInclusionAI (@AntLingAGI), @SnorkelAI, @SonicjobsApp, and @VmaxAI for their generous support!

guohao_li's tweet photo. The SEA Workshop at @NeurIPSConf 2025 is coming next Sunday. It seems we urgently need more open, realistic agent environments for training and evaluating agents. But what are the important environments to build? What are the infrastructure bottlenecks for these environments in training and evaluation? How can we scale up the number of available environments? And how should we use these environments, RL or beyond? These questions are still not clear.

We’re bringing together an amazing list of speakers and panelists to spark the discussion: @egrefen, @Mike_A_Merrill, @mialon_gregoire, @deepaknathani11, @jl_marino, @syz0x1, @qhwang3, Anthony G. Cohn, Eric Sommerlade, and @fredsala. You won’t want to miss it if you’re around.

Also, huge thanks to our four sponsors, @TheInclusionAI (@AntLingAGI), @SnorkelAI, @SonicjobsApp, and @VmaxAI for their generous support!

23K

eagle_hz retweeted

Ai2 @allen_ai

7 months ago

Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, & tool use, and an open model flow—not just the final weights, but the entire training journey. Best fully open 32B reasoning model & best 32B base model. 🧵

allen_ai's tweet photo. Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, & tool use, and an open model flow—not just the final weights, but the entire training journey.
Best fully open 32B reasoning model & best 32B base model. 🧵 https://t.co/vnGrArA44X

327

692

610K

eagle_hz retweeted

Manling Li

@ManlingLi_

7 months ago

#EMNLP Keynote by @hengjinlp: No more Processing. Time to Discover! AI for Science is just so exciting! Let us make LLMs discover like true scientists: Observe → Think → Propose and Verify (A pity to miss the talk. Photo from @May_F1_ @emnlpmeeting )

ManlingLi_'s tweet photo. #EMNLP Keynote by @hengjinlp:

No more Processing. Time to Discover!

AI for Science is just so exciting! Let us make LLMs discover like true scientists: Observe → Think → Propose and Verify

(A pity to miss the talk. Photo from @May_F1_ @emnlpmeeting ) https://t.co/W94Nerj6jl

115

eagle_hz retweeted

Alexi Gladstone

@AlexiGlad

7 months ago

What if your policy could reason and think dynamically, especially about uncertainty, enabling better real-world behavior? ⚡️Introducing EBT-Policy, an instantiation of Energy-Based Transformers for Policies! TLDR: - EBT-Policy broadly outperforms Diffusion Policy in both simulated and real-world tasks, while using significantly less (up to 50x less) resources during both training and inference. - EBT-Policy is the first vanilla Behavior Cloning approach to demonstrate emergent zero-shot retry behavior—recovering from failures/OOD states using only successful demos, with no retry data/training. - EBT-Policy successfully learns uncertainty, enabling dynamic compute allocation for action sequences it’s more uncertain about. 🧵Thread:

AlexiGlad's tweet photo. What if your policy could reason and think dynamically, especially about uncertainty, enabling better real-world behavior?

⚡️Introducing EBT-Policy, an instantiation of Energy-Based Transformers for Policies!
TLDR:
- EBT-Policy broadly outperforms Diffusion Policy in both simulated and real-world tasks, while using significantly less (up to 50x less) resources during both training and inference.
- EBT-Policy is the first vanilla Behavior Cloning approach to demonstrate emergent zero-shot retry behavior—recovering from failures/OOD states using only successful demos, with no retry data/training.
- EBT-Policy successfully learns uncertainty, enabling dynamic compute allocation for action sequences it’s more uncertain about.

🧵Thread:

eagle_hz retweeted

Salesforce AI Research

@SFResearch

8 months ago

Introducing UniDoc-Bench: The First Unified Benchmark for Document-Centric Multimodal RAG 📄 Paper: https://t.co/33S6yvibzO Real documents mix text, tables, and charts—but most RAG benchmarks test them in isolation. We built UniDoc-Bench to change that. 📊 What's inside: ➡️ 70K PDF pages across 8 domains ➡️ 1,600 QA pairs grounding text, tables & images ➡️ Fair comparison across 4 RAG paradigms 🔍 Key finding: Text-image fusion RAG (68.4%) beats both multimodal joint retrieval (64.1%) and single-modality approaches. Current multimodal embeddings still lag behind combining strong unimodal retrievers. 💻 Code: https://t.co/gzVHUqRb0i 📊 Data: https://t.co/UndBiJ3Aqy ➡️ Work by Xiangyu Peng @beckypeng6, Can Qin @canqin001, Zeyuan Chen @ZeyuanChen, Ran Xu @stanleyran, Caiming Xiong @CaimingXiong, and Chien-Sheng Wu @jasonwu0731. #FutureOfAI #EnterpriseAI #MultimodalAI #DocumentIntelligence

SFResearch's tweet photo. Introducing UniDoc-Bench: The First Unified Benchmark for Document-Centric Multimodal RAG

📄 Paper: https://t.co/33S6yvibzO

Real documents mix text, tables, and charts—but most RAG benchmarks test them in isolation. We built UniDoc-Bench to change that.

📊 What's inside:
➡️ 70K PDF pages across 8 domains
➡️ 1,600 QA pairs grounding text, tables & images
➡️ Fair comparison across 4 RAG paradigms

🔍 Key finding: Text-image fusion RAG (68.4%) beats both multimodal joint retrieval (64.1%) and single-modality approaches. Current multimodal embeddings still lag behind combining strong unimodal retrievers.

💻 Code: https://t.co/gzVHUqRb0i
📊 Data: https://t.co/UndBiJ3Aqy

➡️ Work by Xiangyu Peng @beckypeng6, Can Qin @canqin001, Zeyuan Chen @ZeyuanChen, Ran Xu @stanleyran, Caiming Xiong @CaimingXiong, and Chien-Sheng Wu @jasonwu0731.

#FutureOfAI #EnterpriseAI #MultimodalAI #DocumentIntelligence

eagle_hz retweeted

Zhenhailong Wang @zhenhailongW

8 months ago

Multimodal conversational agents struggle to follow complex policies, which also impose a fixed computational cost. We ask: 👉 How can we achieve stronger policy-following behavior without having to include policies in-context? 🌐: https://t.co/mIdhuPw6Cj 🧵1/3

zhenhailongW's tweet photo. Multimodal conversational agents struggle to follow complex policies, which also impose a fixed computational cost.
We ask:
👉 How can we achieve stronger policy-following behavior without having to include policies in-context?
🌐: https://t.co/mIdhuPw6Cj 🧵1/3 https://t.co/2oktzstXWq

eagle_hz retweeted

Yuan He

@lawhy_X

9 months ago

The decision notification letters have been sent! 🎉 We sincerely thank all authors and reviewers for their valuable contributions to this workshop. Kudos to our organizing committee, advisors, and support team for their incredible efforts: @guohao_li @May_F1_ @eagle_hz @FangruLin99 @hxyscott @AlisiaLupidi @thu_yushengsu Ziyu Ye @Wade_Yin9712 @ZiyiYang35007 Jialin Yu @sunandosengupta @agarwl_ @BernardSGhanem @AnimaAnandkumar @philiptorr @douglas_ym @celineee_xie

Qingyun Wang

@eagle_hz

10 months ago

Our VISTA workshop at ICDM 2025 is still open for submissions! If you’re working on GenAI standards, legal constraints, copyright risks, & compliance, we’d love to see your papers! 📄✨ 🧵More information and submit:

Yide Ran @ran_yide42201

12 months ago

🚨 Call for Papers: VISTA Workshop @ ICDM 2025 🚨 📅 Nov 12, 2025 | 📍 Washington, DC Explore GenAI standards, legal constraints, copyright risks, & compliance. Submit by Sep 5! 🔗 https://t.co/MjmZx8UunI Speakers: V. Braverman, D. Atkinson, A. Li #ICDM2025 #GenAI #AIStandards

ran_yide42201's tweet photo. 🚨 Call for Papers: VISTA Workshop @ ICDM 2025 🚨
📅 Nov 12, 2025 | 📍 Washington, DC
Explore GenAI standards, legal constraints, copyright risks, & compliance. Submit by Sep 5!
🔗 https://t.co/MjmZx8UunI
Speakers: V. Braverman, D. Atkinson, A. Li
#ICDM2025 #GenAI #AIStandards https://t.co/GwzFiOWBHv

830

eagle_hz retweeted

Fangru Lin @FangruLin99

10 months ago

🚨 Deadline Extended! 🚨 Our Scaling Environments for Agents 🧑‍💻🤖 workshop at @NeurIPSConf 2025 is still open for submissions! If you’re working on scaling, environments, or agents, we’d love to see your papers! 📄✨ 📅 New deadline: Sept 1st 🧵More information and submit:

Qingyun Wang

@eagle_hz

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users