Dr. Yanjun Qi @Qdatalab - Twitter Profile

Dr. Yanjun Qi @Qdatalab

about 1 month ago

Paper: https://t.co/7tPoaHfhel

0

24

Dr. Yanjun Qi @Qdatalab

about 1 month ago

🎉 Excited to share our latest work: "Reward Is Enough: LLMs Are In-Context Reinforcement Learners” just presented at ICLR 2026. Summary: 🤔 What if an LLM could teach itself to get better — just from a reward score — without any retraining? #TestTimeScaling

1

3

0

1

142

Dr. Yanjun Qi @Qdatalab

about 1 month ago

ICRL — a minimal multi-round framework where an LLM sees its own past responses alongside their scalar reward scores, and iteratively self-improves. No gradient updates. No textual gradients. Just: try → get a number → try better. 🔁 The results are striking:

1

0

71

Qdatalab retweeted

AISecHub

@AISecHub

5 months ago

Sequential Tool Attack Chaining - https://t.co/s6FVOIXzON The AI safety community has fundamentally misallocated its research priorities. While extensive investigation addresses hallucination, bias, and toxicity in LLMs, there is an equally, if not more, critical vulnerability that threatens safe deployment: the inability of these systems to understand context and user intent. To address this gap, we introduce and investigate Sequential Tool Attack Chaining (STAC)—a novel category of multi-turn attacks targeting tool-enabled LLM agents. STAC exploits a unique vulnerability of agents by orchestrating sequences of seemingly innocuous tool calls that individually pass safety checks but collectively achieve harmful goals. Unlike prior multi-turn attacks that aim to elicit unsafe text responses, STAC drives agents into performing harmful tool calls. This paper positions contextual blindness as the most exploitable weakness in contemporary LLMs, rendering existing safety mechanisms inadequate against determined adversaries. Authors: @drjingjing2026, Jianfeng He, Chao Shang, Devang Kulshreshtha, Xun Xian, Yi Zhang, Hang Su, Sandesh Swamy, @Qdatalab - @AWSAI, @UCBerkeley #AISecurity #LLMSecurity #AgentSecurity #PromptInjection #ToolSecurity #ToolChaining #AIJailbreak #AdversarialML #RedTeaming #GenAI #Cybersecurity #MLSafety #STAC

AISecHub's tweet photo. Sequential Tool Attack Chaining - https://t.co/s6FVOIXzON

The AI safety community has fundamentally misallocated its research priorities. While extensive investigation addresses hallucination, bias, and toxicity in LLMs, there is an equally, if not more, critical vulnerability that threatens safe deployment: the inability of these systems to understand context and user intent.

To address this gap, we introduce and investigate Sequential Tool Attack Chaining (STAC)—a novel category of multi-turn attacks targeting tool-enabled LLM agents. STAC exploits a unique vulnerability of agents by orchestrating sequences of seemingly innocuous tool calls that individually pass safety checks but collectively achieve harmful goals. Unlike prior multi-turn attacks that aim to elicit unsafe text responses, STAC drives agents into performing harmful tool calls.

This paper positions contextual blindness as the most exploitable weakness in contemporary LLMs, rendering existing safety mechanisms inadequate against determined adversaries.

Authors: @drjingjing2026, Jianfeng He, Chao Shang, Devang Kulshreshtha, Xun Xian, Yi Zhang, Hang Su, Sandesh Swamy, @Qdatalab - @AWSAI, @UCBerkeley

#AISecurity #LLMSecurity #AgentSecurity #PromptInjection #ToolSecurity #ToolChaining #AIJailbreak #AdversarialML #RedTeaming #GenAI #Cybersecurity #MLSafety #STAC

2

46

5

37

2K

Who to follow

Chenlin Meng

@chenlin_meng

Co-founder & CTO @pika_labs | ex @StanfordAILab @Stanford | Hiring: Head of Engineering and Senior Engineer

Huan Sun

@hhsun1

Prof. @OhioState, endowed CoE Innovation Scholar, advancing the capability and safety/security of LLM-based agents, understanding transformers' limitations

Marinka Zitnik

@marinkazitnik

Associate Professor at Harvard | @Harvard @KempnerInst @broadinstitute | @ProjectTDC @AI_for_Science @ScientistTools | https://t.co/AHessmLLO9

Dr. Yanjun Qi @Qdatalab

12 months ago

Don't miss out on this excellent development! Check it out now! GitHub: https://t.co/9BPlKjPHYD #CodeSharing #TurboFuzzLLM

0

1

0

51

Dr. Yanjun Qi @Qdatalab

12 months ago

🚀 Exciting Code Release Alert! 🚀 GitHub: https://t.co/DJytC3iOaF Get ready to explore the latest code sharing with TurboFuzzLLM! 🌟 -- BEST template based LLM jailbreaking method! https://t.co/9BPlKjPHYD

1

0

90

Dr. Yanjun Qi @Qdatalab

12 months ago

Taking GPTFuzz to the next level, we introduce TurboFuzzLLM, a significantly improved and more efficient method. (Source: https://t.co/3lU5ZWSKw7) -- 3x reduction in queries while generating 2x more jailbreaking templates automatically

Qdatalab's tweet photo. Taking GPTFuzz to the next level, we introduce TurboFuzzLLM, a significantly improved and more efficient method. (Source: https://t.co/3lU5ZWSKw7) -- 3x reduction in queries while generating 2x more jailbreaking templates automatically https://t.co/j25h8U8Zyj

1

0

64

Qdatalab retweeted

Awesome Machine Learning Repositories @MLRepositories

almost 3 years ago

TextAttack: TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://t.co/044cbt6Gj0 Lang: Python ⭐️ 2401 Author: @Qdatalab #MachineLearning https://t.co/ioSDMvVH76

0

10

4

3

893

Dr. Yanjun Qi @Qdatalab

almost 4 years ago

Congratulations to the wonderful work, Zeming!

Zeming Lin

@ebetica

almost 4 years ago

We find that ESMFold does much better than Alphafold2 when both are given just single sequence input. This has implications for de novo design and metagenomic sequences without any homologous sequences.

ebetica's tweet photo. We find that ESMFold does much better than Alphafold2 when both are given just single sequence input. This has implications for de novo design and metagenomic sequences without any homologous sequences. https://t.co/b9cez0R1e4

1

13

0

1

0

1

0

Qdatalab retweeted

elvis

@omarsar0

almost 4 years ago

ML Course Notes (3000⭐️) ICYMI, this repo provides detailed notes on deep learning topics. I'll be releasing the first set of notes on Deep Learning for NLP this coming month. Stay tuned! https://t.co/f3G6ARdH11

omarsar0's tweet photo. ML Course Notes (3000⭐️)

ICYMI, this repo provides detailed notes on deep learning topics.

I'll be releasing the first set of notes on Deep Learning for NLP this coming month. Stay tuned!

https://t.co/f3G6ARdH11 https://t.co/nSZeFJzX39

14

1K

281

637

0

Qdatalab retweeted

Yann LeCun

@ylecun

almost 4 years ago

Reasoning as energy minimization! One of the key points of my recent position paper on autonomous machine intelligence https://t.co/EmT1On8Y9I (and of my 2006 "tutorial on energy-based learning").

2

152

34

62

0

Qdatalab retweeted

Jian Ma

@jmuiuc

almost 4 years ago

On the door of my 10-year-old’s room at home. She made this

0

64

3

0

Qdatalab retweeted

elvis

@omarsar0

almost 4 years ago

MLOps Primer If you are curious about MLOPs and why it matters in designing ML systems, I've put together a collection of my favorite references. Check it out: https://t.co/YrOkyTVxTg

omarsar0's tweet photo. MLOps Primer

If you are curious about MLOPs and why it matters in designing ML systems, I've put together a collection of my favorite references.

Check it out: https://t.co/YrOkyTVxTg https://t.co/qATJK2tVv5

3

686

125

271

0

Qdatalab retweeted

Sahil Bloom

@SahilBloom

almost 4 years ago

Lie: The world is a zero-sum game. If it bothers you to see other people succeed, you’re definitely not gonna make it. Distance yourself from anyone who spends time bringing others down. Celebrate everyone’s wins and you’ll start winning more. A rising tide lifts all boats.

20

4K

498

77

0

Qdatalab retweeted

Ishwariya Venkatesh

@Ishwariya13

almost 4 years ago

I remember breaking down to my Ph.D. advisor about how stupid I felt while troubleshooting a problem in my project when she handed me this paper. Years later it is still relevant to young students starting off in Science. It's ok to feel stupid, we all do on a regular basis.

Ishwariya13's tweet photo. I remember breaking down to my Ph.D. advisor about how stupid I felt while troubleshooting a problem in my project when she handed me this paper. Years later it is still relevant to young students starting off in Science. It's ok to feel stupid, we all do on a regular basis. https://t.co/FZAxfJcRJm

110

12K

3K

0

Qdatalab retweeted

Jascha Sohl-Dickstein

@jaschasd

almost 4 years ago

After 2 years of work by 442 contributors across 132 institutions, I am thrilled to announce that the https://t.co/wezEGzDEHt paper is now live: https://t.co/4Yg36EB9Ru. BIG-bench consists of 204 diverse tasks to measure and extrapolate the capabilities of large language models.

jaschasd's tweet photo. After 2 years of work by 442 contributors across 132 institutions, I am thrilled to announce that the https://t.co/wezEGzDEHt paper is now live: https://t.co/4Yg36EB9Ru. BIG-bench consists of 204 diverse tasks to measure and extrapolate the capabilities of large language models. https://t.co/h3vKFWhNZc

34

2K

541

507

0

Dr. Yanjun Qi @Qdatalab

about 4 years ago

Ten years ago, after reading the sandy hook tragedy, I cries for days . This time, could not even read the news on Uvalde tragedy. Just a peek of titles made me into tears. WHY,WHY, after 10years, this type of tragedy happened again?

0

2

0

Qdatalab retweeted

Jian Ma

@jmuiuc

over 4 years ago

The Fence @CarnegieMellon

1

153

15

0

Dr. Yanjun Qi

@Qdatalab

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users