In 2024, Gavin Newsom called on the California legislature to pass laws restricting the use of A.I. to parody politicians. Thankfully, @ADFLegal teamed up with us at @TheBabylonBee to file a lawsuit against these laws. They were ruled unconstitutional earlier this year.
Now, Gavin is parodying politicians with A.I. You're welcome, Governor!
Last week I presented 2 EMNLP papers (virtually). Here are the recordings and official publications:
RedHerring Attack
Paper: https://t.co/FBV2Z2Ucb2
Video: https://t.co/gKhTahBDxi
Hybrid Select
Paper: https://t.co/fzA8CXkpN4
Video: https://t.co/iyQD03GDWe
Please enjoy!
Hybrid/Dynamic Select
We propose two new methods to improve efficiency of adversarial attacks. We find both strongly reduce the number of required model queries for successful black-box attacks. We verify the methods on both encoder and LLM models.
Arxiv: https://t.co/EQOW0hY3qj
Grateful to be a part of two research papers being virtually presented at EMNLP 2025:
1. "RedHerring Attack: Testing the Reliability of Attack Detection"
2. "Overcoming Black-box Attack Inefficiency with Hybrid and Dynamic Select Algorithms"
w/ A. Belde and R. Ramkumar
RedHerring Attack
We propose/test a novel threat model and attack which causes sows distrust between humans and attack detection models. We expose vulnerabilities in current detection methods as well as investigate defenses to respond.
Arxiv: https://t.co/ErbLMrUxoJ
There should be social consequences for people who openly celebrate the murder of an innocent man. But there obviously shouldn’t be any legal repercussions for “hate speech,” which is not even a valid or coherent concept. There is no law against saying hateful things, and there shouldn’t be.
Just use Brave. 🦁
Brave's built-in protections block ads and trackers across the Web.
Plus, you can enable uBlock Origin directly through our browser (Settings > Extensions > Manifest v2 extensions).
We propose and explore a black-box attack selection method which aims to be more efficient in finding which words to modify compared to Greedy select (and all its variations [e.g. Importance Score]). We find a good tradeoff between reduction in queries and attack effectiveness.
📣Excited to announce a paper which will be presented (virtually) at COLING 2025:
"BinarySelect to Improve Accessibility of Black-Box Attack Research"
W/ Shatarupa Ghosh (https://t.co/TjAdDPJT08)
Arxiv:
https://t.co/bBqZn6aUse
We propose and test a new attack which takes advantage of text classifiers' inability to process text vertically. The attack finds high weighted words and rewrites them vertically. This messes up the processing of a classifier, but allows a human to read the obfuscated message.
🎉Announcing a new paper which was accepted at NAACL 2024 main conference:
"VertAttack: Taking advantage of Text Classifiers' horizontal vision"
Arxiv: https://t.co/hkThwmQ3WX