In case you're planning to use MTurk for any NLP papers, hurry up:
"After careful consideration, we have made the decision to close new customer access to AWS Mechanical Turk, effective 7/30/26."
The proceedings of the (2^6)th Annual Meeting of the Association for Computational Linguistics are now available in the ACL Anthology.
https://t.co/sjRDEzn0cq
STEB: Style Text Embedding Benchmark
@rafaelrivera01 et al. introduce a benchmark for style embeddings across 96 datasets, finding semantic embeddings underperform on stylistic tasks with no single model dominating.
📝 https://t.co/aBqSuuF98A
👨🏽💻 https://t.co/zW0DeCpAZ1
Claude Fable 5 will be available again globally tomorrow.
After a series of productive conversations with the US government, we're redeploying the model with a new set of classifiers to target and block more cybersecurity tasks. In the near term, some routine tasks like coding and debugging will fall back to Opus 4.8. We’ll continue to refine these classifiers over the coming weeks to reduce false positives and better distinguish genuine misuse from legitimate requests.
We’ve also begun drafting a consensus framework—with Amazon, Microsoft, Google, and other Glasswing partners—for assessing the severity of AI jailbreaks and how AI developers should respond to them. We invite other industry partners and model providers to join us in this effort.
Finally, we’re scaling up our collaboration with the US government on model testing and safeguards. This will include pre-release access to models and safeguards for evaluation, information sharing on jailbreaks and misuse, and dedicated resources for joint research.
Thank you to our users for your patience, and to our partners across the government, industry, and the research community who worked alongside us to make Fable 5 available again.
Read our full blog: https://t.co/VHyum831ri
Introducing Claude Sonnet 5, our most agentic Sonnet yet.
It makes plans, uses tools like browsers and terminals, and runs autonomously at a level that just a few months ago required larger and more expensive models.
📣 #EACL2027 updates: the Call for Papers is live, and our keynote speakers are confirmed!
Main conference: 9–14 March 2027. Special theme: The Human in Language. 🧵👇
https://t.co/zVy6509Pos
@eaclmeeting "author response (14–19 Sept) & author–reviewer discussion (20–24 Sept) are now two separate stages."
Interesting change. IMO it's good. More time and having 2 stages facilitates having a proper discussion.
If you use LLM-as-judge, this one is worth reading.
(bookmark it)
It's actually one of the most effective ways to use LLM-as-a-Judge for evals.
Holistic judge scores hide both their reasoning and their ceiling effects.
BINEVAL decomposes each evaluation criterion into atomic yes-or-no questions, answers each independently per output, then aggregates the verdicts into calibrated multi-dimensional scores.
Every question-level verdict is inspectable, so you can diagnose exactly why an output scored low, and the same verdicts feed straight back as targeted prompt-improvement signal.
Across SummEval, Topical-Chat, and QAGS, it matches or beats UniEval and G-Eval, training-free, with especially strong results on factual consistency.
Paper: https://t.co/oar6BZcasm
Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX
Looking for a fun weekend read?
Introducing the Illustrated ICML 🌎
We indexed all 6000+ ICML papers and built a visual way to explore the whole landscape
Search your favorite topics, open a cluster, and dive right in
Inspired by @JayAlammar
🚨 ARR is recruiting extra emergency reviewers and emergency Area Chairs (ACs) for this cycle.
Emergency Reviewer Registration:
https://t.co/DtVZqCHm1o
Emergency AC Registration:
https://t.co/jKxtqzvYbX
Thank you for helping support the ARR review process.
#ARR#ACL#NLProc
🔊CFP for Special Issue on the Ethics of NLP and CL in Computational Linguistics:
https://t.co/gV4Ut9dGVW
⏰Deadline: 27 November, 2026
#CLjournal#NLP#NLproc
@ReviewAcl - Limit the number of submissions for authors qualified to review to 5.
- Require each author who is qualified to review to review three times the number of their submitted papers.
- Limit the number of submission for each author not qualified to review to 2.
Absolutely insane graph here. I’m sorry, if you wrote 40 papers for ACL, no you didn’t. @CVPR put a cap and a bunch of other reforms to try to head off the paper-pocalypse. ACL was caught flat-footed and now it sounds like they’re getting absolutely dogpiled by angry reviewers.
📢 ARR May 2026 cycle update
We have published a blog post about the increased review assignments in the May cycle, what happened, and what comes next.
Please read the full post here:
https://t.co/55D2Og9mz3
#ARR#ACL#NLProc