Really grateful to have 7 papers accepted at @icmlconf 2026, including 2 spotlights!
Massive thanks to all my collaborators—I’ve been lucky to work with such brilliant people
See you all in Seoul?…it feels surreal saying that while still in Rio for ICLR 😄 #ICML2026
Barcelona’s wondrous church La Sagrada Familia came one step closer to a long-sought completion this week! Gaudi’s architectural marvel got a blessing from Pope Leo XIV to inaugurate its latest, and tallest, tower.
In film, "we'll fix it in post" is what you say when something went wrong on set and you don't want to redo it. AI research has made it our entire methodology: train the model, then patch whatever comes out. Our new ICML oral argues this can't be the basis of a science of AI. 🧵
[1/8]
🐶 AgentDoG 1.5: a diagnostic guardrail for AI agents.
Most safety models say "unsafe." AgentDoG names which step, what risk, what harm.
4B hits 55.2% on fine-grained diagnosis. GPT-5.4: 25.8%.
📄 https://t.co/oOGDYfEHjV
💻 https://t.co/wmi1DUtyV7
#AISafety#Agent
We're Neo Research (新衡). Asia’s first independent frontier AI safety evaluation & research lab.
Today we're publishing our first report: an independent safety evaluation of DeepSeek v4 Pro. (1/5)
Artificial intelligences do not undergo experiences, do not possess a body, do not feel joy or pain, do not mature through relationships, and do not know from within what love, work, friendship or responsibility mean. Nor do they have a moral conscience, since they do not judge good and evil, grasp the ultimate meaning of situations, or bear responsibility for consequences. They may imitate or even simulate, but they do not understand what they produce, for they lack the affective, relational, and spiritual perspective through which human beings grow in wisdom. #MagnificaHumanitas
Suppose evaluations were sufficient for deployment: What’d happens once models become continuous learners?
Simple: every update would require new evaluation and your pre deployment evals would expire at step 1!
Checkout our new paper lead by @LPacchiardi
🚨 New paper: AI evaluation is structurally unsuitable for continual learning (CL). To address this, evaluation should be centred on the "behavioural trajectories" that CL systems develop, with the goals of characterising possible behaviours and forecasting their evolution. 🧵
@DakingRai@ZiyuYao@megamor2 Our works been public since Sep'25 and accepted at ICML, it's great to see this used to refine tasks. raw single-instance gradients are noisy, using an un-smoothed s exposes your clusters to noise. BoN would directly stabilise DCD. Looking forward to the update and discussion! 4
@DakingRai@ZiyuYao@megamor2 Also, evaluating specialised subsets hits the exact instance-level evaluation bottlenecks we solved which is why DCD’s CMD′ metric (clipping over-recovery to zero) exactly mirrors the bounding principles of our introduced NDF metric.
(3/)