Some of the most brilliant minds I know are building one benchmark after another instead of finding more principled ways of understanding behaviours. Is this what science has come to?
Totally agree. Application design is primordial in LLM/agentic security because of how hard it is to actually make the model itself safe.
By the way, isn’t that documented as “the golden rule” in the Nvidia Guardrails’ documentation’s LLM security guidelines @rharang ?
One of the best examples of unintentional data poisoning: replicating a common pattern of public code that was used as training data for an LLM.
Workarounds are everywhere because they are often fast solutions. Actual fixes can often take longer and thus may be less frequent
Does anyone know companies hiring for entry level roles (in Canada/remote)? And I mean *real* entry level, not degree + 2 certs + 3 years experience “entry level”.
Not just cybersecurity, any entry level roles at all, in any area.
Dropbox is looking for a senior ML engineer to join our threat intelligence and product trust & safety team, link in reply. DM me if you want to know more
Attacks such as this highlight the importance of system-level security analysis at all
stages of model deployment, starting as the design of the architecture, and extending towards as late
as the actual deployment of the model and how different user queries are batched together.
Folks, our ML Security & Privacy team at DeepMind is looking for a Student Researcher to start in March in London! What will the student do? Hack and fix LLMs.
Please reach out to me if you have any questions or find me at neurips.
I strongly recommend turning this off.
It's unbelievable that they quietly enabled this while everyone was focused on their 'Recall' AI feature. Now they're collecting and using everyone's Microsoft Word and Excel data to train their AI models.
I'm a PhD student at @UniofOxford and I think I'm living in a fairytale :-)
Foxes playing around in the snow at Magdalen College this morning — absolutely magical!