Daniël Vos

@daniel_a_vos

PhD student in machine learning (decision tree whisperer) at @tudelft 👨‍🎓 and organizer for the TU Delft CTF Team 👨‍💻

Delft, Nederland

Joined January 2016

287 Following

124 Followers

23 Posts

Daniël Vos @daniel_a_vos

almost 2 years ago

@gabrielpeyre @YihongWu7 Classification and Regression Trees (CART) is a greedy heuristic that runs efficiently but does not offer a performance guarantee. (e.g. XOR-shaped data can still be problematic)

127

Daniël Vos @daniel_a_vos

almost 2 years ago

@gabrielpeyre @YihongWu7 Both the problems of finding the size-limited tree that minimizes loss and the smallest tree that achieves 0 loss are NP-complete. Dynamic programming can be used when the objective is separable w.r.t. the leaves, but this runs exponentially in tree size: https://t.co/O3GoetEdWw

136

Daniël Vos @daniel_a_vos

almost 4 years ago

@adad8m @_joaogui1 Yes, so input shape is 2 * p and output shape is p, in the figure p=97. I tried some other modular functions as well but I would have to do some digging to find my old code.

Daniël Vos @daniel_a_vos

almost 4 years ago

@adad8m @_joaogui1 Yes exactly! One-hot encoding and a 1 hidden layer MLP, trained with AdamW. The task used in the figures is modular addition. I wanted to see if I could get a minimal example where grokking occurs 🙂

Who to follow

Marcin Przewięźlikowski

@pszwnzl

PhD Student @ Jagiellonian University / GMUM. Working on meta-learning and self-supervised learning.

Oguzhan Ersoy

@oguzer90

Blockchain, AI and Applied Cryptography | Building Decentralized Compute @gensynai | CS PhD from TU Delft | EE & Math from Bogazici University

Dennis Frauen

@dennisfrauen

PhD student @ LMU Munich. Interested in causal inference & machine learning

Daniël Vos @daniel_a_vos

almost 4 years ago

@_joaogui1 When I replicated this work with 1 hidden layer ReLU networks it did seem like increasing width increased the sharpness of the grokking effect by a bit. (left: 128 neurons, right: 8192)

daniel_a_vos's tweet photo. @_joaogui1 When I replicated this work with 1 hidden layer ReLU networks it did seem like increasing width increased the sharpness of the grokking effect by a bit. (left: 128 neurons, right: 8192) https://t.co/7vm62qv4Qz

Daniël Vos @daniel_a_vos

about 4 years ago

@tverven @Hidde_Fokkema @RdeHeide Very interesting paper and I noticed it just while I was writing a section on the robustness of explanations! Great to see that the Dutch Railways were able to assist the paper with footnote 2 😄

Daniël Vos @daniel_a_vos

over 4 years ago

If you are interested in robust optimization, decision trees, adversarial examples or all of the above, then come talk to me at #AAAI2022! Our poster is featuring now and tonight starting at 17:45 GMT+1

daniel_a_vos's tweet photo. If you are interested in robust optimization, decision trees, adversarial examples or all of the above, then come talk to me at #AAAI2022!

Our poster is featuring now and tonight starting at 17:45 GMT+1 https://t.co/G3vrDJtzaj

Daniël Vos @daniel_a_vos

over 4 years ago

@HochreiterSepp It’s interesting that you observed this with SGD! I have been working on reproducing the paper’s results and have only been successful with AdamW. For AdamW I agree with @ykilcher ‘s intuition that weight decay gives a smooth function, I wonder what happens with ‘grokking’ SGD.

Daniël Vos @daniel_a_vos

over 4 years ago

References: Chen et al., 2019: https://t.co/7MkGIYMYxe TREANT: https://t.co/tjK8gjggEl GROOT: https://t.co/wR8CvN7GTr

Daniël Vos @daniel_a_vos

over 4 years ago

I'm proud to announce that my paper with @siccoverwer "Robust Optimal Classification Trees Against Adversarial Examples" has been accepted at the #AAAI2022 conference! 🎊 Paper: https://t.co/qlFCD2IjwP A thread with more details below 👇

Daniël Vos @daniel_a_vos

over 4 years ago

Please check our paper for many more results: https://t.co/qlFCD2IjwP And let's see if we can improve the efficiency of ROCT in the future to train deeper optimal trees. Soon I will explain more about the adversarial accuracy bound from the paper.

Daniël Vos @daniel_a_vos

almost 5 years ago

Excited to present the first paper for my PhD at #ICML2021, if you have any questions I would love to hear them at the virtual poster session (18:00-20:00 CEST)! #betterposter @analytics_cyber

Daniël Vos

@daniel_a_vos

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users