Scool

@InriaScool

Scool is a #MachineLearning research team in @Inria & CRIStAL interested in designing algorithms that learn & adapt on-the-go. It is the new avatar of "SequeL".

Lille, France

Joined November 2020

104 Following

347 Followers

95 Posts

Scool @InriaScool

almost 2 years ago

4/n Alena, Thomas, Phillipe & Bruno motivated by uniqueness ambiguity of value func as a solution to HJB eqn in CTRL, propose to approximate the value func by training a PINN through a specific scheduling iterative process that constraints it to converge to the viscosity solution

InriaScool's tweet photo. 4/n Alena, Thomas, Phillipe & Bruno motivated by uniqueness ambiguity of value func as a solution to HJB eqn in CTRL, propose to approximate the value func by training a PINN through a specific scheduling iterative process that constraints it to converge to the viscosity solution https://t.co/uqA0QKqkb8

0

2

0

0

139

Scool @InriaScool

almost 2 years ago

1/n Today, we concluded @icmlconf with 4 presentations at the #FORLAC workshop conjoining RL theory and Control. Following their UAI work, @tuanquangdam, Odalric & Emilie on their work to address biased value function estimation in #MCTS using power means. #ICML2024 @Inria_Lille

InriaScool's tweet photo. 1/n Today, we concluded @icmlconf with 4 presentations at the #FORLAC workshop conjoining RL theory and Control. Following their UAI work, @tuanquangdam, Odalric & Emilie on their work to address biased value function estimation in #MCTS using power means. #ICML2024 @Inria_Lille https://t.co/G93y4p0P9B

1

3

0

1

370

Scool @InriaScool

almost 2 years ago

3/n MCTS with deep NN shows promising performance in deterministic envs, but fails in stochastic envs. @tuanquangdam, Odarlic & Brahim propose CATS & PATS, leveraging TS to handle selection randomness. They achieve regret guarantees as well as good performance in stochastic envs.

InriaScool's tweet photo. 3/n MCTS with deep NN shows promising performance in deterministic envs, but fails in stochastic envs. @tuanquangdam, Odarlic & Brahim propose CATS & PATS, leveraging TS to handle selection randomness. They achieve regret guarantees as well as good performance in stochastic envs. https://t.co/Yw8ktUGsuC

1

2

0

0

211

Scool @InriaScool

almost 2 years ago

Congratulations to @tuanquangdam, Odalric, and Emilie on their work to address biased value function estimation in #MCTS using power means. 🥳 #UAI2024 @Inria_Lille @RechercheUlille

almost 2 years ago

#UAI2024 Interested in how power mean can enhance value function estimation in tree search methods? Learn about our approach to solving biases in MCTS for stochastic settings. Join us tomorrow at 4:30 PM in the Exhibition room, building 20. Paper: https://t.co/ZH8M3glKsT

0

7

0

0

1K

0

3

0

0

253

Who to follow

RLSS 2026 Milan

@RLSummerSchool

Ten-day Reinforcement Learning Summer School happening in Milan, 3rd to 12th June 2026. Applications are open!

RL Theory Virtual Seminars

Virtual seminar series featuring the latest advances in theoretical reinforcement learning. Seminars (approximately) every Tuesday at 6pm UTC.

Nicolò Cesa-Bianchi

Professor at the University of Milan, Italy Machine learning algorithms

Scool @InriaScool

almost 2 years ago

We've derived tight lower and upper bounds for differentially private finite-armed & linear bandits, while we lack the same for contextual bandits. At #COLT2024, @achraf_azize presents open problems in contextual bandits with privacy. @BasuDebabrota @Inria_Lille @RechercheUlille

InriaScool's tweet photo. We've derived tight lower and upper bounds for differentially private finite-armed & linear bandits, while we lack the same for contextual bandits. At #COLT2024, @achraf_azize presents open problems in contextual bandits with privacy. @BasuDebabrota @Inria_Lille @RechercheUlille https://t.co/GBv26Ob1uo

0

8

1

1

895

InriaScool retweeted

Debabrota Basu @BasuDebabrota

about 2 years ago

It's fun to revisit the sanctum sanctorum: how does a brain learn? Today at Convention on Mathematics of #Neuroscience & #AI, @GuillaumeAP presents our work with @AdityaGilra on how to design a bio-plausible learning rule rather than backprop type methods to learn a time series.

BasuDebabrota's tweet photo. It's fun to revisit the sanctum sanctorum: how does a brain learn? Today at Convention on Mathematics of #Neuroscience & #AI, @GuillaumeAP presents our work with @AdityaGilra on how to design a bio-plausible learning rule rather than backprop type methods to learn a time series. https://t.co/FttOSxCn3F

0

6

2

0

365

Scool @InriaScool

about 2 years ago

The submission link is https://t.co/HZFz1WLQe0. Contact @kohler_hector and the organisers if you have any query.

0

0

1

0

150

Scool @InriaScool

about 2 years ago

We are glad to announce the 1st edition of Workshop on Interpretable Policies in Reinforcement Learning (InterpPol) @RL_Conference. Plz submit your original/published papers on Interpretable/Explainable RL, Policy Distillation, Formal Verification & RL. 👉https://t.co/G2QyW2XtlG

1

1

1

0

193

Scool @InriaScool

about 2 years ago

2/2 If each arm has multiple objectives, how to identify an arm whose mean vector is not worse than any of the others. Tomorrow @aistats_conf, Emilie & Cyrille will present "first" algo to detect such pareto sets with finite budget & bandit feedback. @RechercheUlille @Inria_Lille

InriaScool's tweet photo. 2/2 If each arm has multiple objectives, how to identify an arm whose mean vector is not worse than any of the others. Tomorrow @aistats_conf, Emilie & Cyrille will present "first" algo to detect such pareto sets with finite budget & bandit feedback. @RechercheUlille @Inria_Lille https://t.co/jfcVy43pCo

0

3

0

0

72

Scool @InriaScool

about 2 years ago

1/2 Is exploration harder if we've constraints on policies? No, depends on how constraints change the geometry of alternating set. Today @aistats_conf, @BasuDebabrota & collaborators present insights & algorithms for pure exploration with constraints. #AISTATS2024 @chalmersuniv

InriaScool's tweet photo. 1/2 Is exploration harder if we've constraints on policies? No, depends on how constraints change the geometry of alternating set. Today @aistats_conf, @BasuDebabrota & collaborators present insights & algorithms for pure exploration with constraints. #AISTATS2024 @chalmersuniv https://t.co/EECdBFA6zA

1

7

0

0

182

Scool @InriaScool

about 2 years ago

2/2 Today @satml_conf, @achraf_azize is presenting these nuances of "Concentrated DP for Bandits" along with information-theoretic lower bounds and near-optimal algorithms for linear, contextual, and multi-armed bandits. #privacy #bandits @Inria_Lille @RechercheUlille #SaTML24

InriaScool's tweet photo. 2/2 Today @satml_conf, @achraf_azize is presenting these nuances of "Concentrated DP for Bandits" along with information-theoretic lower bounds and near-optimal algorithms for linear, contextual, and multi-armed bandits. #privacy #bandits @Inria_Lille @RechercheUlille #SaTML24 https://t.co/zfSbXcXeTe

0

3

1

0

161

Scool @InriaScool

about 2 years ago

1/2 To define privacy in bandits, we have to ask what are the input and output of a bandit algorithm? What differs if the adversary is interactive or passive? @achraf_azize & @BasuDebabrota address these in their work https://t.co/z93Cd3R1G0.

InriaScool's tweet photo. 1/2 To define privacy in bandits, we have to ask what are the input and output of a bandit algorithm? What differs if the adversary is interactive or passive? @achraf_azize & @BasuDebabrota address these in their work https://t.co/z93Cd3R1G0. https://t.co/b3iwtbtHjK

1

4

1

0

456

Scool @InriaScool

over 2 years ago

@TmlrOrg we address the corrupted bandit problem, i.e. a stochastic multi-armed bandit problem with unknown reward distributions, which are heavy-tailed and corrupted by a history-independent adversary or Nature. We provide another set of lower bounds and algorithm. #robustness

InriaScool's tweet photo. @TmlrOrg we address the corrupted bandit problem, i.e. a stochastic multi-armed bandit problem with unknown reward distributions, which are heavy-tailed and corrupted by a history-independent adversary or Nature. We provide another set of lower bounds and algorithm. #robustness https://t.co/7Sxq6kEY91

0

3

0

0

75

Scool @InriaScool

over 2 years ago

What happens in a bandit problem if epsilon fraction of feedback are arbitrarily corrupt? What are the new lower bounds on the regret? Can we design an optimal algorithm for #Bandits_corrupted_by_nature? We address this question in two parts.

$InriaScool's tweet photo. What happens in a bandit problem if epsilon fraction of feedback are arbitrarily corrupt? What are the new lower bounds on the regret? Can we design an optimal algorithm for #Bandits_corrupted_by_nature? We address this question in two parts. https://t.co/KiGIS7ETnH$

1

4

0

0

428

Scool @InriaScool

over 2 years ago

Today at #ALT2024, @ShubhadaAgrawal presents CRIMED: a joint work with Timothée, @BasuDebabrota & Odalric. CRIMED achieves matching regret upper bound for symmetric distributions and unbounded corruption. #bandits #corrupted_observations @univ_lille @Inria_Lille

InriaScool's tweet photo. Today at #ALT2024, @ShubhadaAgrawal presents CRIMED: a joint work with Timothée, @BasuDebabrota & Odalric. CRIMED achieves matching regret upper bound for symmetric distributions and unbounded corruption. #bandits #corrupted_observations @univ_lille @Inria_Lille https://t.co/juT9O5hlB9

1

4

0

0

102

Scool @InriaScool

over 2 years ago

Congratulations to Émilie for the well-deserved achievement! 😊🥳

Centre Inria de l'Université de Lille @Inria_Lille

over 2 years ago

Toutes nos félicitations 👏 à notre collègue Émilie Kaufmann, membre de l'équipe @InriaScool. Une médaille de bronze bien méritée 🙂 #MachineLearning

0

12

1

0

1K

0

7

0

0

184

Scool @InriaScool

over 2 years ago

Today @NeurIPSConf, visit the #WANT workshop to know mode about tools and algorithms to make deep network training computationally friendly and resource efficient. #NeurIPS2023

Scool @InriaScool

almost 3 years ago

@InriaScool's Alena Shilova with a team from @nvidia @Inria & @ufrj is organising #WANT workshop @NeurIPSConf. If interested in tools & algorithms to make training computationally efficient & scalable with optimal resource utilisation,visit https://t.co/S9aTClu5YU #HPC #NeurIPS23

0

5

0

0

1K

0

3

1

0

335

Scool @InriaScool

over 2 years ago

What happens if you've multiple objectives/rewards for each arm? How can you find pareto set with bandits? At 5PM @NeurIPSConf, Cyrille'll present an adaptive & sequential sampling to identify Pareto set (or a relaxed Pareto set) of multivariate distributions #NeurIPS23 #Bandit

InriaScool's tweet photo. What happens if you've multiple objectives/rewards for each arm? How can you find pareto set with bandits? At 5PM @NeurIPSConf, Cyrille'll present an adaptive & sequential sampling to identify Pareto set (or a relaxed Pareto set) of multivariate distributions #NeurIPS23 #Bandit https://t.co/gEijuT4lxy

0

3

0

0

133

Scool @InriaScool

over 2 years ago

What happens if you've multiple objectives/rewards for each arm? How can you find pareto set with bandits? At 5PM @NeurIPSConf, Cyrille'll present an adaptive & sequential sampling to identify Pareto set (or a relaxed Pareto set) of multivariate distributions. #NeurIPS23 #Bandit

InriaScool's tweet photo. What happens if you've multiple objectives/rewards for each arm? How can you find pareto set with bandits? At 5PM @NeurIPSConf, Cyrille'll present an adaptive & sequential sampling to identify Pareto set (or a relaxed Pareto set) of multivariate distributions. #NeurIPS23 #Bandit https://t.co/vei1Aq9M0P

0

2

0

0

144

Last Seen Users on Sotwe

Trends for you

Most Popular Users