Blog post with code to generate real time policy and value function plots while actor critic RL agent is interacting with environment: https://t.co/3UNH3pvJHT
We put together resources to get started in #reinforcementlearning at https://t.co/odeynrhz74, including observations from our own experience with them in the @OpenAI Scholars 2020 program. May this boost the sample efficiency of other learners! 💪
Write up of an old idea for reducing test counts needed to screen for disease: Pooling of samples. Written up here to promote what might be a good idea in some circumstances.
https://t.co/3fQwNieZlM
New post: "Universal limiting mean return of CPPI investment portfolios"
This is a sort-of central-limit-theorem like result that holds in some cases for investment portfolios subject to portfolio insurance. See below.
https://t.co/JiY6MNJYkP