Long term planning is one the hardest task in RL. As a hobby I am experimenting with the Sokoban and I've created the AI model which is able to plan up to 128 actions ahead. See live demo here: https://t.co/Zl0eX6tVta #reinforcementlearning#AI#hrl#algorithmresearch