This thesis has been realized for the obtention of the Master's in Mathematical Engineering at the Politecnico di Milano. The goal of this project was to apply some reinforcement learning techniques ...