Closed-loop dynamic control of a soft manipulator using deep reinforcement learning