A Reinforced Learning (RL) algorithm for optimal control of HVAC: Comparison with a standard PI control for the supermarket DOE reference building