Coordinated energy management for a cluster of buildings through deep reinforcement learning