A study on the reward generation method to be used in reinforcement learning to reduce the peak load | IEEE Conference Publication | IEEE Xplore