Abstract: This paper aims to explore a new hybrid algorithm that combines the advantages of Q-learning and Deep Deterministic Policy Gradient (Deep Deterministic Policy Gradient, DDPG) algorithms to ...