Improving Deep Deterministic Policy Gradient For Sparse Reward And Goal-Conditioned Continuous Control