Probabilistic models for data efficient reinforcement learning