Reinforcement Learning For Order Distribution In Self-Organizing Logistics