Performance-Robust, Non-Blocking, Data-Driven Barrier Synchronization for Multicore, Multithreaded Parallel Algorithms