This page overstates the issues. Almost all the initial GCD design is still good, we just ended up with a lot more smaller low-core and non-SMP processors than expected.
Also most of the performance issues are with excess parallelism (runnable threads), not concurrency (hiding IO latencies).