4 Big Data Riddles: The Straggler, the Slacker, the Fatso, and the Heckler
This article discusses four bottlenecks in BigData applications and introduces a number of tools, some of which are new, for identifying and removing them. These bottlenecks could occur in any framework but a particular emphasis will be given to Apache Spark and PySpark.