Oreilly - Debugging Apache Spark by Holden Karau | Released November 2018 | ISBN: 9781492039167...
Apache Spark is an extremely powerful general purpose distributed system that also happens to be extremely difficult to debug. This video, designed for intermediate-level Spark developers and data scientists, looks at some of the most common (and baffling) ways Spark can explode (e.g., out of memory exceptions, unbalanced partitioning, strange...