Written form to keep for later: https://blog.rockthejvm.com/reading-query-plans/
Download Spark: https://spark.apache.org/downloads.html
This video is for Spark programmers who know the essentials (e.g. create a DataFrame, basic selects/joins) and wants a sneak peek at how Spark works internally and get an essential skill for performance analysis and improvement.
In this video I'll show you how Spark creates computation dependencies before it can run anything, and I'll show you how to read query plans and identify performance bottlenecks with shuffles.
Follow Rock the JVM on:
LinkedIn: https://linkedin.com/company/rockthejvm
Twitter: https://twitter.com/rockthejvm
Blog: https://rockthejvm.com/blog
-------------------------------------------------------------------------
Home: https://rockthejvm.com
-------------------------------------------------------------------------