Understanding Spark on YARN: Resource Management and Communication
Original URL: https://medium.com/@wx.london.cun/spark-on-yarn-f74e82ab6070
Article Written: January 21, 2017
Added: November 23, 2025
Type: tech2
Summary
This article explores how Apache Spark interacts with YARN for resource management in a cluster environment. It details the roles of YARN's components: Resource Manager, Application Master, and Node Manager, and explains the communication process during Spark application execution. The author discusses common exceptions encountered when running Spark on YARN, emphasizing the importance of understanding these interactions for effective troubleshooting. The article serves as a guide for advanced users looking to optimize Spark applications on YARN.