Understanding Spark on YARN: Resource Management and Communication

Understanding Spark on YARN: Resource Management and Communication

Understanding Spark on YARN: Resource Management and Communication

Original URL: https://medium.com/@wx.london.cun/spark-on-yarn-f74e82ab6070

Article Written: January 21, 2017

Added: November 23, 2025

Type: tech2

Summary

This article explores how Apache Spark interacts with YARN for resource management in a cluster environment. It details the roles of YARN's components: Resource Manager, Application Master, and Node Manager, and explains the communication process during Spark application execution. The author discusses common exceptions encountered when running Spark on YARN, emphasizing the importance of understanding these interactions for effective troubleshooting. The article serves as a guide for advanced users looking to optimize Spark applications on YARN.

Technologies Referenced