Hadoop YARN
YARN is a general-purpose, open-source cluster manager that was introduced with Hadoop MapReduce version 2.0, where the term "YARN" also became synonymous with this updated version of the Hadoop framework. YARN separates cluster resource management from job execution logic, making it a more flexible and extensible platform for running various distributed computing engines.

Key Features and Architecture
YARN allows not only MapReduce jobs but also other compute engines such as Apache Spark, Apache Tez, and more to run on the same cluster infrastructure. This unified cluster management is made possible by its modular architecture.
Core Components of YARN:
Acts like the master node (similar to Mesos master). It manages and schedules the resources of the entire cluster by receiving resource reports from all NodeManagers.
Acts like the worker node (similar to Mesos slave). It manages the resources of a single machine and reports them to the ResourceManager. It also oversees the execution of tasks in containers on that node.
The ResourceManager consolidates resource information from all NodeManagers and allocates those resources to different applications. In essence, it functions as a global scheduler for the cluster.
YARN Application Components
A distributed application running on YARN typically consists of the following three components:
Advantages of Using YARN
Statlearner
Statlearner