The elevation of Apache Tez to a top-level project highlights the impending battle of succession (“A Game of YARNs”??) as the Map Reduce analytic framework is overtaken. The battle for hearts and minds among the Big Data ecosystem for multi-workload, multi-data engine application platforms is clearly on: will it be reliance on YARN (now withTez) or Spark ?
Apache™ Tez is an extensible framework for building YARN based, high performance batch and interactive data processing applications in Hadoop that need to handle TB to PB scale datasets. It allows projects in the Hadoop ecosystem, such as Apache Hive and Apache Pig, as well as 3rd-party software vendors to express fit-to-purpose data processing applications in a way that meets their unique demands for fast response times and extreme throughput at petabyte scale. Apache Tez provides a developer API and framework to write native YARN applications that bridge the spectrum of interactive and batch workloads and is used with Apache Hive 0.13 as part of Hortonworks Data Platform.