Rittman Mead
  • Services
  • Training
  • Products
  • Case Studies
  • About
Subscribe
Tagged

Tez

A collection of 2 posts

Technical

OBIEE and ODI on Hadoop : Next-Generation Initiatives To Improve Hive Performance

The other week I posted a three-part series (part 1, part 2 and part 3) on going beyond MapReduce for Hadoop-based ETL, where I looked at a typical Apache Pig dataflow-style ETL process and showed how Apache Tez and Apache Spark can potentially make these processes run faster and make

Mark Rittman Dec 18, 2014 • 8 min read
Technical

Going Beyond MapReduce for Hadoop ETL Pt.2 : Introducing Apache YARN and Apache Tez

In the first post in this three part series on going beyond MapReduce for Hadoop ETL, I looked at how a typical Apache Pig script gets compiled into a series of MapReduce jobs, and those MapReduce jobs pass data between themselves by writing intermediate resultsets to disk (HDFS, the Hadoop

Mark Rittman Dec 8, 2014 • 5 min read
Rittman Mead © 2022
Powered by Ghost