Rs. 1299 Rs. 599
The Complete Apache Oozie Tutorial
This Apache Oozie tutorial, created by a Stanford alumni team, teaches how to work with Coordinators, Bundles and Workflows in Oozie with real-time examples to schedule Hadoop jobs.
Apache Oozie, a workflow scheduler system for Apache Hadoop, makes it easy to work with complex dependencies, manage a multitude of jobs at different time schedules, and manage end-to-end data pipelines. It is sometimes considered to be formidable. This is because Oozie is entirely written in XML and is challenging to debug when things go wrong. But, once you have figured out how it works, it's a piece of cake. Oozie permits managing Hadoop jobs, Java scripts, programs, and other executables that have the same basic setup. It facilitates clean and logical management of dependencies. The key to master Oozie is to know the right configuration parameters that will get the job done.
This Apache Oozie tutorial broadly covers Workflow Management, Time-based and Data-based triggers for Workflows, and Data Pipelines using Bundles.
By the end of this Apache Oozie tutorial you will:
- Install and set up Oozie on your system
- Learn how to configure workflows so that you can run jobs on Hadoop
- Know how to configure data-triggered and time-triggered workflows
- Be able to use Bundles in order to configure data pipelines
Prerequisites and Target Audience
This Apache Oozie tutorial requires you to have basic knowledge of the Hadoop eco-system. You should also know how to run MapReduce jobs on Hadoop.