The Oozie Orchestration Framework
Prerequisites: Working with Oozie requires some basic knowledge of the Hadoop eco-system and running MapReduce jobs
Taught by a team that includes 2 Stanford-educated, ex-Googlers, and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data processing jobs.
Oozie is like the formidable, yet super-efficient admin assistant who can get things done for you, if you know how to ask
Let's parse that formidable, yet super-efficient: Oozie is formidable because it is entirely written in XML, which is hard to debug when things go wrong. However, once you've figured out how to work with it, it's like magic. Complex dependencies, managing a multitude of jobs at different time schedules, managing entire data pipelines are all made easy with Oozie get things are done for you: Oozie allows you to manage Hadoop jobs as well as Java programs, scripts, and any other executable with the same basic set up. It manages your dependencies cleanly and logically.
Workflow Management: Workflow specifications, Action nodes, Control nodes, Global configuration, real examples with MapReduce and Shell actions which you can run and tweak
Time-based and data-based triggers for Workflows: Coordinator specification, Mimicking simple cron jobs, specifying the time and data availability triggers for Workflows, dealing with the backlog, running time-triggered and data-triggered coordinator actions
Data Pipelines using Bundles: Bundle specification, the kick-off time for bundles, running a bundle on Oozie
Using discussion forums
Please use the discussion forums on this course to engage with other students and to help each other out. Unfortunately, much as we would like to, it is not possible for us at Loonycorn to respond to individual questions from students.
We're super small and self-funded with only 2 people developing technical video content. Our mission is to make high-quality courses available at super low prices.
The only way to keep our prices this low is to *NOT offer additional technical support over email or in-person*. The truth is, direct support is hugely expensive and just does not scale.
We understand that this is not ideal and that a lot of students might benefit from this additional support. Hiring resources for additional support would make our offering much more expensive, thus defeating our original purpose.
It is a hard trade-off.
Thank you for your patience and understanding!
Who is the target audience?
Yep! Engineers, analysts, and sysadmins who are interested in big data processing on Hadoop
Nope! Beginners who have no knowledge of the Hadoop eco-system
Students should have basic knowledge of the Hadoop eco-system and should be able to run MapReduce jobs on Hadoop
What will you learn
Install and set up Oozie
Configure Workflows to run jobs on Hadoop
Configure time-triggered and data-triggered Workflows
Write your own review
Choose from 5000+ available courses with new additions published every month