Talentcrowd operates as a digital talent platform — providing employers with pipelines of highly vetted senior-level technology talent and on-demand engineering resources. We're tech agnostic and cost-competitive.
About Apache Oozie
Apache Oozie is a Java Web application used to schedule Apache Hadoop jobs. Oozie combines multiple jobs sequentially into one logical unit of work. It is integrated with the Hadoop stack, with YARN as its architectural center, and supports Hadoop jobs for Apache MapReduce, Apache Pig, Apache Hive, and Apache Sqoop. Oozie can also schedule jobs specific to a system, like Java programs or shell scripts.
Apache Oozie is a tool for Hadoop operations that allows cluster administrators to build complex data transformations out of multiple component tasks. This provides greater control over jobs and also makes it easier to repeat those jobs at predetermined intervals. At its core, Oozie helps administrators derive more value from Hadoop.
There are two basic types of Oozie jobs:
- Oozie Workflow jobs are Directed Acyclical Graphs (DAGs), specifying a sequence of actions to execute. The Workflow job has to wait.
- Oozie Coordinator jobs are recurrent Oozie Workflow jobs that are triggered by time and data availability.
Oozie Bundle provides a way to package multiple coordinator and workflow jobs and to manage the lifecycle of those jobs.