Forge Home

airflow

Airflow is a system to programmatically author, schedule and monitor data pipelines.

8,638 downloads

8,319 latest version

3.0 quality score

We run a couple of automated
scans to help you access a
module's quality. Each module is
given a score based on how well
the author has formatted their
code and documentation and
modules are also checked for
malware using VirusTotal.

Please note, the information below
is for guidance only and neither of
these methods should be considered
an endorsement by Puppet.

Support the Puppet Community by contributing to this module

You are welcome to contribute to this module by suggesting new features, currency updates, or fixes. Every contribution is valuable to help ensure that the module remains compatible with the latest Puppet versions and continues to meet community needs. Complete the following steps:

  1. Review the module’s contribution guidelines and any licenses. Ensure that your planned contribution aligns with the author’s standards and any legal requirements.
  2. Fork the repository on GitHub, make changes on a branch of your fork, and submit a pull request. The pull request must clearly document your proposed change.

For questions about updating the module, contact the module’s author.

Version information

  • 0.1.2 (latest)
  • 0.1.1
  • 0.1.0 (deleted)
released Jan 25th 2016
This version is compatible with:
  • Puppet >=3.8.2 < 4.0.0

Start using this module

  • r10k or Code Manager
  • Bolt
  • Manual installation
  • Direct download

Add this module to your Puppetfile:

mod 'similarweb-airflow', '0.1.2'
Learn more about managing modules with a Puppetfile

Add this module to your Bolt project:

bolt module add similarweb-airflow
Learn more about using this module with an existing project

Manually install this module globally with Puppet module tool:

puppet module install similarweb-airflow --version 0.1.2

Direct download is not typically how you would use a Puppet module to manage your infrastructure, but you may want to download the module in order to inspect the code.

Download

Documentation

similarweb/airflow — version 0.1.2 Jan 25th 2016

Airflow

Table of Contents

  1. Overview
  2. Module Description - What the module does and why it is useful
  3. Setup - The basics of getting started with airflow
  4. Usage - Configuration options and additional functionality
  5. Reference - An under-the-hood peek at what the module is doing and how
  6. Development - Guide for contributing to the module

Overview

This module manages airflow by Airbnb.

Module Description

The airflow module sets up and configures airflow.

This module has been tested against airflow versions: 1.5.2, 1.6.2

Setup

Limitations

This module does not initialize the airflow database schema - you can do so by executing:

airflow initdb

More info here.

The module has been tested on CentOS 7

The module manages the following

  • Airflow package.
  • Airflow configuration file.
  • Airflow services.
  • Airflow templates.

Important Note

Please refer to airflow installation before using this module.

Setup Requirements

airflow module depends on the following puppet modules:

  • puppetlabs-stdlib >= 1.0.0
  • stankevich-python >= 1.9.8
  • camptocamp-systemd >= 0.2.2

Beginning with airflow

Install this module via any of these approaches:

Usage

Main class

Install airflow 1.6.2 to /usr/local/airflow

class { 'airflow':
          version => '1.6.2',
          home_folder => '/usr/local/airflow'
      }

Install airflow, the work scheduler and the celery based worker

class { 'airflow': } ->
class { 'airflow::service::scheduler:' }
class { 'airflow::service::worker:' }

Hiera Support

  • Example: Defining ldap authentication and mesos settings in hiera.
airflow::ldap_settings:
  ldap_url: ldap:://<your.ldap.server>:<port>
  user_filter: objectClass=*
  user_name_attr: uid
  bind_user: cn=Manager,dc=example,dc=com
  bind_password: insecure
  basedn: dc=example,dc=com


airflow::mesos_settings:
  master: localhost:5050
  framework_name: Airflow
  task_cpu: 1
  task_memory: 256
  checkpoint: false
  failover_timeout: 604800
  authenticate: false
  default_principal: admin
  default_secret: admin

Reference

Classes

Public classes

  • airflow - Installs and configures airflow.
  • airflow::service::worker - Handles airflow's worker service.
  • airflow::service::scheduler - Handles airflow's scheduler service.
  • airflow::service::webserver - Handles airflow's webserver service.
  • airflow::service::flower - Handles airflow's flower service.

Private classes

  • airflow::install - Installs airflow python package.
  • airflow::config - Configures airflow.

Contributing

  1. Fork the repository on Github
  2. Create a named feature branch (like add_component_x)
  3. Commit your changes.
  4. Submit a Pull Request using Github