Ansible + Hadoop. Deploying Hortonworks Data Platform with Ansible. Michael Young Solutions Engineer February 23, 2017

Similar documents
Ansible at Scale. David Melamed Senior Research Engineer, CTO Office, CloudLock

Ansible in Depth WHITEPAPER. ansible.com

AUTOMATION ACROSS THE ENTERPRISE

Ansible Tower Quick Install

Get Automating with Infoblox DDI IPAM and Ansible

Ansible Tower Quick Install

Red Hat Ansible Workshop. Lai Kok Foong, Kelvin

Cloud and Devops - Time to Change!!! PRESENTED BY: Vijay

Harnessing your cluster with Ansible

We are ready to serve Latest IT Trends, Are you ready to learn?? New Batches Info

Rapid Deployment of Bare-Metal and In-Container HPC Clusters Using OpenHPC playbooks

INTRODUCTION CONTENTS BEGINNER S GUIDE: CONTROL WITH RED HAT ANSIBLE TOWER

Ansible Tower Quick Setup Guide

Zabbix Ansible Module. Patrik Uytterhoeven

Introduction to Ansible

ansible-workshop Documentation

Ansible Bootcamp. Bruce Becker: Coordinator, Africa-Arabia ROC

Ansible Tower Quick Setup Guide

Ansible Tower on the AWS Cloud

Ansible Essentials 5 days Hands on

Webserver deployment on. Amazon Web Services using IAC tool Terraform

IN DEPTH INTRODUCTION ARCHITECTURE, AGENTS, AND SECURITY

Contents. Prerequisites 1. Linux 1. Installation 1. What is Ansible? 1. Basic Ansible Commands 1. Ansible Core Components 2. Plays and Playbooks 8

Automation and configuration management across hybrid clouds with CloudForms, Satellite 6, Ansible Tower

Sanjay Shitole, Principle Solutions Engineer

Behind the scenes of a FOSS-powered HPC cluster at UCLouvain

Infoblox and Ansible Integration

Automation: Making the Best Choice for Your Organization

ANSIBLE TOWER OVERVIEW AND ROADMAP. Bill Nottingham Senior Principal Product Manager

ANSIBLE AUTOMATION AT TJX

Automate Patching for Oracle Database in your Private Cloud

Ansible: Server and Network Device Automation

GIVING POWER TO THE PEOPLE With General Mills

AUTOMATION FOR EVERYONE Accelerating your journey to the Hybrid Cloud with Ansible Tower

Ansible. Go directly to project site 1 / 36

Splunk and Ansible. Joining forces to increase implementation power. Rodrigo Santos Silva Head of Professional Services, Tempest Security Intelligence

Getting started with Ansible and Oracle

RED HAT TECH EXCHANGE HOUSE RULES

OPEN SOURCING ANSIBLE

DevOPS, Ansible and Automation for the DBA. Tech Experience 18, Amsersfoot 7 th / 8 th June 2018

Ansible F5 Workshop +

Ansible and Ansible Tower by Red Hat

Henry Stamerjohann. Apfelwerk GmbH & Co. #macadmins

Zero Touch Provisioning of NIOS on Openstack using Ansible

Study Guide. Expertise in Ansible Automation

MULTI CLOUD AS CODE WITH ANSIBLE & TOWER

AUTOMATING THE ENTERPRISE WITH ANSIBLE. Dustin Boyd Solutions Architect September 12, 2017

Getting Started with Ansible - Introduction

Ansible and Firebird

Button Push Deployments With Integrated Red Hat Open Management

A NOVEL BIG DATA ARCHITECTURE IN SUPPORT OF ADS-B DATA ANALYTIC DR. ERTON BOCI

Infrastructure as Code CS398 - ACC

Terraform & Infrastructure as Code. Ben Higginbottom (kind of a big deal) ((seriously - I ve helped hijack a spacecraft))

Building and Managing Clouds with CloudForms & Ansible. Götz Rieger Senior Solution Architect January 27, 2017

Ask an Expert: Ansible Network Automation

WHAT IS ANSIBLE AND HOW CAN IT HELP ME?

SELF-SERVICE IT WITH ANSIBLE TOWER & MICROSOFT AZURE. Chris Houseknecht Dave Johnson. June #redhat #rhsummit

Housekeeping. Timing Breaks Takeaways

mastering ansible A622DFD780311BCF8921DE033F8C7977 Mastering Ansible 1 / 6

AGENTLESS ARCHITECTURE

Ansible - Automation for Everyone!

Ansible Tower Installation and Reference Guide

SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other

Deploying MySQL HA. with Ansible and Vagrant (101) Daniel Guzman Burgos (Percona) Robert Barabas (Percona)

TACKLING BIG-IP BLUE-GREEN DEPLOYMENTS IN PRIVATE CLOUD USING F5 & VMWARE ANSIBLE MODULES

Enhancing Secrets Management in Ansible with CyberArk Application Identity Manager

Choosing an orchestration tool: Ansible and Salt. Ken Wilson Opengear. Copyright 2017 Opengear, Inc. 1

Getting Started with Ansible for Linux on z David Gross

The recommended way for deploying a OSS DC/OS cluster on GCE is using Terraform.

Introduction to CLI Automation with Ansible

An introduction to ANSIBLE. Anand Buddhdev RIPE NCC

AWS and Ansible. Automating Scalable (and Repeatable) Architecture

Modern Provisioning and CI/CD with Terraform, Terratest & Jenkins. Duncan Hutty

Infrastructure at your Service. Setup Oracle Infrastructure with Vagrant & Ansible

Ansible. Systems configuration doesn't have to be complicated. Jan-Piet

Splunk ConfiguraAon Management and Deployment with Ansible

Infrastructure Configuration and Management with Ansible. Kaklamanos Georgios

Ansible in Operation. Bruce Becker: Coordinator, SAGrid

Ansible Tower Upgrade and Migration

Managing 15,000 network devices with Ansible. Landon Holley & James Mighion May 8, 2018

The Foreman. Doina Cristina Duma, cristina.aiftimiei<at>cnaf.infn.it Diego Michelotto, diego.michelotto<at>cnaf.infn.it INFN-CNAF

J, K, L. Each command, 31. Fully qualified domain name (FQDN), 116

Managing Microservices using Terraform, Docker, and the Cloud

ABOUT INTRODUCTION ANSIBLE END Ansible Basics Oleg Fiksel Security CSPI GmbH OpenRheinRuhr 2015

Be smart. Think open source.

Ansible Tower Upgrade and Migration

introducing Haid-und-Neu-Str. 18, Karlsruhe Germany

MARCO MALAVOLTI

Ansible Tower Installation and Reference Guide

Database Operations at Groupon using Ansible. Mani Subramanian Sr. Manager Global Database Services Groupon

INTRODUCTION WHY CI/CD

Automate DBA Tasks With Ansible

Ansible Tower Installation and Reference Guide

How to avoid boring work - Automation for DBAs

Ansible Hands-on Introduction

HASHICORP TERRAFORM AND RED HAT ANSIBLE AUTOMATION Infrastructure as code automation

Ansible Tower Upgrade and Migration

vagrant up for Network Engineers Do it like they do on the Developer Channel!

Managing Microservices Using Terraform, Docker, and the Cloud

Ansible Tower Installation and Reference Guide

Transcription:

Ansible + Hadoop Deploying Hortonworks Data Platform with Ansible Michael Young Solutions Engineer February 23, 2017

About Me Michael Young Solutions Engineer @ Hortonworks 16+ years of experience (Almost all in Public Sector) Information Retrieval (Solr, Elasticsearch) Hadoop (HDP, MapR, Cloudera) DevOps (Ansible, Puppet, Docker, Vagrant) Development (Python, Perl, Node.js) @jaraxal myoung@hortonworks.com 2 Hortonworks Inc. 2011 2017. All Rights Reserved

About Hortonworks Only 100% Open Source Hadoop Company Over 1,000 customers Over 2,100 partners Hortonworks Data Platform (HDP) Hortonworks Data Flow (HDF) Hortonworks Community Connection (HCC) 3 Hortonworks Inc. 2011 2017. All Rights Reserved

Hortonworks Data Platform 2.5 4 Hortonworks Inc. 2011 2017. All Rights Reserved

Ambari: Management and Monitoring 5 Hortonworks Inc. 2011 2017. All Rights Reserved

HDP Provisioning Workflow Prepare Infrastructure Package Repos DNS NTP Prepare OS Disable Transparent Huge Pages Disable Swapping Jumbo Frames Format and mount disk drives Bootstrap Ambari Install Ambari Server Install Ambari Agents Install HDP Interactively via Ambari s web-based UI Automatically via Ambari Blueprints 6 Hortonworks Inc. 2011 2017. All Rights Reserved

Ambari Blueprints for HDP Deployments https://cwiki.apache.org/confluence/di splay/ambari/blueprints Declarative definition of a cluster written in JSON. Preserves best practice configuration across deployments Requires OS configuration prerequisites already in place Ambari will perform checks and warn you. 7 Hortonworks Inc. 2011 2017. All Rights Reserved

Automation! Why Ansible? 8 Hortonworks Inc. 2011 2017. All Rights Reserved

Ansible for HDP Deployments Playbooks Bootstrap baseline configuration Install DBs Install HDP software Roles Master Servers Slave Servers Ambari Server Ambari Agent Tasks Install prerequisite packages Install Ambari Server packages Install Ambari Agent packages Disable SELinux Turn on NTP Templates /etc/hosts Ambari Blueprints Files Disable THP Disable Swapping 9 Hortonworks Inc. 2011 2017. All Rights Reserved

Create 6-node Environment Using Amazon AWS 6 x c4.4xlarge instances Simple Ansible solution AWS provisioning using ec2 and ec2_group modules Simple inventory Simple playbook Simple ansible.cfg 10 Hortonworks Inc. 2011 2017. All Rights Reserved

Simple Inventory All Ansible commands run locally Uses AWS API Using Anaconda Python 11 Hortonworks Inc. 2011 2017. All Rights Reserved

Simple Playbook: hadoop-demo.yml 2 Tasks Create Security Group Create EC2 Instances 12 Hortonworks Inc. 2011 2017. All Rights Reserved

Task: Provision Security Group ec2_group module Region VPC Rules 13 Hortonworks Inc. 2011 2017. All Rights Reserved

Task: Provision Servers ec2 module Region Group Instance type AMI Volumes Counts Tags 14 Hortonworks Inc. 2011 2017. All Rights Reserved

Run Playbook ansible-playbook -i inventory/hosts playbooks/hadoop-demo.yml Takes ~35 seconds 15 Hortonworks Inc. 2011 2017. All Rights Reserved

16 Hortonworks Inc. 2011 2017. All Rights Reserved DEMO

Ansible AWS Ad-Hoc Examples Dynamic Inventory https://aws.amazon.com/blogs/apn/gettin g-started-with-ansible-and-dynamicamazon-ec2-inventory-management/ https://raw.githubusercontent.com/ansibl e/ansible/devel/contrib/inventory/ec2.py https://raw.githubusercontent.com/ansibl e/ansible/devel/contrib/inventory/ec2.ini Handy Python script allows you to interact with AWS instances 17 Hortonworks Inc. 2011 2017. All Rights Reserved

Ready to Create? Inventory Dev Test Prod Playbook Roles Tasks Templates Files Handlers Generally an iterative process Start small, move towards more complex Entire process could take a couple of days to a couple of weeks 18 Hortonworks Inc. 2011 2017. All Rights Reserved

Why re-invent the wheel? https://github.com/objectrocket/ ansible-hadoop ObjectRocket is a Rackspace company. Enables deployment of hadoop clusters using Ansible Supports Rackspace cloud and existing environments Ansible == 2.1.3.0 (2.2 is not supported at the moment) Expects RHEL/CentOS 6/7 or Ubuntu 14 hosts. Simple Configure, then run two scripts 19 Hortonworks Inc. 2011 2017. All Rights Reserved

20 Hortonworks Inc. 2011 2017. All Rights Reserved DEMO

Minimal Configuration Needed inventory/static playbooks/group_vars/master_nodes playbooks/group_vars/slave_nodes playbooks/group_vars/hortonworks ansible.cfg Optional: custom repos and blueprints 21 Hortonworks Inc. 2011 2017. All Rights Reserved

Modify inventory/static Add information for master, slave and edge nodes Use public IP for ansible_host Default user for my AMI is centos. Set ansible_ssh_user appropriately. Using key, so no password specified Don t forget to comment unused node types (edge-nodes) 22 Hortonworks Inc. 2011 2017. All Rights Reserved

Modify playbook/group_vars/*_nodes Refer to template files for examples Most options are geared towards Rackspace cloud 23 Hortonworks Inc. 2011 2017. All Rights Reserved

Modify playbook/group_vars/hortonworks Specify Configuration Details version of HDP and Ambari to install components to install admin and service passwords repo URL I left this as-is 24 Hortonworks Inc. 2011 2017. All Rights Reserved

Modify ansible.cfg Change library value to playbooks/library/site_facts Specify location of private_key_file. 25 Hortonworks Inc. 2011 2017. All Rights Reserved

Run bootstrap_static.sh Performs the common bootstrap configurations $ bash bootstrap_static.sh Takes ~8 minutes Consistent timing regardless of node count Same tasks done on all servers in parallel Ansible approach. 26 Hortonworks Inc. 2011 2017. All Rights Reserved

27 Hortonworks Inc. 2011 2017. All Rights Reserved DEMO

Run hortonworks_static.sh Performs the Hortonworks installation $ bash hortonworks_static.sh Takes ~19 minutes (4-node m4.xlarge cluster) master01 had significantly more tasks to implement 28 Hortonworks Inc. 2011 2017. All Rights Reserved

Retrying Tasks is Normal The last task is waiting for the cluster to be built Normal to see many failed checks with retry attempts. 29 Hortonworks Inc. 2011 2017. All Rights Reserved

Monitor Ambari During Install Monitor Ambari during cluster installation. 30 Hortonworks Inc. 2011 2017. All Rights Reserved

One Node: ~1,000 seconds One node took ~1,000 seconds to complete install and startup This node is the master node, has more components Room to decrease deployment time by adding more master nodes 31 Hortonworks Inc. 2011 2017. All Rights Reserved

Five Node Cluster 5 x m4.xlarge 2 master and 3 slave nodes Took ~15 minutes ~3 minutes faster than 4-node cluster. More even distribution of components on master servers 32 Hortonworks Inc. 2011 2017. All Rights Reserved

Six Node Cluster 6 x m4.xlarge 3 master and 3 slave nodes Took ~15 minutes No apparent improvement in deployment times over 5-node cluster. 33 Hortonworks Inc. 2011 2017. All Rights Reserved

Comparing Instance Sizes - Six Node Cluster m4.xlarge vs c4.4xlarge Same cluster configuration 3 master and 3 slave nodes Took ~12 minutes ~3 minutes faster than m4.xlarge cluster 34 Hortonworks Inc. 2011 2017. All Rights Reserved

Number & Size of Nodes Factoring the number and size of nodes to decrease deployment time is interesting, but not generally important Size your cluster on based on data size and workload More Data: more local storage per slave node, more slave nodes More Queries: more memory and cpu per slave node, more slave nodes High Availability: Use at least 3 master nodes, at least 3 slave nodes Minimum recommended cluster size for production is ~12 nodes 35 Hortonworks Inc. 2011 2017. All Rights Reserved

Summary Easily created an AWS environment using a simple Ansible playbook Takes ~1-2 minutes, includes modifying playbook Easily deployed 6-node HDP cluster Ran playbook from an AWS node with Ansible Modify a couple of configuration files Run 2 commands and have an HDP cluster in < 20 minutes Demonstrated how cluster size and instance type affected deployment times 36 Hortonworks Inc. 2011 2017. All Rights Reserved

37 Hortonworks Inc. 2011 2017. All Rights Reserved Questions?