Skip to content
/ tack Public

Terraform module for creating Kubernetes cluster running on Container Linux by CoreOS in an AWS VPC

License

Notifications You must be signed in to change notification settings

kz8s/tack

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

terraform-aws-coreos-kubernetes

Circle CI

Opinionated Terraform module for creating a Highly Available Kubernetes cluster running on Container Linux by CoreOS (any channel) in an AWS Virtual Private Cloud VPC. With prerequisites installed make all will simply spin up a default cluster; and, since it is based on Terraform, customization is much easier than CloudFormation.

The default configuration includes Kubernetes add-ons: DNS, Dashboard and UI.

tl;dr

# prereqs
$ brew update && brew install awscli cfssl jq kubernetes-cli terraform

# build artifacts and deploy cluster
$ make all

# nodes
$ kubectl get nodes

# addons
$ kubectl get pods --namespace=kube-system

# verify dns - run after addons have fully loaded
$ kubectl exec busybox -- nslookup kubernetes

# open dashboard
$ make dashboard

# obliterate the cluster and all artifacts
$ make clean

Features

  • Cluster-internal Certificate Authority infrastructure for TLS certificate generation
  • etcd3

AWS

  • EC2 Key Pair creation
  • AWS VPC Public and Private subnets
  • IAM protected S3 bucket for asset distribution
  • Bastion Host
  • Multi-AZ Auto-Scaling Worker Nodes
  • NAT Gateway

Container Linux by CoreOS (1353.7.0, 1381.1.0, 1381.0.0)

  • etcd3 DNS Discovery Bootstrap
  • kubelet runs under rkt (using Container Linux by CoreOS recommended Kubelet Wrapper Script)

Kubernetes (v1.6.2)

Terraform (v0.9.3)

  • Container Linux by CoreOS AMI sourcing
  • Terraform Pattern Modules

Prerequisites

Quick install prerequisites on Mac OS X with Homebrew:

$ brew update && brew install awscli cfssl jq kubernetes-cli terraform

Tested with prerequisite versions:

$ aws --version
aws-cli/1.11.80 Python/2.7.10 Darwin/16.5.0 botocore/1.5.43

$ cfssl version
Version: 1.2.0
Revision: dev
Runtime: go1.7.1

$ jq --version
jq-1.5

$ kubectl version --client
Client Version: version.Info{Major:"1", Minor:"6", GitVersion:"v1.6.2", GitCommit:"477efc3cbe6a7effca06bd1452fa356e2201e1ee", GitTreeState:"clean", BuildDate:"2017-04-19T22:51:55Z", GoVersion:"go1.8.1", Compiler:"gc", Platform:"darwin/amd64"}

$ terraform --version
Terraform v0.9.3

Launch Cluster

make all will create:

  • AWS Key Pair (PEM file)
  • AWS VPC with private and public subnets
  • Route 53 internal zone for VPC
  • Bastion host
  • Certificate Authority server
  • etcd3 cluster bootstrapped from Route 53
  • High Availability Kubernetes configuration (masters running on etcd nodes)
  • Autoscaling worker node group across subnets in selected region
  • kube-system namespace and addons: DNS, UI, Dashboard
make all

To open dashboard:

make dashboard

To display instance information:

make instances

To display status:

make status

To destroy, remove and generally undo everything:

make clean

make all and make clean should be idempotent - should an error occur simply try running the command again and things should recover from that point.

How Tack works

Tack Phases

Tack works in three phases:

  1. Pre-Terraform
  2. Terraform
  3. Post-Terraform

Pre-Terraform

The purpose of this phase is to prep the environment for Terraform execution. Some tasks are hard or messy to do in Terraform - a little prep work can go a long way here. Determining the Container Linux by CoreOS AMI for a given region, channel and VM Type for instance is easy enough to do with a simple shell script.

Terraform

Terraform does the heavy lifting of resource creation and sequencing. Tack uses local modules to partition the work in a logical way. Although it is of course possible to do all of the Terraform work in a single .tf file or collection of .tf files, it becomes unwieldy quickly and impossible to debug. Breaking the work into local modules makes the flow much easier to follow and provides the basis for composing variable solutions down the track - for example converting the worker Auto Scaling Group to use spot instances.

Post-Terraform

Once the infrastructure has been configured and instantiated it will take some type for it to settle. Waiting for the 'master' ELB to become healthy is an example of this.

Components

Like many great tools, tack has started out as a collection of scripts, makefiles and other tools. As tack matures and patterns crystalize it will evolve to a Terraform plugin and perhaps a Go-based cli tool for 'init-ing' new cluster configurations. The tooling will compose Terraform modules into a solution based on user preferences - think npm init or better yet yeoman.

TLS Certificates

curl --cacert /etc/kubernetes/ssl/ca.pem --cert /etc/kubernetes/ssl/k8s-etcd.pem --key /etc/kubernetes/ssl/k8s-etcd-key.pem https://etcd.test.kz8s:2379/health
openssl x509 -text -noout -in /etc/kubernetes/ssl/ca.pem
openssl x509 -text -noout -in /etc/kubernetes/ssl/k8s-etcd.pem

ElasticSearch and Kibana

To access Elasticseach and Kibana first start kubectl proxy.

$ kubectl proxy
Starting to serve on localhost:8001

FAQs

Advanced Features and Configuration

Using an Existing VPC

If you have an existing VPC you'd like to deploy a cluster into, there is an option for this with tack.

Constraints

  • You will need to allocate 3 static IPs for the etcd servers - Choose 3 unused IPs that fall within the IP range of the first subnet specified in subnet-ids-private under vpc-existing.tfvars
  • Your VPC has to have private and public subnets (for now)
  • You will need to know the following information:
    • VPC CIDR Range (e.g. 192.168.0.0/16)
    • VPC Id (e.g. vpc-abc123)
    • VPC Internet Gateway Id (e.g. igw-123bbd)
    • VPC Public Subnet Ids (e.g. subnet-xyz123,subnet-zyx123)
    • VPC Private Subnet Ids (e.g. subnet-lmn123,subnet-opq123)

Enabling Existing VPC Support

  • Edit vpc-existing.tfvars
    • Uncomment the blocks with variables and fill in the missing information
  • Edit modules_override.tf - This uses the overrides feature from Terraform
    • Uncomment the vpc module, this will override the reference to the regular VPC module and instead use the stub vpc-existing module which just pulls in the variables from vpc-existing.tfvars
  • Edit the Makefile as necessary for CIDR_PODS, CIDR_SERVICE_CLUSTER, etc to match what you need (e.g. avoid collisions with existing IP ranges in your VPC or extended infrastructure)

Testing Existing VPC Support from Scratch

In order to test existing VPC support, we need to generate a VPC and then try the overrides with it. After that we can clean it all up. These instructions are meant for someone wanting to ensure that the tack existing VPC code works properly.

  • Run make all to generate a VPC with Terraform
  • Edit terraform.tfstate
    • Search for the VPC block and cut it out and save it somewhere. Look for "path": ["root","vpc"]
  • Run make clean to remove everything but the VPC and associated networking (we preserved it in the previous step)
  • Edit as per instructions above
  • Run make all to test out using an existing VPC
  • Cleaning up:
    • Re-insert the VPC block into terraform.tfstate
    • Run make clean to clean up everything

Additional Configuration

Inspiration

Other Terraform Projects

References