Kubernetes Networking Setup

Kubernetes Operations (kops) currently supports 4 networking modes:

kubenet kubernetes native networking via a CNI plugin. This is the default.
cni Container Network Interface(CNI) style networking, often installed via a Daemonset.
classic kubernetes native networking, done in-process.
external networking is done via a Daemonset. This is used in some custom implementations.

kops Default Networking

Kubernetes Operations (kops) uses kubenet networking by default. This sets up networking on AWS using VPC networking, where the master allocates a /24 CIDR to each Node, drawing from the Node network.
Using kubenet mode routes for each node are then configured in the AWS VPC routing tables.

One important limitation when using kubenet networking is that an AWS routing table cannot have more than 50 entries, which sets a limit of 50 nodes per cluster. AWS support will sometimes raise the limit to 100, but their documentation notes that routing tables over 50 may take a performance hit.

Because k8s modifies the AWS routing table, this means that realistically kubernetes needs to own the routing table, and thus it requires its own subnet. It is theoretically possible to share a routing table with other infrastructure (but not a second cluster!), but this is not really recommended. Certain cni networking solutions claim to address these problems.

Users running --topology private will not be able to choose kubenet networking because kubenet requires a single routing table. These advanced users are usually running in multiple availability zones and NAT gateways are single AZ, multiple route tables are needed to use each NAT gateway.

Supported CNI Networking

Container Network Interface provides a specification and libraries for writing plugins to configure network interfaces in Linux containers. Kubernetes has built in support for CNI networking components.

Several different CNI providers are currently built into kops:

Calico
Canal (Flannel + Calico)
flannel - use --networking flannel-vxlan (recommended) or --networking flannel-udp (legacy). --networking flannel now selects flannel-vxlan.
kopeio-vxlan
kube-router
romana
weave
amazon-vpc-routed-eni

The manifests for the providers are included with kops, and you simply use --networking provider-name. Replace the provider name with the names listed above with you kops cluster create. For instance to install kopeio-vxlan execute the following:

$ kops create cluster --networking kopeio-vxlan

This project has no bias over the CNI provider that you run, we care that we provide the correct setup to run CNI providers.

Both kubenet and classic networking options are completely baked into kops, while since CNI networking providers are not part of the Kubernetes project, we do not maintain their installation processes. With that in mind, we do not support problems with different CNI providers but support configuring Kubernetes to run CNI providers.

Specifying network option for cluster creation

You are able to specify your networking type via command line switch or in your yaml file. The --networking option accepts the three different values defined above: kubenet, cni, classic, and external. If --networking is left undefined kubenet is installed.

Weave Example for CNI

Installation Weave on a new Cluster

The following command sets up a cluster, in HA mode, that is ready for a CNI installation.

$ export $ZONE=mylistofzones
$ kops create cluster \
  --zones $ZONES \
  --master-zones $ZONES \
  --master-size m4.large \
  --node-size m4.large \
  --networking cni \
  --yes \
  --name myclustername.mydns.io

Once the cluster is stable, which you can check with a kubectl cluster-info command, the next step is to install CNI networking. Most of the CNI network providers are moving to installing their components plugins via a Daemonset. For instance weave will install with the following command:

Daemonset installation for K8s 1.6.x or above.

$ kubectl create -f https://git.io/weave-kube-1.6

Daemonset installation for K8s 1.4.x or 1.5.x.

$ kubectl create -f https://git.io/weave-kube

Configuring Weave MTU

The Weave MTU is configurable by editing the cluster and setting mtu option in the weave configuration. AWS VPCs support jumbo frames, so on cluster creation kops sets the weave MTU to 8912 bytes (9001 minus overhead).

spec:
  networking:
    weave:
      mtu: 8912

Calico Example for CNI and Network Policy

Installing Calico on a new Cluster

The following command sets up a cluster, in HA mode, with Calico as the CNI and Network Policy provider.

$ export $ZONES=mylistofzones
$ kops create cluster \
  --zones $ZONES \
  --master-zones $ZONES \
  --master-size m4.large \
  --node-size m4.large \
  --networking calico \
  --yes \
  --name myclustername.mydns.io

The above will deploy a daemonset installation which requires K8s 1.4.x or above.

Enable Cross-Subnet mode in Calico (AWS only)

Calico [since 2.1] supports a new option for IP-in-IP mode where traffic is only encapsulated when it’s destined to subnets with intermediate infrastructure lacking Calico route awareness – for example, across heterogeneous public clouds or on AWS where traffic is crossing availability zones/ regions.

With this mode, IP-in-IP encapsulation is only performed selectively. This provides better performance in AWS multi-AZ deployments, and in general when deploying on networks where pools of nodes with L2 connectivity are connected via a router.

Reference: Calico 2.1 Release Notes

Note that Calico by default, routes between nodes within a subnet are distributed using a full node-to-node BGP mesh. Each node automatically sets up a BGP peering with every other node within the same L2 network. This full node-to-node mesh per L2 network has its scaling challenges for larger scale deployments. BGP route reflectors can be used as a replacement to a full mesh, and is useful for scaling up a cluster. The setup of BGP route reflectors is currently out of the scope of kops.

More information about Calico

For Calico specific documentation please visit the Calico Docs.

Getting help with Calico

For help with Calico or to report any issues:

Calico Github
Calico Users Slack

Calico Backend

Calico currently uses etcd as a backend for storing information about workloads and policies. Calico does not interfere with normal etcd operations and does not require special handling when upgrading etcd. For more information please visit the etcd Docs

Canal Example for CNI and Network Policy

Canal is a project that combines Flannel and Calico for CNI Networking. It uses Flannel for networking pod traffic between hosts via VXLAN and Calico for network policy enforcement and pod to pod traffic.

Installing Canal on a new Cluster

The following command sets up a cluster, in HA mode, with Canal as the CNI and networking policy provider

$ export $ZONES=mylistofzones
$ kops create cluster \
  --zones $ZONES \
  --master-zones $ZONES \
  --master-size m4.large \
  --node-size m4.large \
  --networking canal \
  --yes \
  --name myclustername.mydns.io

The above will deploy a daemonset installation which requires K8s 1.4.x or above.

Getting help with Canal

For problems with deploying Canal please post an issue to Github:

Canal Issues

For support with Calico Policies you can reach out on Slack or Github:

Calico Github
Calico Users Slack

For support with Flannel you can submit an issue on Github:

Flannel

Kube-router example for CNI, IPVS based service proxy and Network Policy enforcer

Kube-router is project that provides one cohesive soltion that provides CNI networking for pods, an IPVS based network service proxy and iptables based network policy enforcement.

Installing kube-router on a new Cluster

The following command sets up a cluster with Kube-router as the CNI, service proxy and networking policy provider

$ kops create cluster \
  --node-count 2 \
  --zones us-west-2a \
  --master-zones us-west-2a \
  --dns-zone aws.cloudnativelabs.net \
  --node-size t2.medium \
  --master-size t2.medium \
  --networking kube-router \
  --yes \
  --name myclustername.mydns.io

Currently kube-router supports 1.6 and above. Please note that kube-router will also provide service proxy, so kube-proxy will not be deployed in to the cluster.

No additional configurations are required to be done by user. Kube-router automatically disables source-destination check on all AWS EC2 instances. For the traffic within a subnet there is no overlay or tunneling used. For cross-subnet pod traffic ip-ip tunneling is used implicitly and no configuration is required.

Romana Example for CNI

Installing Romana on a new Cluster

The following command sets up a cluster with Romana as the CNI.

$ export $ZONES=mylistofzones
$ kops create cluster \
  --zones $ZONES \
  --master-zones $ZONES \
  --master-size m4.large \
  --node-size m4.large \
  --networking romana \
  --yes \
  --name myclustername.mydns.io

Currently Romana supports Kubernetes 1.6 and above.

Getting help with Romana

For problems with deploying Romana please post an issue to Github:

Romana Issues

You can also contact the Romana team on Slack

Romana Slack (invite required - email info@romana.io)

Romana Backend

Romana uses the cluster's etcd as a backend for storing information about routes, hosts, host-groups and IP allocations. This does not affect normal etcd operations or require special treatment when upgrading etcd. The etcd port (4001) is opened between masters and nodes when using this networking option.

Amazon VPC Backend

The Amazon VPC CNI plugin requires no additional configurations to be done by user.

Important: the pods uses the VPC CIDR, i.e. there is no isolation between the master, node/s and the internal k8s network.

Note: The following permissions are added to all nodes by kops to run the provider:

  {
    "Sid": "kopsK8sEC2NodeAmazonVPCPerms",
    "Effect": "Allow",
    "Action": [
      "ec2:CreateNetworkInterface",
      "ec2:AttachNetworkInterface",
      "ec2:DeleteNetworkInterface",
      "ec2:DetachNetworkInterface",
      "ec2:DescribeNetworkInterfaces",
      "ec2:DescribeInstances",
      "ec2:ModifyNetworkInterfaceAttribute",
      "ec2:AssignPrivateIpAddresses",
      "tag:TagResources"
    ],
    "Resource": [
      "*"
    ]
  }

In case of any issues the directory /var/log/aws-routed-eni contains the log files of the CNI plugin. This directory is located in all the nodes in the cluster.

Validating CNI Installation

You will notice that kube-dns fails to start properly until you deploy your CNI provider. Pod networking and IP addresses are provided by the CNI provider.

Here are some steps items that will confirm a good CNI install:

kubelet is running with the with --network-plugin=cni option.
The CNS provider started without errors.
kube-dns daemonset starts.
Logging on a node will display messages on pod create and delete.

The sig-networking and sig-cluster-lifecycle channels on K8s slack are always good starting places for Kubernetes specific CNI challenges.

Switching between networking providers

kops edit cluster and you will see a block like:

  networking:
    classic: {}

That means you are running with classic networking. The {} means there are no configuration options, beyond the setting classic.

To switch to kubenet, change the word classic to kubenet.

  networking:
    kubenet: {}

Now follow the normal update / rolling-update procedure:

$ kops update cluster # to preview
$ kops update cluster --yes # to apply
$ kops rolling-update cluster # to preview the rolling-update
$ kops rolling-update cluster --yes # to roll all your instances

Your cluster should be ready in a few minutes. It is not trivial to see that this has worked; the easiest way seems to be to SSH to the master and verify that kubelet has been run with --network-plugin=kubenet.

Switching from kubenet to a CNI network provider has not been tested at this time.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!